DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment upSET and Setd1a

DIOPT Version :9

Sequence 1:NP_001261819.1 Gene:upSET / 39551 FlyBaseID:FBgn0036398 Length:3146 Species:Drosophila melanogaster
Sequence 2:NP_821172.2 Gene:Setd1a / 233904 MGIID:2446244 Length:1716 Species:Mus musculus


Alignment Length:1748 Identity:311/1748 - (17%)
Similarity:523/1748 - (29%) Gaps:601/1748 - (34%)


- Green bases have known domain annotations that are detailed below.


  Fly   927 QAANGAAAVAAGTTLSGGLGSGLPMSEELQHRLA-------------SGLNGGFATGTGMSKKSK 978
            ::::..||..||||:.|..|:|.|.|::.....:             ....|...|..|.:..|:
Mouse   219 RSSSDTAAYPAGTTVGGTPGNGTPCSQDTNFSSSRQDTPSSFGQFTPQSSQGTPYTSRGSTPYSQ 283

  Fly   979 KTKENSGSTSTLKKTKKSAVGMG---GEKNASGSGTPTGSSGKTSKKSSKRKSKSGGDGSSGGGS 1040
            .:..:|.:|||..|.::|.....   ..::.|.|..|..::..||..::...:.|....||...|
Mouse   284 DSAYSSSTTSTSFKPRRSENSYQDSFSRRHFSTSSAPATTATATSATAAATAASSSSSSSSSSSS 348

  Fly  1041 SPALTAAEKHAANLRQWIENYE--YAVTNHYSPELRARLHAIQKQPSLLQSIQNTENKALRQIQQ 1103
            |   :::...|:..|....:|.  |...|.|.                    ::|.....|..::
Mouse   349 S---SSSSSSASQFRGSDSSYPAYYESWNRYQ--------------------RHTSYPPRRATRE 390

  Fly  1104 QLSTAGSAEQLEQRAQLIPYAGAKVLISSVDLSPHAPIHELRGKYMLTTQFRTQNPTVNMNTPPP 1168
            ..|.|..||...:|.                                                ||
Mouse   391 DPSGASFAENTAERF------------------------------------------------PP 407

  Fly  1169 S--NYLNSFKAHKTPGQFVFFYQLPGVEAPMQTLRPD----------------------GSVP-- 1207
            |  :||.......|...    |:.|..|||    .|:                      |..|  
Mouse   408 SYTSYLAPEPNRSTDQD----YRPPASEAP----PPEPPEPGGGGGGSGGGGGGGGGGGGGAPSP 464

  Fly  1208 --QVAQQPP-------SYLKGPEVCVDTRTYGNDARFVRRSCRPNAELQHYFEKGTLHLYIVALT 1263
              :.|:.||       |....||...::..:.                ||              :
Mouse   465 EREEARTPPRPASPARSGSPAPETTNESVPFA----------------QH--------------S 499

  Fly  1264 HIRAQTEITIRHEPHDLTAVEQKKSHAAVIQPTSTRCACDMGSDCLFALPLAVQQQLQAPPTQPR 1328
            .:.::.|:.::         ||:...:.:...|                               .
Mouse   500 SLDSRIEMLLK---------EQRSKFSFLASDT-------------------------------E 524

  Fly  1329 SSHRNKAAAAAAAAAAANSAAAIQLTMGLGVGATVAAGASVLPNSRNRSTSSSGESSQMGLNSPQ 1393
            ....|.:|...|..|.|      ::..|.|.|......|..  |..:.:.:.|||          
Mouse   525 EEEENSSAGPGARDAGA------EVPSGAGHGPCTPPPAPA--NFEDVAPTGSGE---------- 571

  Fly  1394 LGQLNLGFKTSVTATSLTAPVPGVHCNNSGGSSSSSNNSCSVSMSSVLHDSGICTSSSSPSVSIP 1458
                                 ||....:...:..:..:.||......:.|.   ....||.   |
Mouse   572 ---------------------PGAARESPKANGQNQASPCSSGEDMEISDD---DRGGSPP---P 609

  Fly  1459 SPTPTQMQSPTLQQHPQQIPQQQLSLLQRSPTQQHQQQILAALPTPMLTPMLSPQL-PKPAQQQA 1522
            :|||.|...|.....|...|....||....|  .||...|.. |.|...|  .|:. |.|.....
Mouse   610 APTPPQQPPPPPPPPPPPPPPYLASLPLGYP--PHQPAYLLP-PRPDGPP--PPEYPPPPPPPPP 669

  Fly  1523 HVV------------------LPQS--QQTSLLQQQQSQQSQEPLAVIAAAAAAQQPMATY---F 1564
            |:.                  :|.|  .||.:|.:....:..:.|    .||:|..|...:   |
Mouse   670 HIYDFVNSLELMDRLGAQWGGMPMSFQMQTQMLTRLHQLRQGKGL----TAASAGPPGGAFGEAF 730

  Fly  1565 VRQPQQQQQQQSPKPQALVAQQQHVVGAQQQQHF-----LQQQQKQQQQQMADEARM---AVSAL 1621
            :..|..|:.... .|.||..|.|...|:..::.:     :..:.........:|||:   ..:.:
Mouse   731 LPFPPPQEAAYG-LPYALYTQGQEGRGSYSREAYHLPLPMAAEPLPSSSVSGEEARLPHREEAEI 794

  Fly  1622 QTLHAAPTSHIVSPIKVAAVQQQSQPQQQQQNTHQQPHNQQAVQQQS----NQLQQQQSQQPNYP 1682
            ......|::..|..:....|       |:.::..|:..|::.|:..:    :|..:.:.::....
Mouse   795 AESKVLPSAGTVGRVLATLV-------QEMKSIMQRDLNRKMVENVAFGAFDQWWESKEEKAKPF 852

  Fly  1683 QSPQRQQKPQPVQHQPQIVISTGAQAIPATMPTKLSSP--------TKSAAPVISNNNITVSAQS 1739
            |:..:||..:..:.:                 .||..|        .||.    ....|...|..
Mouse   853 QNAAKQQAKEEDKEK-----------------MKLKEPGMLSLVDWAKSG----GITGIEAFAFG 896

  Fly  1740 SVVGG----------KKTPAKHPQQQQQQQQQPVTPVSAATAPAATPSSSESKEDD----VSASS 1790
            |.:.|          :|.|::..:..::::.:|.||             :|..|||    ..|..
Mouse   897 SGLRGALRLPSFKVKRKEPSEISEASEEKRPRPSTP-------------AEEDEDDPEREKEAGE 948

  Fly  1791 TTTPTTRTPAKDKPKQSREDRKLEAILRAIEKMEKQEARGKKDTRQSSGGKRQASNSPASPNKRN 1855
            ...|.|:.|.:|:                 |:.:.|....|..|..|.|  .:||...:|....:
Mouse   949 PGRPGTKPPKRDE-----------------ERGKTQGKHRKSFTLDSEG--EEASQESSSEKDED 994

  Fly  1856 SSNSISEDVETPTSTNSAAAAAQRRNKKKRKVSRSLNNNTNGLGSGGGSNNKRRKSIVVESDGES 1920
            ..:...||.|...:.::        .||:.:.|.         |....|::..:.|:..:||||:
Mouse   995 DDDEDEEDEEQEEAVDA--------TKKEAEASD---------GEDEDSDSSSQCSLYADSDGEN 1042

  Fly  1921 HALTNSE---------SEDQGQHPQSHHSGSEDQAAGLLLALAHNNSSPNEPFKS---------- 1966
            .:.::||         |........|..|.||::....::..|.......||..:          
Mouse  1043 GSTSDSESGSSSSSSSSSSSSSSSSSSESSSEEEEQSAVIPSASPPREVPEPLPAPDEKPETDGL 1107

  Fly  1967 ------PLSQSHSLPATPASVSSACLLIEAAMGPLQQQP------------APASASPSLAE--- 2010
                  |||:..:||..||             ||.::.|            .|..|:|.|.|   
Mouse  1108 VDSPVMPLSEKETLPTQPA-------------GPAEEPPPSVPQPPAEPPAGPPDAAPRLDERPS 1159

  Fly  2011 --FKYPPGGAKTKKSLMSSWFQQAEQQHASGLDSLVQAAMSEINGEREQLQRQPQGESLPAPALL 2073
              ....|...|.:|::..|..::|.....|      .||         .||.:..|     |...
Mouse  1160 SPIPLLPPPKKRRKTVSFSAAEEAPVPEPS------TAA---------PLQAKSSG-----PVSR 1204

  Fly  2074 KVEQFIHQAESTTAVPAREQLHLPLQNNSSVKKRWLRQ-AISEETTPVDELQQSQNQSVTATPSP 2137
            ||.:.:.          |...:||| :::|:.|.|..: |..........::.::.:..|.:.:.
Mouse  1205 KVPRVVE----------RTIRNLPL-DHASLVKSWPEEVARGGRNRAGGRVRSTEEEEATESGTE 1258

  Fly  2138 QPVPTVSPLANGFSTPLKKRRLVVVSNGTNVESDETHIDVIGEPKDEAEENVAMTELKVEIENHH 2202
            ..:..::.||   .|| .:|.|..:..|.:.|:.||        .||||.               
Mouse  1259 VDLAVLADLA---LTP-ARRGLATLPTGDDSEATET--------SDEAER--------------- 1296

  Fly  2203 QEQDDDVDILRSPSPGTHQIVAEDN-LVKIEPEDT-----------------SAAADD------- 2242
                        |||....|:.|.| .:.|:|..|                 |:.||:       
Mouse  1297 ------------PSPLLSHILLEHNYALAIKPPPTTPAPRPLEPAPALAALFSSPADEVLEAPEV 1349

  Fly  2243 VKIDVEREESQACDKFEEMVKVKREEEEQREKEI-----------------KQLQERQEHEQPKV 2290
            |..:.|..:.|...:..|....:.||:|:.|.|.                 :.|:......:|.:
Mouse  1350 VVAEAEEPKQQLQQQHPEQEGEEEEEDEEEESESSESSSSSSSDEEGAIRRRSLRSHTRRRRPPL 1414

  Fly  2291 EPAPVEPKLENTVAKAEPKVEPSQEIVSK-------KEPTKVEPKPGESLLRST--ATVTATPTA 2346
            .|.|..|          |..||..|....       .....:|..   |.||.|  ..:..|..|
Mouse  1415 PPPPPPP----------PSFEPRSEFEQMTILYDIWNSGLDLEDM---SYLRLTYERLLQQTSGA 1466

  Fly  2347 ATIAAT-----TLLDVSKVAFKTRPPLKLEDEP---QKKKPKLESILPA------PVATVPPVSV 2397
            ..:..|     |:.::|....|.||    :|.|   |....:.|...|.      ....|.|||.
Mouse  1467 DWLNDTHWVQHTITNLSTPKRKRRP----QDGPREHQTGSARSEGYYPISKKEKDKYLDVCPVSA 1527

  Fly  2398 PPIPAASNATTSAVTNTAAASLTTTTAPSSTKNLTEHDIQE-RLLSFHAANISYLQSR 2454
            ..:.......|:.|.:...:......:...|..:.:.|:.: ..|.|....:.:.:||
Mouse  1528 RQLEGGDTQGTNRVLSERRSEQRRLLSAIGTSAIMDSDLLKLNQLKFRKKKLRFGRSR 1585

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
upSETNP_001261819.1 PHD_MLL5 856..899 CDD:277025
SET <1221..1273 CDD:214614 4/51 (8%)
Setd1aNP_821172.2 RRM_Set1A 92..186 CDD:240992
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 194..367 33/150 (22%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 380..499 29/204 (14%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 516..670 41/234 (18%)
Topoisomer_IB_N <835..>883 CDD:322080 7/64 (11%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 849..869 3/36 (8%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 911..1206 73/376 (19%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1230..1259 2/28 (7%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1275..1297 9/56 (16%)
HCFC1-binding motif (HBM). /evidence=ECO:0000250|UniProtKB:O15047 1307..1311 2/3 (67%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1355..1427 15/81 (19%)
Interaction with CFP1. /evidence=ECO:0000250|UniProtKB:O15047 1424..1459 8/37 (22%)
N-SET 1434..1571 CDD:314603 26/143 (18%)
Interaction with ASH2L, RBBP5 and WDR5. /evidence=ECO:0000250|UniProtKB:O15047 1459..1546 19/90 (21%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1480..1508 8/31 (26%)
WDR5 interaction motif (WIN). /evidence=ECO:0000250|UniProtKB:O15047 1501..1506 0/4 (0%)
SET 1577..1700 CDD:214614 2/9 (22%)
PostSET 1700..1716 CDD:214703
Blue background indicates that the domain is not in the aligned region.


Information from Original Tools:


Tool Simple Score Weighted Score Original Tool Information
BLAST Result Score Score Type Cluster ID
Compara 00.000 Not matched by this tool.
Domainoid 00.000 Not matched by this tool.
eggNOG 00.000 Not matched by this tool.
Hieranoid 00.000 Not matched by this tool.
Homologene 00.000 Not matched by this tool.
Inparanoid 00.000 Not matched by this tool.
Isobase 00.000 Not matched by this tool.
OMA 00.000 Not matched by this tool.
OrthoDB 00.000 Not matched by this tool.
OrthoFinder 00.000 Not matched by this tool.
OrthoInspector 00.000 Not matched by this tool.
orthoMCL 00.000 Not matched by this tool.
Panther 00.000 Not matched by this tool.
Phylome 00.000 Not matched by this tool.
RoundUp 1 1.030 - avgDist Average_Evolutionary_Distance R4489
SonicParanoid 00.000 Not matched by this tool.
SwiftOrtho 00.000 Not matched by this tool.
TreeFam 00.000 Not matched by this tool.
11.030

Return to query results.
Submit another query.