DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Set2 and Setd1a

DIOPT Version :9

Sequence 1:NP_572888.2 Gene:Set2 / 32301 FlyBaseID:FBgn0030486 Length:2362 Species:Drosophila melanogaster
Sequence 2:NP_821172.2 Gene:Setd1a / 233904 MGIID:2446244 Length:1716 Species:Mus musculus


Alignment Length:1770 Identity:343/1770 - (19%)
Similarity:578/1770 - (32%) Gaps:517/1770 - (29%)


- Green bases have known domain annotations that are detailed below.


  Fly     5 GPPSNPSPVA---SRGRGRGRPPKVALSALGNTPPHINPSLKHADAEASPTAPEDQDSGQSECRR 66
            |.|.|.:|.:   :....|...|    |:.|...|..:....:....::|.:   |||..|....
Mouse   235 GTPGNGTPCSQDTNFSSSRQDTP----SSFGQFTPQSSQGTPYTSRGSTPYS---QDSAYSSSTT 292

  Fly    67 SSRKKIIKFDVRDLLNKNRKAHKIQIEAR---IDSNPSTGHSQSGTTAASTSMSTATASAASASS 128
            |:     .|..|    ::..:::.....|   ..|.|:|..:.:..|||:|:.|::::|::|:||
Mouse   293 ST-----SFKPR----RSENSYQDSFSRRHFSTSSAPATTATATSATAAATAASSSSSSSSSSSS 348

  Fly   129 AATVSRLFSMFEMSHQSLPP--------------PP---------------------PPPTALEI 158
            :::.|...|.|..|..|.|.              ||                     ||.....:
Mouse   349 SSSSSSSASQFRGSDSSYPAYYESWNRYQRHTSYPPRRATREDPSGASFAENTAERFPPSYTSYL 413

  Fly   159 FAKP--------RPTQSLIVAQVTSEPSAVGGA----------------HPVQTMAGLP--PVTP 197
            ..:|        ||..|........||...||.                .|.:..|..|  |.:|
Mouse   414 APEPNRSTDQDYRPPASEAPPPEPPEPGGGGGGSGGGGGGGGGGGGGAPSPEREEARTPPRPASP 478

  Fly   198 RKRGRPRKSQLADAAIIPTVIVPSCSDSDTNS---------------TSTTTSNMSSDSGELPGF 247
            .:.|.|......::       ||....|..:|               .::.|.....:|...|| 
Mouse   479 ARSGSPAPETTNES-------VPFAQHSSLDSRIEMLLKEQRSKFSFLASDTEEEEENSSAGPG- 535

  Fly   248 PIQKPKSKLRVSLKRLKLGGRLESSDSGNSPSSSSP------EVEPPALQDENAMDERPKQE-QN 305
                           .:..|....|.:|:.|.:..|      :|.|....:..|..|.||.. ||
Mouse   536 ---------------ARDAGAEVPSGAGHGPCTPPPAPANFEDVAPTGSGEPGAARESPKANGQN 585

  Fly   306 LSRMVDAEENSDSDSQIIFIEIETESPKGEEEQEEGRPVEVEPQDLIDIDMELAKQEPTPDPEED 370
            .:....:.|:         :||..:...|...     |....||           |.|.|.|...
Mouse   586 QASPCSSGED---------MEISDDDRGGSPP-----PAPTPPQ-----------QPPPPPPPPP 625

  Fly   371 LDEIMVEVLSGPPSLWSADDEAEEEEDATVQRATPPGKEPAA-------------DSCSSAPRRS 422
            ...        ||.|.|........:.|.:....|.|..|..             |..:|.....
Mouse   626 PPP--------PPYLASLPLGYPPHQPAYLLPPRPDGPPPPEYPPPPPPPPPHIYDFVNSLELMD 682

  Fly   423 RRSAPLSG----------------SSRQGKTLEETFA-----------------EIAAESSKQIL 454
            |..|...|                ..||||.|....|                 :.||......|
Mouse   683 RLGAQWGGMPMSFQMQTQMLTRLHQLRQGKGLTAASAGPPGGAFGEAFLPFPPPQEAAYGLPYAL 747

  Fly   455 -----EAEESQDQEEQHILIDLIEDTLSESEVTSSVSPTIEHMVVEEVVVEENQLVDEADEI--- 511
                 |...|..:|..|:.:.:..:.|..|.| |.....:.|.  ||..:.|::::..|..:   
Mouse   748 YTQGQEGRGSYSREAYHLPLPMAAEPLPSSSV-SGEEARLPHR--EEAEIAESKVLPSAGTVGRV 809

  Fly   512 ---LDSKQEFVIKKVFSES--DNIAASLNKDIFEPKVETKATCGEVVPRPEMVTEDVYITEGIAA 571
               |..:.:.::::..:..  :|:|.......:|.| |.||                        
Mouse   810 LATLVQEMKSIMQRDLNRKMVENVAFGAFDQWWESK-EEKA------------------------ 849

  Fly   572 TLEKSAVVTKPTTEMIAETKLSDEVVIEPPLKDESDPKQTEVELPESKPAVNIPKSERILSAEVE 636
                     || .:..|:.:..:|           |.::.:::.|.....|:..||..|...|..
Mouse   850 ---------KP-FQNAAKQQAKEE-----------DKEKMKLKEPGMLSLVDWAKSGGITGIEAF 893

  Fly   637 TTSSPLVPPECCTLESVSGPVLLETSLSTEEKSNENVETTPLKTEAAKEDSP-PAAPEEEASNSS 700
            ...|.|           .|.:.| .|...:.|....:      :||::|..| |:.|.||..:..
Mouse   894 AFGSGL-----------RGALRL-PSFKVKRKEPSEI------SEASEEKRPRPSTPAEEDEDDP 940

  Fly   701 EEPNFLLEDYESNQEQVAEDEMMKCNNQKGQKQTPLPEMKEPEKPVAETVSKKEKAMENPARSSP 765
            |......|......:....||.......|.:|...|.  .|.|:...|:.|:|::..::..... 
Mouse   941 EREKEAGEPGRPGTKPPKRDEERGKTQGKHRKSFTLD--SEGEEASQESSSEKDEDDDDEDEED- 1002

  Fly   766 AIVDKKVRAGEMEKKVVKSTKGTVPEKKMDSKKSCAAVTPAKQKESGKSAKEAILKKETEKEKSS 830
               :::..|.:..||..:::.|  .::..||...|:....: ..|:|.::       ::|...||
Mouse  1003 ---EEQEEAVDATKKEAEASDG--EDEDSDSSSQCSLYADS-DGENGSTS-------DSESGSSS 1054

  Fly   831 AKLDSSSPNTLDKKGKDTAQWSPQLQTLPKSSTKPPQE-----SAPSVISKTTSNQPAP------ 884
            :...|||.::.....:.:::...|...:|.:|  ||:|     .||....:|.....:|      
Mouse  1055 SSSSSSSSSSSSSSSESSSEEEEQSAVIPSAS--PPREVPEPLPAPDEKPETDGLVDSPVMPLSE 1117

  Fly   885 KEEQHAAKKGLSDNSPPSVLKAKEKAVSGFVECDAMFKAMDLANAQLRLDEK----------NKK 939
            ||.......|.::..||||.:...:..:|            ..:|..||||:          .||
Mouse  1118 KETLPTQPAGPAEEPPPSVPQPPAEPPAG------------PPDAAPRLDERPSSPIPLLPPPKK 1170

  Fly   940 KLKKVP-TKVEAPPKVEPPTAVPVPGQKKSLSGKTSLRRNTVYE--------DSPNLERNSSPSS 995
            :.|.|. :..|..|..||.||.|:  |.|| ||..|.:...|.|        |..:|.::.....
Mouse  1171 RRKTVSFSAAEEAPVPEPSTAAPL--QAKS-SGPVSRKVPRVVERTIRNLPLDHASLVKSWPEEV 1232

  Fly   996 DSAQANTSAGKLKPSKVKKKINPRRSTICEAAKDLRSSSSSSTPTREVAASSPVSTSSDSSSKRN 1060
            .....|.:.|:::.::.::..............||     :.||.|...|:.|....|:::...:
Mouse  1233 ARGGRNRAGGRVRSTEEEEATESGTEVDLAVLADL-----ALTPARRGLATLPTGDDSEATETSD 1292

  Fly  1061 GSKRTTSDLDGGSKLDQRRYTICEDRQPETAIPVPLTKRRFSMHPKASANPLHDTLLQTAGKKRG 1125
            .::|.:..|  ...|.:..|.:.....|.|..|.||                             
Mouse  1293 EAERPSPLL--SHILLEHNYALAIKPPPTTPAPRPL----------------------------- 1326

  Fly  1126 RKEGKESLSRQNSLDSSSSASQGAPKKKALKSAEILSAALLETESSESTSSGSKMSRWDVQTSPE 1190
              |...:|:   :|.||       |..:.|::.|::.|   |.|..:....         |..||
Mouse  1327 --EPAPALA---ALFSS-------PADEVLEAPEVVVA---EAEEPKQQLQ---------QQHPE 1367

  Fly  1191 LEAANPFGDIAKFIEDGVNLLKRDKVDEDQRKEGQDEVKREADPEEDEFAQRVANMET-----PA 1250
            .|.                  :.::.||::..|..:.....:..||....:|.....|     |.
Mouse  1368 QEG------------------EEEEEDEEEESESSESSSSSSSDEEGAIRRRSLRSHTRRRRPPL 1414

  Fly  1251 TTPTPSPTQSNPEDSASTTTVLKEL----------------------ETGGG--VRRSHRI---- 1287
            ..|.|.|....|.......|:|.::                      :|.|.  :..:|.:    
Mouse  1415 PPPPPPPPSFEPRSEFEQMTILYDIWNSGLDLEDMSYLRLTYERLLQQTSGADWLNDTHWVQHTI 1479

  Fly  1288 --------KQKPQ-GPRASQGRGVASVALAPISMDE--QLAELANIEAINEQFLRSEGLNTFQLL 1341
                    |::|| |||..|.....|....|||..|  :..::..:.|...:...::|.|     
Mouse  1480 TNLSTPKRKRRPQDGPREHQTGSARSEGYYPISKKEKDKYLDVCPVSARQLEGGDTQGTN----- 1539

  Fly  1342 KENFYRCARQVSQENAEMQCDCFLTGDEEAQGHLSCGAGCINRMLMIECGP---LCSNGARCTNK 1403
                    |.:|:..:|                        .|.|:...|.   :.|:..:....
Mouse  1540 --------RVLSERRSE------------------------QRRLLSAIGTSAIMDSDLLKLNQL 1572

  Fly  1404 RFQQHQCWPCRVFRTEKKGCGITAELLIPPGEFIMEYVGEVIDSEEFERRQHLYSKDRNRHYYFM 1468
            :|::.:   .|..|:.....|:.|...|...|.::||||:.|.....:.|:..|.::.....|..
Mouse  1573 KFRKKK---LRFGRSRIHEWGLFAMEPIAADEMVIEYVGQNIRQMVADMREKRYVQEGIGSSYLF 1634

  Fly  1469 ALRGEAVIDATSKGNISRYINHSCDPNAETQKWTVNGELRIGFFSVKPIQPGEEITFDYQYLRYG 1533
            .:..:.:||||..||::|:|||.|.||...:..|:..:.:|..:|.:||...||||:||::....
Mouse  1635 RVDHDTIIDATKCGNLARFINHCCTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYKFPLED 1699

  Fly  1534 RDAQRCYCEAANCRG 1548
            .... |.|...:|||
Mouse  1700 NKIP-CLCGTESCRG 1713

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
Set2NP_572888.2 AWS 1358..1410 CDD:197795 6/54 (11%)
SET 1414..1533 CDD:214614 38/118 (32%)
PostSET 1535..1551 CDD:214703 5/14 (36%)
WW 2014..2043 CDD:278809
SRI 2270..2348 CDD:285448
Setd1aNP_821172.2 RRM_Set1A 92..186 CDD:240992
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 194..367 35/147 (24%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 380..499 21/125 (17%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 516..670 36/202 (18%)
Topoisomer_IB_N <835..>883 CDD:322080 12/93 (13%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 849..869 6/64 (9%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 911..1206 76/333 (23%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1230..1259 2/28 (7%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1275..1297 3/21 (14%)
HCFC1-binding motif (HBM). /evidence=ECO:0000250|UniProtKB:O15047 1307..1311 0/3 (0%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1355..1427 16/98 (16%)
Interaction with CFP1. /evidence=ECO:0000250|UniProtKB:O15047 1424..1459 3/34 (9%)
N-SET 1434..1571 CDD:314603 27/173 (16%)
Interaction with ASH2L, RBBP5 and WDR5. /evidence=ECO:0000250|UniProtKB:O15047 1459..1546 20/99 (20%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1480..1508 8/27 (30%)
WDR5 interaction motif (WIN). /evidence=ECO:0000250|UniProtKB:O15047 1501..1506 0/4 (0%)
SET 1577..1700 CDD:214614 38/125 (30%)
PostSET 1700..1716 CDD:214703 5/15 (33%)
Blue background indicates that the domain is not in the aligned region.


Information from Original Tools:


Tool Simple Score Weighted Score Original Tool Information
BLAST Result Score Score Type Cluster ID
Compara 1 0.930 - - C167848414
Domainoid 00.000 Not matched by this tool.
eggNOG 00.000 Not matched by this tool.
Hieranoid 00.000 Not matched by this tool.
Homologene 00.000 Not matched by this tool.
Inparanoid 00.000 Not matched by this tool.
Isobase 00.000 Not matched by this tool.
OMA 00.000 Not matched by this tool.
OrthoDB 00.000 Not matched by this tool.
OrthoFinder 00.000 Not matched by this tool.
OrthoInspector 00.000 Not matched by this tool.
orthoMCL 00.000 Not matched by this tool.
Panther 00.000 Not matched by this tool.
Phylome 1 0.910 - -
RoundUp 1 1.030 - avgDist Average_Evolutionary_Distance R4489
SonicParanoid 00.000 Not matched by this tool.
SwiftOrtho 00.000 Not matched by this tool.
TreeFam 00.000 Not matched by this tool.
32.870

Return to query results.
Submit another query.