DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment NSD and Setd1a

DIOPT Version :9

Sequence 1:NP_733239.1 Gene:NSD / 43351 FlyBaseID:FBgn0039559 Length:1427 Species:Drosophila melanogaster
Sequence 2:NP_821172.2 Gene:Setd1a / 233904 MGIID:2446244 Length:1716 Species:Mus musculus


Alignment Length:1509 Identity:293/1509 - (19%)
Similarity:454/1509 - (30%) Gaps:471/1509 - (31%)


- Green bases have known domain annotations that are detailed below.


  Fly     6 DAHSEIEGDAAHGNVL-----CNSASDSLTATDEVAAG---------NDESVATEGDDVEIPRDT 56
            ||.:|:...|.||...     .|....:.|.:.|..|.         |..|..:.|:|:||..|.
Mouse   538 DAGAEVPSGAGHGPCTPPPAPANFEDVAPTGSGEPGAARESPKANGQNQASPCSSGEDMEISDDD 602

  Fly    57 NNSTPVRLLDKPGQNPVQNGAQPAAEESELESQRQTPVQKQQQQRVSMVNRKRD----------- 110
            ...:|......|.|.|......|......|.|   .|:.....|...::..:.|           
Mouse   603 RGGSPPPAPTPPQQPPPPPPPPPPPPPPYLAS---LPLGYPPHQPAYLLPPRPDGPPPPEYPPPP 664

  Fly   111 ------------LINLQSALSPKYIGYANANSPTPLSDSDDTIRTTR-RRVNQAAALNNSSAGET 162
                        .:.|...|..::.|       .|:|....|...|| .::.|...|..:|||  
Mouse   665 PPPPPHIYDFVNSLELMDRLGAQWGG-------MPMSFQMQTQMLTRLHQLRQGKGLTAASAG-- 720

  Fly   163 LAHDNASPRTPGGGGGGG---------------------GDDSANQLLSKTYMSPIEKLLIKNGA 206
                     .|||..|..                     |.:.......:.|..|:.  :.....
Mouse   721 ---------PPGGAFGEAFLPFPPPQEAAYGLPYALYTQGQEGRGSYSREAYHLPLP--MAAEPL 774

  Fly   207 SSPNSTGFEAGSEDLGIRPIVRKHVKRKMKRVPKAKVTLELDEKN--QQEVDEKSVKTEPIDEEV 269
            .|.:.:|.||.........|....|......|.:...||..:.|:  |::::.|.|:..... ..
Mouse   775 PSSSVSGEEARLPHREEAEIAESKVLPSAGTVGRVLATLVQEMKSIMQRDLNRKMVENVAFG-AF 838

  Fly   270 DRTDEAPTQEAQTTAISIKSETEAEHKAAVDVHIKQEDTIRLDIVNNPVESTSIVITEEPKDLEK 334
            |:..|:..::|:....:.|.:.:.|.|.    .:|.::...|.:|                |..|
Mouse   839 DQWWESKEEKAKPFQNAAKQQAKEEDKE----KMKLKEPGMLSLV----------------DWAK 883

  Fly   335 S-----TEELAFALPLAS-----STEVDLKSPPDLSSTALATSIKSPSSVSIDSAKGLSIVTDPG 389
            |     .|..||...|..     |.:|..|.|.::|.   |:..|.|                  
Mouse   884 SGGITGIEAFAFGSGLRGALRLPSFKVKRKEPSEISE---ASEEKRP------------------ 927

  Fly   390 WPTYQVGDLFWGKVFSYCFWPCMVCPDPLGQIVGNMPSHPQRSSLDNANVPIQVHVRFFADNGRR 454
                                               .||.|.....|:                  
Mouse   928 -----------------------------------RPSTPAEEDEDD------------------ 939

  Fly   455 NWIKPENLLTFAGLKAFDDMREELRIKHGPKSAKYRQMVPKRTKVVIWRQAIEEAQAMTQIPYSD 519
                ||              ||:...:.|....|    .|||          :|.:..||     
Mouse   940 ----PE--------------REKEAGEPGRPGTK----PPKR----------DEERGKTQ----- 967

  Fly   520 RLEKFYQTYENVVTLNRQKRKRTKYMMQDTSDVGSSLYDSTDNLHNKQGTQLLAVKRERSESPFS 584
                  ..:....||:.:..:.:    |::|.......|..|....:|...:.|.|:|...|...
Mouse   968 ------GKHRKSFTLDSEGEEAS----QESSSEKDEDDDDEDEEDEEQEEAVDATKKEAEASDGE 1022

  Fly   585 PAFSPVKSKNEKRAKRRKLSNGTEADTGSNSMAVTPSQTETTVDSSAYENPEFRQLLSAVMEYVM 649
            ...|...|:....|.... .||:.:|:.|.|.:.:.|.:.::..||:.|:....:..|||:....
Mouse  1023 DEDSDSSSQCSLYADSDG-ENGSTSDSESGSSSSSSSSSSSSSSSSSSESSSEEEEQSAVIPSAS 1086

  Fly   650 MNRS--------DEKVEKVLLSVVSNIWSLKQIQLRELERDLASGEIEEPLGS----SVVGRGSG 702
            ..|.        |||.|...| |.|.:..|.:   :|......:|..|||..|    ........
Mouse  1087 PPREVPEPLPAPDEKPETDGL-VDSPVMPLSE---KETLPTQPAGPAEEPPPSVPQPPAEPPAGP 1147

  Fly   703 VGTIKRLSNR-------LMTMMVRRSMTPVVTPSTTPAPSEPDRRLSEPPKTKKPVNRPIEEVIE 760
            .....||..|       |.....||...........|.| ||........|:..||:|.:..|:|
Mouse  1148 PDAAPRLDERPSSPIPLLPPPKKRRKTVSFSAAEEAPVP-EPSTAAPLQAKSSGPVSRKVPRVVE 1211

  Fly   761 DI---LQLDSKYLFRGLSREPICKYCYQAGSDLVRCSRTCSSWLHADCLERKVTGAPMPKIGSRK 822
            ..   |.||...|.:                          ||      ..:|......:.|.|.
Mouse  1212 RTIRNLPLDHASLVK--------------------------SW------PEEVARGGRNRAGGRV 1244

  Fly   823 ALVIPPTSKSPSPDEDHVTADAKEV-VAVGTSLVCHECNVG---EPEGCVICHQVESPAVPSTPR 883
            .          |.:|:..|....|| :||...|.......|   .|.|      .:|.|..::..
Mouse  1245 R----------STEEEEATESGTEVDLAVLADLALTPARRGLATLPTG------DDSEATETSDE 1293

  Fly   884 KEDSS---SHTPIEDKLLTCSQPMCGKRFHTSCCKYWPQASSSKHSARCPRHVCHTCVSDDPSGK 945
            .|..|   ||..:|.......:|              |..:.:      ||.:       :|:..
Mouse  1294 AERPSPLLSHILLEHNYALAIKP--------------PPTTPA------PRPL-------EPAPA 1331

  Fly   946 FQQLGSSKLAKCVRCPATYHQLSKCIPAGTQMLNTTNIICPRHNIAKADAHVNVLWCYICVKGGE 1010
            ...|.||                   || .::|....::     :|:|:.....|......:.||
Mouse  1332 LAALFSS-------------------PA-DEVLEAPEVV-----VAEAEEPKQQLQQQHPEQEGE 1371

  Fly  1011 LVCCETCPIAVHAHCRNIPIKTNESYICEECESGRLPLYGEIVWAKFNNF--RWWPAIILPPTEV 1073
                           .....:..||...|...|......|.|......:.  |..|.:..||...
Mouse  1372 ---------------EEEEDEEEESESSESSSSSSSDEEGAIRRRSLRSHTRRRRPPLPPPPPPP 1421

  Fly  1074 PSNILKKAHGENDFVVRFFGTHDHGWISRRRVYLYIEGDTGDGHKTKSQLFR--------NYTTG 1130
            ||   .:...|.:.:...:...:.|.......||.:         |..:|.:        |.|..
Mouse  1422 PS---FEPRSEFEQMTILYDIWNSGLDLEDMSYLRL---------TYERLLQQTSGADWLNDTHW 1474

  Fly  1131 VEEASRFLPIIKARRQEQD--MERQSGNKLHPPPYVKIKTNKAVPPLRFSQNLEDLSTCNCLPVD 1193
            |:.....|...|.:|:.||  .|.|:|:......|          |:...:..:.|..|   ||.
Mouse  1475 VQHTITNLSTPKRKRRPQDGPREHQTGSARSEGYY----------PISKKEKDKYLDVC---PVS 1526

  Fly  1194 EHPC-GPEAGCLNRMLFNECNPEYCKAGSLCENRMFEQRKSPRL------------EVVYMNERG 1245
            .... |.:....||:|               ..|..|||   ||            :::.:|:..
Mouse  1527 ARQLEGGDTQGTNRVL---------------SERRSEQR---RLLSAIGTSAIMDSDLLKLNQLK 1573

  Fly  1246 F---------------GLVNREPIAVGDFVIEYVGEVINHAEFQRRMEQKQRDRDENYYFLGVEK 1295
            |               ||...||||..:.||||||:.|.......|.::..::...:.|...|:.
Mouse  1574 FRKKKLRFGRSRIHEWGLFAMEPIAADEMVIEYVGQNIRQMVADMREKRYVQEGIGSSYLFRVDH 1638

  Fly  1296 DFIIDAGPKGNLARFMNHSCEPNCETQKWTVNCIHRVGIFAIKDIPVNSELTFNYLWDDLMNNSK 1360
            |.||||...||||||:||.|.|||..:..|:....::.|::.:.|.|:.|:|::|.:.  :.::|
Mouse  1639 DTIIDATKCGNLARFINHCCTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYKFP--LEDNK 1701

  Fly  1361 KACFCGAKRCSGEI 1374
            ..|.||.:.|.|.:
Mouse  1702 IPCLCGTESCRGSL 1715

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
NSDNP_733239.1 MSH6_like 391..508 CDD:99898 13/116 (11%)
PHD2_NSD 867..932 CDD:277040 10/67 (15%)
PHD3_NSD 933..988 CDD:277041 7/54 (13%)
PHD4_NSD 1001..1041 CDD:277042 5/39 (13%)
WHSC1_related 1047..1141 CDD:99899 18/103 (17%)
AWS 1183..1233 CDD:197795 12/50 (24%)
SET 1234..1354 CDD:214614 44/146 (30%)
Setd1aNP_821172.2 RRM_Set1A 92..186 CDD:240992
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 194..367
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 380..499
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 516..670 27/134 (20%)
Topoisomer_IB_N <835..>883 CDD:322080 10/68 (15%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 849..869 4/23 (17%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 911..1206 77/421 (18%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1230..1259 6/38 (16%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1275..1297 6/27 (22%)
HCFC1-binding motif (HBM). /evidence=ECO:0000250|UniProtKB:O15047 1307..1311 1/3 (33%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1355..1427 15/89 (17%)
Interaction with CFP1. /evidence=ECO:0000250|UniProtKB:O15047 1424..1459 5/43 (12%)
N-SET 1434..1571 CDD:314603 32/176 (18%)
Interaction with ASH2L, RBBP5 and WDR5. /evidence=ECO:0000250|UniProtKB:O15047 1459..1546 22/114 (19%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1480..1508 8/27 (30%)
WDR5 interaction motif (WIN). /evidence=ECO:0000250|UniProtKB:O15047 1501..1506 1/4 (25%)
SET 1577..1700 CDD:214614 40/124 (32%)
PostSET 1700..1716 CDD:214703 6/16 (38%)
Blue background indicates that the domain is not in the aligned region.


Information from Original Tools:


Tool Simple Score Weighted Score Original Tool Information
BLAST Result Score Score Type Cluster ID
Compara 00.000 Not matched by this tool.
Domainoid 00.000 Not matched by this tool.
eggNOG 00.000 Not matched by this tool.
Hieranoid 00.000 Not matched by this tool.
Homologene 00.000 Not matched by this tool.
Inparanoid 00.000 Not matched by this tool.
Isobase 00.000 Not matched by this tool.
OMA 00.000 Not matched by this tool.
OrthoDB 00.000 Not matched by this tool.
OrthoFinder 00.000 Not matched by this tool.
OrthoInspector 00.000 Not matched by this tool.
orthoMCL 00.000 Not matched by this tool.
Panther 00.000 Not matched by this tool.
Phylome 1 0.910 - -
RoundUp 1 1.030 - avgDist Average_Evolutionary_Distance R4489
SonicParanoid 00.000 Not matched by this tool.
SwiftOrtho 00.000 Not matched by this tool.
TreeFam 00.000 Not matched by this tool.
21.940

Return to query results.
Submit another query.