DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment NSD and Setd1a

DIOPT Version :10

Sequence 1:NP_733239.1 Gene:NSD / 43351 FlyBaseID:FBgn0039559 Length:1427 Species:Drosophila melanogaster
Sequence 2:NP_821172.2 Gene:Setd1a / 233904 MGIID:2446244 Length:1716 Species:Mus musculus


Alignment Length:1509 Identity:293/1509 - (19%)
Similarity:454/1509 - (30%) Gaps:471/1509 - (31%)


- Green bases have known domain annotations that are detailed below.


  Fly     6 DAHSEIEGDAAHGNVL-----CNSASDSLTATDEVAAG---------NDESVATEGDDVEIPRDT 56
            ||.:|:...|.||...     .|....:.|.:.|..|.         |..|..:.|:|:||..|.
Mouse   538 DAGAEVPSGAGHGPCTPPPAPANFEDVAPTGSGEPGAARESPKANGQNQASPCSSGEDMEISDDD 602

  Fly    57 NNSTPVRLLDKPGQNPVQNGAQPAAEESELESQRQTPVQKQQQQRVSMVNRKRD----------- 110
            ...:|......|.|.|......|......|.|   .|:.....|...::..:.|           
Mouse   603 RGGSPPPAPTPPQQPPPPPPPPPPPPPPYLAS---LPLGYPPHQPAYLLPPRPDGPPPPEYPPPP 664

  Fly   111 ------------LINLQSALSPKYIGYANANSPTPLSDSDDTIRTTR-RRVNQAAALNNSSAGET 162
                        .:.|...|..::.|       .|:|....|...|| .::.|...|..:|||  
Mouse   665 PPPPPHIYDFVNSLELMDRLGAQWGG-------MPMSFQMQTQMLTRLHQLRQGKGLTAASAG-- 720

  Fly   163 LAHDNASPRTPGGGGGGG---------------------GDDSANQLLSKTYMSPIEKLLIKNGA 206
                     .|||..|..                     |.:.......:.|..|:.  :.....
Mouse   721 ---------PPGGAFGEAFLPFPPPQEAAYGLPYALYTQGQEGRGSYSREAYHLPLP--MAAEPL 774

  Fly   207 SSPNSTGFEAGSEDLGIRPIVRKHVKRKMKRVPKAKVTLELDEKN--QQEVDEKSVKTEPIDEEV 269
            .|.:.:|.||.........|....|......|.:...||..:.|:  |::::.|.|:..... ..
Mouse   775 PSSSVSGEEARLPHREEAEIAESKVLPSAGTVGRVLATLVQEMKSIMQRDLNRKMVENVAFG-AF 838

  Fly   270 DRTDEAPTQEAQTTAISIKSETEAEHKAAVDVHIKQEDTIRLDIVNNPVESTSIVITEEPKDLEK 334
            |:..|:..::|:....:.|.:.:.|.|.    .:|.::...|.:|                |..|
Mouse   839 DQWWESKEEKAKPFQNAAKQQAKEEDKE----KMKLKEPGMLSLV----------------DWAK 883

  Fly   335 S-----TEELAFALPLAS-----STEVDLKSPPDLSSTALATSIKSPSSVSIDSAKGLSIVTDPG 389
            |     .|..||...|..     |.:|..|.|.::|.   |:..|.|                  
Mouse   884 SGGITGIEAFAFGSGLRGALRLPSFKVKRKEPSEISE---ASEEKRP------------------ 927

  Fly   390 WPTYQVGDLFWGKVFSYCFWPCMVCPDPLGQIVGNMPSHPQRSSLDNANVPIQVHVRFFADNGRR 454
                                               .||.|.....|:                  
Mouse   928 -----------------------------------RPSTPAEEDEDD------------------ 939

  Fly   455 NWIKPENLLTFAGLKAFDDMREELRIKHGPKSAKYRQMVPKRTKVVIWRQAIEEAQAMTQIPYSD 519
                ||              ||:...:.|....|    .|||          :|.:..||     
Mouse   940 ----PE--------------REKEAGEPGRPGTK----PPKR----------DEERGKTQ----- 967

  Fly   520 RLEKFYQTYENVVTLNRQKRKRTKYMMQDTSDVGSSLYDSTDNLHNKQGTQLLAVKRERSESPFS 584
                  ..:....||:.:..:.:    |::|.......|..|....:|...:.|.|:|...|...
Mouse   968 ------GKHRKSFTLDSEGEEAS----QESSSEKDEDDDDEDEEDEEQEEAVDATKKEAEASDGE 1022

  Fly   585 PAFSPVKSKNEKRAKRRKLSNGTEADTGSNSMAVTPSQTETTVDSSAYENPEFRQLLSAVMEYVM 649
            ...|...|:....|.... .||:.:|:.|.|.:.:.|.:.::..||:.|:....:..|||:....
Mouse  1023 DEDSDSSSQCSLYADSDG-ENGSTSDSESGSSSSSSSSSSSSSSSSSSESSSEEEEQSAVIPSAS 1086

  Fly   650 MNRS--------DEKVEKVLLSVVSNIWSLKQIQLRELERDLASGEIEEPLGS----SVVGRGSG 702
            ..|.        |||.|...| |.|.:..|.:   :|......:|..|||..|    ........
Mouse  1087 PPREVPEPLPAPDEKPETDGL-VDSPVMPLSE---KETLPTQPAGPAEEPPPSVPQPPAEPPAGP 1147

  Fly   703 VGTIKRLSNR-------LMTMMVRRSMTPVVTPSTTPAPSEPDRRLSEPPKTKKPVNRPIEEVIE 760
            .....||..|       |.....||...........|.| ||........|:..||:|.:..|:|
Mouse  1148 PDAAPRLDERPSSPIPLLPPPKKRRKTVSFSAAEEAPVP-EPSTAAPLQAKSSGPVSRKVPRVVE 1211

  Fly   761 DI---LQLDSKYLFRGLSREPICKYCYQAGSDLVRCSRTCSSWLHADCLERKVTGAPMPKIGSRK 822
            ..   |.||...|.:                          ||      ..:|......:.|.|.
Mouse  1212 RTIRNLPLDHASLVK--------------------------SW------PEEVARGGRNRAGGRV 1244

  Fly   823 ALVIPPTSKSPSPDEDHVTADAKEV-VAVGTSLVCHECNVG---EPEGCVICHQVESPAVPSTPR 883
            .          |.:|:..|....|| :||...|.......|   .|.|      .:|.|..::..
Mouse  1245 R----------STEEEEATESGTEVDLAVLADLALTPARRGLATLPTG------DDSEATETSDE 1293

  Fly   884 KEDSS---SHTPIEDKLLTCSQPMCGKRFHTSCCKYWPQASSSKHSARCPRHVCHTCVSDDPSGK 945
            .|..|   ||..:|.......:|              |..:.:      ||.:       :|:..
Mouse  1294 AERPSPLLSHILLEHNYALAIKP--------------PPTTPA------PRPL-------EPAPA 1331

  Fly   946 FQQLGSSKLAKCVRCPATYHQLSKCIPAGTQMLNTTNIICPRHNIAKADAHVNVLWCYICVKGGE 1010
            ...|.||                   || .::|....::     :|:|:.....|......:.||
Mouse  1332 LAALFSS-------------------PA-DEVLEAPEVV-----VAEAEEPKQQLQQQHPEQEGE 1371

  Fly  1011 LVCCETCPIAVHAHCRNIPIKTNESYICEECESGRLPLYGEIVWAKFNNF--RWWPAIILPPTEV 1073
                           .....:..||...|...|......|.|......:.  |..|.:..||...
Mouse  1372 ---------------EEEEDEEEESESSESSSSSSSDEEGAIRRRSLRSHTRRRRPPLPPPPPPP 1421

  Fly  1074 PSNILKKAHGENDFVVRFFGTHDHGWISRRRVYLYIEGDTGDGHKTKSQLFR--------NYTTG 1130
            ||   .:...|.:.:...:...:.|.......||.:         |..:|.:        |.|..
Mouse  1422 PS---FEPRSEFEQMTILYDIWNSGLDLEDMSYLRL---------TYERLLQQTSGADWLNDTHW 1474

  Fly  1131 VEEASRFLPIIKARRQEQD--MERQSGNKLHPPPYVKIKTNKAVPPLRFSQNLEDLSTCNCLPVD 1193
            |:.....|...|.:|:.||  .|.|:|:......|          |:...:..:.|..|   ||.
Mouse  1475 VQHTITNLSTPKRKRRPQDGPREHQTGSARSEGYY----------PISKKEKDKYLDVC---PVS 1526

  Fly  1194 EHPC-GPEAGCLNRMLFNECNPEYCKAGSLCENRMFEQRKSPRL------------EVVYMNERG 1245
            .... |.:....||:|               ..|..|||   ||            :::.:|:..
Mouse  1527 ARQLEGGDTQGTNRVL---------------SERRSEQR---RLLSAIGTSAIMDSDLLKLNQLK 1573

  Fly  1246 F---------------GLVNREPIAVGDFVIEYVGEVINHAEFQRRMEQKQRDRDENYYFLGVEK 1295
            |               ||...||||..:.||||||:.|.......|.::..::...:.|...|:.
Mouse  1574 FRKKKLRFGRSRIHEWGLFAMEPIAADEMVIEYVGQNIRQMVADMREKRYVQEGIGSSYLFRVDH 1638

  Fly  1296 DFIIDAGPKGNLARFMNHSCEPNCETQKWTVNCIHRVGIFAIKDIPVNSELTFNYLWDDLMNNSK 1360
            |.||||...||||||:||.|.|||..:..|:....::.|::.:.|.|:.|:|::|.:.  :.::|
Mouse  1639 DTIIDATKCGNLARFINHCCTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYKFP--LEDNK 1701

  Fly  1361 KACFCGAKRCSGEI 1374
            ..|.||.:.|.|.:
Mouse  1702 IPCLCGTESCRGSL 1715

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
NSDNP_733239.1 PWWP_NSD_rpt1 395..514 CDD:438972 14/118 (12%)
PHD2_NSD 867..932 CDD:277040 10/67 (15%)
PHD3_NSD 933..988 CDD:277041 7/54 (13%)
PHD4_NSD 1001..1041 CDD:277042 5/39 (13%)
PWWP_NSD_rpt2 1048..1143 CDD:438963 18/104 (17%)
AWS 1183..1233 CDD:197795 12/50 (24%)
SET_NSD 1233..1375 CDD:380950 50/169 (30%)
Setd1aNP_821172.2 Interaction with WDR82. /evidence=ECO:0000250|UniProtKB:O15047 60..89
RRM_Set1A 92..186 CDD:409964
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 194..367
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 380..499
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 516..670 27/134 (20%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 849..869 4/23 (17%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 911..1206 77/421 (18%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1230..1259 6/38 (16%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1275..1297 6/27 (22%)
HCFC1-binding motif (HBM). /evidence=ECO:0000250|UniProtKB:O15047 1307..1311 1/3 (33%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1355..1427 15/89 (17%)
Interaction with CFP1. /evidence=ECO:0000250|UniProtKB:O15047 1424..1459 5/43 (12%)
N-SET 1434..1571 CDD:463344 32/176 (18%)
Interaction with ASH2L, RBBP5 and WDR5. /evidence=ECO:0000250|UniProtKB:O15047 1459..1546 22/114 (19%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1480..1508 8/27 (30%)
WDR5 interaction motif (WIN). /evidence=ECO:0000250|UniProtKB:O15047 1501..1506 1/4 (25%)
RxxxRR motif. /evidence=ECO:0000250|UniProtKB:P38827 1546..1551 3/7 (43%)
SET_SETD1 1565..1712 CDD:380946 46/148 (31%)
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.