DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment NSD and SETD1B

DIOPT Version :10

Sequence 1:NP_733239.1 Gene:NSD / 43351 FlyBaseID:FBgn0039559 Length:1427 Species:Drosophila melanogaster
Sequence 2:NP_001340274.1 Gene:SETD1B / 23067 HGNCID:29187 Length:1966 Species:Homo sapiens


Alignment Length:1454 Identity:280/1454 - (19%)
Similarity:442/1454 - (30%) Gaps:526/1454 - (36%)


- Green bases have known domain annotations that are detailed below.


  Fly    33 DEVAA------GNDESVATEGDDVEI-----------------PRDTNNSTPVRLLDKPGQNPVQ 74
            |.:|:      |..|.:..||..:.|                 |.||.:|...:.| :|      
Human   926 DRIASCLLESWGKGEGLGYEGLGLGIGLRGAIRLPSFKVKRKEPPDTTSSGDQKRL-RP------ 983

  Fly    75 NGAQPAAEESELESQRQTPVQKQQQQRVSMVNRKRDLINLQSALSPKYIGYANANSPTPLSDSDD 139
               ..:.:|.:.||:|:               |.||:                |::|..|:..|.
Human   984 ---STSVDEEDEESERE---------------RDRDM----------------ADTPCELAKRDP 1014

  Fly   140 TIRTTRRRVNQAAALNNSSAGETLAHDNASPRTPGGGGGGGGDDSANQLLSKTYMSPIEKLLIKN 204
            .....|||  .|..|...|.||....::.|..:........|..:.         ||......|.
Human  1015 KGVGVRRR--PARPLELDSGGEEDEKESLSASSSSSASSSSGSSTT---------SPSSSASDKE 1068

  Fly   205 GASSPNSTGFEAGSEDLGIRPIVRKHVKRKMKRVPKAKVTLELDEKNQQEVDEKSVKTEPID--- 266
            ..........||..|:       .:.|.|..........|.:.|:.:....|....:.:..|   
Human  1069 EEQESTEEEEEAEEEE-------EEEVPRSQLSSSSTSSTSDKDDDDDDSDDRDESENDDEDTAL 1126

  Fly   267 EEVDRTDEAPTQEAQTTAI-------------SIKSETEAEHKAAVDVHIKQEDTIRLDIVNNPV 318
            .|....||..:.|.:|.:|             |..||.|:..:::......:|:.:..:  ....
Human  1127 SEASEKDEGDSDEEETVSIVTSKAEATSSSESSESSEFESSSESSPSSSEDEEEVVARE--EEEE 1189

  Fly   319 ESTSIVITEE------PKDLEKSTEELAFA--LP----LASSTEVDLKS--------PPDLSSTA 363
            |....::.||      |:|.|:..||.|.|  .|    |....|||:::        |..|....
Human  1190 EEEEEMVAEESMASAGPEDFEQDGEEAALAPGAPAVDSLGMEEEVDIETEAVAPEERPSMLDEPP 1254

  Fly   364 LATSIKSPSSVSIDSAK-----GLS----IVTDPGWPTYQV-------------GDLFWGKVFSY 406
            |...::.|:    ||.:     |||    ::..|..|..:|             .||   :|...
Human  1255 LPVGVEEPA----DSREPPEEPGLSQEGAMLLSPEPPAKEVEARPPLSPERAPEHDL---EVEPE 1312

  Fly   407 CFWPCMVCPDPLGQIV--------GNMPSHPQRSSLDNANVPIQVHVRFFADNGRRNWIKPENLL 463
               |.|:.|.||...:        .:.|..|:.:...:.:||                  ||.| 
Human  1313 ---PPMMLPLPLQPPLPPPRPPRPPSPPPEPETTDASHPSVP------------------PEPL- 1355

  Fly   464 TFAGLKAFDDMREELRIKHGPKSAKYRQMVPKRTKVVIWRQAIEEAQAMTQIPYSDRLEKFYQTY 528
                        .|....|.|...                .::.::|:...:|.:...|      
Human  1356 ------------AEDHPPHTPGLC----------------GSLAKSQSTETVPATPGGE------ 1386

  Fly   529 ENVVTLNRQKRKRTKYMMQDTSDVGSSLYDSTDNLHNKQGTQLLAVKRERSESPFS-PAFSPVKS 592
                              ...|...|.|..|:..:..               |||| ||.||   
Human  1387 ------------------PPLSGGSSGLSLSSPQVPG---------------SPFSYPAPSP--- 1415

  Fly   593 KNEKRAKRRKLSNGTEADTGSNSMAVTPSQTETTVDSSAYENPEFRQLLSAVMEYVMMNRSDEKV 657
                     .||:|....|.....:.||:.:|.:       .|                      
Human  1416 ---------SLSSGGLPRTPGRDFSFTPTFSEPS-------GP---------------------- 1442

  Fly   658 EKVLLSVVSNIWSLKQIQLRELERDLASGEIEEPLGSSVV---GRGSGVGTIKRLSNRLMTMMVR 719
              :||.|         ..|....||..||    ||.|.|:   |....:.....|...|..::..
Human  1443 --LLLPV---------CPLPTGRRDERSG----PLASPVLLETGLPLPLPLPLPLPLALPAVLRA 1492

  Fly   720 RSMTPVVTPSTTPAPSEPDRRLSEPPKTKKPVNRPIEEVIEDILQLDSKYLFRGLSREPICKYCY 784
            ::..|...|...|||     ..|.||..|:...|| ......:|.||..     |.|.|      
Human  1493 QARAPTPLPPLLPAP-----LASCPPPMKRKPGRP-RRSPPSMLSLDGP-----LVRPP------ 1540

  Fly   785 QAGSDLVRCSRTCSSWLHADCLERKVTGAPMPKIGSRKALVIPPTSKSP----SPDEDHVTADAK 845
             ||:.|                             .|:.|::|...::|    :.|...||.|.:
Human  1541 -AGAAL-----------------------------GRELLLLPGQPQTPVFPSTHDPRTVTLDFR 1575

  Fly   846 EVVAVGTSLVCHECNVGEPEGCVICHQVESPAVPSTPRKEDSSSHTPIEDKLLTCSQPMCGKRFH 910
                          |.|.|          :|..|..|:........|:|...|...:        
Human  1576 --------------NAGIP----------APPPPLPPQPPPPPPPPPVEPTKLPFKE-------- 1608

  Fly   911 TSCCKYWPQASSSKHSARCPRHVCHTCVSDDPSGKFQQLGSSKLAKCVRCPATYHQLSKCIPAGT 975
              ....||    |:.....||.      .|:.:.::.:|..|: ....|.|...|: ....|||:
Human  1609 --LDNQWP----SEAIPPGPRG------RDEVTEEYMELAKSR-GPWRRPPKKRHE-DLVPPAGS 1659

  Fly   976 QMLNTTN-IICPRHNIAKADAHVNVLW---------CYICVKGGELVCCETCPIAVHAHCRNIPI 1030
            ..|:... :..||....:.....:: |         .::||                        
Human  1660 PELSPPQPLFRPRSEFEEMTILYDI-WNGGIDEEDIRFLCV------------------------ 1699

  Fly  1031 KTNESYICEECESGRLPLYGEIVWAKFNNFRW-----WPAIILPPTEVPSNILKKAHGENDFVVR 1090
             |.|..:.::                 |...|     |  :..|.|.:.|...||         |
Human  1700 -TYERLLQQD-----------------NGMDWLNDTLW--VYHPSTSLSSAKKKK---------R 1735

  Fly  1091 FFGTHDHGWISRRRVYLYIEGDTGDGHKTKSQLFRNYTTGVEEASRFLPIIKARRQEQDMERQSG 1155
            ..|..:|              .||   ..:|:.|  ||...::..|:|...:|...|...:.| |
Human  1736 DDGIREH--------------VTG---CARSEGF--YTIDKKDKLRYLNSSRASTDEPPADTQ-G 1780

  Fly  1156 NKLHPPPYVKIKTNKAVPPLRFSQNLEDLSTCNCLPVDEHPCGPEAGCLNRMLFNECNPEYCKAG 1220
            ..:...|:...:....    |.|:....||:..      ..|..:....|::.|.:...::||: 
Human  1781 MSIPAQPHASTRAGSE----RRSEQRRLLSSFT------GSCDSDLLKFNQLKFRKKKLKFCKS- 1834

  Fly  1221 SLCENRMFEQRKSPRLEVVYMNERGFGLVNREPIAVGDFVIEYVGEVINHAEFQRRMEQKQRDRD 1285
                                 :...:||...||||..:.||||||:.|.......|.::.:.:..
Human  1835 ---------------------HIHDWGLFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGI 1878

  Fly  1286 ENYYFLGVEKDFIIDAGPKGNLARFMNHSCEPNCETQKWTVNCIHRVGIFAIKDIPVNSELTFNY 1350
            .:.|...|:.|.||||...||.|||:||||.|||..:..||....::.|::.:.|.||.|:|::|
Human  1879 GSSYMFRVDHDTIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDY 1943

  Fly  1351 LWDDLMNNSKKACFCGAKRCSGEI 1374
            .:.  :.:.|..|.||::.|.|.:
Human  1944 KFP--IEDVKIPCLCGSENCRGTL 1965

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
NSDNP_733239.1 PWWP_NSD_rpt1 395..514 CDD:438972 20/139 (14%)
PHD2_NSD 867..932 CDD:277040 10/64 (16%)
PHD3_NSD 933..988 CDD:277041 11/55 (20%)
PHD4_NSD 1001..1041 CDD:277042 5/48 (10%)
PWWP_NSD_rpt2 1048..1143 CDD:438963 19/99 (19%)
AWS 1183..1233 CDD:197795 7/49 (14%)
SET_NSD 1233..1375 CDD:380950 48/142 (34%)
SETD1BNP_001340274.1 Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1..26
Interaction with WDR82. /evidence=ECO:0000269|PubMed:37030068 68..98
RRM_Set1B 101..193 CDD:409965
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 235..302
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 357..660
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 675..719
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 963..1462 126/711 (18%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1501..1541 16/57 (28%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1555..1606 14/74 (19%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1636..1668 8/33 (24%)
N-SET 1679..1821 CDD:463344 34/225 (15%)
WDR5 interaction motif (WIN). /evidence=ECO:0000269|PubMed:22266653, ECO:0000269|PubMed:22665483 1745..1750 1/7 (14%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1767..1800 6/37 (16%)
RxxxRR motif. /evidence=ECO:0000250|UniProtKB:P38827 1798..1803 1/4 (25%)
SET_SETD1 1815..1962 CDD:380946 50/170 (29%)
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.