DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Set2 and NSD1

DIOPT Version :9

Sequence 1:NP_572888.2 Gene:Set2 / 32301 FlyBaseID:FBgn0030486 Length:2362 Species:Drosophila melanogaster
Sequence 2:NP_071900.2 Gene:NSD1 / 64324 HGNCID:14234 Length:2696 Species:Homo sapiens


Alignment Length:2951 Identity:546/2951 - (18%)
Similarity:912/2951 - (30%) Gaps:1120/2951 - (37%)


- Green bases have known domain annotations that are detailed below.


  Fly    11 SPVASRGRGRGRPPKVALS---ALGNTPP---HINPSLKH---------------ADAEASPTAP 54
            :|:.......|.|..:|:.   :..|:|.   .:..::|:               .|:|..|..|
Human   109 TPIVCTSLSPGGPTALAMKQEPSCNNSPELQVKVTKTIKNGFLHFENFTCVDDADVDSEMDPEQP 173

  Fly    55 EDQDSGQSEC-RRSSRKKIIKFDVRDLLNKNRKAHKIQIEARIDSNPSTGHSQSGTTAASTSMST 118
            ..:|....|. ..:.......::     .|:....|:.:.:..||.|.:.|....:.....:..|
Human   174 VTEDESIEEIFEETQTNATCNYE-----TKSENGVKVAMGSEQDSTPESRHGAVKSPFLPLAPQT 233

  Fly   119 ATAS-------AASASSAATVSRLFSM-------------FEMSHQSLPPPPPPPTALEIFAKPR 163
            .|..       ..|...||.:...||:             ..:|.|..|                
Human   234 ETQKNKQRNEVDGSNEKAALLPAPFSLGDTNITIEEQLNSINLSFQDDP---------------- 282

  Fly   164 PTQSLIVAQVTSEPSAVG------GAHPVQTMAGLPPVTPRKRGRPRKSQLADAAII-------- 214
                      .|..|.:|      |.....|...||...|:|:..|.|.::.|  :|        
Human   283 ----------DSSTSTLGNMLELPGTSSSSTSQELPFCQPKKKSTPLKYEVGD--LIWAKFKRRP 335

  Fly   215 --PTVIVPSCSDSDTNSTSTTTSNMSSDSGELPGFPIQKPKSKLRVSLKRLKLGGRLESSDSGNS 277
              |..|   |||                       |:....||::||.:|   ..|....::...
Human   336 WWPCRI---CSD-----------------------PLINTHSKMKVSNRR---PYRQYYVEAFGD 371

  Fly   278 PSSSSPEVEPPAL---QDENAMDERP-------KQEQNLSRMVDAEENSDSDSQIIFIEIETESP 332
            ||..: .|...|:   :..:..:|.|       ::|:.....|..:..|..::.:...| :.:.|
Human   372 PSERA-WVAGKAIVMFEGRHQFEELPVLRRRGKQKEKGYRHKVPQKILSKWEASVGLAE-QYDVP 434

  Fly   333 KGEEEQEEGRPVEVEPQDL-IDIDMELAKQEPTPDPEEDLDEIMVEVLSGPPSL-WSADDEAEEE 395
            ||.:.:      :..|..: :|.:.::..::.|.|||.:.|.::...|.   || :.::..|:|:
Human   435 KGSKNR------KCIPGSIKLDSEEDMPFEDCTNDPESEHDLLLNGCLK---SLAFDSEHSADEK 490

  Fly   396 EDATVQRATPPGKEPAADSCSSAPRRSRRSA--PLSGSSRQGKTLEETFAEIAAESSKQILEAEE 458
            |            :|.|.|      |:|:|:  |...|.::|        .|..|:.|     :|
Human   491 E------------KPCAKS------RARKSSDNPKRTSVKKG--------HIQFEAHK-----DE 524

  Fly   459 SQDQEEQHILIDLIEDTLSESEVTSSVSPTIEHMVVEEVVVEENQLVDEADEILDSKQEFVIKKV 523
            .:.:..:::.::.|...:|:::.::.:|                                     
Human   525 RRGKIPENLGLNFISGDISDTQASNELS------------------------------------- 552

  Fly   524 FSESDNIAASLNKDIFEPKVETKATCGEVVPRPEMVTED----VYITEGIAATLEK-SAVVTKPT 583
                 .||.||......|.....::||:...:.|..|.:    :.:.||  |.:.| |....||.
Human   553 -----RIANSLTGSNTAPGSFLFSSCGKNTAKKEFETSNGDSLLGLPEG--ALISKCSREKNKPQ 610

  Fly   584 TEMIA-------------ETKLSDEVVI----------EPPLKDESDPKQTEVELPE----SKPA 621
            ..::.             |.|.||.:.|          ..|::..|:...:.:|:|:    ::..
Human   611 RSLVCGSKVKLCYIGAGDEEKRSDSISICTTSDDGSSDLDPIEHSSESDNSVLEIPDAFDRTENM 675

  Fly   622 VNIPKSERILSAEVETTSSPLVPPECCTLESVSGPVLLETSLSTEEKSNENVETTPLKTEAAKED 686
            :::.|:|:|..:....|::.:...:...:.:.....|:..:.|.|    ...||:.:.....|..
Human   676 LSMQKNEKIKYSRFAATNTRVKAKQKPLISNSHTDHLMGCTKSAE----PGTETSQVNLSDLKAS 736

  Fly   687 SPPAAPEEEASNSSEEPNFLLEDYESNQEQVAEDEMMKCNNQKGQKQTPLPEMKEPEKPVAETVS 751
            :....|:.:.:|.:..|.|.|....|     :|:.::|    .|.....|...|. ::|...::.
Human   737 TLVHKPQSDFTNDALSPKFNLSSSIS-----SENSLIK----GGAANQALLHSKS-KQPKFRSIK 791

  Fly   752 KKEKAMENPARSSPAIVDKKVRAGEMEKKVVKS-TKGT-------------------VPEKKMDS 796
            .|.|  |||..:.|.::::     |...|...| |||:                   :.||..||
Human   792 CKHK--ENPVMAEPPVINE-----ECSLKCCSSDTKGSPLASISKSGKVDGLKLLNNMHEKTRDS 849

  Fly   797 KKSCAAVTP---AKQKE-SGKSAKEAILKKETEKEK-----SSAKLDSSSP-------NTL---- 841
            .....||..   ::.|| |.:|..|.:....|.|..     |||...:..|       :||    
Human   850 SDIETAVVKHVLSELKELSYRSLGEDVSDSGTSKPSKPLLFSSASSQNHIPIEPDYKFSTLLMML 914

  Fly   842 ----DKKGKDTAQWSPQLQTLPKS------STKPPQESAPSVIS-------------KTTSNQPA 883
                |.|.|:....:.|.....:|      ||..|...:..::|             ...|..|:
Human   915 KDMHDSKTKEQRLMTAQNLVSYRSPGRGDCSTNSPVGVSKVLVSGGSTHNSEKKGDGTQNSANPS 979

  Fly   884 PKEEQHAAKKGLSDNSPPSVLKAKEKAVSGFVECDAM-----------FKAMDLANAQL------ 931
            |.....|....||.:.|..:...::...||....|.:           .|..|..:||:      
Human   980 PSGGDSALSGELSASLPGLLSDKRDLPASGKSRSDCVTRRNCGRSKPSSKLRDAFSAQMVKNTVN 1044

  Fly   932 --RLDEKNKKKLKKVPT-------------------KVEAPPKVEPPTAVPVPGQ---------- 965
              .|..:.|:||.::|:                   ..|.|.|.:|   :.:.|.          
Human  1045 RKALKTERKRKLNQLPSVTLDAVLQGDRERGGSLRGGAEDPSKEDP---LQIMGHLTSEDGDHFS 1106

  Fly   966 --------KKSLSGKTSLRRNTVYEDSPNLERNSSPSSDSAQANTSAGKLKPSKVKKKINPRRST 1022
                    |:|..||.| .:...:|:....|.:|..:|::.:.| ...::.|.|..:::|.||:.
Human  1107 DVHFDSKVKQSDPGKIS-EKGLSFENGKGPELDSVMNSENDELN-GVNQVVPKKRWQRLNQRRTK 1169

  Fly  1023 ICEAAKDLRSSSSSSTPTREVAASSPVSTSSDSSSKRNGSKRTTS---------DLDGGSKLDQR 1078
            ..:.....:...:|....|.:..|.||....|...:.    ||.|         :.:....||..
Human  1170 PRKRMNRFKEKENSECAFRVLLPSDPVQEGRDEFPEH----RTPSASILEEPLTEQNHADCLDSA 1230

  Fly  1079 --RYTICE-------DRQPETAIPVPLTKRRFSMHPKASANPLHDTLLQTAGKKRGRKEGKESLS 1134
              |..:|:       |.:.|..||        |:.|:|.   |.:..:::. |||.||..|..|.
Human  1231 GPRLNVCDKSSASIGDMEKEPGIP--------SLTPQAE---LPEPAVRSE-KKRLRKPSKWLLE 1283

  Fly  1135 RQNSLDSSSSASQGAPKKKALKSAEILS--AALLETESSESTSSGSKMSRWDVQTS--------P 1189
            .....|...     |||||..|..|.:.  ::..|.||..:....|..::...:.|        |
Human  1284 YTEEYDQIF-----APKKKQKKVQEQVHKVSSRCEEESLLARGRSSAQNKQVDENSLISTKEEPP 1343

  Fly  1190 ELEAANPF--GDIAK---------------FIEDGVNLLKRDKVDEDQ---RKEGQDEVKREADP 1234
            .||...||  |.:|:               .:.....:..|..::.::   :..|..|.||:..|
Human  1344 VLEREAPFLEGPLAQSELGGGHAELPQLTLSVPVAPEVSPRPALESEELLVKTPGNYESKRQRKP 1408

  Fly  1235 EEDEFAQRVANMETPATTP---------------------TPSPTQSNPEDSASTTTVL------ 1272
            .:....   :|...|...|                     |.|...|..:|....||.:      
Human  1409 TKKLLE---SNDLDPGFMPKKGDLGLSKKCYEAGHLENGITESCATSYSKDFGGGTTKIFDKPRK 1470

  Fly  1273 ----------------------KELETGGGVRRSHRIKQKPQ-----------GPRAS---QG-R 1300
                                  ||:....|....||....|:           |..||   || |
Human  1471 RKRQRHAAAKMQCKKVKNDDSSKEIPGSEGELMPHRTATSPKETVEEGVEHDPGMPASKKMQGER 1535

  Fly  1301 GVASVALAPISMD-EQLAELANIEA------------INE----QFLRSE---GLNTFQLLKEN- 1344
            |..:.....:..: |:|.||...||            :.|    :|:.:|   |::|..:.|:: 
Human  1536 GGGAALKENVCQNCEKLGELLLCEAQCCGAFHLECLGLTEMPRGKFICNECRTGIHTCFVCKQSG 1600

  Fly  1345 -------------FY--------------------------------------------RCAR-- 1350
                         ||                                            ||.|  
Human  1601 EDVKRCLLPLCGKFYHEECVQKYPPTVMQNKGFRCSLHICITCHAANPANVSASKGRLMRCVRCP 1665

  Fly  1351 ---------------------------------------------QVSQENAEMQC--------- 1361
                                                         .|..|...:.|         
Human  1666 VAYHANDFCLAAGSKILASNSIICPNHFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFH 1730

  Fly  1362 ----------------DC----------------------------------------------- 1363
                            ||                                               
Human  1731 RECLNIDIPEGNWYCNDCKAGKKPHYREIVWVKVGRYRWWPAEICHPRAVPSNIDKMRHDVGEFP 1795

  Fly  1364 -------------------FLTGD--------------------------EE--AQGHL------ 1375
                               ::.||                          ||  ||..|      
Human  1796 VLFFGSNDYLWTHQARVFPYMEGDVSSKDKMGKGVDGTYKKALQEAAARFEELKAQKELRQLQED 1860

  Fly  1376 -------------------------------------------SCG--AGCINRMLMIECGP-LC 1394
                                                       .||  :.||||||:.||.| :|
Human  1861 RKNDKKPPPYKHIKVNRPIGRVQIFTADLSEIPRCNCKATDENPCGIDSECINRMLLYECHPTVC 1925

  Fly  1395 SNGARCTNKRFQQHQCWPCRVFRTEKKGCGITAELLIPPGEFIMEYVGEVIDSEEFERRQHLYSK 1459
            ..|.||.|:.|.:.|.....:|||.::|.|:..:..|..|||:.|||||:||.||...|.....:
Human  1926 PAGGRCQNQCFSKRQYPEVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQE 1990

  Fly  1460 DRNRHYYFMALRGEAVIDATSKGNISRYINHSCDPNAETQKWTVNGELRIGFFSVKPIQPGEEIT 1524
            ....::|.:.|..:.:|||..|||.:|::||.|.||.|||||:|||:.|:|.|::..|:.|.|:|
Human  1991 HDITNFYMLTLDKDRIIDAGPKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELT 2055

  Fly  1525 FDYQYLRYGRDAQRCYCEAANCRGWIGGEPDSDEGEQLDEESDSDAEMDEEELEAEPEEGQP--- 1586
            |:|.....|.....|.|.|.||.|::|..|                            :.||   
Human  2056 FNYNLECLGNGKTVCKCGAPNCSGFLGVRP----------------------------KNQPIAT 2092

  Fly  1587 -RKSAKAKAKSKLKAKLPLATGRKRKEQTKPKDRE---YKAGRWLKPSATGSSSSAEKPPKKPKV 1647
             .||.|.|.|.:         |::|.:....|:||   :..|      ..|...|.:| |..|||
Human  2093 EEKSKKFKKKQQ---------GKRRTQGEITKEREDECFSCG------DAGQLVSCKK-PGCPKV 2141

  Fly  1648 NKFQAMLEDPDVVEELSLLRRGGLKNQQDTLRFSRCLVRAKLLKTRLALLRVLTHGELPCRRLFL 1712
              :.|        :.|:|.:|...|.:   ..:.:|.:..|         ...:..|: |...|.
Human  2142 --YHA--------DCLNLTKRPAGKWE---CPWHQCDICGK---------EAASFCEM-CPSSFC 2183

  Fly  1713 DYHGLRLLHAWISENGNDDQLREALLDTLESLPI-PNRTMLSDSRVYQSVQLWSNSLEQQLAVVP 1776
            ..|...:|  :||:       .:..|...|..|. ||.....:.|.|....:       .|...|
Human  2184 KQHREGML--FISK-------LDGRLSCTEHDPCGPNPLEPGEIREYVPPPV-------PLPPGP 2232

  Fly  1777 QEKQAALHKRMVALLQKWQALPEIFRIPKRERIEQMKEHEREADRQQKHVHASTALEDQRERESS 1841
            ....|.....|.|...|....|                   .||..|....:..||....:|...
Human  2233 STHLAEQSTGMAAQAPKMSDKP-------------------PADTNQMLSLSKKALAGTCQRPLL 2278

  Fly  1842 NDRFRQDRFRRDTTSSRIGKPIRMSGNNTIC-TITTQQKGSNGAP--DGMTRNDNRRRSDIGPPS 1903
            .:|..:   |.|:....:.|...::|:.|.. ::.:.|:..:..|  .|.....:.:.|.:..||
Human  2279 PERPLE---RTDSRPQPLDKVRDLAGSGTKSQSLVSSQRPLDRPPAVAGPRPQLSDKPSPVTSPS 2340

  Fly  1904 EQRRTLSKELRRSLFERKVALDEAERRVCTEDRLEHELRCEFFGADINTDPKQLPFYQKTDTNEW 1968
            ......|:.|.|.|......||::                  .||                    
Human  2341 SSPSVRSQPLERPLGTADPRLDKS------------------IGA-------------------- 2367

  Fly  1969 FNSDDVPVPAPPRTELLTKALLSPDIDVGQGATDVEYKLPPGVDPLPPAWNWQVTSD-------- 2025
                     |.||.:.|.|.                 .:|.|: .|||.....:||.        
Human  2368 ---------ASPRPQSLEKT-----------------SVPTGL-RLPPPDRLLITSSPKPQTSDR 2405

  Fly  2026 -GDIYYYNLRERISQWEPPSPEQRL----QTLLEENTTQQPLHELQIDPAVLENELIQVDTDYVG 2085
             .|..:.:|.:|:     |.||:.|    |||:.:....:|:.:                     
Human  2406 PTDKPHASLSQRL-----PPPEKVLSAVVQTLVAKEKALRPVDQ--------------------- 2444

  Fly  2086 SLSAKSLAQYIEAKVRERRDL---RRSKLVSIRLISPRRDEDRLYNQLESRKYKENKEKIRRRKE 2147
            :..:|:.|    |.|.:..||   ::.:..|...::|:.||.  ...|||..:..:|        
Human  2445 NTQSKNRA----ALVMDLIDLTPRQKERAASPHQVTPQADEK--MPVLESSSWPASK-------- 2495

  Fly  2148 LYRRRKIEVLPDAV------DEIPVPGKAL----------------------PIQPYLFSSDEEE 2184
                 .:..:|.||      |.:...|||.                      |.:.:|:   |..
Human  2496 -----GLGHMPRAVEKGCVSDPLQTSGKAAAPSEDPWQAVKSLTQARLLSQPPAKAFLY---EPT 2552

  Fly  2185 TKVAAIEQPAAEEEQDSLNMAPSTSHAAMAALG----KAVAQPTG--LGTVGKRKLPMPPSVTVK 2243
            |:.:......||:....|:.:|.....|...:|    .|:|..:|  ..::||....:|      
Human  2553 TQASGRASAGAEQTPGPLSQSPGLVKQAKQMVGGQQLPALAAKSGQSFRSLGKAPASLP------ 2611

  Fly  2244 KHRQEQRSKKVKSSQSP--LTATSAR 2267
                .:..|.|.:.|||  |...|:|
Human  2612 ----TEEKKLVTTEQSPWALGKASSR 2633

Known Domains:


Indicated by green bases in alignment.

Software error:

Illegal division by zero at /www/www.flyrnai.org/docroot/cgi-bin/DRSC_prot_align.pl line 591.

For help, please send mail to the webmaster (ritg@hms.harvard.edu), giving this error message and the time and date of the error.

GeneSequenceDomainRegion External IDIdentity
Set2NP_572888.2 AWS 1358..1410 CDD:197795 27/222 (12%)
SET 1414..1533 CDD:214614 50/118 (42%)
PostSET 1535..1551 CDD:214703 6/15 (40%)
WW 2014..2043 CDD:278809 8/37 (22%)
SRI 2270..2348 CDD:285448
NSD1NP_071900.2 Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 207..252 7/44 (16%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 281..311 8/55 (15%)
MSH6_like 319..429 CDD:99898 24/141 (17%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 487..514 11/44 (25%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 872..891 3/18 (17%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 936..1035 17/98 (17%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1067..1093 4/28 (14%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1112..1134 6/22 (27%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1243..1272 8/40 (20%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1294..1344 11/49 (22%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1382..1428 9/48 (19%)
ING <1431..1587 CDD:331088 28/155 (18%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1480..1534 10/53 (19%)
PHD1_NSD1_2 1545..1587 CDD:277118 9/41 (22%)
PHD2_NSD1 1592..1638 CDD:277120 4/45 (9%)
PHD3_NSD1 1639..1692 CDD:277123 3/52 (6%)
PHD4_NSD1 1709..1748 CDD:277126 3/38 (8%)
WHSC1_related 1754..1848 CDD:99899 3/93 (3%)
AWS 1891..1941 CDD:197795 17/49 (35%)
SET 1942..2065 CDD:214614 50/122 (41%)
S-adenosyl-L-methionine binding 1952..1954 0/1 (0%)
S-adenosyl-L-methionine binding 1994..1997 0/2 (0%)