DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment upSET and NSD1

DIOPT Version :9

Sequence 1:NP_001261819.1 Gene:upSET / 39551 FlyBaseID:FBgn0036398 Length:3146 Species:Drosophila melanogaster
Sequence 2:NP_071900.2 Gene:NSD1 / 64324 HGNCID:14234 Length:2696 Species:Homo sapiens


Alignment Length:3045 Identity:519/3045 - (17%)
Similarity:927/3045 - (30%) Gaps:1026/3045 - (33%)


- Green bases have known domain annotations that are detailed below.


  Fly   450 VNGTQMTDELSARILQSMAQKSFSQQQRFHQVPATGSGNMPPPTQIIYSNSTSNGA--------- 505
            :||..|  :||.  :...:|.::.|......:|.....::.....:.|.|.:::|:         
Human    43 LNGCTM--QLST--VSGTSQNAYGQDSPSCYIPLRRLQDLASMINVEYLNGSADGSESFQDPEKS 103

  Fly   506 ----------AASSPGG------------NASGNMLLAHYQAAGTKPVSSA-----SFITVTGT- 542
                      .:.||||            |.|..:     |...||.:.:.     :|..|... 
Human   104 DSRAQTPIVCTSLSPGGPTALAMKQEPSCNNSPEL-----QVKVTKTIKNGFLHFENFTCVDDAD 163

  Fly   543 --------PPVTV--------------ATTPSVSISSHGFASGSAAISSYMSSATAARRQSVSAP 585
                    .|||.              ||....:.|.:|.   ..|:.|...|...:|..:|.:|
Human   164 VDSEMDPEQPVTEDESIEEIFEETQTNATCNYETKSENGV---KVAMGSEQDSTPESRHGAVKSP 225

  Fly   586 SSRAVSLERKQHHQQLQHDVIGGGRKAP-------------TVIEYYNKHGVNSIVGSSNNLAQS 637
            ..........|.::| :::|.|...||.             |:.|..|...::......::.:..
Human   226 FLPLAPQTETQKNKQ-RNEVDGSNEKAALLPAPFSLGDTNITIEEQLNSINLSFQDDPDSSTSTL 289

  Fly   638 NSMSNLAGPRSNSGSGFATTTPTP-------ATPLHLT---------------PVNV---PV--- 674
            .:|..|.|..|:|     |:...|       :|||...               |..:   |:   
Human   290 GNMLELPGTSSSS-----TSQELPFCQPKKKSTPLKYEVGDLIWAKFKRRPWWPCRICSDPLINT 349

  Fly   675 ---------------HVEA-APPSSPALVKGSSQPPAQPQQQQQQAHPLGPNQLNANDEELYIEE 723
                           :||| ..||..|.|.|.:....:.:.|.::                    
Human   350 HSKMKVSNRRPYRQYYVEAFGDPSERAWVAGKAIVMFEGRHQFEE-------------------- 394

  Fly   724 VRPVPVLTQDLRLQQLHAIMQDHTYASQQQQQQPQQAAGD-TTNPGAAQQVQQPQQWSLGGIGVT 787
               :|||.:  |.:|     ::..|    :.:.||:.... ..:.|.|:|...|:          
Human   395 ---LPVLRR--RGKQ-----KEKGY----RHKVPQKILSKWEASVGLAEQYDVPK---------- 435

  Fly   788 VSGSQGTPTAVGGYCSYFGQQIARSQADDDAHSAISSSSRMGLASTDIDPGEETETAPEAEAEDD 852
              ||:......|.                           :.|.|.:..|.|:....||:| .|.
Human   436 --GSKNRKCIPGS---------------------------IKLDSEEDMPFEDCTNDPESE-HDL 470

  Fly   853 SVTRCICELTHDDGYMICCDKCSAWQHVDCMGIDRQNIPEEYMCELCQPRAVDKARARALQRQKR 917
            .:..|:..|..|.            :|    ..|.:..|       |......|:.....:...:
Human   471 LLNGCLKSLAFDS------------EH----SADEKEKP-------CAKSRARKSSDNPKRTSVK 512

  Fly   918 KEHMLLVA-TQAANGAAAVAAGTTLSGGLGSGLPMSEELQHRLASGLNGG--------------- 966
            |.|:...| .....|......|.....|..|....|.||. |:|:.|.|.               
Human   513 KGHIQFEAHKDERRGKIPENLGLNFISGDISDTQASNELS-RIANSLTGSNTAPGSFLFSSCGKN 576

  Fly   967 -----FATGTG-----------MSKKSKKTKENSGSTSTLKKTKKSAVGMGGEKNASGSGT--PT 1013
                 |.|..|           :||.|::..:...|.....|.|...:|.|.|:..|.|.:  .|
Human   577 TAKKEFETSNGDSLLGLPEGALISKCSREKNKPQRSLVCGSKVKLCYIGAGDEEKRSDSISICTT 641

  Fly  1014 GSSGKTSKKSSKRKSKSGGDGSSGGGSSPALTAAEKHAANL-----RQWIENYEYAVTNHYSPEL 1073
            ...|.:.....:..|:|       ..|...:..|.....|:     .:.|:...:|.||      
Human   642 SDDGSSDLDPIEHSSES-------DNSVLEIPDAFDRTENMLSMQKNEKIKYSRFAATN------ 693

  Fly  1074 RARLHAIQKQPSLLQSIQNTENKALRQIQQQLSTAGSAEQLEQRAQLIPYAGAKVLISSVDLSPH 1138
             .|:.|.||     ..|.|:....|      :....|||...:.:|          ::..||...
Human   694 -TRVKAKQK-----PLISNSHTDHL------MGCTKSAEPGTETSQ----------VNLSDLKAS 736

  Fly  1139 APIHE---------LRGKYMLTTQFRTQNPTVNMNTPPPSNYLNSFKAHKTPGQFVFFYQLPGVE 1194
            ..:|:         |..|:.|::...::|..:.      ....|....|....|..|        
Human   737 TLVHKPQSDFTNDALSPKFNLSSSISSENSLIK------GGAANQALLHSKSKQPKF-------- 787

  Fly  1195 APMQTLRPDGSVPQVAQQPP-----SYLKGPEVCVDTRTYGNDARFVRRSCRPNA----ELQHYF 1250
               ::::.......|..:||     ..||    |..:.|.|:....:.:|.:.:.    ...|..
Human   788 ---RSIKCKHKENPVMAEPPVINEECSLK----CCSSDTKGSPLASISKSGKVDGLKLLNNMHEK 845

  Fly  1251 EKGTLHLYIVALTHIRAQ-TEITIRHEPHDLT-AVEQKKSHAAVIQPTSTRCACDMGSDCLFALP 1313
            .:.:..:....:.|:.:: .|::.|....|:: :...|.|...:....|::....:..|..|:..
Human   846 TRDSSDIETAVVKHVLSELKELSYRSLGEDVSDSGTSKPSKPLLFSSASSQNHIPIEPDYKFSTL 910

  Fly  1314 LAVQQQLQAPPTQPRS--------SHRNKAAAAAAAAAAANSAAAIQLTMGLGVGATVAAGASVL 1370
            |.:.:.:....|:.:.        |:|    :......:.||.        :||...:.:|.|  
Human   911 LMMLKDMHDSKTKEQRLMTAQNLVSYR----SPGRGDCSTNSP--------VGVSKVLVSGGS-- 961

  Fly  1371 PNSRNRSTSSSGESSQMGLN-SPQLGQLNLGFKTSVTATSLTAPVPGVHCNNSGGSSSSSNNSCS 1434
                ..::...|:.:|...| ||..|.       |..:..|:|.:||:..:.....:|..:.|..
Human   962 ----THNSEKKGDGTQNSANPSPSGGD-------SALSGELSASLPGLLSDKRDLPASGKSRSDC 1015

  Fly  1435 VSMSSVLHDSGICTSSSSPSVSIPSPTPTQMQSPTLQQHP---------QQIPQQQL-SLLQ--- 1486
            |:..:       | ..|.||..:......||...|:.:..         .|:|...| ::||   
Human  1016 VTRRN-------C-GRSKPSSKLRDAFSAQMVKNTVNRKALKTERKRKLNQLPSVTLDAVLQGDR 1072

  Fly  1487 ----------RSPTQQHQQQILAAL---------------------PTPMLTPMLS------PQL 1514
                      ..|:::...||:..|                     |..:....||      |:|
Human  1073 ERGGSLRGGAEDPSKEDPLQIMGHLTSEDGDHFSDVHFDSKVKQSDPGKISEKGLSFENGKGPEL 1137

  Fly  1515 PKPAQQQ------AHVVLPQSQQTSLLQQ-----------QQSQQSQEPLAVIAAAAAAQQPMAT 1562
            ......:      .:.|:|:.:...|.|:           ::.:.|:....|:..:         
Human  1138 DSVMNSENDELNGVNQVVPKKRWQRLNQRRTKPRKRMNRFKEKENSECAFRVLLPS--------- 1193

  Fly  1563 YFVRQPQQQQQQQSPK---PQALVAQQQHVVGAQQQQHFLQQQQKQQQQQMADEARMAVSALQTL 1624
                .|.|:.:.:.|:   |.|.:.::.    ..:|.|.........:..:.|::..::..::..
Human  1194 ----DPVQEGRDEFPEHRTPSASILEEP----LTEQNHADCLDSAGPRLNVCDKSSASIGDMEKE 1250

  Fly  1625 HAAPTSHIVSPIKVAAVQQQSQ---------------------PQQQQQNTHQQPHNQQAVQQQS 1668
            ...|:....:.:...||:.:.:                     |:::|:...:|.|...:..::.
Human  1251 PGIPSLTPQAELPEPAVRSEKKRLRKPSKWLLEYTEEYDQIFAPKKKQKKVQEQVHKVSSRCEEE 1315

  Fly  1669 NQLQQQQSQQPN----------YPQSPQRQQKPQPVQHQP--QIVISTGAQAIPATMPTKLSSPT 1721
            :.|.:.:|...|          ..:.|...::..|....|  |..:..|...:|....:...:|.
Human  1316 SLLARGRSSAQNKQVDENSLISTKEEPPVLEREAPFLEGPLAQSELGGGHAELPQLTLSVPVAPE 1380

  Fly  1722 KSAAPVISNNNITVSAQSSVVGGKKTPAKHPQQQQQQQQQPVTPV--SAATAPAATPSSSE---S 1781
            .|..|.:.:..:.|          |||..:   :.::|::|...:  |....|...|...:   |
Human  1381 VSPRPALESEELLV----------KTPGNY---ESKRQRKPTKKLLESNDLDPGFMPKKGDLGLS 1432

  Fly  1782 K--------EDDVSASSTTTPTT-----RTPAKDKPKQSREDRKLEAILRAIEKMEKQEARGKKD 1833
            |        |:.::.|..|:.:.     .|...|||::.:..|      .|..||:.::.:....
Human  1433 KKCYEAGHLENGITESCATSYSKDFGGGTTKIFDKPRKRKRQR------HAAAKMQCKKVKNDDS 1491

  Fly  1834 TRQSSG--GKRQASNSPASPNKRNSSNSISEDVETPTST----NSAAAAAQRRN--KKKRKVSRS 1890
            :::..|  |:.....:..|| |......:..|...|.|.    .....||.:.|  :...|:...
Human  1492 SKEIPGSEGELMPHRTATSP-KETVEEGVEHDPGMPASKKMQGERGGGAALKENVCQNCEKLGEL 1555

  Fly  1891 LNNNTNGLGS-------------GGGSNNKRRKSI----VVESDGES------------------ 1920
            |.......|:             |....|:.|..|    |.:..||.                  
Human  1556 LLCEAQCCGAFHLECLGLTEMPRGKFICNECRTGIHTCFVCKQSGEDVKRCLLPLCGKFYHEECV 1620

  Fly  1921 -----------------------HALTNSE-SEDQG------QHPQSHHSGSEDQAAGLLLALAH 1955
                                   ||...:. |..:|      :.|.::|:.....|||..:..::
Human  1621 QKYPPTVMQNKGFRCSLHICITCHAANPANVSASKGRLMRCVRCPVAYHANDFCLAAGSKILASN 1685

  Fly  1956 NNSSPNE--PFKSPLSQSH-------------SL---PATPASVSSACLLIEAAMGPLQQQPAPA 2002
            :...||.  |.:...:..|             ||   .:.||:....||.|:...|........|
Human  1686 SIICPNHFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNIDIPEGNWYCNDCKA 1750

  Fly  2003 SASPSLAE--------FKYPPGGAKTKKSLMSS----------------------WFQQA----- 2032
            ...|...|        :::.|......:::.|:                      |..||     
Human  1751 GKKPHYREIVWVKVGRYRWWPAEICHPRAVPSNIDKMRHDVGEFPVLFFGSNDYLWTHQARVFPY 1815

  Fly  2033 -------EQQHASGLDSLVQAAM-------SEINGERE--QLQRQPQGESLPAP-ALLKVEQFIH 2080
                   :.:...|:|...:.|:       .|:..::|  |||...:.:..|.| ..:||.:.|.
Human  1816 MEGDVSSKDKMGKGVDGTYKKALQEAAARFEELKAQKELRQLQEDRKNDKKPPPYKHIKVNRPIG 1880

  Fly  2081 QAESTTA----VPAREQLHLPLQNNSSVKKRWLRQAISEETTPV--DELQQSQNQSVTATPSPQP 2139
            :.:..||    :| |.......:|...:....:.:.:..|..|.  ....:.|||..:....|: 
Human  1881 RVQIFTADLSEIP-RCNCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFSKRQYPE- 1943

  Fly  2140 VPTVSPLANGFSTPLKKRRLVVVSNGTNVESDETHIDVIGEPKDEAEENVAMTELKVEIENHHQE 2204
            |.....|..|:....|          |:::..|...:.:||..||.|....:        .:.||
Human  1944 VEIFRTLQRGWGLRTK----------TDIKKGEFVNEYVGELIDEEECRARI--------RYAQE 1990

  Fly  2205 QDD--------DVDILRSPSPGTHQIVAEDNLVKI-----EPE----------DTSA---AADDV 2243
            .|.        |.|.:....|       :.|..:.     :|.          ||..   |..|:
Human  1991 HDITNFYMLTLDKDRIIDAGP-------KGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDI 2048

  Fly  2244 KIDVE------------------------------REESQACDKFEEMVKVKREEEEQREKEIKQ 2278
            |...|                              |.::|.....|:..|.|::::.:|..:.:.
Human  2049 KAGTELTFNYNLECLGNGKTVCKCGAPNCSGFLGVRPKNQPIATEEKSKKFKKKQQGKRRTQGEI 2113

  Fly  2279 LQERQEH----------------EQPKVEPAPVEPKLENTVAKAEPKVEPSQEIVSKKEPTKVEP 2327
            .:||::.                ..|||..|......:....|.|...... :|..|:..:..|.
Human  2114 TKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQC-DICGKEAASFCEM 2177

  Fly  2328 KPGESLLRSTATVTATPTAATIAATTLLDVSKVAFKTRPPLKLEDEPQKKKP----KLESILPAP 2388
            .|.....:.              ...:|.:||:..:..   ..|.:|....|    ::...:|  
Human  2178 CPSSFCKQH--------------REGMLFISKLDGRLS---CTEHDPCGPNPLEPGEIREYVP-- 2223

  Fly  2389 VATVPPVSVPPIPAASNATTSAVTNTAAASLTTTTAPSSTKNLTEHDIQERLLSFHAANISYLQS 2453
                |||.:||.|:...|..|  |..||.:...:..|.:..|        ::||.          
Human  2224 ----PPVPLPPGPSTHLAEQS--TGMAAQAPKMSDKPPADTN--------QMLSL---------- 2264

  Fly  2454 RNKKATAA-----LTSASPSQKSNS---------SSGGSGTESKK--SSK-----------DKDE 2491
             :|||.|.     |....|.::::|         ...||||:|:.  ||:           .:.:
Human  2265 -SKKALAGTCQRPLLPERPLERTDSRPQPLDKVRDLAGSGTKSQSLVSSQRPLDRPPAVAGPRPQ 2328

  Fly  2492 KRDKEKQL-KKSKKEKKKSKDKEKQKAAV------NVNSTSQIVDTKKKTTQP------------ 2537
            ..||...: ..|.....:|:..|:.....      ::.:.|....:.:||:.|            
Human  2329 LSDKPSPVTSPSSSPSVRSQPLERPLGTADPRLDKSIGAASPRPQSLEKTSVPTGLRLPPPDRLL 2393

  Fly  2538 --SKPDSKSSIAPV---------LVPPSLPVATANGKT---KHTAYNNVDQQQQQQMRRRTMSMC 2588
              |.|..::|..|.         .:||...|.:|..:|   |..|...|||..|.: .|..:.|.
Human  2394 ITSSPKPQTSDRPTDKPHASLSQRLPPPEKVLSAVVQTLVAKEKALRPVDQNTQSK-NRAALVMD 2457

  Fly  2589 ITPVTPTPVVTPSPLHGTPPSTKKRQTNFEQELTKPNSQILSSSILLNSSKGLG-LP-------- 2644
            :..:||......:..|...|...::....|           |||  ..:||||| :|        
Human  2458 LIDLTPRQKERAASPHQVTPQADEKMPVLE-----------SSS--WPASKGLGHMPRAVEKGCV 2509

  Fly  2645 ---------LAAPT------VVSVPTA-VQQQQHRKENNHQEATPASG----------GPMSLAA 2683
                     .|||:      |.|:..| :..|...|...::..|.|||          ||:|.:.
Human  2510 SDPLQTSGKAAAPSEDPWQAVKSLTQARLLSQPPAKAFLYEPTTQASGRASAGAEQTPGPLSQSP 2574

  Fly  2684 AIASGKLNAISRRRESMCGSRQQQALIAAALKKEKKEKKKSKKKDREKQKHDKQKGKEKEREKDK 2748
            .:.        ::.:.|.|.:|..||.|           ||.:..|...|               
Human  2575 GLV--------KQAKQMVGGQQLPALAA-----------KSGQSFRSLGK--------------- 2605

  Fly  2749 EKDNKQKTNHIQKPAHPTTVPANSMPISAPAPVPVLVPTPVTTPKAAPIPVLITQPTPS----PI 2809
                                        |||.:|......|||.::   |..:.:.:..    ||
Human  2606 ----------------------------APASLPTEEKKLVTTEQS---PWALGKASSRAGLWPI 2639

  Fly  2810 HVTQPLVNNCSTKVASLPFYNTIY--GKLQDPSTPTSSPIPVTNTMPSLAEYLES 2862
            ...|.|..:|.:..::.....|.:  |:.|||. |..:.:|..|..||..:..||
Human  2640 VAGQTLAQSCWSAGSTQTLAQTCWSLGRGQDPK-PEQNTLPALNQAPSSHKCAES 2693

Known Domains:


Indicated by green bases in alignment.

Software error:

Illegal division by zero at /www/www.flyrnai.org/docroot/cgi-bin/DRSC_prot_align.pl line 591.

For help, please send mail to the webmaster (ritg@hms.harvard.edu), giving this error message and the time and date of the error.

GeneSequenceDomainRegion External IDIdentity
upSETNP_001261819.1 PHD_MLL5 856..899 CDD:277025 6/42 (14%)
SET <1221..1273 CDD:214614 7/56 (13%)
NSD1NP_071900.2 Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 207..252 10/45 (22%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 281..311 7/34 (21%)
MSH6_like 319..429 CDD:99898 21/143 (15%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 487..514 4/33 (12%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 872..891 3/18 (17%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 936..1035 25/131 (19%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1067..1093 3/25 (12%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1112..1134 3/21 (14%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1243..1272 3/28 (11%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1294..1344 8/49 (16%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1382..1428 11/58 (19%)
ING <1431..1587 CDD:331088 29/162 (18%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1480..1534 10/54 (19%)
PHD1_NSD1_2 1545..1587 CDD:277118 5/41 (12%)
PHD2_NSD1 1592..1638 CDD:277120 3/45 (7%)
PHD3_NSD1 1639..1692 CDD:277123 10/52 (19%)
PHD4_NSD1 1709..1748 CDD:277126 8/38 (21%)
WHSC1_related 1754..1848 CDD:99899 10/93 (11%)
AWS 1891..1941 CDD:197795 8/50 (16%)
SET 1942..2065 CDD:214614 26/148 (18%)
S-adenosyl-L-methionine binding 1952..1954 0/1 (0%)
S-adenosyl-L-methionine binding 1994..1997 0/2 (0%)