DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment trx and Nsd1

DIOPT Version :10

Sequence 1:NP_476769.1 Gene:trx / 41737 FlyBaseID:FBgn0003862 Length:3726 Species:Drosophila melanogaster
Sequence 2:NP_001388465.1 Gene:Nsd1 / 306764 RGDID:1307748 Length:2689 Species:Rattus norvegicus


Alignment Length:2489 Identity:453/2489 - (18%)
Similarity:757/2489 - (30%) Gaps:931/2489 - (37%)


- Green bases have known domain annotations that are detailed below.


  Fly  1633 MSQVIQQSNCDELDIAYKELLSEQFPWFQNETKACTDALE-----------------EDMFE--- 1677
            |:|....:|..||.::..:.....|..|.|.|  |.|..:                 |::||   
  Rat   126 MNQEPTCNNPPELQLSVTKTTKNGFLHFDNFT--CVDDADVDSEMDPEQPVTEDESIEEIFEETQ 188

  Fly  1678 ---SCSGGNYEDLQDTGGVSASVYNEHSTSQAESRSGVLD---IPLEEVDDFGSCGIKMRLDTRM 1736
               ||   |||. :...||..::.:|.. |..|||.|.::   :||....:      |.:...| 
  Rat   189 TNASC---NYEP-KSENGVEMAMGSEQD-SMPESRHGAVEQPFVPLAPQTE------KQKNKQR- 241

  Fly  1737 CLFCRKSGEGLSGEEARLLYCGHDCWVHTNCAMWSAEVFEEIDGSLQNVHSAVARGRMIKCTVCG 1801
                                                   .|:|||.:......|...:      |
  Rat   242 ---------------------------------------SEVDGSNEKTALLPAPNSL------G 261

  Fly  1802 NRGATVGCNVRSCGEHYHYPCARSIDCAFLTDKSMYCPAHAKNGNALKANG--SPSVTYESNFEV 1864
            :...|:       .|.::     ||:.:|..|..   .:.:..||.|:..|  |||.:.|..|  
  Rat   262 DTNVTI-------EEQFN-----SINLSFQDDPD---SSTSTLGNMLELPGTSSPSTSQELPF-- 309

  Fly  1865 SRPVYVELDRKRKKLIEPARVQFHIGSLEVRQLGAIVPRF-----------SDSYEAVVPINFLC 1918
            ..|         ||...|  :::.:|.|       |..:|           ||      |:....
  Rat   310 CHP---------KKKSTP--LKYEVGDL-------IWAKFKRRPWWPCRICSD------PLINTH 350

  Fly  1919 SRLYWSSKEPWKIVEYTVRTTIQNSSSTLTA-----LDVGRNY--TVDHTNPNSKEVQLGMAQ-- 1974
            |::..:::.|::  ||.|......|.....|     :..||:.  .:.......|:.:.|...  
  Rat   351 SKMKVANRRPYR--EYYVEAFGDPSEKAWVAGKAIVMFEGRHQFEELPVLRKRGKQKEKGYRHKV 413

  Fly  1975 ----IARWHTSLARSEFLE--NGGTDWSGEFPNPNSCVP-----------PDENTEEEPQQQADL 2022
                :::|..|:..:|..:  .|        |....|||           |.|:...:|..:.||
  Rat   414 PQKILSKWEASVGLAEQCDVPRG--------PKTQKCVPSSAKLDSEEDMPFEDCANDPDSEHDL 470

  Fly  2023 LPPELKDAIFEDLPHELLDGISMLDIFLYDDKTDLFAISEQSKDGTQAMTSN--------QAQNQ 2079
            |......::..|..|..            |:|....|.|...|.......::        :||.:
  Rat   471 LLNGCLKSLAFDSEHSA------------DEKEKPCAKSRVRKSSDNIKRTSVKKGLMPFEAQKE 523

  Fly  2080 NQQ------------AGGANSVSICDEDTRNSNTSLGNGWPASNPVEDAMLSAARNSSQVQMLKT 2132
            .::            :||.:.....:|.:|.:|:..|:   ::.|.:....|..:|:::.. .:|
  Rat   524 ERRGKSPENLGLDFLSGGVSDKQASNELSRIANSLTGS---SAAPGQFLFSSCGQNTAKTD-FET 584

  Fly  2133 LAWPKLDGNSAMATAIKRRKLSKNLAEGVFLTLSSQQRNKKEMATV-AGVSRRQSISETSVEGVA 2196
            .....|.|.|..|...|.....|.|..|...:      :|.::..| ||...::|.|.:    |.
  Rat   585 PNCDSLSGLSESALISKHSGEKKKLQPGQVCS------SKVQLCYVGAGDEEKRSDSVS----VC 639

  Fly  2197 TTSGSVRSKSFTWSAAKRYFEKSEGREEAAKMRIMQMDGVDDSITEFRIISGDGNLSTAQFSGQV 2261
            |||                                     ||         |..:|...:.:.:.
  Rat   640 TTS-------------------------------------DD---------GSSDLDPTEHNSEF 658

  Fly  2262 KCDRCQCTYRNYDAFQRHLPSCSPTMSSNETESDVSGQGMTNNATQIS----AESLNELQKQLLA 2322
                             |......|.:.::||:.:|   |..|.|:.|    ...:.|.||.|:.
  Rat   659 -----------------HKSVLEVTDALDKTENALS---MHKNETKYSRYPATNRVKEKQKSLIT 703

  Fly  2323 NAGGLNYLQSATSFPQVQSLGSLGQFGLQGLQQLQLQPQSLGSGFFLSQPNPATQANTDDLQIYA 2387
            |: ..::|..:|...:.         |...:.|:.|....:.|..    |.|..:...|.|....
  Rat   704 NS-HTDHLMDSTKTVEP---------GTAEISQVNLSDLKISSPI----PKPQPEFRNDGLTTKF 754

  Fly  2388 NSLQSL----AANLGGGFTLAQPTVTAP---AQPQLIAVSTNPDGTQQFIQIPQTMQATTTPTAT 2445
            |:...:    :...||   ||..|: .|   .||:..::..      :..:.|..  |.|:.|:.
  Rat   755 NAPPGIRNENSLTKGG---LANQTL-LPLKCRQPKFRSIKC------KHKESPTA--AETSATSE 807

  Fly  2446 YQTLQATNTDKKIMLPLTAAGKPLKTVATKAAQQAAVKQRQLKSGHQVKPIQAKLQPHPQQH--- 2507
            ..:|:..::|..        |.|:.:: :|:.:...:|  .|.:.|:.....:.::....:|   
  Rat   808 DLSLKCCSSDTN--------GSPMTSI-SKSGKGEGLK--LLNNMHEKTRDSSDIETAVVKHVLS 861

  Fly  2508 --QQQQQTQVQQPITVMGQNLLQPQLLFQS-STQTQAPQIILPQAQ------------------- 2550
              ::.....:.:.::..|.:.....|||.| |:|...|  |.|..:                   
  Rat   862 ELKELSYRSLSEDVSDSGTSKSSKPLLFSSASSQNHIP--IEPDYKFSTLLMMLKDMHDSKTKEQ 924

  Fly  2551 ----PQNIISFVT---GDGSQGQPLQYISIPTAGEYKPQPQPTATPTFLTTAPGAGATYLQTDAS 2608
                .||:.|:.|   ||.|.|.|                                         
  Rat   925 RLMTAQNVASYRTPDRGDCSSGSP----------------------------------------- 948

  Fly  2609 GNLVLTTTPSNSGLQMLTAQSLQAQPQVIGTLIQPQTIQLGGGA-DGNQPGSNQQPLILGGTGGG 2672
                                        :||   .:.:.|||.. :..:||.:.|..:....|||
  Rat   949 ----------------------------VGT---SKVLVLGGSTHNSEKPGDSTQDSVRLSPGGG 982

  Fly  2673 SSGLEFATTSPQVILATQPMYYGLETIVQNTVMSSQQFVSTAMPGMLSQNASFSATTTQVFQASK 2737
            .|.|....:|...:|.:.          :..:.:..:..|..:|......|..|...........
  Rat   983 DSALSGELSSSLSVLPSD----------KRDLPACGKIRSNCIPRRNCGRAKLSPKLRVTISTQM 1037

  Fly  2738 IEPIVD--------------LPAGYVVLNNTGDASSAGTFLNAASVLQQQTQDDTTTQ------I 2782
            .:|.|:              |||..:..|..|:..|.|: :|..  |:...||....:      :
  Rat  1038 AKPSVNPKALKTERKRKLSRLPAVTLAANGLGNKESGGS-VNGP--LKGGAQDPAKEEPLQQMDL 1099

  Fly  2783 LQNANFQFQSVPTSS--------------------GASTSMDYTSPVMVTAKIP----------- 2816
            |:|....|.|....|                    |:..:::...|..|...:|           
  Rat  1100 LRNEETHFDSKVKQSDPDKILEKEPSFENRKGPEVGSEINIENDEPHGVDQVVPKKRWQRLNQRR 1164

  Fly  2817 --PVTQIKRTNAQAKAAGISGVGKVPPQP-------QVVNKVLPTSIVTQQSQVQVKNSNLKQSQ 2872
              |..:..|...:..:.|..|| .:|..|       .:..:..||||        :::|....:.
  Rat  1165 PKPGKRASRFREKENSEGAFGV-LLPGDPVQKGRDDYLEQRAPPTSI--------LEDSAADPNH 1220

  Fly  2873 VKGKAASGTGTTCGAPPSIASKPLQKKTNMIRPIHKLEVKPKVMKPTPKVQNQNHSLLQQQQQQQ 2937
            |....:.|.........|::...|:|:|.:  |    .:.|:...|.|.|:::...|        
  Rat  1221 VSHSESVGPRLNVCDKSSVSMGDLEKETGI--P----SLTPQTKIPEPAVRSEKKRL-------- 1271

  Fly  2938 PQLQQQIPAVVVNQVPKVTISQQRIPAQTQQQQLQQAQMIHIPQQQQPLQQQQVQVQPSMPIITL 3002
                                   |.|::...:..::...|..|:::|...|:||....|      
  Rat  1272 -----------------------RKPSKWLLEYTEEYDQIFAPKKKQKKVQEQVHKVSS------ 1307

  Fly  3003 AEAPVVQSQFVMEPQALEQQELANRVQHFSTSSSSSSSNCSLPTNVVNPMQQQAPSTTSSSTTRP 3067
                           ..|.:.|..|.:       ||:.|            :|....:..||   
  Rat  1308 ---------------RCEDESLLARCR-------SSAQN------------KQVDENSLIST--- 1335

  Fly  3068 TNRVLPMQQRQEPAPLSNECPVVSSPTPPKPVEQPIIHQMTSASVSKCYAQKSTLPSPVYEAELK 3132
                     ::||..|..|.|.         :|.|::              :|.|  .|..|||.
  Rat  1336 ---------KEEPPVLEREAPF---------LEGPLV--------------QSDL--GVAHAELP 1366

  Fly  3133 VSSVLESIVPDVTMDAILEEQPVTESIYTEGLYEKNSPGESKTEQLLLQQQQREQLNQQLVNN-- 3195
            ..::...:.|:|:....||.:.:.  :.|.|.||              .::||:...:.|.:|  
  Rat  1367 QLTLSVPVAPEVSPRPTLESEELL--VKTPGNYE--------------GKRQRKPTKKLLESNDL 1415

  Fly  3196 --GYLLDKHTFQVEPMDTDVYREEDLEEEEDEDDDFSLKMATSACNDHEMSDSE-EPAVKD---K 3254
              |::..|                       .|...|.|...|...::.:.||. .|.:|:   .
  Rat  1416 DPGFMPKK-----------------------GDLGLSRKCCESGHLENGVGDSRATPHLKEFGGG 1457

  Fly  3255 ISKILD------------------NLTNDDCADSIATATTMEVDASAGYQQMVEDVLATTAAQSA 3301
            .::|.|                  .:..:|.|        .|..:||..:.|:.    .|||  :
  Rat  1458 TTRIFDKPRKRKRQRHGTARVHYKRVKKEDSA--------RETPSSAEGELMIH----RTAA--S 1508

  Fly  3302 PTEEFEGALE-TAAVEAAATYINEMADAHVLDLKQLQNGVEL----------------------E 3343
            |.|..|..:| ...:.|:.....|......|.....||..:|                      |
  Rat  1509 PKEILEEGIEHDPGMSASKRLQGERGGGAALKENVCQNCEKLGELLLCEAQCCGAFHLECLGLTE 1573

  Fly  3344 LRRRK---EEQRT-------VSQEQEQSKAAIVP--------TAAAPEPPQPIQEPKKMTGPHLL 3390
            :.|.|   .|.||       ..|..|..|..::|        ......||..:|........|:.
  Rat  1574 MPRGKFICNECRTGIHTCFVCKQSGEDVKRCLLPLCGKFYHEECVQKYPPTVVQNKGFRCPLHIC 1638

  Fly  3391 YEIQSEDGFTYKSSSITEIWEKVFEAVQ--VARRAHGLTPLPEGPLADMGGIQMIGLKTNALKYL 3453
            ....:.:.....:|.     .::...|:  ||..|:...      ||       .|.|..|...:
  Rat  1639 ITCHAANPANVSASK-----GRLMRCVRCPVAYHANDFC------LA-------AGSKILASNSI 1685

  Fly  3454 IEQLPGVEKC-SKYTPKYHKRNG---NVSTAANGAHGGN-LGGSSASAA-----LSV-------- 3500
            |        | :.:||:...||.   |||.....:.||: |...|..||     |::        
  Rat  1686 I--------CPNHFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNIDIPEGNWY 1742

  Fly  3501 -----SGGDSH--------------------------------------------GLLDY----- 3511
                 :|...|                                            |..||     
  Rat  1743 CNDCKAGKKPHYREIVWVKVGRYRWWPAEICHPRAVPSNIDKMRHDVGEFPVLFFGSNDYLWTHQ 1807

  Fly  3512 ----------------------GSDQDELEENAYDCARCEPYSNRSEYDMFSWLASRHRKQP--- 3551
                                  |:.:..|:|.|   ||.|....:.|...........:|.|   
  Rat  1808 ARVFPYMEGDVSSKDKMGKGVDGTYKKALQEAA---ARFEELKAQKELRQLQEDRKNDKKPPPYK 1869

  Fly  3552 ----------IQVFVQPSDNELVPR---RGTGSNLPMAMKYRTL--------------------- 3582
                      :|:|.  :|...:||   :.|..| |..:....:                     
  Rat  1870 HIKVNRPIGRVQIFT--ADLSEIPRCNCKATDEN-PCGIDSECINRMLLYECHPTVCPAGGRCQN 1931

  Fly  3583 ----KETYKDYVGVFRSHIHGRGLYCTKDIEAGEMVIEYAGELIRSTLTDKRERYYDSRGI-GCY 3642
                |..|.| |.:||:...|.||....||:.||.|.||.||||.......|.||.....| ..|
  Rat  1932 QCFSKRQYPD-VEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFY 1995

  Fly  3643 MFKIDDNLVVDATMRGNAARFINHCCEPNCYSKVVDILGHKHIIIFALRRIVQGEELTYDYKFP- 3706
            |..:|.:.::||..:||.|||:||||:|||.::...:.|...:.:|||..|..|.|||::|... 
  Rat  1996 MLTLDKDRIIDAGPKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLEC 2060

  Fly  3707 FEDEKIPCSCGSKRCRKYL 3725
            ..:.|..|.||:..|..:|
  Rat  2061 LGNGKTVCKCGAPNCSGFL 2079

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
trxNP_476769.1 NR_DBD_like 762..>856 CDD:413390
PRK13914 <1006..>1224 CDD:237555
PHD1_KMT2A_like 1268..1344 CDD:276981
PHD 1346..1390 CDD:214584
PHD3_KMT2A_like 1423..1479 CDD:276983
ePHD_KMT2A_like 1737..1841 CDD:277134 12/103 (12%)
FYRN 1890..1937 CDD:461787 11/57 (19%)
FYRC 3388..3476 CDD:197781 17/93 (18%)
SET_KMT2A_2B 3575..3726 CDD:380947 57/178 (32%)
Nsd1NP_001388465.1 PWWP_NSD1_rpt1 318..433 CDD:438989 22/131 (17%)
TNG2 <1452..1585 CDD:227367 26/146 (18%)
PHD1_NSD1_2 1543..1585 CDD:277118 7/41 (17%)
PHD2_NSD1 1590..1636 CDD:277120 7/45 (16%)
PHD3_NSD1 1637..1690 CDD:277123 12/78 (15%)
PHD4_NSD1 1707..1746 CDD:277126 7/38 (18%)
PWWP_NSD1_rpt2 1753..1848 CDD:438992 11/97 (11%)
AWS 1899..1937 CDD:465559 2/38 (5%)
SET_NSD1 1939..2080 CDD:380987 56/142 (39%)
PHD5_NSD1 2118..2160 CDD:277129
C5HCH 2159..2208 CDD:465605
PHA03247 <2221..2593 CDD:223021
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.