DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment egg and Nsd1

DIOPT Version :10

Sequence 1:NP_611966.3 Gene:egg / 37962 FlyBaseID:FBgn0086908 Length:1262 Species:Drosophila melanogaster
Sequence 2:NP_001388465.1 Gene:Nsd1 / 306764 RGDID:1307748 Length:2689 Species:Rattus norvegicus


Alignment Length:1595 Identity:294/1595 - (18%)
Similarity:462/1595 - (28%) Gaps:649/1595 - (40%)


- Green bases have known domain annotations that are detailed below.


  Fly    35 PVRKGENSL---------------ESPAEQAAKDVEIEELTHSEAIAATGSTRKQCPYGGKAPDE 84
            |..:.||||               ..|..::.|....|..|.:|..|.:.....:|    .:.|.
  Rat   758 PGIRNENSLTKGGLANQTLLPLKCRQPKFRSIKCKHKESPTAAETSATSEDLSLKC----CSSDT 818

  Fly    85 PGK-LADESEDRKGENTKAI--------ASSPVLVAV-------------------DSDSSVELI 121
            .|. :...|:..|||..|.:        .||.:..||                   .|||.....
  Rat   819 NGSPMTSISKSGKGEGLKLLNNMHEKTRDSSDIETAVVKHVLSELKELSYRSLSEDVSDSGTSKS 883

  Fly   122 ESPVKFSSANESEKDPPKPD-----------AVNEAAAKEAEEMTDSSISS-------PTSESFP 168
            ..|:.||||:.....|.:||           .::::..||...||..:::|       ..|...|
  Rat   884 SKPLLFSSASSQNHIPIEPDYKFSTLLMMLKDMHDSKTKEQRLMTAQNVASYRTPDRGDCSSGSP 948

  Fly   169 EKDEKT----NKENEQEPPGMEVDQDVEESISRPAEEYKIENTLKGHKRISLTEIEEHKIVDKKD 229
            ....|.    ...:..|.||......|..|   |...   ::.|.|....||:.:..    ||:|
  Rat   949 VGTSKVLVLGGSTHNSEKPGDSTQDSVRLS---PGGG---DSALSGELSSSLSVLPS----DKRD 1003

  Fly   230 DVLEVELEKGTAP-----KAAEDEKLNALLSDGDVFYDKECVNCNCTKLHKQYVLANM--ATLNF 287
            .....::.....|     :|....||...:|   ....|..||....|..::..|:.:  .||..
  Rat  1004 LPACGKIRSNCIPRRNCGRAKLSPKLRVTIS---TQMAKPSVNPKALKTERKRKLSRLPAVTLAA 1065

  Fly   288 YQVLRKSSKQQF---LCMGCHDTAMDLYEEYAGQLMAKQPLLLKDF---HQDHADFVALDSSDEE 346
            ..:..|.|....   |..|..|.|.:            :||...|.   .:.|.|.....|..::
  Rat  1066 NGLGNKESGGSVNGPLKGGAQDPAKE------------EPLQQMDLLRNEETHFDSKVKQSDPDK 1118

  Fly   347 EEEKQPEKSDFSKNKLQLIEDELDDAIKNVLNK----VDFTAQLSWSKTILQAKADHLERQFALA 407
            ..||:|   .|...|...:..|:     |:.|.    ||                          
  Rat  1119 ILEKEP---SFENRKGPEVGSEI-----NIENDEPHGVD-------------------------- 1149

  Fly   408 DVELEKVQTTADKMHCALYNSCPVAHKHLPTL-DIEPSDYVHEVPPPGEIVRPPIQLGETYYAVK 471
                   |....|....|....|...|..... :.|.|:....|..||:    |:|.|...|..:
  Rat  1150 -------QVVPKKRWQRLNQRRPKPGKRASRFREKENSEGAFGVLLPGD----PVQKGRDDYLEQ 1203

  Fly   472 NKAIASWVSIKVIEFTESTAINGNTMKSYKIRYLNTPYQMIKTVTAKHIAYFEPPPVRLTIGTRV 536
            .....|        ..|.:|.:.|                       |:::.|....||.:    
  Rat  1204 RAPPTS--------ILEDSAADPN-----------------------HVSHSESVGPRLNV---- 1233

  Fly   537 IAYFDGTTLSRG---KDKGVVQSAFYPGI-----IAEPLKQANRYR------YLIFYDDGYTQ-Y 586
               .|.:::|.|   |:.|:      |.:     |.||..::.:.|      :|:.|.:.|.| :
  Rat  1234 ---CDKSSVSMGDLEKETGI------PSLTPQTKIPEPAVRSEKKRLRKPSKWLLEYTEEYDQIF 1289

  Fly   587 VPHRDVRLVCQASEKVWEDVHAASRDF--------IQKYVEKYSVDRPMVQCTRGQSMTTESNGT 643
            .|.:       ..:||.|.||..|...        .:...:...||...:..|:.:....|....
  Rat  1290 APKK-------KQKKVQEQVHKVSSRCEDESLLARCRSSAQNKQVDENSLISTKEEPPVLEREAP 1347

  Fly   644 WLYARVIDIDCSL----------------------------VLMQFEGDKNHTEWIYRGSLRLGP 680
            :|...::..|..:                            :|::..|:       |.|..:..|
  Rat  1348 FLEGPLVQSDLGVAHAELPQLTLSVPVAPEVSPRPTLESEELLVKTPGN-------YEGKRQRKP 1405

  Fly   681 VFRETQNNMNSSSAQQLRVPR-----------------------RTEPFIR-----YTKEMESSS 717
                |:..:.|:......:|:                       |..|.::     .|:..:...
  Rat  1406 ----TKKLLESNDLDPGFMPKKGDLGLSRKCCESGHLENGVGDSRATPHLKEFGGGTTRIFDKPR 1466

  Fly   718 KVNQQMRAFAR------------KSSASAQNNALAAASSAATP----AGGRTNAGGVSTSNSASA 766
            |..:|....||            :.:.|:....|....:||:|    ..|..:..|:|.|.....
  Rat  1467 KRKRQRHGTARVHYKRVKKEDSARETPSSAEGELMIHRTAASPKEILEEGIEHDPGMSASKRLQG 1531

  Fly   767 VR----HLNNSTIYVDDENRPK-GHVVYFTAK------------RNLPPKMYKCHECSPNCLFKI 814
            .|    .|..:..    :|..| |.::...|:            ..:|...:.|:||...     
  Rat  1532 ERGGGAALKENVC----QNCEKLGELLLCEAQCCGAFHLECLGLTEMPRGKFICNECRTG----- 1587

  Fly   815 VHRLDSYSPLAKPLLSGWERLVMRQK---TKKSVVYKGP-CGKSLRSLAEVHRYLRATENVLNVD 875
            :|..                .|.:|.   .|:.::   | |||...... |.:|....     |.
  Rat  1588 IHTC----------------FVCKQSGEDVKRCLL---PLCGKFYHEEC-VQKYPPTV-----VQ 1627

  Fly   876 NFDFTPDLK-CLAEYSIDPSIVKDTDISKGQEKMAIPL-VNYYDNTLPPPCTYAKQRI------- 931
            |..|...|. |:..::.:|:   :...|||:....:.. |.|:.|..   |..|..:|       
  Rat  1628 NKGFRCPLHICITCHAANPA---NVSASKGRLMRCVRCPVAYHANDF---CLAAGSKILASNSII 1686

  Fly   932 ------PTEGV----HLNLDEEF-------LLCCD-CE-------------------DDCSDKSK 959
                  |..|.    |:|:...|       ||||| |.                   :||....|
  Rat  1687 CPNHFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNIDIPEGNWYCNDCKAGKK 1751

  Fly   960 CACWQLTVAGV-RY-------CNP----------------------------------------- 975
            ....::....| ||       |:|                                         
  Rat  1752 PHYREIVWVKVGRYRWWPAEICHPRAVPSNIDKMRHDVGEFPVLFFGSNDYLWTHQARVFPYMEG 1816

  Fly   976 ----------------KKPIEEIGYQYKRL---------------------HEHV----PTG--- 996
                            ||.::|...:::.|                     ::|:    |.|   
  Rat  1817 DVSSKDKMGKGVDGTYKKALQEAAARFEELKAQKELRQLQEDRKNDKKPPPYKHIKVNRPIGRVQ 1881

  Fly   997 IYECN----SRCKCK----------KNCLNRVVQFSLE--------------------MKLQVFK 1027
            |:..:    .||.||          ..|:||::.:...                    ..:::|:
  Rat  1882 IFTADLSEIPRCNCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFSKRQYPDVEIFR 1946

  Fly  1028 TSNRGWGLRCVNDIPKGAFICIYAGHLLTETMANEGGQDAGDEYFADLDYIEVAEQLKEGYESEV 1092
            |..||||||...||.||.|:..|.|.|:                                     
  Rat  1947 TLQRGWGLRTKTDIKKGEFVNEYVGELI------------------------------------- 1974

  Fly  1093 DHSDPDAEEDNGGPDAEDDDDFRPNYHYQRKIKRSSRSGSTQNSSTQSSELDSQERAVINFNPNA 1157
                             |:::.|....|                        :||..:.||. ..
  Rat  1975 -----------------DEEECRARIRY------------------------AQEHDITNFY-ML 1997

  Fly  1158 DLDETVRENSVRRLFGKDEAPYIMDAKTTGNLGRYFNHSCSPNLFVQNVFVDTHDLRFPWVAFFS 1222
            .||             ||.   |:||...||..|:.||.|.||...|...|: .|.|   |..|:
  Rat  1998 TLD-------------KDR---IIDAGPKGNYARFMNHCCQPNCETQKWSVN-GDTR---VGLFA 2042

  Fly  1223 AAHIRSGTELTWNYNYEVGVVPGKVLYCQCGAPNC 1257
            .:.|::|||||:|||.|. :..||.: |:||||||
  Rat  2043 LSDIKAGTELTFNYNLEC-LGNGKTV-CKCGAPNC 2075

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
eggNP_611966.3 Tudor_SETDB1_rpt1 532..622 CDD:410453 21/112 (19%)
Tudor_SETDB1_rpt2 630..684 CDD:410548 9/81 (11%)
HMT_MBD 823..879 CDD:238689 11/59 (19%)
SET_SETDB1 900..1262 CDD:380915 106/530 (20%)
Nsd1NP_001388465.1 PWWP_NSD1_rpt1 318..433 CDD:438989
TNG2 <1452..1585 CDD:227367 23/136 (17%)
PHD1_NSD1_2 1543..1585 CDD:277118 7/45 (16%)
PHD2_NSD1 1590..1636 CDD:277120 12/70 (17%)
PHD3_NSD1 1637..1690 CDD:277123 11/58 (19%)
PHD4_NSD1 1707..1746 CDD:277126 7/38 (18%)
PWWP_NSD1_rpt2 1753..1848 CDD:438992 9/94 (10%)
AWS 1899..1937 CDD:465559 3/37 (8%)
SET_NSD1 1939..2080 CDD:380987 63/238 (26%)
PHD5_NSD1 2118..2160 CDD:277129
C5HCH 2159..2208 CDD:465605
PHA03247 <2221..2593 CDD:223021
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.