DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment egg and Ehmt1

DIOPT Version :10

Sequence 1:NP_611966.3 Gene:egg / 37962 FlyBaseID:FBgn0086908 Length:1262 Species:Drosophila melanogaster
Sequence 2:XP_006233681.2 Gene:Ehmt1 / 362078 RGDID:1307588 Length:1309 Species:Rattus norvegicus


Alignment Length:1520 Identity:296/1520 - (19%)
Similarity:498/1520 - (32%) Gaps:548/1520 - (36%)


- Green bases have known domain annotations that are detailed below.


  Fly    38 KGENSLESPAEQAAKDVEIEELTHSEAIAATGSTRKQCPYGGKAPDEPGKLADESEDRKGENTKA 102
            :|..:|.:.|..||         .:||:.|...|::.|....:...|...:|.:    :|...|.
  Rat     5 RGGATLRARAMAAA---------DAEAVLAKQETKQDCCMKTELLREDTLMAAD----EGSTEKQ 56

  Fly   103 IASSPVLVAVDSDSSVELIESPVKFSSANESEKDPPKPDAVNEAAAKEAEEMTDSSISSPTSESF 167
            ...:|:....:::.|.|        .|.:.|..:.||....|..|:  .:|.| :.:|.......
  Rat    57 AGETPMAADGETNGSCE--------KSGDISHPNAPKHTQENTRAS--PQEGT-NRVSRVAENGV 110

  Fly   168 PEKDEKTNKENEQEPPGMEVDQDVEESI--------SRPA---EEYKIENTLK----GHKRISL- 216
            .|:|.:..|:|.     :..|..::.|:        ::||   :..:..|||.    ||...:| 
  Rat   111 SERDTEVGKQNH-----VTADDFMQTSVIGSNGYFLNKPALQGQPLRTPNTLNSSLPGHAAKTLP 170

  Fly   217 ---------------------------TEIEEHKIVDKKDDVLEVELEKGTAPKAAEDEKLNALL 254
                                       .:.|:.|......|| .|...:.|.||:...  |:|..
  Rat   171 GGASKCRTPSALPQTPTTAPTVPGEGSADTEDRKPTASGTDV-RVHRARKTMPKSILG--LHAAS 232

  Fly   255 SDGDVFYD----KECVNCNCTKLHKQYVLANMATLN----FYQVLRKSSKQQFLC---------- 301
            .|.....|    ||.:|.|.::..:|.:|.....|:    ..|....::|.|..|          
  Rat   233 KDHREVQDHKEPKEDINRNISECGRQQLLPTFPALHQSLPQNQCYMATTKSQTACLPFVLAAAVS 297

  Fly   302 ------MGCHD-------------TAMDLYEEYAGQLM-AKQPLLLKD--FHQDHADFVALDSSD 344
                  ||.:.             |.:::::......: ||...:|.|  .|. :.:.:.:||.:
  Rat   298 RKKKRRMGTYSLVPKKKTKVLKQRTVIEMFKSITHSTVGAKGEKVLDDSALHV-NGESLEMDSEE 361

  Fly   345 EEEEEKQPEKSDFSKNKLQLIEDELDDAIKNVL---NKVDFTAQLSWSKTILQAKADHLERQFAL 406
            |:.||               :|||.|...:...   .:...|::.|.|:|...||.|        
  Rat   362 EDSEE---------------LEDEEDRGAEQAAAFPTEDSRTSKESMSETDRAAKMD-------- 403

  Fly   407 ADVELEKVQTTADKMHCALYNSCPVAHKHLPTLDIEPSDYVHEVPPPGE--------IVRPPIQL 463
            .|.|.|:                                   |.|..||        .:.....:
  Rat   404 GDSEEEQ-----------------------------------ESPDTGEDEDGGDESDLSSESSI 433

  Fly   464 GETYYAVKNKAIASWV---------SIKVIEFTESTAINGNTMKSYKIRYLNTPYQMIKTVTAKH 519
            .:.:...:.|..:.|:         |.|    ..|:.:.....||       :|..|.:......
  Rat   434 KKKFLKRRGKTDSPWIKPARKRRRRSRK----KPSSMLGSEACKS-------SPGSMEQAALGDS 487

  Fly   520 IAYFEPPPVRLTIGTRVIAYFDGTTLSRGKDKGVVQSAFYPGIIAEPLKQANRYRYLIFYDDGYT 584
            ..|.|.....|.:..|      |...|:.:::|:...   |.::.               .||..
  Rat   488 AGYMEVSLDSLDLRVR------GILSSQTENEGLANG---PDVLE---------------TDGLH 528

  Fly   585 QYVP----------HRDVRLV----CQASEKVWEDVHAASRDFIQKYVEKYSVDRP--------- 626
            : ||          .|::..:    |.|:|.|..::...:..     |.||.:.||         
  Rat   529 E-VPLCSCRMETPKSREISTLANNQCMATESVDHELGRCTNS-----VVKYELMRPSNKAPLLVL 587

  Fly   627 -------MVQ----------CTRGQSMTTESNGTWLY-------ARV------------------ 649
                   ||:          ||.|..|..:...:..:       :||                  
  Rat   588 CEDHRGRMVKHQCCPGCGYFCTAGNFMECQPESSISHRFHKDCASRVNNASYCPHCGEETSKAKE 652

  Fly   650 -----IDIDCSLVL-------MQFEGDKNHTEWIYRGSLRLGPVFRETQNNMNSSSAQQLRVPRR 702
                 .|...::.|       :..||..:.|    .||:...|...::|:.:  :.|.:...|..
  Rat   653 VTIAKADTTSTVTLAPGQEKSLAAEGRADTT----TGSIAGAPEDEKSQSTV--TQAPECFDPAG 711

  Fly   703 TEPFIRYT--------KE-MESS-----SKVNQQMRAFARKSSASAQNNALAAA----------- 742
            ...|:|.|        || :||:     |:..:::|...::...||:...|...           
  Rat   712 PAGFVRPTSGFSQGPGKETLESALIALDSEKPKKLRFHPKQLYFSARQGELQKVLLMLVDGIDPN 776

  Fly   743 -----SSAATPAGGRTNAGGV---------------------------STSNSASAVRHLNNSTI 775
                 .|..:|......||.|                           :.:|...||::|..:..
  Rat   777 FKMEHQSKRSPLHAAAEAGHVDICHMLVQAGANIDTCSEDQRTPLMEAAENNHLDAVKYLIKAGA 841

  Fly   776 YVDDENRPKGHVVYFTAKR---NLPPKMYKCHECSPNC---------LFKIVHRLDSYSPLAKPL 828
            .||.::......::..||:   ::...:....:...||         ::...::   :..|.|.|
  Rat   842 QVDPKDAEGSTCLHLAAKKGHYDVVQYLLSNGQMDVNCQDDGGWTPMIWATEYK---HVDLVKLL 903

  Fly   829 LSGWERLVMRQKTKK---------------SVVYKGPC--------GKS-LRSLAEVHRY----- 864
            ||....:.:|...:.               .::....|        |.| |...|..:||     
  Rat   904 LSKGSDINIRDNEENICLHWAAFSGCVDIAEILLAAKCDLHAVNIHGDSPLHIAARENRYDCVVL 968

  Fly   865 LRATENVLNVDNFDFTPDLKC----------------LAEYSID-PSIVKDT---DISKGQEKMA 909
            ..:.::.:.:.|.:....|:|                |.:.:.| |..|:.|   ||::|.|::.
  Rat   969 FLSRDSDVTLKNKEGETPLQCASLNSQVWSALQMSKALQDSAPDKPVAVEKTVSRDIARGYERIP 1033

  Fly   910 IPLVNYYDNTL-PPPCTYAKQRIPTEGVHLNLDEEFLLCCDCEDDCSDKSKCACWQLTVAGVRYC 973
            ||.||..|:.| |....|..|...|..::::.:...|..|.|.|||| .|.|.|.||::.    |
  Rat  1034 IPCVNAVDSELCPTNYKYVSQNCVTSPMNIDRNITHLQYCVCVDDCS-SSTCMCGQLSMR----C 1093

  Fly   974 NPKKPIEEIGYQYKRLHEHVPTGIYECNSRCKCKKNCLNRVVQFSLEMKLQVFKTSNRGWGLRCV 1038
            ...|.    |......:...|..|:|||..|.|.:||.|||||..|..:||:::|.:.|||:|.:
  Rat  1094 WYDKD----GRLLPEFNMAEPPLIFECNHACSCWRNCRNRVVQNGLRARLQLYRTQDMGWGVRSL 1154

  Fly  1039 NDIPKGAFICIYAGHLLTETMANEGGQDAGDEYFADLDYIEVAEQLKEGYESEVDHSDPDAEEDN 1103
            .|||.|.|:|.|.|.|::::.|:...:|:   |..|||                           
  Rat  1155 QDIPLGTFVCEYVGELISDSEADVREEDS---YLFDLD--------------------------- 1189

  Fly  1104 GGPDAEDDDDFRPNYHYQRKIKRSSRSGSTQNSSTQSSELDSQERAVINFNPNADLDETVRENSV 1168
                                                                             
  Rat  1190 ----------------------------------------------------------------- 1189

  Fly  1169 RRLFGKDEAPYIMDAKTTGNLGRYFNHSCSPNLFVQNVFVDTHDLRFPWVAFFSAAHIRSGTELT 1233
                .||...|.:||:..||:.|:.||.|.|||....||:...|||||.:||||...|::|.:|.
  Rat  1190 ----NKDGEVYCIDARFYGNVSRFINHHCEPNLVPVRVFMSHQDLRFPRIAFFSTRLIQAGEQLG 1250

  Fly  1234 WNYNYEVGVVPGKVLYCQCGAPNCR 1258
            ::|......|.||:..|:||:|.||
  Rat  1251 FDYGERFWDVKGKLFSCRCGSPKCR 1275

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
eggNP_611966.3 Tudor_SETDB1_rpt1 532..622 CDD:410453 16/103 (16%)
Tudor_SETDB1_rpt2 630..684 CDD:410548 14/90 (16%)
HMT_MBD 823..879 CDD:238689 14/84 (17%)
SET_SETDB1 900..1262 CDD:380915 104/360 (29%)
Ehmt1XP_006233681.2 EHMT_ZBD 527..657 CDD:411018 20/135 (15%)
ANKYR 731..987 CDD:440430 36/258 (14%)
ANK repeat 785..814 CDD:293786 4/28 (14%)
ANK repeat 816..847 CDD:293786 6/30 (20%)
ANK repeat 849..881 CDD:293786 4/31 (13%)
ANK repeat 883..914 CDD:293786 5/33 (15%)
ANK repeat 916..947 CDD:293786 1/30 (3%)
ANK repeat 949..980 CDD:293786 6/30 (20%)
SET_EHMT1 1047..1277 CDD:380933 93/337 (28%)

Return to query results.
Submit another query.