DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment EHMT1 and Set2

DIOPT Version :10

Sequence 1:XP_011517323.1 Gene:EHMT1 / 79813 HGNCID:24650 Length:1301 Species:Homo sapiens
Sequence 2:NP_572888.2 Gene:Set2 / 32301 FlyBaseID:FBgn0030486 Length:2362 Species:Drosophila melanogaster


Alignment Length:1472 Identity:295/1472 - (20%)
Similarity:478/1472 - (32%) Gaps:464/1472 - (31%)


- Green bases have known domain annotations that are detailed below.


Human    11 AVPARGEPQQDCCV--------KTELLG--------EET--PMAAD------EGSAEKQAGEAHM 51
            |.|...||..|.|.        ...|.|        |||  .:||:      |....:...|.|:
  Fly   403 ATPPGKEPAADSCSSAPRRSRRSAPLSGSSRQGKTLEETFAEIAAESSKQILEAEESQDQEEQHI 467

Human    52 AADGETNGSCENSDASSHANAAKH-------TQDSARVNPQDG--TNTLTRIAENGVSERDSEAA 107
            ..|...:...|:...||.:...:|       .:::..|:..|.  .:....:.:...||.|:.||
  Fly   468 LIDLIEDTLSESEVTSSVSPTIEHMVVEEVVVEENQLVDEADEILDSKQEFVIKKVFSESDNIAA 532

Human   108 KQNHVTADDFVQTSVIGSNGYILNKPALQAQPLRTTSTLASSLPGHAAKTLPGGAGKGRT----- 167
            ..|....:..|:|.  .:.|.::.:|.:..:.:..|..:|::|...|..|.|.......|     
  Fly   533 SLNKDIFEPKVETK--ATCGEVVPRPEMVTEDVYITEGIAATLEKSAVVTKPTTEMIAETKLSDE 595

Human   168 --------------------PSAFP------------------QTPAAPP----------ATLGE 184
                                |.:.|                  .:|..||          ..|.|
  Fly   596 VVIEPPLKDESDPKQTEVELPESKPAVNIPKSERILSAEVETTSSPLVPPECCTLESVSGPVLLE 660

Human   185 GSADTEDRKLP-APGADVKVHRARKTMPKSVVGLHA--ASKDPR---EVREARDHKEPKEEINKN 243
            .|..||::... .....:|...|::..|.:.....|  :|::|.   |..|:...:..::|:.|.
  Fly   661 TSLSTEEKSNENVETTPLKTEAAKEDSPPAAPEEEASNSSEEPNFLLEDYESNQEQVAEDEMMKC 725

Human   244 ISDFGRQQLLPPFPSLHQSLPQNQCYMATTKSQTACLPFVLAAAVSRKKKRRMGTYSLVPKKKTK 308
            .:..|::|  .|.|.:.:  |:..                :|..||:|:|....     |.:.:.
  Fly   726 NNQKGQKQ--TPLPEMKE--PEKP----------------VAETVSKKEKAMEN-----PARSSP 765

Human   309 VLKQRTVI--EMFKSITHSTVGSKGEKDLGASSLHVNGESLEMDSDEDDSEELEEDDGHGAEQAA 371
            .:..:.|.  ||.|.:..||.|:..||              :|||.:              ..||
  Fly   766 AIVDKKVRAGEMEKKVVKSTKGTVPEK--------------KMDSKK--------------SCAA 802

Human   372 AFPTEDSRTSKESMSEADRAQKMDGESEEEQESVDTGEEEEGGDESDLSSESSIKKKFLKRKGKT 436
            ..|.:               ||..|:|.:|.......|:|:...:.|.||.:::.|     |||.
  Fly   803 VTPAK---------------QKESGKSAKEAILKKETEKEKSSAKLDSSSPNTLDK-----KGKD 847

Human   437 DSPWIKPARKRRRRSRKKPSGALGSESYKSSAGSAEQTAPGD-----STGYMEVS----LDSLDL 492
            .:.|....:...:.|.|.|..:..|...|:   ::.|.||.:     ..|..:.|    |.:.:.
  Fly   848 TAQWSPQLQTLPKSSTKPPQESAPSVISKT---TSNQPAPKEEQHAAKKGLSDNSPPSVLKAKEK 909

Human   493 RVKGILSS----QAEGLANGPDVLETDGLQEVPLCSCRMETPKSREITTL--ANNQCMATESVDH 551
            .|.|.:..    :|..|||....|:....:::.....::|.|...|..|.  ...|   .:|:..
  Fly   910 AVSGFVECDAMFKAMDLANAQLRLDEKNKKKLKKVPTKVEAPPKVEPPTAVPVPGQ---KKSLSG 971

Human   552 ELGRCTNSVVKYE----LMRPSNKAPLLVLCEDHRGRM----VKHQCCPGCGYFCTAGNFMECQP 608
            :.....|:|  ||    |.|.|:.:..........|::    ||.:..|.....|.|...:....
  Fly   972 KTSLRRNTV--YEDSPNLERNSSPSSDSAQANTSAGKLKPSKVKKKINPRRSTICEAAKDLRSSS 1034

Human   609 ESSISHR---------FHKDCASRVNNASYCPHCGEESSK--AKEVTIAKADTTSTVTPVPGQEK 662
            .||...|         ...|.:|:.|.:.......:..||  .:..||.:.....|..|||    
  Fly  1035 SSSTPTREVAASSPVSTSSDSSSKRNGSKRTTSDLDGGSKLDQRRYTICEDRQPETAIPVP---- 1095

Human   663 GSALEGRADTTTGSAAGPPLSEDDKLQGAASHVPEGFDPTGPAGLGRPTPGLSQG--PGKETLES 725
               |..|..:....|:..|| .|..||                     |.|..:|  .|||:| |
  Fly  1096 ---LTKRRFSMHPKASANPL-HDTLLQ---------------------TAGKKRGRKEGKESL-S 1134

Human   726 ALIALDSEK------PKKLRFHPKQLYFSARQGELQKVLLMLVDGID---PNFKMEHQN-KRSP- 779
            ...:|||..      |||     |.|    :..|:....|:..:..:   ...||...: :.|| 
  Fly  1135 RQNSLDSSSSASQGAPKK-----KAL----KSAEILSAALLETESSESTSSGSKMSRWDVQTSPE 1190

Human   780 LHAAAEAGHVDICHMLVQAGANI---DTCSEDQRTPLMEAAENNHLEAVKYLIKAGALVDPKDAE 841
            |.||...|  ||. ..::.|.|:   |...||||   .|..:....||           ||::.|
  Fly  1191 LEAANPFG--DIA-KFIEDGVNLLKRDKVDEDQR---KEGQDEVKREA-----------DPEEDE 1238

Human   842 GSTCLHLAAKKGHYEVVQYLLSNGQMDVNCQDDGGWTPMIWATEYKHVDLVKLLLSKGSDINIRD 906
                  .|.:..:.|......:......|.:|....|.::           |.|.:.|.      
  Fly  1239 ------FAQRVANMETPATTPTPSPTQSNPEDSASTTTVL-----------KELETGGG------ 1280

Human   907 NEENICLHWAAFSGCVDIAEILLAAKCDLHAVNIHGDSPLHIAARENRYDCVVLFLSRDSDVTLK 971
                                                      ..|.:|               :|
  Fly  1281 ------------------------------------------VRRSHR---------------IK 1288

Human   972 NK-EGETPLQCASLNSQVWSALQMSKALQDSAPDRPSPVERIVSRDIARGYERIPIPCVNAVDSE 1035
            .| :|....|...:.|...:.:.|.:.|.:.|.                         :.|::.:
  Fly  1289 QKPQGPRASQGRGVASVALAPISMDEQLAELAN-------------------------IEAINEQ 1328

Human  1036 PCPS----NYKYVSQNCVTSPMNIDRNITHLQYCVCI---DDCSSSNCMCGQLSMRCWYDKDGRL 1093
            ...|    .::.:.:|.......:.:....:| |.|.   |:.:..:..||.             
  Fly  1329 FLRSEGLNTFQLLKENFYRCARQVSQENAEMQ-CDCFLTGDEEAQGHLSCGA------------- 1379

Human  1094 LPEFNMAEPPLIFECNHACSCWRNCRNRVVQNGLRARLQLYRTRDMGWGVRSLQDIPPGTFVCEY 1158
                ......|:.||...||....|.|:..|.......:::||...|.|:.:...||||.|:.||
  Fly  1380 ----GCINRMLMIECGPLCSNGARCTNKRFQQHQCWPCRVFRTEKKGCGITAELLIPPGEFIMEY 1440

Human  1159 VGELISDSEADVREEDSYLFDLDNK--------DGEVYCIDARFYGNVSRFINHHCEPNLVPVRV 1215
            |||:|...|.:.|:   :|:..|..        .||. .|||...||:||:|||.|:|| ...:.
  Fly  1441 VGEVIDSEEFERRQ---HLYSKDRNRHYYFMALRGEA-VIDATSKGNISRYINHSCDPN-AETQK 1500

Human  1216 FMAHQDLRFPRIAFFSTRLIEAGEQLGFDY--------GERFWDIKGKLFSCRCGSPKCR----- 1267
            :..:.:|   ||.|||.:.|:.||::.|||        .:|          |.|.:..||     
  Fly  1501 WTVNGEL---RIGFFSVKPIQPGEEITFDYQYLRYGRDAQR----------CYCEAANCRGWIGG 1552

Human  1268 -----------HSSAALAQRQASAAQEAQEDGLPDTSSAAAA 1298
                       ..|.:.|:......:...|:|.|..|:.|.|
  Fly  1553 EPDSDEGEQLDEESDSDAEMDEEELEAEPEEGQPRKSAKAKA 1594

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
EHMT1XP_011517323.1 EHMT_ZBD 517..647 CDD:411018 29/150 (19%)
ANKYR 723..979 CDD:440430 48/270 (18%)
ANK repeat 745..771 CDD:293786 3/28 (11%)
ANK repeat 777..806 CDD:293786 11/32 (34%)
ANK repeat 808..839 CDD:293786 8/30 (27%)
ANK repeat 841..873 CDD:293786 4/31 (13%)
ANK repeat 875..906 CDD:293786 4/30 (13%)
ANK repeat 908..939 CDD:293786 0/30 (0%)
ANK repeat 941..972 CDD:293786 2/30 (7%)
SET_EHMT1 1039..1269 CDD:380933 67/268 (25%)
Set2NP_572888.2 valS <709..832 CDD:237855 36/190 (19%)
AWS 1358..1410 CDD:197795 14/69 (20%)
SET_SETD2 1410..1551 CDD:380949 51/158 (32%)
WW 2014..2043 CDD:459800
SRI 2266..2355 CDD:462404
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.