DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment E(z) and Ehmt1

DIOPT Version :10

Sequence 1:NP_001261682.1 Gene:E(z) / 39203 FlyBaseID:FBgn0000629 Length:765 Species:Drosophila melanogaster
Sequence 2:XP_006498485.1 Gene:Ehmt1 / 77683 MGIID:1924933 Length:1312 Species:Mus musculus


Alignment Length:837 Identity:176/837 - (21%)
Similarity:271/837 - (32%) Gaps:261/837 - (31%)


- Green bases have known domain annotations that are detailed below.


  Fly    77 TSYNGIPSGP--------QKVPICVINAVTPIPTMYTWAPTQQNFMVEDETVLHNIPYMGDEVLD 133
            |...|:.|||        |:||:|.....||              ...:.:.|.|...|..|.:|
Mouse   513 TENEGLASGPDVLGTDGLQEVPLCSCRMETP--------------KSREISTLANNQCMATESVD 563

  Fly   134 KD-GKFIEELIKNYDGKVHGDKDP----------------------------SFMDDAIFVELVH 169
            .: |:....::| |:.....:|.|                            :||:......:.|
Mouse   564 HELGRCTNSVVK-YELMRPSNKAPLLVLCEDHRGRMVKHQCCPGCGYFCTAGNFMECQPESSISH 627

  Fly   170 ALMR-------------------SYSKELEEA----------APGTATAIKTETLAKSKQGEDDG 205
            ...:                   |.:||:..|          |||...::..|..|.:..|...|
Mouse   628 RFHKDCASRVNNASYCPHCGEEASKAKEVTIAKADTTSTVTLAPGQEKSLAAEGRADTTTGSIAG 692

  Fly   206 V-VDVDADGESPMKLEKTDSKG---------DLTEVEKKETEEPLETEDADVKPDVEEVKDKLPF 260
            . .|..:...:|...|..|..|         .|::...|||     .|.|.:..|.|:.| ||.|
Mouse   693 APEDERSQSTAPQAPECFDPAGPAGLVRPTSGLSQGPGKET-----LESALIALDSEKPK-KLRF 751

  Fly   261 -PAPIIFQAISANFPDKGTAQELKEKYIELTEHQDPERPQECTPNIDGIKAESVSRERTMHSF-- 322
             |..:.|.|...         ||::..:.|.:..||           ..|.|..|:...:|:.  
Mouse   752 HPKQLYFSARQG---------ELQKVLLMLVDGIDP-----------NFKMEHQSKRSPLHAAAE 796

  Fly   323 --HTLFCRRCFKYDCFLHRHHVQGLQGHAGPNLQKRRYPELKPFAEPCSNSCYMLIDGMKEKLAA 385
              |...|....:                ||.|:......:..|..|...|:   .:|.:|..:.|
Mouse   797 AGHVDICHMLVQ----------------AGANIDTCSEDQRTPLMEAAENN---HLDAVKYLIKA 842

  Fly   386 DSKTPPID----SCNEASSEDSNDSNSQF--SNKDFNHENSKDNGLTVNSAAV----AEINSIMA 440
            .::..|.|    :|...:::..:....|:  ||...:.....|.|.|....|.    .|:..::.
Mouse   843 GAQVDPKDAEGSTCLHLAAKKGHYDVVQYLLSNGQMDVNCQDDGGWTPMIWATEYKHVELVKLLL 907

  Fly   441 GM---MNITSTQ---CVWTGADQALYRVLHKVYLKNYCAIAHNMLTKTC---------RQVYEFA 490
            ..   :||...:   |            ||.........||..:|...|         ......|
Mouse   908 SKGSDINIRDNEENIC------------LHWAAFSGCVDIAEILLAAKCDLHAVNIHGDSPLHIA 960

  Fly   491 QKE------------DAEFSFEDLRQDFTPPRKKKKKQRLWS-LHCRK----------IQLKKDS 532
            .:|            |::.:.:: ::..||.:......::|| |...|          :.::|..
Mouse   961 ARENRYDCVVLFLSRDSDVTLKN-KEGETPLQCASLSSQVWSALQMSKALRDSAPDKPVAVEKTV 1024

  Fly   533 SSNHVYNY----TPCDH--PGHPCDMNCSCIQTQNFCEKFCNCSSDCQN------RFPGCRCKAQ 585
            |.:....|    .||.:  ....|..|...:..        ||.:...|      ....|.|...
Mouse  1025 SRDIARGYERIPIPCVNAVDSELCPTNYKYVSQ--------NCVTSPMNIDRNITHLQYCVCVDD 1081

  Fly   586 CNTKQCPCYLAVREC---------------DPDL---C-QACGADQFKLTKITCKNVCVQRGLHK 631
            |::..|.|......|               :|.|   | .||...:      .|:|..||.||..
Mouse  1082 CSSSTCMCGQLSMRCWYDKDGRLLPEFNMAEPPLIFECNHACSCWR------NCRNRVVQNGLRA 1140

  Fly   632 HLLMAPSDIAGWGIFLKEGAQKNEFISEYCGEIISQDEADRRGKVYDKYMCSFLFNLNND----F 692
            .|.:..:...|||:...:......|:.||.||:||..|||.|.:      .|:||:|:|.    :
Mouse  1141 RLQLYRTQDMGWGVRSLQDIPLGTFVCEYVGELISDSEADVREE------DSYLFDLDNKDGEVY 1199

  Fly   693 VVDATRKGNKIRFANHSINPNCY-AKVMMVTGD---HRIGIFAKRAIQPGEELFFDY 745
            .:||...||..||.||...||.. .:|.|...|   .||..|:.|.||.||:|.|||
Mouse  1200 CIDARFYGNVSRFINHHCEPNLVPVRVFMSHQDLRFPRIAFFSTRLIQAGEQLGFDY 1256

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
E(z)NP_001261682.1 PRC2_HTH_1 151..291 CDD:436286 39/207 (19%)
SANT 452..492 CDD:238096 7/48 (15%)
preSET_CXC 579..610 CDD:408079 11/49 (22%)
SET_EZH2 628..747 CDD:380995 47/126 (37%)
Ehmt1XP_006498485.1 EHMT_ZBD 530..660 CDD:411018 22/144 (15%)
ANKYR 734..990 CDD:440430 55/308 (18%)
ANK repeat 788..817 CDD:293786 6/44 (14%)
ANK repeat 819..850 CDD:293786 7/33 (21%)
ANK repeat 852..884 CDD:293786 4/31 (13%)
ANK repeat 886..917 CDD:293786 6/30 (20%)
ANK repeat 919..950 CDD:293786 7/42 (17%)
ANK repeat 952..983 CDD:293786 3/30 (10%)
SET_EHMT1 1050..1280 CDD:380933 66/227 (29%)

Return to query results.
Submit another query.