DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Set2 and EHMT1

DIOPT Version :10

Sequence 1:NP_572888.2 Gene:Set2 / 32301 FlyBaseID:FBgn0030486 Length:2362 Species:Drosophila melanogaster
Sequence 2:XP_011517323.1 Gene:EHMT1 / 79813 HGNCID:24650 Length:1301 Species:Homo sapiens


Alignment Length:1472 Identity:295/1472 - (20%)
Similarity:478/1472 - (32%) Gaps:464/1472 - (31%)


- Green bases have known domain annotations that are detailed below.


  Fly   403 ATPPGKEPAADSCSSAPRRSRRSAPLSGSSRQGKTLEETFAEIAAESSKQILEAEESQDQEEQHI 467
            |.|...||..|.|.        ...|.|        |||  .:||:      |....:...|.|:
Human    11 AVPARGEPQQDCCV--------KTELLG--------EET--PMAAD------EGSAEKQAGEAHM 51

  Fly   468 LIDLIEDTLSESEVTSSVSPTIEHMVVEEVVVEENQLVDEADEILDSKQEFVIKKVFSESDNIAA 532
            ..|...:...|:...||.:...:|       .:::..|:..|.  .:....:.:...||.|:.||
Human    52 AADGETNGSCENSDASSHANAAKH-------TQDSARVNPQDG--TNTLTRIAENGVSERDSEAA 107

  Fly   533 SLNKDIFEPKVETK--ATCGEVVPRPEMVTEDVYITEGIAATLEKSAVVTKPTTEMIAETKLSDE 595
            ..|....:..|:|.  .:.|.::.:|.:..:.:..|..:|::|...|..|.|.......|     
Human   108 KQNHVTADDFVQTSVIGSNGYILNKPALQAQPLRTTSTLASSLPGHAAKTLPGGAGKGRT----- 167

  Fly   596 VVIEPPLKDESDPKQTEVELPESKPAVNIPKSERILSAEVETTSSPLVPPECCTLESVSGPVLLE 660
                                |.:.|                  .:|..||          ..|.|
Human   168 --------------------PSAFP------------------QTPAAPP----------ATLGE 184

  Fly   661 TSLSTEEKSNENVETTPLKTEAAKEDSPPAAPEEEASNSSEEPNFLLEDYESNQEQVAEDEMMKC 725
            .|..||::... .....:|...|::..|.:.....|  :|::|.   |..|:...:..::|:.|.
Human   185 GSADTEDRKLP-APGADVKVHRARKTMPKSVVGLHA--ASKDPR---EVREARDHKEPKEEINKN 243

  Fly   726 NNQKGQKQ--TPLPEMKE--PEKP----------------VAETVSKKEKAMEN-----PARSSP 765
            .:..|::|  .|.|.:.:  |:..                :|..||:|:|....     |.:.:.
Human   244 ISDFGRQQLLPPFPSLHQSLPQNQCYMATTKSQTACLPFVLAAAVSRKKKRRMGTYSLVPKKKTK 308

  Fly   766 AIVDKKVRAGEMEKKVVKSTKGTVPEK--------------KMDSKK--------------SCAA 802
            .:..:.|.  ||.|.:..||.|:..||              :|||.:              ..||
Human   309 VLKQRTVI--EMFKSITHSTVGSKGEKDLGASSLHVNGESLEMDSDEDDSEELEEDDGHGAEQAA 371

  Fly   803 VTPAK---------------QKESGKSAKEAILKKETEKEKSSAKLDSSSPNTLDK-----KGKD 847
            ..|.:               ||..|:|.:|.......|:|:...:.|.||.:::.|     |||.
Human   372 AFPTEDSRTSKESMSEADRAQKMDGESEEEQESVDTGEEEEGGDESDLSSESSIKKKFLKRKGKT 436

  Fly   848 TAQWSPQLQTLPKSSTKPPQESAPSVISKT---TSNQPAPKEEQHAAKKGLSDNSPPSVLKAKEK 909
            .:.|....:...:.|.|.|..:..|...|:   ::.|.||.:     ..|..:.|    |.:.:.
Human   437 DSPWIKPARKRRRRSRKKPSGALGSESYKSSAGSAEQTAPGD-----STGYMEVS----LDSLDL 492

  Fly   910 AVSGFVECDAMFKAMDLANAQLRLDEKNKKKLKKVPTKVEAPPKVEPPTAVPVPGQ---KKSLSG 971
            .|.|.:..    :|..|||....|:....:::.....::|.|...|..|.  ...|   .:|:..
Human   493 RVKGILSS----QAEGLANGPDVLETDGLQEVPLCSCRMETPKSREITTL--ANNQCMATESVDH 551

  Fly   972 KTSLRRNTV--YEDSPNLERNSSPSSDSAQANTSAGKLKPSKVKKKINPRRSTICEAAKDLRSSS 1034
            :.....|:|  ||    |.|.|:.:..........|::    ||.:..|.....|.|...:....
Human   552 ELGRCTNSVVKYE----LMRPSNKAPLLVLCEDHRGRM----VKHQCCPGCGYFCTAGNFMECQP 608

  Fly  1035 SSSTPTREVAASSPVSTSSDSSSKRNGSKRTTSDLDGGSKLDQRRYTICEDRQPETAIPVP---- 1095
            .||...|         ...|.:|:.|.:.......:..||  .:..||.:.....|..|||    
Human   609 ESSISHR---------FHKDCASRVNNASYCPHCGEESSK--AKEVTIAKADTTSTVTPVPGQEK 662

  Fly  1096 ---LTKRRFSMHPKASANPL-HDTLLQ---------------------TAGKKRGRKEGKESL-S 1134
               |..|..:....|:..|| .|..||                     |.|..:|  .|||:| |
Human   663 GSALEGRADTTTGSAAGPPLSEDDKLQGAASHVPEGFDPTGPAGLGRPTPGLSQG--PGKETLES 725

  Fly  1135 RQNSLDSSSSASQGAPKK-----KAL----KSAEILSAALLETESSESTSSGSKMSRWDVQTSPE 1190
            ...:|||..      |||     |.|    :..|:....|:..:..:   ...||...: :.|| 
Human   726 ALIALDSEK------PKKLRFHPKQLYFSARQGELQKVLLMLVDGID---PNFKMEHQN-KRSP- 779

  Fly  1191 LEAANPFG--DIA-KFIEDGVNLLKRDKVDEDQR---KEGQDEVKREA-----------DPEEDE 1238
            |.||...|  ||. ..::.|.|:   |...||||   .|..:....||           ||::.|
Human   780 LHAAAEAGHVDICHMLVQAGANI---DTCSEDQRTPLMEAAENNHLEAVKYLIKAGALVDPKDAE 841

  Fly  1239 ------FAQRVANMETPATTPTPSPTQSNPEDSASTTTVL-----------KELETGGG------ 1280
                  .|.:..:.|......:......|.:|....|.::           |.|.:.|.      
Human   842 GSTCLHLAAKKGHYEVVQYLLSNGQMDVNCQDDGGWTPMIWATEYKHVDLVKLLLSKGSDINIRD 906

  Fly  1281 ------------------------------------------VRRSHR---------------IK 1288
                                                      ..|.:|               :|
Human   907 NEENICLHWAAFSGCVDIAEILLAAKCDLHAVNIHGDSPLHIAARENRYDCVVLFLSRDSDVTLK 971

  Fly  1289 QKPQGPRASQGRGVASVALAPISMDEQLAELAN-------------------------IEAINEQ 1328
            .| :|....|...:.|...:.:.|.:.|.:.|.                         :.|::.:
Human   972 NK-EGETPLQCASLNSQVWSALQMSKALQDSAPDRPSPVERIVSRDIARGYERIPIPCVNAVDSE 1035

  Fly  1329 FLRSEGLNTFQLLKENFYRCARQVSQENAEMQ-CDCFLTGDEEAQGHLSCGA------------- 1379
            ...|    .::.:.:|.......:.:....:| |.|.   |:.:..:..||.             
Human  1036 PCPS----NYKYVSQNCVTSPMNIDRNITHLQYCVCI---DDCSSSNCMCGQLSMRCWYDKDGRL 1093

  Fly  1380 ----GCINRMLMIECGPLCSNGARCTNKRFQQHQCWPCRVFRTEKKGCGITAELLIPPGEFIMEY 1440
                ......|:.||...||....|.|:..|.......:::||...|.|:.:...||||.|:.||
Human  1094 LPEFNMAEPPLIFECNHACSCWRNCRNRVVQNGLRARLQLYRTRDMGWGVRSLQDIPPGTFVCEY 1158

  Fly  1441 VGEVIDSEEFERRQ---HLYSKDRNRHYYFMALRGEA-VIDATSKGNISRYINHSCDPN-AETQK 1500
            |||:|...|.:.|:   :|:..|..        .||. .|||...||:||:|||.|:|| ...:.
Human  1159 VGELISDSEADVREEDSYLFDLDNK--------DGEVYCIDARFYGNVSRFINHHCEPNLVPVRV 1215

  Fly  1501 WTVNGEL---RIGFFSVKPIQPGEEITFDYQYLRYGRDAQR----------CYCEAANCRGWIGG 1552
            :..:.:|   ||.|||.:.|:.||::.|||        .:|          |.|.:..||     
Human  1216 FMAHQDLRFPRIAFFSTRLIEAGEQLGFDY--------GERFWDIKGKLFSCRCGSPKCR----- 1267

  Fly  1553 EPDSDEGEQLDEESDSDAEMDEEELEAEPEEGQPRKSAKAKA 1594
                       ..|.:.|:......:...|:|.|..|:.|.|
Human  1268 -----------HSSAALAQRQASAAQEAQEDGLPDTSSAAAA 1298

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
Set2NP_572888.2 valS <709..832 CDD:237855 36/190 (19%)
AWS 1358..1410 CDD:197795 14/69 (20%)
SET_SETD2 1410..1551 CDD:380949 51/158 (32%)
WW 2014..2043 CDD:459800
SRI 2266..2355 CDD:462404
EHMT1XP_011517323.1 EHMT_ZBD 517..647 CDD:411018 29/150 (19%)
ANKYR 723..979 CDD:440430 48/270 (18%)
ANK repeat 745..771 CDD:293786 3/28 (11%)
ANK repeat 777..806 CDD:293786 11/32 (34%)
ANK repeat 808..839 CDD:293786 8/30 (27%)
ANK repeat 841..873 CDD:293786 4/31 (13%)
ANK repeat 875..906 CDD:293786 4/30 (13%)
ANK repeat 908..939 CDD:293786 0/30 (0%)
ANK repeat 941..972 CDD:293786 2/30 (7%)
SET_EHMT1 1039..1269 CDD:380933 67/268 (25%)
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.