DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Set2 and Ehmt1

DIOPT Version :10

Sequence 1:NP_572888.2 Gene:Set2 / 32301 FlyBaseID:FBgn0030486 Length:2362 Species:Drosophila melanogaster
Sequence 2:XP_006498485.1 Gene:Ehmt1 / 77683 MGIID:1924933 Length:1312 Species:Mus musculus


Alignment Length:1502 Identity:300/1502 - (19%)
Similarity:496/1502 - (33%) Gaps:482/1502 - (32%)


- Green bases have known domain annotations that are detailed below.


  Fly   358 LAKQEPTPD---PEEDLDEIMVEVLSGPPSLWSADDEAEEEEDATVQRATPPGKEP-AAD----- 413
            |||||...|   ..|.|.|       |..:..:||:.:.|:::         |:.| |||     
Mouse    25 LAKQETKQDCCMKTELLRE-------GKDTPMAADEGSTEKQE---------GETPMAADGETNG 73

  Fly   414 SCS--------SAPRRSR---RSAPLSGSSRQGKTLEETFAEIAAESSKQILEAEESQDQEEQHI 467
            ||.        :||:.::   |::|..|::|..:..|...:|...|..||            .|:
Mouse    74 SCEKSGDPSHLNAPKHTQENTRASPQEGTNRVSRVAENGVSERDTEVGKQ------------NHV 126

  Fly   468 LIDLIEDTLSESEVTSS----VSPTIEHMVVEEVVVEENQLVDEADEILDSKQEFVIKKVFSESD 528
            ..|   |.:..|.:.|:    ..|.::...:....:..:.|...|.:.|....        |:..
Mouse   127 TAD---DFMQTSVIGSNGYFLNKPALQGQPLRTPNILTSSLPGHAAKTLPGGA--------SKCR 180

  Fly   529 NIAASLNKDIFEPKVETKATCGEVVPRPEMVTEDVYITEGIAATLEKSAVVTKPTTEMIAETKLS 593
            .::|........|.|..:.:......:|.....||.:... ..|:.||.:      .:.|.:|..
Mouse   181 TLSALPQTPTTAPTVPGEGSADTEDRKPTASGTDVRVHRA-RKTMPKSIL------GLHAASKDH 238

  Fly   594 DEVVIEPPLKDESDPKQT---------EVELPESKPAV--NIPKSERILSAEVETTSSPLVPPEC 647
            .||      :|..:||:.         ..:|..:.||:  ::|:::..::    ||.|       
Mouse   239 REV------QDHKEPKEDINRNISECGRQQLLPTFPALHQSLPQNQCYMA----TTKS------- 286

  Fly   648 CTLESVSGPVLLETSLSTEEKSNENV--------------------------ETTPLKTEAAKED 686
               ::...|.:|..::|.::|.....                          .|...|.|.|.:|
Mouse   287 ---QTACLPFVLAAAVSRKKKRRMGTYSLVPKKKTKVLKQRTVIEMFKSITHSTVGAKGEKALDD 348

  Fly   687 SP--------PAAPEEEASNSSEEPN---------FLLEDYESNQEQVAE-DEMMKCNNQKGQKQ 733
            |.        ....|:|.|:..|:..         |..||..:::|.::| |...|.:....::|
Mouse   349 SALHVNGESLEMDSEDEDSDELEDDEDHGAEQAAAFPTEDSRTSKESMSETDRAAKMDGDSEEEQ 413

  Fly   734 TPLPEMKE-----PEKPVAETVSKKEKAMENPARS-SPAIVDKKVRAGEMEKKVVKSTKGTVPEK 792
            .. |:..|     .|..::...|.|:|.::...:: ||.|...:.|.....||         |..
Mouse   414 ES-PDTGEDEDGGDESDLSSESSIKKKFLKRRGKTDSPWIKPARKRRRRSRKK---------PSS 468

  Fly   793 KMDSKKSCAAVTPAKQKESGKSA-------------KEAILKKETEKEKSSAKLDSSSPNTLDKK 844
            .:.|:...::....:|...|.||             ...||..:||.|..     :|.|:.|...
Mouse   469 MLGSEACKSSPGSMEQAALGDSAGYMEVSLDSLDLRVRGILSSQTENEGL-----ASGPDVLGTD 528

  Fly   845 GKDTAQWSPQLQTLPKSSTKPPQESAPSVISKTTSNQPAPKEEQHAAKKGLSDNS---------- 899
            |         ||.:|..|.:  .|:..|....|.:|......|....:.|...||          
Mouse   529 G---------LQEVPLCSCR--METPKSREISTLANNQCMATESVDHELGRCTNSVVKYELMRPS 582

  Fly   900 ---PPSVL-------KAKEKAVSG---------FVEC------------DAMFKAMDLANAQLRL 933
               |..||       ..|.:...|         |:||            |...:..:.:......
Mouse   583 NKAPLLVLCEDHRGRMVKHQCCPGCGYFCTAGNFMECQPESSISHRFHKDCASRVNNASYCPHCG 647

  Fly   934 DEKNKKK---LKKVPTKVEAPPKVEPPTAVPVPGQKKSLS--GKTSLRRNTVYEDSPNLERNSSP 993
            :|.:|.|   :.|..|         ..|....|||:|||:  |:......:: ..:|..||:.|.
Mouse   648 EEASKAKEVTIAKADT---------TSTVTLAPGQEKSLAAEGRADTTTGSI-AGAPEDERSQST 702

  Fly   994 SSDSAQANTSAGKLKPSKVKKKINPRRSTICEAAKDLRSSSSSSTPTREVAASSPVSTSSDSSSK 1058
            :..:.:....||   |:.:   :.|             :|..|..|.:|...|:.::..|:...|
Mouse   703 APQAPECFDPAG---PAGL---VRP-------------TSGLSQGPGKETLESALIALDSEKPKK 748

  Fly  1059 ------------RNG----------------------SKRTTSD--------------LDGGSKL 1075
                        |.|                      |||:...              :..|:.:
Mouse   749 LRFHPKQLYFSARQGELQKVLLMLVDGIDPNFKMEHQSKRSPLHAAAEAGHVDICHMLVQAGANI 813

  Fly  1076 DQRRYTICED-RQP--ETAIPVPLTKRRFSMHPKASANPLH---DTLLQTAGKKRGRKEGKESLS 1134
            |    |..|| |.|  |.|....|...::.:...|..:|..   .|.|..|.|| |..:..:.|.
Mouse   814 D----TCSEDQRTPLMEAAENNHLDAVKYLIKAGAQVDPKDAEGSTCLHLAAKK-GHYDVVQYLL 873

  Fly  1135 RQNSLDSSSSASQG-APKKKALKSAEILSAALLETESSESTSSGSKMS---RW-------DV--- 1185
            ....:|.:.....| .|...|.:...:....||.::.|:.....::.:   .|       |:   
Mouse   874 SNGQMDVNCQDDGGWTPMIWATEYKHVELVKLLLSKGSDINIRDNEENICLHWAAFSGCVDIAEI 938

  Fly  1186 --QTSPELEAANPFGDIAKFIEDGVN-------LLKRDKVDEDQRKEGQDEVKREADPEEDEFAQ 1241
              ....:|.|.|..||....|....|       .|.||.....:.|||                 
Mouse   939 LLAAKCDLHAVNIHGDSPLHIAARENRYDCVVLFLSRDSDVTLKNKEG----------------- 986

  Fly  1242 RVANMETPATTPTPSPTQSNPEDSASTTTVLKELETGGGVRRSHRIKQKPQGPRASQGRGVA-SV 1305
                 |||.            :.::.::.|...|:....:|.|  ...||.....:..|.:| ..
Mouse   987 -----ETPL------------QCASLSSQVWSALQMSKALRDS--APDKPVAVEKTVSRDIARGY 1032

  Fly  1306 ALAPI----SMDEQLAELANIEAINEQFLRSEGLNTFQLLKENFYRCARQVSQENAEMQ-CDCFL 1365
            ...||    ::|.:|..                 ..::.:.:|.......:.:....:| |.|. 
Mouse  1033 ERIPIPCVNAVDSELCP-----------------TNYKYVSQNCVTSPMNIDRNITHLQYCVCV- 1079

  Fly  1366 TGDEEAQGHLSCGA-----------------GCINRMLMIECGPLCSNGARCTNKRFQQHQCWPC 1413
              |:.:.....||.                 ......|:.||...||....|.|:..|.......
Mouse  1080 --DDCSSSTCMCGQLSMRCWYDKDGRLLPEFNMAEPPLIFECNHACSCWRNCRNRVVQNGLRARL 1142

  Fly  1414 RVFRTEKKGCGITAELLIPPGEFIMEYVGEVIDSEEFERRQ---HLYSKDRNRHYYFMALRGEA- 1474
            :::||:..|.|:.:...||.|.|:.|||||:|...|.:.|:   :|:..|..        .||. 
Mouse  1143 QLYRTQDMGWGVRSLQDIPLGTFVCEYVGELISDSEADVREEDSYLFDLDNK--------DGEVY 1199

  Fly  1475 VIDATSKGNISRYINHSCDPN-AETQKWTVNGEL---RIGFFSVKPIQPGEEITFDYQYLRYGRD 1535
            .|||...||:||:|||.|:|| ...:.:..:.:|   ||.|||.:.||.||::.|||        
Mouse  1200 CIDARFYGNVSRFINHHCEPNLVPVRVFMSHQDLRFPRIAFFSTRLIQAGEQLGFDY-------- 1256

  Fly  1536 AQR----------CYCEAANCRGWIGGEPDSDEGEQLDEESDSDAEMDEEELEA--EPEE-GQPR 1587
            .:|          |.|.::.||                   .|.|.:.:.:..|  ||:| |.|.
Mouse  1257 GERFWDVKGKLFSCRCGSSKCR-------------------HSSAALAQRQASAAQEPQENGLPD 1302

  Fly  1588 KSAKAKA 1594
            .|:.|.|
Mouse  1303 TSSAAAA 1309

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
Set2NP_572888.2 valS <709..832 CDD:237855 29/142 (20%)
AWS 1358..1410 CDD:197795 14/69 (20%)
SET_SETD2 1410..1551 CDD:380949 51/158 (32%)
WW 2014..2043 CDD:459800
SRI 2266..2355 CDD:462404
Ehmt1XP_006498485.1 EHMT_ZBD 530..660 CDD:411018 24/131 (18%)
ANKYR 734..990 CDD:440430 51/282 (18%)
ANK repeat 788..817 CDD:293786 4/32 (13%)
ANK repeat 819..850 CDD:293786 8/30 (27%)
ANK repeat 852..884 CDD:293786 8/32 (25%)
ANK repeat 886..917 CDD:293786 6/30 (20%)
ANK repeat 919..950 CDD:293786 4/30 (13%)
ANK repeat 952..983 CDD:293786 7/30 (23%)
SET_EHMT1 1050..1280 CDD:380933 66/267 (25%)
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.