DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment G9a and Kmt2d

DIOPT Version :9

Sequence 1:NP_001259088.1 Gene:G9a / 30971 FlyBaseID:FBgn0040372 Length:1657 Species:Drosophila melanogaster
Sequence 2:XP_017450810.1 Gene:Kmt2d / 100362634 RGDID:2324324 Length:5544 Species:Rattus norvegicus


Alignment Length:1992 Identity:345/1992 - (17%)
Similarity:593/1992 - (29%) Gaps:715/1992 - (35%)


- Green bases have known domain annotations that are detailed below.


  Fly    45 LANNQFASKEKKHKDKEEEERKEARNQEEIEDIKALLADVVD---------AAAVKLE------- 93
            :.:....:::::.:.::::::::.:.|..:..:..|...::.         .|...|:       
  Rat  3848 MGHRLLTAQQQQQQQQQQQQQQQQQQQGSMTGLSQLQQGMMSHGGQPKMSVQALGSLQQQQQQQQ 3912

  Fly    94 --EEEAQNAEKVEPHTKCEIEEEGRKEMEYDQDVAKQDSEMEKKQNGKATSITVKMESNERAEKH 156
              :::.|..::::...:.:::::.:::|   |.:.:|..:::::|          .:..::.:.|
  Rat  3913 QLQQQQQQMQQLQQQQQQQLQQQQQQQM---QQLQQQQQQLQQQQ----------QQQQQQQQLH 3964

  Fly   157 ATEIATTSTERWENESFKTEQQNKKAAEKEEEPILAATQKLEAN----------AEPLTTTRIEV 211
            ..:......:..:.:....:||.:.:...:...:|:..|:.:..          .:||.......
  Rat  3965 LQQQLHQQQQLQQQQLQLQQQQQQMSLLNQNRTLLSPQQQQQQQQQVTLGPGMPVKPLQHFSSPG 4029

  Fly   212 AVASPLVVSSASVKLAADA----TNQMRAATSAGAATL--ADKNVQVSPGGTRRSRRTPRPI--- 267
            |:...|:::.....:|..|    ..:..:|...|..|:  ..:::.|.||..:.|......:   
  Rat  4030 ALGPALLLTGKEQNIAETALPSEVTEGPSAHQGGPPTVGTTPESMSVEPGEVKPSISGDSQLLLV 4094

  Fly   268 -DTPTSVTDEHVQVENKKFGKSEQY-----TDCSSHLERFTLDDNT---------AIVRLQLKSE 317
             ..||||   .:|...:..|:.:|.     |...||.::.....::         |...:.|..:
  Rat  4095 QSQPTSV---QLQPPLRLPGQPQQVNLLHTTGGGSHGQQLGSGSSSEAPSGPHLLAQPSVSLGEQ 4156

  Fly   318 PDKPSLTALSPEE-----------------NSAPAPKRGRGRARKIRPDAEVETSEVILPCEDSL 365
            |...:...|..::                 .|.|||:.|:|     .|...|      :|....|
  Rat  4157 PGPMAQNLLGSQQPLGLERPIQNNTGSQPPKSGPAPQSGQG-----PPGVGV------MPTVGQL 4210

  Fly   366 GEKKPGRKRKLPD----EPIDQQQLSDLVVVKTEQ-----------EELGDAP------LG---- 405
            ..:..|...|.|.    .|..||||..|::.:..|           :|.|..|      ||    
  Rat  4211 RAQLQGVLAKTPQLRHLSPQQQQQLQALLMQRQLQQSQAVRQMPPGQESGTQPSPLQGLLGCQPQ 4275

  Fly   406 ----DVKRMRRSVRLGNRLHADGSPWEEVKTEALHPQPSAELSFAEVTSEILPLAVLDEKTPPKK 466
                ...::.....||......|.|        ..|.|...||...|...:.|       |||..
  Rat  4276 PGVFSASQIGPLQELGAGSRPQGPP--------RLPVPQGALSTGPVLGPVHP-------TPPPS 4325

  Fly   467 RGRKAKTPCVKLESETSCGLPFANGNKKTNSSGGCELQLPKRSKRRIKPTPKIL----------- 520
            ..::.|.|. :|.|.::...|...|..|..   |...:||   ..|:.|....|           
  Rat  4326 SPQEPKRPS-QLPSPSAQLTPTHPGTPKPQ---GPASELP---PGRVSPAAAQLADAFFGKGLGP 4383

  Fly   521 --ENDEL----RCEFETKHIERMTQWESAAAVDGDF--------ETPTTGGNGSNSSTSRQKSDK 571
              .:|.|    :.|..:....|:.|.....|.:...        |.|...|    :.|.::::: 
  Rat  4384 WDPSDNLPEAQKPEQSSLAAGRLEQVNGQVAHEPSHLSIKQEPREEPCALG----AQTVKREAN- 4443

  Fly   572 SDGSNFEGGPGHPAGTSAIKKRLFSKSQRDIENYGAAMLAKSKLPPCPDVEQFLNDIKASRINAN 636
                      |.|||.......|.....|  ...|..:|.|            |...|..::.|.
  Rat  4444 ----------GEPAGAPGTSNHLLLAGSR--SEAGHLLLQK------------LLRAKNVQLGAG 4484

  Fly   637 RSPE--ERKLNKKQQRKLAKQKEKHLKHLGLQKNHRDEPSDNDSSNTDNEFFPTTRVQVGKPSVT 699
            |.||  ..::|.....||:.|::|      ||           .:::..|.....:..:.||.  
  Rat  4485 RGPEGLRAEINGHVDSKLSGQEQK------LQ-----------GTSSSKEDAAARKPLMAKPK-- 4530

  Fly   700 LRVRNSVTKELPTTATLKSRRNPVVQAAKLTRRIGARAAGEVTEAAR---ASVPISTPDAEQLHS 761
             ||:.       |:..|.|.|.      ||.:..|.||...:.:..:   :.:|::.|......|
  Rat  4531 -RVQK-------TSDRLASSRK------KLRKEDGVRANEALLKQLKQELSQLPLTEPTITANFS 4581

  Fly   762 L-------------------------------------DTSIQADVTPIRDLDMRPSTSRVSKFI 789
            |                                     ..::....||...|...|..|...|.:
  Rat  4582 LFAPFGSGCLVGGQSQLRGAFGSGALHTGPDYYSQLLTKNNLSNPPTPPSSLPPTPPPSVQQKMV 4646

  Fly   790 C-------LCQKPSQYYARNAPDSSYCCAIDHIDDQKIGCCNELSSEVHNLLRPSQRVSYMILCD 847
            .       |.:.|..  |.:|.||..... |..:.:.:.....|.:..||     |.....:..|
  Rat  4647 NGVTPSEELGEHPKD--AASAQDSERTLK-DAAEVKSLDLLAALPTPPHN-----QTEDVRMESD 4703

  Fly   848 EHKKRLQSHNCCAGCGIFCTQGKFVLCKQQHFFHPDCAQRFILSTSYEKELGDEE---------- 902
            |..                                |.....:.::|.|..||:|.          
  Rat  4704 EDS--------------------------------DSPDSIVPASSPESILGEEAPRFPQLGSGR 4736

  Fly   903 -DQGVKFSSPVLVLKCPHCGLDTPERTSTVTMKCQSLPVFLRTQKYKIKPARLTTSSHLTQFGTV 966
             :|..:..|||:.:        .| ||        |:|||..|:.|.:.              .:
  Rat  4737 WEQDTRALSPVIPI--------IP-RT--------SIPVFPDTKPYGVL--------------DL 4770

  Fly   967 ENANTPGATARNKG-GLSTAVTLSAASSPASKTNGAQRGRAGTSNSNSRHALNSINFAQLIPESV 1030
            |....|.|||..|| |...:|.|:.:::.|...||..                 :..|:|:...:
  Rat  4771 EVPGKPPATAWEKGKGSEVSVMLTVSAAAAKNLNGVM-----------------VAVAELLSMKI 4818

  Fly  1031 MNVVLRGHVVSASGRVTAEFTPRDMYYAVQNDDLERVAEILAADFNVLTPIREYLNGTCLHLVAH 1095
            .|                      .|..:..|...|..         |.|.:....|        
  Rat  4819 PN----------------------SYEVLFPDGPARAG---------LEPKKGETEG-------- 4844

  Fly  1096 SGTLQMAYLLLCKGAS------SPDFVNIVD-----YELRTALMCAVMNEKCDMLNLFLQCGADV 1149
            ||..:       ||.|      .||::...|     |.|::.|         |:|:|..|     
  Rat  4845 SGGKE-------KGLSGRGPDTGPDWLKQFDAVLPGYTLKSQL---------DILSLLKQ----- 4888

  Fly  1150 AIKGPDGKTSLH--IAAQLGNLEATQLIVDSYRTSRNITSFLS----------FIDAQDEGGWTA 1202
              :.|..:.|:.  ....:.||:..||............|.|:          .::.|.|.    
  Rat  4889 --ESPAPEPSIQHSYTYNVSNLDVRQLSAPPPEEPSPPPSPLAPSPASPPAEPMVELQAEP---- 4947

  Fly  1203 MVWAAELGHTDIVRLASLPQA--------------------------VFLKLINIFLFISFLLNQ 1241
               .||......:.|||.|:|                          |..|.:.:.|.|.....:
  Rat  4948 ---PAEPPIPSPLPLASSPEATRPKPRARPPDESEDSRPPRLKKWKGVRWKRLRLLLTIQKGSGR 5009

  Fly  1242 DAD----------------PNICDNDN-------------------------------NTVLHWS 1259
            ..|                |:....||                               |..| ||
  Rat  5010 QEDEREVAEFMEQLGTALRPSKVPRDNRRCCFCHEEGDGATDGPARLLNLDLDLWVHLNCAL-WS 5073

  Fly  1260 T----------------LHNDGLDTITVLLQSGADCNVQNVEGDTPLHIAC-------------- 1294
            |                ||...|...::..::||..:...:......|.||              
  Rat  5074 TEVYETQGGALMNVEVALHRGLLTKCSLCQRTGATGSCNRMRCPNVYHFACAIRAKCMFFKDKTM 5138

  Fly  1295 ---RHSVTRMCIALIANGADL----MIKNKAEQLPFDCIPNEESECGRTVGFNM---------QM 1343
               .|.:...|...:::.|..    :.:::.:|:.......|.....|..|...         ||
  Rat  5139 LCPMHKIKGPCEQELSSFAVFRRVYIERDEVKQIASIIQRGERLHMFRVGGLVFHAIGQLLPHQM 5203

  Fly  1344 RSFR------PLG---------LRT------FVVCADASNGREARPIQVVRNELAMSENEDEADS 1387
            ..|.      |:|         |||      :......:|||....|:|:...|......|.:..
  Rat  5204 ADFHSATALYPVGYEATRIYWSLRTNNRRCCYRCSISENNGRPEFVIKVIEQGLEDLVFTDASPQ 5268

  Fly  1388 LMWPDFRYVTQCIIQQNSVQIDRRVSQ---------------MRICSCLDSCSSDRCQCNGASSQ 1437
            .:|.  |.:......:....:.|...:               :||...|....|  ||       
  Rat  5269 AVWN--RIIEPVAAMRKEADMLRLFPEYLKGEELFGLTVHAVLRIAESLPGVES--CQ------- 5322

  Fly  1438 NWYTAESRLNADFNY------EDPAVIFECNDVCGCNQLSCK----------------NRVVQN- 1479
                     |..|.|      |.|.:|    :..||.:...|                ::..|: 
  Rat  5323 ---------NYLFRYGRHPLMELPLMI----NPTGCARSEPKILTHYKRPHTLNSTSMSKAYQST 5374

  Fly  1480 ---GTRTPL--QIVE-------------------CEDQAKGWGVRALANVPKGTFVGSYTGEILT 1520
               .|.||.  |.|.                   ...:.:|.|:.|..::.|.|.|..|.|.|:.
  Rat  5375 FTGETNTPYSKQFVHSKSSQYRRLRTEWKNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTIIR 5439

  Fly  1521 AMEADRR-------TDDSYYFDLDNGHCIDANYYGNVTRFFNHSCEPNVLPVRVFYEHQDYRFPK 1578
            ...|:||       ....|.|.::|.|.|||...|...|:.||||.||.:...|.::.:|    |
  Rat  5440 NEVANRREKIYEEQNRGIYMFRINNEHVIDATLTGGPARYINHSCAPNCVAEVVTFDKED----K 5500

  Fly  1579 IAFFSCRDIDAGEEICFDYGEKFWRVEHRSCVGCRCLTTTCK 1620
            |...|.|.|..|||:.:||...|...:|:  :.|.|....|:
  Rat  5501 IIIISSRRIPKGEELTYDYQFDFEDDQHK--IPCHCGAWNCR 5540

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
G9aNP_001259088.1 ATP-synt_B 150..248 CDD:304375 14/113 (12%)
Ank_2 1056..1152 CDD:289560 21/106 (20%)
ANK 1088..1217 CDD:238125 27/151 (18%)
ANK repeat 1088..1120 CDD:293786 7/37 (19%)
ANK repeat 1124..1153 CDD:293786 5/28 (18%)
Ank_2 1127..1249 CDD:289560 28/175 (16%)
ANK 1155..1306 CDD:238125 39/268 (15%)
ANK repeat 1155..1196 CDD:293786 7/52 (13%)
ANK repeat 1199..1249 CDD:293786 13/91 (14%)
Ank_2 1205..1316 CDD:289560 31/220 (14%)
ANK repeat 1251..1283 CDD:293786 12/78 (15%)
ANK repeat 1285..1316 CDD:293786 6/51 (12%)
PreSET 1357..1466 CDD:128744 22/129 (17%)
SET 1495..1602 CDD:214614 39/113 (35%)
Kmt2dXP_017450810.1 ePHD1_KMT2D 134..217 CDD:277165
PHD1_KMT2C_like 228..273 CDD:276984
PHD_SF 275..320 CDD:304600
PHD3_KMT2D 1335..1385 CDD:277072
PHD5_KMT2C_like 1386..1432 CDD:276988
PHD5_KMT2D 1463..1513 CDD:277074
HMG 1979..2030 CDD:197700
ePHD2_KMT2D 5039..5145 CDD:277168 14/106 (13%)
FYRN 5189..5239 CDD:283589 9/49 (18%)
FYRC 5247..5334 CDD:197781 16/106 (15%)
SET 5404..5526 CDD:214614 40/125 (32%)
Blue background indicates that the domain is not in the aligned region.


Information from Original Tools:


Tool Simple Score Weighted Score Original Tool Information
BLAST Result Score Score Type Cluster ID
Compara 1 0.930 - - C166351987
Domainoid 00.000 Not matched by this tool.
eggNOG 00.000 Not matched by this tool.
Hieranoid 00.000 Not matched by this tool.
Homologene 00.000 Not matched by this tool.
Inparanoid 00.000 Not matched by this tool.
OMA 00.000 Not matched by this tool.
OrthoDB 00.000 Not matched by this tool.
OrthoFinder 00.000 Not matched by this tool.
OrthoInspector 00.000 Not matched by this tool.
orthoMCL 00.000 Not matched by this tool.
Panther 00.000 Not matched by this tool.
Phylome 00.000 Not matched by this tool.
SonicParanoid 00.000 Not matched by this tool.
SwiftOrtho 00.000 Not matched by this tool.
TreeFam 00.000 Not matched by this tool.
10.930

Return to query results.
Submit another query.