DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment G9a and set-15

DIOPT Version :10

Sequence 1:NP_001259088.1 Gene:G9a / 30971 FlyBaseID:FBgn0040372 Length:1657 Species:Drosophila melanogaster
Sequence 2:NP_001370771.1 Gene:set-15 / 187813 WormBaseID:WBGene00020006 Length:611 Species:Caenorhabditis elegans


Alignment Length:453 Identity:103/453 - (22%)
Similarity:161/453 - (35%) Gaps:121/453 - (26%)


- Green bases have known domain annotations that are detailed below.


  Fly  1277 ADCNVQN-VEGDTPLHIACRHSVTRMCIALIANG-----ADLMIKNKAE-----QLPFDCIPNEE 1330
            |:|:|:| ::.|..|.    ....|.|..||...     ||:|..:.|.     .:|....|.:.
 Worm   174 AECSVRNAIKSDPELD----GFYVRTCGKLINKTVLNTMADMMDNHMATTRLLWNMPRHEFPVDF 234

  Fly  1331 SECGRTVGFNMQMRSFRPLGLRTFV----------VCADASNGREARPIQVVRNELAMSENEDEA 1385
            .|  :.|.|.:.: |:.|...|.||          ...|.. |...|.|.:....:......:..
 Worm   235 HE--KFVDFALNV-SYTPPKKRIFVEKKLFEKSISTFTDIP-GETHRTISLGNTFVYKYTERNVV 295

  Fly  1386 DSLMWPDFRYVTQCIIQQNSVQIDRRVSQMRICSCLDSCSSDRC---------QCNGASSQNWYT 1441
            |...:|:...:|:...:.|:|        :| |.|..|.|..:|         |.|....:....
 Worm   296 DVEKYPELHALTEEAERPNNV--------IR-CECCSSGSIKKCWNNPDCFCFQMNLKLRKEQDP 351

  Fly  1442 AESRLNADFNYEDPAVI-----------FECNDVCGCNQLSCKNRVVQNGTRTPLQIVE-----C 1490
            ..::|..||:..||..|           |.|::.|.|.. .|.|.:    |..|.:.:.     .
 Worm   352 KNNKLLTDFSTFDPVFIRERNHFFDTIGFACSENCACGG-KCTNNI----TLLPEKNINKFEIYR 411

  Fly  1491 EDQAKGWGVRALANVPKGTFVGSYTGEILTAMEADRRTDDSYYFDLDN-GH-------------- 1540
            :::..|:.:|.|.::|.||.|..:|||::.....| ..|..|.|::.| .|              
 Worm   412 KNEIMGFAIRTLNSIPAGTPVMEFTGELMDFDILD-NIDQDYAFEIVNEAHNLHETLPNFNKRWS 475

  Fly  1541 ---------------CIDANYYGNVTRFFNHSCEPNVLPVRVFYEHQDYRFPKIAFFSCRDIDAG 1590
                           .::....|||.|...|||:||:..||||.:.......|:...:..||..|
 Worm   476 ENFKSSLKKQLARPWFVNPKRIGNVARICCHSCQPNMAMVRVFQKGFSPAHCKLLLVTLEDIFPG 540

  Fly  1591 EEICFDYGEKFWRVEHRSCVGCRCLTTTCKYASQSSSTNASPTNATTAPENETGTLSSTNTEK 1653
            .|:.||||..:.. |.:.  ||.|....|                   |..|:..:.|..|:|
 Worm   541 VELTFDYGPGYLN-ELKG--GCLCERIGC-------------------PNTESFGILSRATKK 581

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
G9aNP_001259088.1 PTZ00121 <68..>188 CDD:173412
EHMT_ZBD 786..933 CDD:411018
ANKYR 1063..1322 CDD:440430 14/55 (25%)
ANK repeat 1088..1120 CDD:293786
ANK repeat 1124..1153 CDD:293786
ANK repeat 1155..1196 CDD:293786
ANK repeat 1199..1249 CDD:293786
ANK repeat 1251..1283 CDD:293786 3/5 (60%)
ANK repeat 1285..1316 CDD:293786 9/35 (26%)
SET_EHMT 1391..1622 CDD:380941 69/285 (24%)
set-15NP_001370771.1 SET 405..554 CDD:214614 38/150 (25%)
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.