DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment E(z) and set-2

DIOPT Version :10

Sequence 1:NP_001261682.1 Gene:E(z) / 39203 FlyBaseID:FBgn0000629 Length:765 Species:Drosophila melanogaster
Sequence 2:NP_498039.1 Gene:set-2 / 175662 WormBaseID:WBGene00004782 Length:1510 Species:Caenorhabditis elegans


Alignment Length:650 Identity:131/650 - (20%)
Similarity:212/650 - (32%) Gaps:206/650 - (31%)


- Green bases have known domain annotations that are detailed below.


  Fly   173 RSYSKELEEAAPGTATAIKTETLAKSKQGEDDGVVDVDADGESPMKLEKTD---SKGDLTEVEKK 234
            :|..::|..::..::|...|.|...|.:...........|||.|.|..:||   .:....|.|::
 Worm   984 KSRKRKLIMSSDESSTTGSTATSVVSSRQSSLEPQQEKTDGEPPKKKSQTDFISERVSKIEGEER 1048

  Fly   235 ETEEPLETEDADVKPDVEEVKDKLPFPAPIIFQAISANFPDKGTAQELKEKYIELTEHQDPERPQ 299
            ...||:||.                  .|||           |.:..|..|.:    |.:.....
 Worm  1049 PLPEPVETS------------------GPII-----------GDSSYLPYKIV----HWEKAGII 1080

  Fly   300 ECTPNIDGIKAESVSRERTMHSFHTLFCRRCFKYDCFLHRHHVQGLQGHAGPNLQKRRYPELKPF 364
            |.....:.|:|         |.:|......|:.                   .:...|.|:::.|
 Worm  1081 EMNLPANSIRA---------HEYHPFTTEHCYF-------------------GIDDPRQPKIQIF 1117

  Fly   365 -AEPCSNSCYMLIDGMKEKLAADSKTPPIDSCNEA-------------SSEDSNDSNSQFSNKD- 414
             ..||.:.     .|.:......:...|||:..|.             :.:.......|...|| 
 Worm  1118 DHSPCKSE-----PGSEPLKITPAPWGPIDNVAETGPLIYMDVVTAPKTVQKKQKPRKQVFEKDP 1177

  Fly   415 ---FNHENSKDNG------LTVNSAAVAEINSIMAGMMNITSTQCVWTGADQALYRVLHKVYLKN 470
               :....:|...      .|....:..|...|:....::...:..|                  
 Worm  1178 YEYYEPPPTKRPAPPPRFKKTFKPRSEEEKKKIIGDCEDLPDLEDQW------------------ 1224

  Fly   471 YCAIAHNMLTKTCRQVYEFAQKEDAEFSFEDLRQD-----FTPPRKKK----------------- 513
            |...|.|.:....:...|...|:...|. |.||.:     ..|.|.||                 
 Worm  1225 YLRAALNEMQSEVKSADELPWKKMLTFK-EMLRSEDPLLRLNPIRSKKGLPDAFYEDEELDGVIP 1288

  Fly   514 -----KKQRLWSLHCRKIQLKKDSSSNHVYNYTPCDHPGHP----CDMNCSCIQTQNFCEK---- 565
                 .:.|.:    .|:.:|:..|...    .| |:..||    .:.:.:.|:.|:...|    
 Worm  1289 VAAGCSRARPY----EKMTMKQKRSLVR----RP-DNESHPTAIFSERDETAIRHQHLASKDMRL 1344

  Fly   566 ----FCNCSSDCQNRFPGCRCKAQCNTKQCPCYLAVRECDPDLCQACGADQFKLTKITCKNVCVQ 626
                ......|..|.|                                   ||:.::..:.    
 Worm  1345 LQRRLLTSLGDANNDF-----------------------------------FKINQLKFRK---- 1370

  Fly   627 RGLHKHLLMAPSDIAGWGIFLKEGAQKNEFISEYCGEIISQDEADRRGKVYDK--YMCSFLFNLN 689
                |.:..|.|.|.|||::..|....:|.|.||.|:.|....|:.|.|.|::  ...|:||.::
 Worm  1371 ----KMIKFARSRIHGWGLYAMESIAPDEMIVEYIGQTIRSLVAEEREKAYERRGIGSSYLFRID 1431

  Fly   690 NDFVVDATRKGNKIRFANHSINPNCYAKVMMVTGDHRIGIFAKRAIQPGEELFFDYRYGPTEQLK 754
            ...|:|||::||..||.|||..|||||||:.:.|:.||.|:::..|:.|||:.:||:: |.|..|
 Worm  1432 LHHVIDATKRGNFARFINHSCQPNCYAKVLTIEGEKRIVIYSRTIIKKGEEITYDYKF-PIEDDK 1495

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
E(z)NP_001261682.1 PRC2_HTH_1 151..291 CDD:436286 25/120 (21%)
SANT 452..492 CDD:238096 5/39 (13%)
preSET_CXC 579..610 CDD:408079 0/30 (0%)
SET_EZH2 628..747 CDD:380995 49/120 (41%)
set-2NP_498039.1 RRM_SF 124..213 CDD:473069
SET_SETD1 1359..1506 CDD:380946 54/180 (30%)
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.