DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Set2 and Ezh1

DIOPT Version :10

Sequence 1:NP_572888.2 Gene:Set2 / 32301 FlyBaseID:FBgn0030486 Length:2362 Species:Drosophila melanogaster
Sequence 2:NP_001100521.1 Gene:Ezh1 / 303547 RGDID:1305028 Length:747 Species:Rattus norvegicus


Alignment Length:749 Identity:164/749 - (21%)
Similarity:242/749 - (32%) Gaps:258/749 - (34%)


- Green bases have known domain annotations that are detailed below.


  Fly   880 NQPAPKEEQ--HAAKKGLSDNSP---PSVLKAKEKAVSGFVECDAMFKAMDLANAQLRLDEKNKK 939
            ||.:.:||:  :....|..|:|.   |...|.|..|:.|                       |||
  Rat   184 NQYSDEEEEGHNDTSDGKQDDSKEDLPVTRKRKRHAIEG-----------------------NKK 225

  Fly   940 KLKK-VPTKV--EAPPKVEPPTAVPVPGQKKSLSGKTSLRRNTVYEDSPNLERNSSPSSDSAQAN 1001
            ..|| .|..:  .|...:.|...||.       ..|...|..|...|...|....:|:.|...| 
  Rat   226 SSKKQFPNDMIFSAIASMFPENGVPD-------DMKERYRELTEMSDPNALPPQCTPNIDGPNA- 282

  Fly  1002 TSAGKLKPSKVKKKINPRRSTICEAA--KDLRSSSSSSTP------TREVAASSPVSTSSDSSSK 1058
                  |..:.::.::...:..|...  .|.......:||      .||:.. .|....:|....
  Rat   283 ------KSVQREQSLHSFHTLFCRRCFKYDCFLHPFHATPNVYKRKNREIKI-EPEPCGADCFLW 340

  Fly  1059 RNGSKRTTSDLDGGSKLDQRRYTICEDRQPETAIPVPLTKRRFSMHPKASA---NPLHDTLLQTA 1120
            ..|:|......:..||...||                  :||   ||..||   |.....:.:| 
  Rat   341 LEGAKEYAMLHNPRSKCSGRR------------------RRR---HPVVSASCSNTSASAMAET- 383

  Fly  1121 GKKRGRKEGKESLSRQNSLDSSSSAS----QGAPKKKALKSAEILSAALLETESSESTSSGSKMS 1181
                  |||.......|...||||.:    |...|:||..:...|......:|..|.|.:...:.
  Rat   384 ------KEGDSDRDTGNDWASSSSEANSRCQTPTKQKASPAPPQLCVVEAPSEPVEWTGAEESLF 442

  Fly  1182 RWDVQTSPELEAANPFGDIAKFIEDGVNLLKRDKVDEDQRKEGQDEVKREADPEEDEFAQRVANM 1246
            |....|     ..|.|..||:.:  |....|                      :..:||.:.:.:
  Rat   443 RVFHGT-----YFNNFCSIARLL--GTKTCK----------------------QVFQFAVKESLI 478

  Fly  1247 ETPATTPTPSPTQSNPEDSASTTTVLKELETGGGVRRSHRIKQKPQGPRASQGRGVASVALAPIS 1311
            ....|....:|:|..                    :|.||:                        
  Rat   479 LKLPTDELMNPSQKK--------------------KRKHRL------------------------ 499

  Fly  1312 MDEQLAELANIEAINEQFLRSEGLNTFQLLKENFYRCARQVSQENAEMQCD----CFLTGDEEAQ 1372
               ..|....|:      |:.:. |:.|:.  |:..|      ::.:..||    |.:|.:    
  Rat   500 ---WAAHCRKIQ------LKKDN-NSTQVY--NYQPC------DHPDRPCDSTCPCIMTQN---- 542

  Fly  1373 GHLSCGAGCINRMLMIECGPLCSN---GARC-TNKRFQQHQC--------------------WPC 1413
               .|...|       :|.|.|.|   |.|| |....:|..|                    |.|
  Rat   543 ---FCEKFC-------QCSPDCQNRFPGCRCKTQCNTKQCPCYLAVRECDPDLCLTCGASEHWDC 597

  Fly  1414 RVFRTEKKGC----GITAELLIPP----------------GEFIMEYVGEVIDSEEFERRQHLYS 1458
            :|  ...|.|    |:...||:.|                .|||.||.||:|..:|.:||..:| 
  Rat   598 KV--VSCKNCSIQRGLKKHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVY- 659

  Fly  1459 KDRNRHYYFMALRGEAVIDATSKGNISRYINHSCDPNAETQKWTVNGELRIGFFSVKPIQPGEEI 1523
             |:....:...|..:.|:|||.|||..|:.|||.:||...:...|||:.|||.|:.:.||.|||:
  Rat   660 -DKYMSSFLFNLNNDFVVDATRKGNKIRFANHSVNPNCYAKVVMVNGDHRIGIFAKRAIQAGEEL 723

  Fly  1524 TFDYQYLRYGRDAQRCYCEAANCRGWIGGEPDSD 1557
            .|||:|            ..|:...::|.|.::|
  Rat   724 FFDYRY------------SQADALKYVGIERETD 745

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
Set2NP_572888.2 valS <709..832 CDD:237855
AWS 1358..1410 CDD:197795 15/59 (25%)
SET_SETD2 1410..1551 CDD:380949 54/180 (30%)
WW 2014..2043 CDD:459800
SRI 2266..2355 CDD:462404
Ezh1NP_001100521.1 EZH2_WD-Binding 39..68 CDD:463308
PRC2_HTH_1 159..262 CDD:436286 24/107 (22%)
SANT 433..474 CDD:238096 12/69 (17%)
preSET_CXC 560..591 CDD:408079 6/30 (20%)
SET_EZH1 608..743 CDD:380994 50/148 (34%)
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.