DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Set2 and EZH1

DIOPT Version :10

Sequence 1:NP_572888.2 Gene:Set2 / 32301 FlyBaseID:FBgn0030486 Length:2362 Species:Drosophila melanogaster
Sequence 2:NP_001308008.1 Gene:EZH1 / 2145 HGNCID:3526 Length:753 Species:Homo sapiens


Alignment Length:924 Identity:185/924 - (20%)
Similarity:305/924 - (33%) Gaps:288/924 - (31%)


- Green bases have known domain annotations that are detailed below.


  Fly   748 ETVSKKEKAMENPARSSPAIV--DKKVRAGEMEKKVVKSTKGTVPEKKMDSKKSCAAVTPAKQKE 810
            |..||.|  :.||. :|..|.  .:||::..|..:.:|..:..:      ..|:......||.:|
Human     2 EDYSKME--IPNPP-TSKCITYWKRKVKSEYMRLRQLKRLQANM------GAKALYVANFAKVQE 57

  Fly   811 SGKSAKEAILKKETEKEKSSAKLDSSSPNTLDKKGKDTAQWSPQLQTLPKSSTKPPQESAPSVIS 875
                 |..||.:|.:|                            |:..|..|.||  .|....:.
Human    58 -----KTQILNEEWKK----------------------------LRVQPVQSMKP--VSGHPFLK 87

  Fly   876 KTTSNQPAPK-EEQHAAKKGLSDNSPPSVLKAKEKAVSGF-VECDAMFKAMDLANAQLRLDEKNK 938
            |.|.....|. ..||...:.|:..:...::.:.......| ||.:.:     |.|.....||   
Human    88 KCTIESIFPGFASQHMLMRSLNTVALVPIMYSWSPLQQNFMVEDETV-----LCNIPYMGDE--- 144

  Fly   939 KKLKKVPTKVEAPPKVEPPTAV---------PVPGQKKSLSGKTSLRRNTVYEDSPNLERNSSPS 994
                         .|.|..|.:         .|.|:::.:.|.. |..:.|:.:..:.....|..
Human   145 -------------VKEEDETFIEELINNYDGKVHGEEEMIPGSV-LISDAVFLELVDALNQYSDE 195

  Fly   995 SDSAQANTSAGKLKPSKVKKKINPRRSTICEAAKDLRSSSSSSTPTREVAASSPVSTSSDSSSKR 1059
            .:....:||.||...||                :||                 ||:         
Human   196 EEEGHNDTSDGKQDDSK----------------EDL-----------------PVT--------- 218

  Fly  1060 NGSKRTTSDLDGGSKLDQRRYTICEDRQPETAIPVPLTKRRFSMHPKASANPLHDTLLQTAGKKR 1124
              .||....::|..|..::::       |...|...:.    ||.|:   |.:.|.:     |:|
Human   219 --RKRKRHAIEGNKKSSKKQF-------PNDMIFSAIA----SMFPE---NGVPDDM-----KER 262

  Fly  1125 GRKEGKESLSRQNSLDSSSSASQGAPKKKALKSAEILSAALLET-------------ESSESTSS 1176
            .|:  ...:|..|:|....:.:...|..|:::..:.|.:  ..|             ....:|.:
Human   263 YRE--LTEMSDPNALPPQCTPNIDGPNAKSVQREQSLHS--FHTLFCRRCFKYDCFLHPFHATPN 323

  Fly  1177 GSKMSRWDVQTSPELEAANPFGDIAKFIEDGVN---LLKRDKVDEDQRKEGQDEV---------- 1228
            ..|....:::..||     |.|.....:.:|..   :|...:.....|:..:..:          
Human   324 VYKRKNKEIKIEPE-----PCGTDCFLLLEGAKEYAMLHNPRSKCSGRRRRRHHIVSASCSNASA 383

  Fly  1229 -----KREADPEED---EFAQRVANMETPATTPTPSPTQSNPEDSASTTTVLKELETGGGVRRSH 1285
                 .:|.|.:.|   ::|...:...:...|||.......|..........:.:|..|......
Human   384 SAVAETKEGDSDRDTGNDWASSSSEANSRCQTPTKQKASPAPPQLCVVEAPSEPVEWTGAEESLF 448

  Fly  1286 RIKQKPQGPRASQGRGVASVA-LAPISMDEQLAELANIEAINEQFLRSEGLNTFQLLKENF---- 1345
            |:.      ..:......|:| |......:|:.:.|..|::..:....|.:|..|..|...    
Human   449 RVF------HGTYFNNFCSIARLLGTKTCKQVFQFAVKESLILKLPTDELMNPSQKKKRKHRLWA 507

  Fly  1346 YRCAR-QVSQENAEMQ-----------------CDCFLTGDEEAQGHLSCGAGCINRMLMIECGP 1392
            ..|.: |:.::|:..|                 |.|.:|.:       .|...|       :|.|
Human   508 AHCRKIQLKKDNSSTQVYNYQPCDHPDRPCDSTCPCIMTQN-------FCEKFC-------QCNP 558

  Fly  1393 LCSN---GARC-TNKRFQQHQC--------------------WPCRVFRTEKKGC----GITAEL 1429
            .|.|   |.|| |....:|..|                    |.|:|  ...|.|    |:...|
Human   559 DCQNRFPGCRCKTQCNTKQCPCYLAVRECDPDLCLTCGASEHWDCKV--VSCKNCSIQRGLKKHL 621

  Fly  1430 LIPP----------------GEFIMEYVGEVIDSEEFERRQHLYSKDRNRHYYFMALRGEAVIDA 1478
            |:.|                .|||.||.||:|..:|.:||..:|  |:....:...|..:.|:||
Human   622 LLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVY--DKYMSSFLFNLNNDFVVDA 684

  Fly  1479 TSKGNISRYINHSCDPNAETQKWTVNGELRIGFFSVKPIQPGEEITFDYQYLRYGRDAQRCYCEA 1543
            |.|||..|:.|||.:||...:...|||:.|||.|:.:.||.|||:.|||:|            ..
Human   685 TRKGNKIRFANHSVNPNCYAKVVMVNGDHRIGIFAKRAIQAGEELFFDYRY------------SQ 737

  Fly  1544 ANCRGWIGGEPDSD 1557
            |:...::|.|.::|
Human   738 ADALKYVGIERETD 751

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
Set2NP_572888.2 valS <709..832 CDD:237855 21/85 (25%)
AWS 1358..1410 CDD:197795 15/72 (21%)
SET_SETD2 1410..1551 CDD:380949 54/180 (30%)
WW 2014..2043 CDD:459800
SRI 2266..2355 CDD:462404
EZH1NP_001308008.1 EZH2_WD-Binding 45..74 CDD:463308 10/61 (16%)
PRC2_HTH_1 165..268 CDD:436286 29/168 (17%)
SANT 439..480 CDD:238096 8/46 (17%)
preSET_CXC 566..597 CDD:408079 6/30 (20%)
SET_EZH1 614..749 CDD:380994 50/148 (34%)
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.