DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Set1 and EZH2

DIOPT Version :10

Sequence 1:NP_001015221.1 Gene:Set1 / 3354971 FlyBaseID:FBgn0040022 Length:1641 Species:Drosophila melanogaster
Sequence 2:XP_011514185.1 Gene:EZH2 / 2146 HGNCID:3527 Length:759 Species:Homo sapiens


Alignment Length:889 Identity:182/889 - (20%)
Similarity:304/889 - (34%) Gaps:287/889 - (32%)


- Green bases have known domain annotations that are detailed below.


  Fly   871 ISDTARSKIKGPVPIQESDSKSHTSGLNS----KRKGSASSFFSSSSSSTSSEAEYEAIDCVEKA 931
            :..|.:...|||| ......||....|..    :|.....|.|||:........|.         
Human     9 MGQTGKKSEKGPV-CWRKRVKSEYMRLRQLKRFRRADEVKSMFSSNRQKILERTEI--------- 63

  Fly   932 RTSEEDSPRGYGQRNLNQRTTTIRNRNLVGTMDVINVRNLCSGSNEFKKENVTKRTKKNIYSDTD 996
                           |||..   :.|.:.....:.:|.:| .|:.|.           ::.||.|
Human    64 ---------------LNQEW---KQRRIQPVHILTSVSSL-RGTREC-----------SVTSDLD 98

  Fly   997 EDNDRTLFPA----LKEKNISTILSDLEEISKDSCIGLDENGI--EPTILRKIPNTPKLNEECRR 1055
                   ||.    ||..|....:..:...|.     |.:|.:  :.|:|.   |.|.:.:|.  
Human    99 -------FPTQVIPLKTLNAVASVPIMYSWSP-----LQQNFMVEDETVLH---NIPYMGDEV-- 146

  Fly  1056 SLTPVPPPGYNEEEIKKKVDCK----QKPSFEYDRIY----------------SDSEEEKEYQER 1100
                :...|...||:.|..|.|    ::..|..|.|:                .|.::.:|.:|:
Human   147 ----LDQDGTFIEELIKNYDGKVHGDRECGFINDEIFVELVNALGQYNDDDDDDDGDDPEEREEK 207

  Fly  1101 RKRNTEYMAQME----REF------------------LEEQEKRIEKSLDKNLQS--PNNIVKNN 1141
            :|...::....|    |:|                  .||.:::.::..::.|..  |.....|.
Human   208 QKDLEDHRDDKESRPPRKFPSDKIFEAISSMFPDKGTAEELKEKYKELTEQQLPGALPPECTPNI 272

  Fly  1142 NSPRNKNDETRKTAIS----QTRSCFESASKVDTTLVNIISVENDINEFGPHEEGDVLTNGCNKM 1202
            :.|..|:.:..::..|    ..|.||    |.|..|               |.:       ||..
Human   273 DGPNAKSVQREQSLHSFHTLFCRRCF----KYDCFL---------------HRK-------CNYS 311

  Fly  1203 Y--TNSKGKTKRTQSPVYSEGGSSQASQASQVALEHCYSLP-------------------PHSVS 1246
            :  |.:..|.|.|::.:.::....|..|..:.|.|...:|.                   |::.|
Human   312 FHATPNTYKRKNTETALDNKPCGPQCYQHLEGAKEFAAALTAERIKTPPKRPGGRRRGRLPNNSS 376

  Fly  1247 LGDYPSGKVNETKNI-LKREAENIAIVSQMTRTGPGRPRKDPICIQKKKRDLAPRMSNVKS---- 1306
            ....|:..|.|:|:. ..|||.        |.||.....|:    :::|:|.....|...|    
Human   377 RPSTPTINVLESKDTDSDREAG--------TETGGENNDKE----EEEKKDETSSSSEANSRCQT 429

  Fly  1307 --KMTPNGD-----EWPDLAHKNVHFVPCDMY-------------KTRDQNEEMVILYTFLTKGI 1351
              ||.||.:     ||.. |..::..|....|             ||..|..|..:..:.:....
Human   430 PIKMKPNIEPPENVEWSG-AEASMFRVLIGTYYDNFCAIARLIGTKTCRQVYEFRVKESSIIAPA 493

  Fly  1352 DAEDINFIKMSYLDHLHKEPYAMFLNNTHWVDHCTTDRAFWPPPSKKRRKDDELIRHKTGCARTE 1416
            .|||::                                   .||.||:||      |:...|...
Human   494 PAEDVD-----------------------------------TPPRKKKRK------HRLWAAHCR 517

  Fly  1417 GFYKLDVRE--KAKHKYHYAKANTEDSFNEDR----SDEPTALTNHHHNKLI-------SKMQGI 1468
               |:.:::  .:.|.|:|...      :..|    |..|..:..:...|..       ::..|.
Human   518 ---KIQLKKDGSSNHVYNYQPC------DHPRQPCDSSCPCVIAQNFCEKFCQCSSECQNRFPGC 573

  Fly  1469 SREARSNQRR---------------LLTAFGSMGESELLKFNQLKFR---KKQLKFAKSAIHDWG 1515
            ..:|:.|.::               |........:|:.:.......:   ||.|..|.|.:..||
Human   574 RCKAQCNTKQCPCYLAVRECDPDLCLTCGAADHWDSKNVSCKNCSIQRGSKKHLLLAPSDVAGWG 638

  Fly  1516 LFAMEPIAADEMVIEYVGQMIRPVVADLRETKYEAIGIGSSYLFRIDMETIIDATKCGNLARFIN 1580
            :|..:|:..:|.:.||.|::|....||.|...|:.  ...|:||.::.:.::|||:.||..||.|
Human   639 IFIKDPVQKNEFISEYCGEIISQDEADRRGKVYDK--YMCSFLFNLNNDFVVDATRKGNKIRFAN 701

  Fly  1581 HSCNPNCYAKVITIESEKKIVIYSKQPIGINEEITYDYKFPLED 1624
            ||.||||||||:.:..:.:|.|::|:.|...||:.:||::...|
Human   702 HSVNPNCYAKVMMVNGDHRIGIFAKRAIQTGEELFFDYRYSQAD 745

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
Set1NP_001015221.1 RRM_Set1 99..191 CDD:409745
U2AF_lg 255..>386 CDD:273727
N-SET 1352..1496 CDD:463344 24/171 (14%)
SET_SETD1 1490..1637 CDD:380946 48/138 (35%)
EZH2XP_011514185.1 EZH2_WD-Binding 47..76 CDD:463308 9/55 (16%)
PRC2_HTH_1 166..257 CDD:436286 12/90 (13%)
preSET_CXC 572..603 CDD:408079 4/30 (13%)
SET_EZH2 622..741 CDD:380995 47/120 (39%)
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.