DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Set1 and EZH1

DIOPT Version :10

Sequence 1:NP_001015221.1 Gene:Set1 / 3354971 FlyBaseID:FBgn0040022 Length:1641 Species:Drosophila melanogaster
Sequence 2:NP_001308008.1 Gene:EZH1 / 2145 HGNCID:3526 Length:753 Species:Homo sapiens


Alignment Length:841 Identity:179/841 - (21%)
Similarity:277/841 - (32%) Gaps:279/841 - (33%)


- Green bases have known domain annotations that are detailed below.


  Fly   840 PTKRNFLERDLSDQEEMVQRSDSDKEDSNVEISDTARSKIKGPVPIQESDSKSHTSGLNSKRKGS 904
            |.::||:..|.:....:....|..||:....|.:.          |...|.|.|  |......||
Human   122 PLQQNFMVEDETVLCNIPYMGDEVKEEDETFIEEL----------INNYDGKVH--GEEEMIPGS 174

  Fly   905 ASSFFSSSSSSTSSEAEY-EAIDCVEKARTSEEDSPRGYGQRNLNQRTTTIRNRNLVGTMDVINV 968
            .          ..|:|.: |.:|.:.:....||:   |:..                 |.|    
Human   175 V----------LISDAVFLELVDALNQYSDEEEE---GHND-----------------TSD---- 205

  Fly   969 RNLCSGSNEFKKEN--VTKRTKKNIYSDTDEDNDRTLFPALKEKNISTILS---------DLEEI 1022
                 |..:..||:  ||::.|::.. :.::.:.:..||  .:...|.|.|         |::|.
Human   206 -----GKQDDSKEDLPVTRKRKRHAI-EGNKKSSKKQFP--NDMIFSAIASMFPENGVPDDMKER 262

  Fly  1023 SKDSCIGLDENGIEPTILRKI--PNTPKLNEE----------CRRSLTPVPPPGYNEEEIKKKVD 1075
            .::.....|.|.:.|.....|  ||...:..|          |||..               |.|
Human   263 YRELTEMSDPNALPPQCTPNIDGPNAKSVQREQSLHSFHTLFCRRCF---------------KYD 312

  Fly  1076 CKQKPSFEYDRIYSDSEEEKEYQERRKRNTEYMAQMERE------FLEEQEKRIEKSLDKNLQSP 1134
            |...|......:|            :::|.|  .::|.|      ||             .|:..
Human   313 CFLHPFHATPNVY------------KRKNKE--IKIEPEPCGTDCFL-------------LLEGA 350

  Fly  1135 NNIVKNNNSPRNK-NDETRKTAISQTRSCFE-SASKVDTTLVNIISVENDINEFGPHEEGDVLTN 1197
            ......:| ||:| :...|:.....:.||.. |||.|..|                 :|||...:
Human   351 KEYAMLHN-PRSKCSGRRRRRHHIVSASCSNASASAVAET-----------------KEGDSDRD 397

  Fly  1198 GCNKMYTNSKGKTKRTQSPVYSEGGSSQASQA-SQVALEHCYSLPP-------------HSVSLG 1248
            ..|...::|.....|.|:|.     ..:||.| .|:.:....|.|.             |.....
Human   398 TGNDWASSSSEANSRCQTPT-----KQKASPAPPQLCVVEAPSEPVEWTGAEESLFRVFHGTYFN 457

  Fly  1249 DYPS-GKVNETKNILKREAENIAIVSQMTRTGPGRPRKDPICIQKKKRDLAPRMSNVKSKMTPNG 1312
            ::.| .::..||..  ::....|:...:....|.....:|  .|||||         |.::    
Human   458 NFCSIARLLGTKTC--KQVFQFAVKESLILKLPTDELMNP--SQKKKR---------KHRL---- 505

  Fly  1313 DEWPDLAHKNVHFVPCDMYKTRDQNEEMVILYTFLTKGIDAEDINFIKMSYLDHLHKE-----PY 1372
              |  .||       |...:.:..|....: |.:               ...||..:.     |.
Human   506 --W--AAH-------CRKIQLKKDNSSTQV-YNY---------------QPCDHPDRPCDSTCPC 543

  Fly  1373 AMFLNNTHWVDHCTTDRAFWPPPSKKRRKDDELIRHKTGCARTEGFYKLDVREKAKHKYHYAKAN 1437
            .|..|.......|.       |..:.|...   .|.||.|...:....|.|||            
Human   544 IMTQNFCEKFCQCN-------PDCQNRFPG---CRCKTQCNTKQCPCYLAVRE------------ 586

  Fly  1438 TEDSFNEDRSDEPTALT----NHHHNKLISKMQGISREARSNQRRLLTAFGSMGESELLKFNQLK 1498
                     .|....||    .|...|:      :|.:..|.||.|                   
Human   587 ---------CDPDLCLTCGASEHWDCKV------VSCKNCSIQRGL------------------- 617

  Fly  1499 FRKKQLKFAKSAIHDWGLFAMEPIAADEMVIEYVGQMIRPVVADLRETKYEAIGIGSSYLFRIDM 1563
              ||.|..|.|.:..||.|..|.:..:|.:.||.|::|....||.|...|:.  ..||:||.::.
Human   618 --KKHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDK--YMSSFLFNLNN 678

  Fly  1564 ETIIDATKCGNLARFINHSCNPNCYAKVITIESEKKIVIYSKQPIGINEEITYDYKFPLED 1624
            :.::|||:.||..||.|||.||||||||:.:..:.:|.|::|:.|...||:.:||::...|
Human   679 DFVVDATRKGNKIRFANHSVNPNCYAKVVMVNGDHRIGIFAKRAIQAGEELFFDYRYSQAD 739

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
Set1NP_001015221.1 RRM_Set1 99..191 CDD:409745
U2AF_lg 255..>386 CDD:273727
N-SET 1352..1496 CDD:463344 26/152 (17%)
SET_SETD1 1490..1637 CDD:380946 49/135 (36%)
EZH1NP_001308008.1 EZH2_WD-Binding 45..74 CDD:463308
PRC2_HTH_1 165..268 CDD:436286 26/146 (18%)
SANT 439..480 CDD:238096 5/42 (12%)
preSET_CXC 566..597 CDD:408079 11/54 (20%)
SET_EZH1 614..749 CDD:380994 52/149 (35%)
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.