DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Set1 and CLF

DIOPT Version :10

Sequence 1:NP_001015221.1 Gene:Set1 / 3354971 FlyBaseID:FBgn0040022 Length:1641 Species:Drosophila melanogaster
Sequence 2:NP_179919.1 Gene:CLF / 816870 AraportID:AT2G23380 Length:902 Species:Arabidopsis thaliana


Alignment Length:1139 Identity:203/1139 - (17%)
Similarity:365/1139 - (32%) Gaps:452/1139 - (39%)


- Green bases have known domain annotations that are detailed below.


  Fly   671 PSESASRLFSNGAYVHSEYLKAVASFNFDSFSKPYDYNKGALSDQNDGIRQKVKQVIGYIVEELK 735
            ||.||:|                        |:|   .|.:.:::.....::|.:||    |.||
plant     7 PSSSATR------------------------SEP---PKDSPAEERGPASKEVSEVI----ESLK 40

  Fly   736 QILKRD----VNKRMIEITAFKHFETWWDEHTSKARSKPLFEKADSTVNTPL----NCIKDTSYN 792
            :.|..|    :.||:             ||:     .|.||....|.:.:.:    :| ||.|  
plant    41 KKLAADRCISIKKRI-------------DEN-----KKNLFAITQSFMRSSMERGGSC-KDGS-- 84

  Fly   793 EKNPDINLLI----------------NAHREVADFQSYSSIGLRAAMP--------KLPSFRRIR 833
                  :||:                |.:|.|.|..:.|.:...:::|        |:|..:|:.
plant    85 ------DLLVKRQRDSPGMKSGIDESNNNRYVEDGPASSGMVQGSSVPVKISLRPIKMPDIKRLS 143

  Fly   834 KHPSPIPTKRNFLERDLSDQEEMVQR--------------SDSDKE--DSNVEISDTARSKIKGP 882
            .:.:.:...||  :|...||..:.:|              |||::|  |...|..|.        
plant   144 PYTTWVFLDRN--QRMTEDQSVVGRRRIYYDQTGGEALICSDSEEEAIDDEEEKRDF-------- 198

  Fly   883 VPIQESD----SKSHTSGLNSKRKGSASSFFSSSSSSTSSEAEYEAIDCVEKARTSEEDSPRGYG 943
              ::..|    ......||:.......:||.|.|:|..             |||.......:...
plant   199 --LEPEDYIIRMTLEQLGLSDSVLAELASFLSRSTSEI-------------KARHGVLMKEKEVS 248

  Fly   944 QRNLNQRTTTIRNRNLVGTMD-----------VINVR------------------------NLCS 973
            :...||..:::.|:::.|.:|           |.:.|                        ||..
plant   249 ESGDNQAESSLLNKDMEGALDSFDNLFCRRCLVFDCRLHGCSQDLIFPAEKPAPWCPPVDENLTC 313

  Fly   974 GSNEFK----------------KENV------TKRTKKNIYSDTDEDNDRTLFPALKEKNISTIL 1016
            |:|.:|                |...      ||.|.....|..:....:| ||:  |...|...
plant   314 GANCYKTLLKSGRFPGYGTIEGKTGTSSDGAGTKTTPTKFSSKLNGRKPKT-FPS--ESASSNEK 375

  Fly  1017 SDLEEISKDSCIGLDENGI-EPTILRKIPNTPKLNEECRRSLTPVPPPGYNEEEIKKKVDCKQKP 1080
            ..||  :.||     |||: :.|...|:.::||:....||                         
plant   376 CALE--TSDS-----ENGLQQDTNSDKVSSSPKVKGSGRR------------------------- 408

  Fly  1081 SFEYDRIYSDSEEEKEYQERRKRNTEYMAQMEREFLEEQEKRIEKSLDKNLQSPNNIVKNNNSPR 1145
                              ..||||...:|:......::::|:.|.|      ..::|...:.||.
plant   409 ------------------VGRKRNKNRVAERVPRKTQKRQKKTEAS------DSDSIASGSCSPS 449

  Fly  1146 NKNDETRKTAISQTRSCFESAS------------------KVDTTLVNIISVENDINEFGPHEE- 1191
            :...:..:.|.|.::...:|.:                  |.|..:.....|.::::..|..|. 
plant   450 DAKHKDNEDATSSSQKHVKSGNSGKSRKNGTPAEVSNNSVKDDVPVCQSNEVASELDAPGSDESL 514

  Fly  1192 ------GDVLTNG---CNKMYTNSKGKTKRTQSPVYSEGGSSQASQASQVA------LEHCYSLP 1241
                  |:.::.|   .||::       :..:..::.:|.......:..:|      .:.|:.:.
plant   515 RKEEFMGETVSRGRLATNKLW-------RPLEKSLFDKGVEIFGMNSCLIARNLLSGFKSCWEVF 572

  Fly  1242 PHSV------------SLGDYPSGKVNETKNILKREAENIAIVSQMTRTGPGRPRKDPICIQKKK 1294
            .:..            .|....|.|.:...|::..:                        ::::.
plant   573 QYMTCSENKASFFGGDGLNPDGSSKFDINGNMVNNQ------------------------VRRRS 613

  Fly  1295 RDLAPRMSNVKSKMTPNGDEWPDLAHKNV-------------HFVPCDMYKTRDQNEEMVILYTF 1346
            |.|..|....:.|.|     |...|:.::             .|.||:                 
plant   614 RFLRRRGKVRRLKYT-----WKSAAYHSIRKRITEKKDQPCRQFNPCN----------------- 656

  Fly  1347 LTKGIDAEDINFIKMSYLDHLHKEPYAMFLNNTHWVDHCTTDRAFWPPPSKKRRKDDELIRHKTG 1411
                        .|::    ..|| ....||.|....:|                         |
plant   657 ------------CKIA----CGKE-CPCLLNGTCCEKYC-------------------------G 679

  Fly  1412 CARTEGFYKLDVREKAKHKY---HYAKANTED----SFNEDRSDEPTALTN-----HHHNKLISK 1464
            |.::           .|:::   |.||:....    .|..||..:|....|     ...:..:..
plant   680 CPKS-----------CKNRFRGCHCAKSQCRSRQCPCFAADRECDPDVCRNCWVIGGDGSLGVPS 733

  Fly  1465 MQGISREARSNQRRLLTAFGSMGESELLKFNQLKFRKKQLKFAKSAIHDWGLFAMEPIAADEMVI 1529
            .:|.:.|.| |.:.||.                  :::::....|.:..||.|....::..|.:.
plant   734 QRGDNYECR-NMKLLLK------------------QQQRVLLGISDVSGWGAFLKNSVSKHEYLG 779

  Fly  1530 EYVGQMIRPVVADLRETKYEAIGIGSSYLFRIDMETIIDATKCGNLARFINHSCNPNCYAKVITI 1594
            ||.|::|....||.|...|:.  ...|:||.::.:.::||.:.|:..:|.|||..||||||||.:
plant   780 EYTGELISHKEADKRGKIYDR--ENCSFLFNLNDQFVLDAYRKGDKLKFANHSPEPNCYAKVIMV 842

  Fly  1595 ESEKKIVIYSKQPIGINEEITYDYKFPLEDEKIP 1628
            ..:.::.|::|:.|...||:.|||::  |.::.|
plant   843 AGDHRVGIFAKERILAGEELFYDYRY--EPDRAP 874

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
Set1NP_001015221.1 RRM_Set1 99..191 CDD:409745
U2AF_lg 255..>386 CDD:273727
N-SET 1352..1496 CDD:463344 24/155 (15%)
SET_SETD1 1490..1637 CDD:380946 41/139 (29%)
CLFNP_179919.1 preSET_CXC 690..721 CDD:408079 8/30 (27%)
SET_EZH 752..868 CDD:380917 39/117 (33%)
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.