DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Set1 and SUVH3

DIOPT Version :10

Sequence 1:NP_001015221.1 Gene:Set1 / 3354971 FlyBaseID:FBgn0040022 Length:1641 Species:Drosophila melanogaster
Sequence 2:NP_565056.1 Gene:SUVH3 / 843641 AraportID:AT1G73100 Length:669 Species:Arabidopsis thaliana


Alignment Length:737 Identity:137/737 - (18%)
Similarity:228/737 - (30%) Gaps:281/737 - (38%)


- Green bases have known domain annotations that are detailed below.


  Fly  1045 NTPKLNEECRRSLTPVP------------------------------PPGYNEEEIKKKVDCKQK 1079
            :||.||:   ...||:|                              |.|....:.|:|.....:
plant    70 DTPDLNQ---TQNTPIPSFVPPLRSYRTPTKTNGPSSSSGTKRGVGRPKGTTSVKKKEKKTVANE 131

  Fly  1080 PSFEYD---RIYSDSEEEKEYQERRKRNTEYMAQMEREFLEEQEKRIEKSLDKNLQSP------N 1135
            |:.:..   :..||.:......||...|...::.:...|...:.:..:....|:..|.      :
plant   132 PNLDVQVVKKFSSDFDSGISAAEREDGNAYLVSSVLMRFDAVRRRLSQVEFTKSATSKAAGTLMS 196

  Fly  1136 NIVKNNNSPR---NKNDETRKTAISQTRSCFESASKVDTTLVNIISVENDINEFGPHEEG---DV 1194
            |.|:.|...|   ....|......|:...|.     |...:..:..::..|::.|..||.   .:
plant   197 NGVRTNMKKRVGTVPGIEVGDIFFSRIEMCL-----VGLHMQTMAGIDYIISKAGSDEESLATSI 256

  Fly  1195 LTNGCNKMYTNSKGKTKRTQSPVYS-EGG--------SSQASQASQVALEH-------------- 1236
            :::|      ..:|:.:..:|.:|| :||        |.|..:...:|||:              
plant   257 VSSG------RYEGEAQDPESLIYSGQGGNADKNRQASDQKLERGNLALENSLRKGNGVRVVRGE 315

  Fly  1237 ---------------CYSLPPHSVSLGDYPSGKVNETKNILKREAENIAIVSQMTRTGPGRPRKD 1286
                           .||:....|..|  .|| .|..|..|.|:              ||:|   
plant   316 EDAASKTGKIYIYDGLYSISESWVEKG--KSG-CNTFKYKLVRQ--------------PGQP--- 360

  Fly  1287 PI-----CIQKKKRDLAPRMSNVKSKMTPNGDEWPDLAHKNVHFVPCDMYKTRDQNEEMVILYTF 1346
            |.     .:||.|..|..|...:...:|...:..|      |..|       .|.:|:       
plant   361 PAFGFWKSVQKWKEGLTTRPGLILPDLTSGAESKP------VSLV-------NDVDED------- 405

  Fly  1347 LTKGIDAEDINFIKMSYLDHLHKEPYAMFLNNTHWVDHCTTDRAFWPPPSKKRRKDDELIRHKTG 1411
              ||          .:|..:.....|:.....|..|..|:...:..|.            .|...
plant   406 --KG----------PAYFTYTSSLKYSETFKLTQPVIGCSCSGSCSPG------------NHNCS 446

  Fly  1412 CARTEGFYKLDVREKAKHKYHYAKANTEDSFNEDRSDEPTALTNHHHNKLISKMQGISR------ 1470
            |.|                             ::..|.|      :.|.:|.    :||      
plant   447 CIR-----------------------------KNDGDLP------YLNGVIL----VSRRPVIYE 472

  Fly  1471 -------EARSNQRRLLTAFGSMGESELLKFNQLKFRKKQLKFAKSAIHDWGLFAMEPIAADEMV 1528
                   .|....|.:.|..                 |.:|:..|:....|||.:.:.:.|...:
plant   473 CGPTCPCHASCKNRVIQTGL-----------------KSRLEVFKTRNRGWGLRSWDSLRAGSFI 520

  Fly  1529 IEYVGQMIRPVVAD---LRETK------YEAIGIGSSYLFRIDMETI------------------ 1566
            .||.|:     |.|   ||..:      ::...:.:|:.:..:.|.:                  
plant   521 CEYAGE-----VKDNGNLRGNQEEDAYVFDTSRVFNSFKWNYEPELVDEDPSTEVPEEFNLPSPL 580

  Fly  1567 -IDATKCGNLARFINHSCNPNCYAKVITIESEKKIVI----YSKQPIGINEEITYDYKF----PL 1622
             |.|.|.||:|||:||||:||.:.:.:..|...:.||    ::.:.|....|:||||..    ..
plant   581 LISAKKFGNVARFMNHSCSPNVFWQPVIREGNGESVIHIAFFAMRHIPPMAELTYDYGISPTSEA 645

  Fly  1623 EDEKI-----PCLCGAQGCRGT 1639
            .||.:     .||||::.|||:
plant   646 RDESLLHGQRTCLCGSEQCRGS 667

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
Set1NP_001015221.1 RRM_Set1 99..191 CDD:409745
U2AF_lg 255..>386 CDD:273727
N-SET 1352..1496 CDD:463344 18/156 (12%)
SET_SETD1 1490..1637 CDD:380946 45/187 (24%)
SUVH3NP_565056.1 SRA 203..359 CDD:197742 32/183 (17%)
SET 410..638 CDD:394802 56/300 (19%)
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.