DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Sin3A and WRKY19

DIOPT Version :10

Sequence 1:NP_001246282.1 Gene:Sin3A / 36382 FlyBaseID:FBgn0022764 Length:2066 Species:Drosophila melanogaster
Sequence 2:NP_001118968.1 Gene:WRKY19 / 826810 AraportID:AT4G12020 Length:1895 Species:Arabidopsis thaliana


Alignment Length:2178 Identity:390/2178 - (17%)
Similarity:629/2178 - (28%) Gaps:871/2178 - (39%)


- Green bases have known domain annotations that are detailed below.


  Fly    82 HGGAQTIAYLPSTTPTATNLKTTTSIVDSTTAGGPVGAGAQVAVGVG----------SAAG---- 132
            |||.:...||                  ..|.|........:|.|.|          ||.|    
plant   137 HGGGRRCEYL------------------GCTKGAEGSTDFCIAHGGGRRCNHEDCTRSAWGRTEF 183

  Fly   133 ----GGGVVVST---GSTGTQTLQYTTSYSVASIQAGGTLKANTADGANTVQ-------IHVTGG 183
                |||....|   |.:.:..|.:..::       ||..|.:..|.....:       :|  ||
plant   184 CVKHGGGARCKTYGCGKSASGPLPFCRAH-------GGGKKCSHEDCTGFARGRSGLCLMH--GG 239

  Fly   184 G----AANNPASAQTVSS--SSQTGTIRQRTISGTQTVATAVGNLATISQQQPVQQSPLGKAQTP 242
            |    ..|...||:.:|.  .|..|..|.::|..|                    :...|.....
plant   240 GKRCQRENCTKSAEGLSGLCISHGGGRRCQSIGCT--------------------KGAKGSKMFC 284

  Fly   243 PSSVVANSIPVGGTTPPQGQSGNATPRLKVEDALSYLDQVKYQYADQPQIYNNFLDIMKEFKSHC 307
            .:.:....:.:.|    .|..|..|    ..|||:||..||.::.|..: |:.||:::.:.|...
plant   285 KACITKRPLTIDG----GGNMGGVT----TGDALNYLKAVKDKFEDSEK-YDTFLEVLNDCKHQG 340

  Fly   308 IDTPGVIERVSTLFKGHTELIYGFNMFLPPGYKIEIHSDALGCSVPVVSMPSPPGAPTSTGTVHM 372
            :||.|||.|:..|||||.:|:.|||.:|...|:|.|        :|....|              
plant   341 VDTSGVIARLKDLFKGHDDLLLGFNTYLSKEYQITI--------LPEDDFP-------------- 383

  Fly   373 LTGNSSMSGAGHIAIKTTNAATLTPAAGAGAAAAAAAVAQIQSAGAVNLMTHGGASLTQTTIHAL 437
                                                 :..:........||:..|...|...:  
plant   384 -------------------------------------IDFLDKVEGPYEMTYQQAQTVQANAN-- 409

  Fly   438 QQATPPQSQSPGGGHVHVSVTNSTAANAVVPGQPGISVSAHNVPQNYSRDRERATITPTGQVAGA 502
               ..||::.|          :|:|..:...|||.|..||.:                    :..
plant   410 ---MQPQTEYP----------SSSAVQSFSSGQPQIPTSAPD--------------------SSL 441

  Fly   503 AANVNASASIVVGGPPTPNSLSELSPHGGAGGGPGAGAAQHNLHHIQQAHQSILLGETGQQNQPV 567
            .|..|.|...::             .|          .:|..|:..:|.:......:.||:    
plant   442 LAKSNTSGITII-------------EH----------MSQQPLNVDKQVNDGYNWQKYGQK---- 479

  Fly   568 EFNHAITYVNKIK-NRFQNQPAKYKKFLEILHDYQREQKVMKEGSLNQGKMLTEQEVYTQVAKLF 631
                      |:| ::|   |..|.|...:             |..::.|:  |:.:..|||::.
plant   480 ----------KVKGSKF---PLSYYKCTYL-------------GCPSKRKV--ERSLDGQVAEIV 516

  Fly   632 GQDEDLLREFGQFLPDATNHQSGQYMSKSASVHNDHGKRPTATLSGGAHITMSSASPAPSGSPLH 696
            .:|..             ||:.           .:.||..:.|...|:...::..|...:.|...
plant   517 YKDRH-------------NHEP-----------PNQGKDGSTTYLSGSSTHINCMSSELTASQFS 557

  Fly   697 LGATTLPQIDKSAHAAAIGNLSAVNTSVSIKTYNNNQQQQNHVIGSGLNATRNDILFEKDYHAGL 761
            ...|.:.|.:.::.|..|..:|..         ::|::..|.....|          |||..   
plant   558 SNKTKIEQQEAASLATTIEYMSEA---------SDNEEDSNGETSEG----------EKDED--- 600

  Fly   762 QQQAHQRGAGVGGHHHLAGTAAGANIGRPGVGASVMVSYDKEHRNNHHVQKYVGHAPNQNLTHGH 826
            :.:..:|...|....  ...|:...:..|.|........|                   ||..|:
plant   601 EPEPKRRITEVQVSE--LADASDRTVREPRVIFQTTSEVD-------------------NLDDGY 644

  Fly   827 NAKKSPSYGIPSVIGSMPHISDNSLDRSSPGISYATPPLPSGPHGQHNSGSATRRPGDDSLVGHY 891
            ..:|   ||...|.|:                          |:.:.:|..      |..:|..|
plant   645 RWRK---YGQKVVKGN--------------------------PYPRFSSSK------DYDVVIRY 674

  Fly   892 ASGAPPAKRPKPYCRDVSFSEASSKCTISDAAFFDKVRKAL--RSPEVYDNF--------LRCLT 946
                                   .:..||:..|...:|.:|  |...||:.|        .|.|.
plant   675 -----------------------GRADISNEDFISHLRASLCRRGISVYEKFNEVDALPKCRVLI 716

  Fly   947 LFNQEIVSKTELLGL-----------------VSPFLMKFPDLLRWFTDFL-------------G 981
            :........:.||.:                 :||:            ||:             .
plant   717 IVLTSTYVPSNLLNILEHQHTEDRVVYPIFYRLSPY------------DFVCNSKNYERFYLQDE 769

  Fly   982 PPSGQPAGGLIDGMPLAATQRQGGGSSNSSHDRGTSHQSAAEYVQDVDLSSCKRLGASYCALPQS 1046
            |...|.|...|..||         |.:       .:.:|.:|.:.::...:.|.|          
plant   770 PKKWQAALKEITQMP---------GYT-------LTDKSESELIDEIVRDALKVL---------- 808

  Fly  1047 TVPKKCSG---------------RTALCREVLNDKWVSFPTWASEDSTFVTSRKTQFEETIYRNI 1096
                 ||.               .:.||.|.|:.:  |...|.:     |...||...|.|:|.|
plant   809 -----CSADKVNMIGMDMQVEEILSLLCIESLDVR--SIGIWGT-----VGIGKTTIAEEIFRKI 861

  Fly  1097 ---------------------HRTEDERFELDLVIEVNSATIRVLE--------NLQKKMSRMST 1132
                                 |....|.| |..|:||....||:.:        .||:|...:..
plant   862 SVQYETCVVLKDLHKEVEVKGHDAVRENF-LSEVLEVEPHVIRISDIKTSFLRSRLQRKRILVIL 925

  Fly  1133 EELSKFH-LDDHLGGTSQTIHQRAIHRIYGDKSGEIITGMKKNPFVAV----------------- 1179
            ::::.:. :|..||    |::      .:|..|..|:|...:..||..                 
plant   926 DDVNDYRDVDTFLG----TLN------YFGPGSRIIMTSRNRRVFVLCKIDHVYEVKPLDIPKSL 980

  Fly  1180 -------------PIVLKRLKVKEEEWREAQ-------KTFNKQWREQNEK------YYLKSL-D 1217
                         |.|.|.|.::..::....       .:.:::|.:.:::      .|:..: :
plant   981 LLLDRGTCQIVLSPEVYKTLSLELVKFSNGNPQVLQFLSSIDREWNKLSQEVKTTSPIYIPGIFE 1045

  Fly  1218 HQAINFKPNDMKALRSKSLFNEIETLYDERHDQEDDAME----PFGPH-----LVLPYKDKTILD 1273
            ........|:      :.:|.:|...:: |.|:::.||.    .|..|     ||    ||::|.
plant  1046 KSCCGLDDNE------RGIFLDIACFFN-RIDKDNVAMLLDGCGFSAHVGFRGLV----DKSLLT 1099

  Fly  1274 DAANLLIHHVK--RQTGIQKQEKQKIKQIIRQFVPD-------LFFAP--RQPLSDDERDDAFP- 1326
            .:.:.|:..:.  :.||         ::|:||...|       |:.|.  |....:|....|.. 
plant  1100 ISQHNLVDMLSFIQATG---------REIVRQESADRPGDRSRLWNADYIRHVFINDTGTSAIEG 1155

  Fly  1327 -FLVDDNTKMDVDSPLGRTESSTRNAKSTPSESASPARSNASTSSVTPAGIKKETDDSKATTGSF 1390
             ||...|.|.|.:..:.....:.|..|.    ..|.|......|  .|.|::......:.....:
plant  1156 IFLDMLNLKFDANPNVFEKMCNLRLLKL----YCSKAEEKHGVS--FPQGLEYLPSKLRLLHWEY 1214

  Fly  1391 APASSATASSATPVDDATPSTSSAAA----AASAASSSTVSGTEGKPKDDPLSSHKEEGAGSTSS 1451
            .|.|| ...|..|.:....:..|:.|    ....|...|.:.:..|.|...|         |.|.
plant  1215 YPLSS-LPKSFNPENLVELNLPSSCAKKLWKGKKARFCTTNSSLEKLKKMRL---------SYSD 1269

  Fly  1452 GVATSPRQAQDTAGAGVDVEIKLEHPADFSNPKLLPPHAHGQREDESYTLFFANNNWYL----FL 1512
            .:...||.:..|         .||| .|......|              |..:.:..||    ||
plant  1270 QLTKIPRLSSAT---------NLEH-IDLEGCNSL--------------LSLSQSISYLKKLVFL 1310

  Fly  1513 RLHAILCDRLH---VMYERARLLAIEEERCRVNRRESTATALRLKPKPEI--QVEDYY------- 1565
            .|..  |.:|.   .|.:...|..:....|.           :|...|||  .|::.|       
plant  1311 NLKG--CSKLENIPSMVDLESLEVLNLSGCS-----------KLGNFPEISPNVKELYMGGTMIQ 1362

  Fly  1566 --------------------------PTFLDMLKNV----LDGNMDSNTFEDTMREMFGIYAYIS 1600
                                      ||.:..||::    |.|.:....|.|:.|.|    ..:.
plant  1363 EIPSSIKNLVLLEKLDLENSRHLKNLPTSIYKLKHLETLNLSGCISLERFPDSSRRM----KCLR 1423

  Fly  1601 FTLDKVVSNAVRQLQYCVTERAALDCVELFATEQRRGCTGGFCRDAHKTFDREMSYQRKAESILN 1665
            | || :....:::|...::...|||  ||...:.||........:|:.|                
plant  1424 F-LD-LSRTDIKELPSSISYLTALD--ELLFVDSRRNSPVVTNPNANST---------------- 1468

  Fly  1666 EENCFKVYIYKIDCRVTIELLDSEPEEVDKPAALKAQKFSKYVERLANPALGGGGNTGRSDSALG 1730
                              ||:.||               |..:|.|..||              .
plant  1469 ------------------ELMPSE---------------SSKLEILGTPA--------------D 1486

  Fly  1731 NDSVVDGSDIKTEADEDTAEL--RYRE-------GGIGGAGKARFLVRNKRRSKLLEEQVRQLFE 1786
            |:.||.|:..||...|.|..:  :.||       ..:||..|.       .|..:|:.|......
plant  1487 NEVVVGGTVEKTRGIERTPTILVKSREYLIPDDVVAVGGDIKG-------LRPPVLQLQPAMKLS 1544

  Fly  1787 H----------------------------RRNRIEQHGTAASLEPISVDSAISSNSTTGGGGSGT 1823
            |                            |...:|...|.|...|:. |....|.:...|..|.|
plant  1545 HIPRGSTWDFVTHFAPPETVAPPSSSSEAREEEVETEETGAMFIPLG-DKETCSFTVNKGDSSRT 1608

  Fly  1824 NNNN-----NNNNNNNSWGGKRLERLGAPQVGLAEQYAFNDRDEISTNISNGRCFVTSKNLKLLK 1883
            .:|.     :..:....|  ::.:.||...:|           .:...||....|...|.:.|| 
plant  1609 ISNTSPIYASEGSFITCW--QKGQLLGRGSLG-----------SVYEGISADGDFFAFKEVSLL- 1659

  Fly  1884 YDAVRRAKKSHCRVTQAKYAHFQTYVNKWLQQ------HVSEQQQQNCVDWLLGKTADQINASGW 1942
                          .|...||      :|:||      .:|:.|.||.|.: .|.|.|:.|...:
plant  1660 --------------DQGSQAH------EWIQQVEGGIALLSQLQHQNIVRY-RGTTKDESNLYIF 1703

  Fly  1943 GSASKTKTVQQKDTSKTPYRIYNRYRVVQSAVS 1975
                 .:.|.|....|    :|.|.::..|.||
plant  1704 -----LELVTQGSLRK----LYQRNQLGDSVVS 1727

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
Sin3ANP_001246282.1 Sin3 238..1719 CDD:227889 292/1682 (17%)
WRKY19NP_001118968.1 Sin3 308..>444 CDD:227889 48/230 (21%)
PAH 324..368 CDD:460645 19/44 (43%)
WRKY 467..523 CDD:214815 15/100 (15%)
WRKY 640..>661 CDD:214815 8/49 (16%)
PLN03210 663..>1448 CDD:215633 169/955 (18%)
leucine-rich repeat 1260..1282 CDD:275380 7/39 (18%)
leucine-rich repeat 1283..1306 CDD:275380 8/37 (22%)
leucine-rich repeat 1307..1329 CDD:275380 6/23 (26%)
leucine-rich repeat 1330..1350 CDD:275380 6/30 (20%)
leucine-rich repeat 1351..1373 CDD:275380 2/21 (10%)
leucine-rich repeat 1398..1421 CDD:275380 6/26 (23%)
leucine-rich repeat 1422..1444 CDD:275380 4/23 (17%)
Protein Kinases, catalytic domain 1626..1877 CDD:473864 33/146 (23%)

Return to query results.
Submit another query.