DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Polr2A and rpoC

DIOPT Version :10

Sequence 1:NP_511124.1 Gene:Polr2A / 32100 FlyBaseID:FBgn0003277 Length:1887 Species:Drosophila melanogaster
Sequence 2:NP_418415.1 Gene:rpoC / 948487 ECOCYCID:EG10895 Length:1407 Species:Escherichia coli


Alignment Length:1627 Identity:366/1627 - (22%)
Similarity:607/1627 - (37%) Gaps:466/1627 - (28%)


- Green bases have known domain annotations that are detailed below.


  Fly    17 VQFGILSPDEIRRMSVTEGGVQFAETM--EGGRPKLGGLMDPR--------------------QG 59
            ::..:.|||.||..|.  |.|:..||:  ...:|:..||...|                    :|
E. coli    20 IKIALASPDMIRSWSF--GEVKKPETINYRTFKPERDGLFCARIFGPVKDYECLCGKYKRLKHRG 82

  Fly    60 VIDRTSRCQTCAGNMTECP---GHFGHIDLAKPVFHIGFITKTIKILRCVCFYCSKMLVSPHNPK 121
            ||     |:.|...:|:..   ...|||:||.|..||.|       |:.:......:|..|    
E. coli    83 VI-----CEKCGVEVTQTKVRRERMGHIELASPTAHIWF-------LKSLPSRIGLLLDMP---- 131

  Fly   122 IKEIVMKSRGQPRKRLAYVYDLCKGKTICEGG----EDMDLTKENQQPDPNKKPGHG-----GCG 177
            :::|         :|:.|.    :...:.|||    |...:..|.|..|..::.|..     |..
E. coli   132 LRDI---------ERVLYF----ESYVVIEGGMTNLERQQILTEEQYLDALEEFGDEFDAKMGAE 183

  Fly   178 HYQPSIRRTGLD-----LTAEWKHQNEDSQEKKIVVSAERVWEILKHITDEECFILGMDPKYARP 237
            ..|..::...|:     |..|....|.:::.||:.          |.|...|.|:...:    :|
E. coli   184 AIQALLKSMDLEQECEQLREELNETNSETKRKKLT----------KRIKLLEAFVQSGN----KP 234

  Fly   238 DWMIVTVLPVPPLAVRPAVVMFGAAKNQDDLTHKLSDIIKANNELRKNEASGAAAHVIQENIKML 302
            :|||:|||||.|..:||.|.:.|......||......:|..||.|::.....|...:::...:||
E. coli   235 EWMILTVLPVLPPDLRPLVPLDGGRFATSDLNDLYRRVINRNNRLKRLLDLAAPDIIVRNEKRML 299

  Fly   303 QFHVATLVDNDMPGMPRAMQKSGK-PLKAIKARLKGKEGRIRGNLMGKRVDFSARTVITPDPNLR 366
            |..|..|:||...|  ||:..|.| |||::...:|||:||.|.||:|||||:|.|:|||..|.||
E. coli   300 QEAVDALLDNGRRG--RAITGSNKRPLKSLADMIKGKQGRFRQNLLGKRVDYSGRSVITVGPYLR 362

  Fly   367 IDQVGVPRSIAQNLTFPELVTPFNIDRMQELVRRGNSQYPGAKYIVRDNGERIDLRFHPKSSDLH 431
            :.|.|:|:.:|..|..|.:.....:..:...::.........:.:|.|..:.: :|.||      
E. coli   363 LHQCGLPKKMALELFKPFIYGKLELRGLATTIKAAKKMVEREEAVVWDILDEV-IREHP------ 420

  Fly   432 LQCGYKVERHLRDDDLVIFNRQPTLHKMSMMGHRVKVLPWSTFRMNLSCTSPYNADFDGDEMNLH 496
                            |:.||.||||::.:......::.....:::....:.||||||||:|.:|
E. coli   421 ----------------VLLNRAPTLHRLGIQAFEPVLIEGKAIQLHPLVCAAYNADFDGDQMAVH 469

  Fly   497 VPQSMETRAEVENIHITPRQIITPQANKPVMGIVQDTLTAVRKMTKRDVFITREQVMNLLMFLPT 561
            ||.::|.:.|...:.::...|::|...:|::...||.:..:..|| ||....:.:.|.|      
E. coli   470 VPLTLEAQLEARALMMSTNNILSPANGEPIIVPSQDVVLGLYYMT-RDCVNAKGEGMVL------ 527

  Fly   562 WDAKMPQPCILKPRPLWTG-KQIFSLIIPGNVNM---IRTHSTHPDEEDEGP-YKWISPGDTKVM 621
                             || |:...|...|..::   ::...|..:::..|. ....|..||.| 
E. coli   528 -----------------TGPKEAERLYRSGLASLHARVKVRITEYEKDANGELVAKTSLKDTTV- 574

  Fly   622 VEHGELIMGILCKK---------SLGTSA-GSLLHICFLELGHDIAGRFYGNIQTVINNWLLFEG 676
               |..|:.::..|         :||..| ..:|:.|:..||......|...|......:....|
E. coli   575 ---GRAILWMIVPKGLPYSIVNQALGKKAISKMLNTCYRILGLKPTVIFADQIMYTGFAYAARSG 636

  Fly   677 HSIGIGDTIADPQTYNEIQQAIKKAKDDVINVIQKAHNMELEPTPGNTLRQTFENKVNRILNDAR 741
            .|:||.|.:. |:..:||   |.:|:.:|.. ||:.....| .|.|...     |||..|...|.
E. coli   637 ASVGIDDMVI-PEKKHEI---ISEAEAEVAE-IQEQFQSGL-VTAGERY-----NKVIDIWAAAN 690

  Fly   742 DKTG----------------GSAKKSLTEYNNLKAMVVSGSKGSNINISQVIACVGQQNVEGKRI 790
            |:..                |..:|.:: :|::..|..||::||...|.|:   .|.:.:..|  
E. coli   691 DRVSKAMMDNLQTETVINRDGQEEKQVS-FNSIYMMADSGARGSAAQIRQL---AGMRGLMAK-- 749

  Fly   791 PYGFRKRTLPHFIKDDYGPESRGFVENSYLAGLTPSEFYFHAMGGREGLIDTAVKTAETGYIQRR 855
            |.|....|          |.:..|.|     ||...:::....|.|:||.|||:|||.:||:.||
E. coli   750 PDGSIIET----------PITANFRE-----GLNVLQYFISTHGARKGLADTALKTANSGYLTRR 799

  Fly   856 LIKAMESVMVNYDGTVRNSVGQLIQLRYGEDGLCGELVEFQNMPTVKLSNKSFEKRFKFDWSNER 920
            |:...:.::|..|.                   ||........|.::..:.....|       :|
E. coli   800 LVDVAQDLVVTEDD-------------------CGTHEGIMMTPVIEGGDVKEPLR-------DR 838

  Fly   921 LMKKVFTDDVIKEMTDSSEAIQELEAEWDRLVSDRDSLRQIFPNGESKVVLPCN-LQRMIWNVQK 984
            ::.:|..:||:|.                               |.:.:::|.| |....|    
E. coli   839 VLGRVTAEDVLKP-------------------------------GTADILVPRNTLLHEQW---- 868

  Fly   985 IFHINKRLPTDLSPIRVIKGVKTLLERCVIVTGNDRISKQANENATLLFQCLIRSTLCTKYVSEE 1049
             ..:.:....|...:|.:....|....|....|.|                |.|           
E. coli   869 -CDLLEENSVDAVKVRSVVSCDTDFGVCAHCYGRD----------------LAR----------- 905

  Fly  1050 FRLSTEAFEWLVGEIETRFQQAQANPGEMVGALAAQSLGEPATQMTLNTFHFAGVSSKNV----- 1109
                        |.|        .|.||.:|.:||||:|||.||:|:.|||..|.:|:..     
E. coli   906 ------------GHI--------INKGEAIGVIAAQSIGEPGTQLTMRTFHIGGAASRAAAESSI 950

  Fly  1110 ------TLGVPRLKEIINISKK----PKAPSLTVFLTGGAARDAEKAKNVLCRLEHTTLRKVTAN 1164
                  ::.:..:|.::|.|.|    .:...|.:....|..:::.|........:....:.....
E. coli   951 QVKNKGSIKLSNVKSVVNSSGKLVITSRNTELKLIDEFGRTKESYKVPYGAVLAKGDGEQVAGGE 1015

  Fly  1165 TAIYYDPDPQRTVISEDQEFVNVYYEMPDFDPTRISPWLLRIELDRKRMTDK-----KLTMEQIA 1224
            |...:||... .||:|...||. :.:|  .|...|:           |.||:     .|.:...|
E. coli  1016 TVANWDPHTM-PVITEVSGFVR-FTDM--IDGQTIT-----------RQTDELTGLSSLVVLDSA 1065

  Fly  1225 EKINVGFGEDLNCIFNDDNADKLVLRIRIMNNEEN-----------KFQDEDEAVDKMED----- 1273
            |:  ...|:||.            ..::|::.:.|           ::....:|:.::||     
E. coli  1066 ER--TAGGKDLR------------PALKIVDAQGNDVLIPGTDMPAQYFLPGKAIVQLEDGVQIS 1116

  Fly  1274 --DMFLRC-IEANMLSDMT-----------------------LQGIEAIGKVYMHLPQTDSKKRI 1312
              |...|. .|:....|:|                       :.||.:.||      :|..|:|:
E. coli  1117 SGDTLARIPQESGGTKDITGGLPRVADLFEARRPKEPAILAEISGIVSFGK------ETKGKRRL 1175

  Fly  1313 VITETGEF----KAIGEW----LLETDGTSMMKVLSERDVDPIRTSSNDICEIFQVLGIEAVRKS 1369
            |||.....    :.|.:|    :.|.:......|:|:....|        .:|.::.|:.||.:.
E. coli  1176 VITPVDGSDPYEEMIPKWRQLNVFEGERVERGDVISDGPEAP--------HDILRLRGVHAVTRY 1232

  Fly  1370 VEKEMNAVLQFYGLYVNYRHLALLCDVMTAKG--------------------------------- 1401
            :..|:..|.:..|:.:|.:|:.::...|..|.                                 
E. coli  1233 IVNEVQDVYRLQGVKINDKHIEVIVRQMLRKATIVNAGSSDFLEGEQVEYSRVKIANRELEANGK 1297

  Fly  1402 -------HLMAITRHGINRQDTGALMRCSFEETVDVLMDAAAHAETDPMRGVSENIIMGQLPKMG 1459
                   .|:.||:..:..:  ..:...||:||..||.:||...:.|.:||:.||:|:|:|...|
E. coli  1298 VGATYSRDLLGITKASLATE--SFISAASFQETTRVLTEAAVAGKRDELRGLKENVIVGRLIPAG 1360

  Fly  1460 TG 1461
            ||
E. coli  1361 TG 1362

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
Polr2ANP_511124.1 RNAP_II_RPB1_N 15..868 CDD:259848 236/921 (26%)
RNA_pol_Rpb1_6 890..1071 CDD:461511 23/181 (13%)
RNAP_II_Rpb1_C 1050..1468 CDD:132720 107/522 (20%)
Herpes_BLLF1 <1498..1862 CDD:282904
PTZ00449 <1707..1880 CDD:185628
rpoCNP_418415.1 PRK00566 16..1381 CDD:234794 366/1627 (22%)
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.