DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Polr2A and NRPC1

DIOPT Version :10

Sequence 1:NP_511124.1 Gene:Polr2A / 32100 FlyBaseID:FBgn0003277 Length:1887 Species:Drosophila melanogaster
Sequence 2:NP_001190573.1 Gene:NRPC1 / 836126 AraportID:AT5G60040 Length:1391 Species:Arabidopsis thaliana


Alignment Length:1547 Identity:478/1547 - (30%)
Similarity:741/1547 - (47%) Gaps:262/1547 - (16%)


- Green bases have known domain annotations that are detailed below.


  Fly    10 PLRQVKRVQFGILSPDEIRRMSVTEGGVQFAETM--------EGGRPKLGGLMDPRQGVIDRTSR 66
            ||: :|.:.|.:||..|:         ::.||..        ...:|...||:|||.|..::.|.
plant    20 PLK-IKSINFSVLSDLEV---------MKAAEVQVWNIGLYDHSFKPYENGLLDPRMGPPNKKSI 74

  Fly    67 CQTCAGNMTECPGHFGHIDLAKPVFHIGFITKTIKILRCVC----------FYCSKMLVSPHNPK 121
            |.||.||...||||:|::.|..||:::|:....:.||:|:|          ..||.||:   :.|
plant    75 CTTCEGNFQNCPGHYGYLKLDLPVYNVGYFNFILDILKCICKVTELADYVSLRCSNMLL---DEK 136

  Fly   122 IKEIVMKSRGQPRKRLAYVYDLCKGKT-ICEGGEDMDLTKENQQPDPNKKPGHGGCGHYQPSIRR 185
            :.|..::....||.......:|.|... .|.       |..:|:....||     ||:....:::
plant   137 LYEDHLRKMRNPRMEPLKKTELAKAVVKKCS-------TMASQRIITCKK-----CGYLNGMVKK 189

  Fly   186 TGLDLTAEWKHQNE-----DSQEKKIVVSAER-----------------VWEILKHITDEECFIL 228
            ..........|...     :..|.|..:|..:                 |..:.|.::|::|.:|
plant   190 IAAQFGIGISHDRSKIHGGEIDECKSAISHTKQSTAAINPLTYVLDPNLVLGLFKRMSDKDCELL 254

  Fly   229 GMDPKYA--RPDWMIVTVLPVPPLAVRPAVVMFGAAKNQDDLTHKLSDIIKANNELRKNEASGAA 291
                 |.  ||:.:|:|.:.||||::||:|::.|...|::|||.:|..||..|..|.|..:...:
plant   255 -----YIAYRPENLIITCMLVPPLSIRPSVMIGGIQSNENDLTARLKQIILGNASLHKILSQPTS 314

  Fly   292 AHVIQENIKMLQFHVATLVDNDMPGMPRAMQKSGKPLKAIKARLKGKEGRIRGNLMGKRVDFSAR 356
            :....:....:|..||..:::::.|...  |....||..|..|||||.||.|.||.||||:|:.|
plant   315 SPKNMQVWDTVQIEVARYINSEVRGCQN--QPEEHPLSGILQRLKGKGGRFRANLSGKRVEFTGR 377

  Fly   357 TVITPDPNLRIDQVGVPRSIAQNLTFPELVTPFNIDRMQELVRRGNSQYPGAKYIVRDNGERIDL 421
            |||:|||||:|.:||:|..:||.|||||.|:..||:::::.||.|.::||||:.:...:|....|
plant   378 TVISPDPNLKITEVGIPILMAQILTFPECVSRHNIEKLRQCVRNGPNKYPGARNVRYPDGSSRTL 442

  Fly   422 --RFHPKSSDLHLQCGYKVERHLRDDDLVIFNRQPTLHKMSMMGHRVKVLPWSTFRMNLSCTSPY 484
              .:..:.:| .|..|..|:|||::.|:|:|||||:||:||:|.||.:::||.|.|.|.|..:||
plant   443 VGDYRKRIAD-ELAIGCIVDRHLQEGDVVLFNRQPSLHRMSIMCHRARIMPWRTLRFNESVCNPY 506

  Fly   485 NADFDGDEMNLHVPQSMETRAEVENIHITPRQIITPQANKPVMGIVQDTLTAVRKMTKRDVFITR 549
            ||||||||||:||||:.|.|.|...:......:.||:..:.::...||.||:...:|::|.|..|
plant   507 NADFDGDEMNMHVPQTEEARTEAITLMGVQNNLCTPKNGEILVASTQDFLTSSFLITRKDTFYDR 571

  Fly   550 EQVMNLLMFLPTW--DAKMPQPCILKPRPLWTGKQIFSLIIPGNVNMIRTHSTHPDEE------D 606
            .....:..::...  ...:|.|.||||..|||||||||:::..|.: ||.:.|...:|      :
plant   572 AAFSLICSYMGDGMDSIDLPTPTILKPIELWTGKQIFSVLLRPNAS-IRVYVTLNVKEKNFKKGE 635

  Fly   607 EGPYKWISPGDTKVMVEHGELIMGILCKKSL---------GTSAGSLLHICFLELGHDIAGRFYG 662
            .|..:.:...|..|...:.|||.|.|.|.:|         |...| |..|...:.....|.....
plant   636 HGFDETMCINDGWVYFRNSELISGQLGKATLALDIFPLGNGNKDG-LYSILLRDYNSHAAAVCMN 699

  Fly   663 NIQTVINNWLLFEGHSIGIGDTIADPQTYNEIQQAIKKAKDDVINVIQKAHNMELEPTPGNTLRQ 727
            .:..:...|:...|.||||.|.....:...|.:.:|:...|.....|::.:...|:...|....:
plant   700 RLAKLSARWIGIHGFSIGIDDVQPGEELSKERKDSIQFGYDQCHRKIEEFNRGNLQLKAGLDGAK 764

  Fly   728 TFENKVNRILNDARDKTGGSAKKSLTEYNNLKAMVVSGSKGSNINISQVIACVGQQNVEGKRIPY 792
            :.|.::..|||..|:.||.:....|...|:...|...|||||.|||||::||||||.|.|.|.|.
plant   765 SLEAEITGILNTIREATGKACMSGLHWRNSPLIMSQCGSKGSPINISQMVACVGQQTVNGHRAPD 829

  Fly   793 GFRKRTLPHFIKDDYGPESRGFVENSYLAGLTPSEFYFHAMGGREGLIDTAVKTAETGYIQRRLI 857
            ||..|:||||.:....|.::|||.||:.:|||.:||:||.|||||||:|||||||.|||:.|||:
plant   830 GFIDRSLPHFPRMSKSPAAKGFVANSFYSGLTATEFFFHTMGGREGLVDTAVKTASTGYMSRRLM 894

  Fly   858 KAMESVMVNYDGTVRNSVGQLIQLRYGEDGLCGELVEFQNMPTVKLSNKSFEKRFKFDWSNERLM 922
            ||:|.::|:||.||||:.|.::|..||:||:...|:|.::...:..:               ||.
plant   895 KALEDLLVHYDNTVRNASGCILQFTYGDDGMDPALMEGKDGAPLNFN---------------RLF 944

  Fly   923 KKVFTDDVIKEMT----------DSSEAIQELEAEWDRLVSDRDSLRQIFPNGESKVVLPCNLQR 977
            .||       :.|          .|.|..|:.|.|..|                           
plant   945 LKV-------QATCPPRSHHTYLSSEELSQKFEEELVR--------------------------- 975

  Fly   978 MIWNVQKIFHINKRLPTDLSPIRVIKGVKTLLERCVIVTGNDRISKQANENATLLFQCLIRSTLC 1042
                     |...|:.||..       ||:|.| .|.:.|                   ::|...
plant   976 ---------HDKSRVCTDAF-------VKSLRE-FVSLLG-------------------VKSASP 1004

  Fly  1043 TKYVSEEFRLSTEAFEWLVGEIETRFQQAQANPGEMVGALAAQSLGEPATQMTLNTFHFAGVSSK 1107
            .:.:.:...::.:..|..|.....|:::.:...|..:|.:.|||:|||.|||||.|||||||:|.
plant  1005 PQVLYKASGVTDKQLEVFVKICVFRYREKKIEAGTAIGTIGAQSIGEPGTQMTLKTFHFAGVASM 1069

  Fly  1108 NVTLGVPRLKEIINISKKPKAPSLTVFLTGGAARDAEKAKNVLCRLEHTTLRKVTANTAIYYDPD 1172
            |:|.||||:.||||.||....|.::..|..  ..:...|:.|..|:|.|||.:|..:..:.....
plant  1070 NITQGVPRINEIINASKNISTPVISAELEN--PLELTSARWVKGRIEKTTLGQVAESIEVLMTST 1132

  Fly  1173 PQRTVISEDQEFVNVYYEMPDFDPTRISPWLLR---IELDRKRMTDKKLTMEQIAEKINVGFGED 1234
            .....|..|.:.:       :.....|:||.::   ::..|.::.|                   
plant  1133 SASVRIILDNKII-------EEACLSITPWSVKNSILKTPRIKLND------------------- 1171

  Fly  1235 LNCIFNDDNADKLVLRIRIMNNEENKFQDEDEAVDKMEDDMFLRCIEANMLSDMTLQGIEAIGKV 1299
                 ||         ||:::..    .|....|||......|..:: |:|.::.:.||:.:   
plant  1172 -----ND---------IRVLDTG----LDITPVVDKSRAHFNLHNLK-NVLPNIIVNGIKTV--- 1214

  Fly  1300 YMHLPQTDSKKRIVITETGEFKAIGEWLLETDG---TSMMKVLSERDVDPIRTSSNDICEIFQVL 1361
                      :|:|:.|..: |.:.:.::....   |:::.|:....::...|:||::.|:.:.|
plant  1215 ----------ERVVVAEDMD-KMLAKLIIPCPRWACTNLLAVMGTPGINGRTTTSNNVVEVSKTL 1268

  Fly  1362 GIEAVRKSVEKEMNAVLQFYGLYVNYRHLALLCDVMTAKGHLMAITRHGINRQDTGALMRCSFEE 1426
            ||||.|.::..|:..|:..:|:.::.||:.||.||||.:|.::.|.|.||.:.|...||:.|||.
plant  1269 GIEAARTTIIDEIGTVMGNHGMSIDIRHMMLLADVMTYRGEVLGIQRTGIQKMDKSVLMQASFER 1333

  Fly  1427 TVDVLMDAAAHAETDPMRGVSENIIMGQLPKMGTGCFDLLL---DAEKCRFG 1475
            |.|.|..|||..:.|.:.||:|.:|||...|:|||...:|.   |..|.::|
plant  1334 TGDHLFSAAASGKVDNIEGVTECVIMGIPMKLGTGILKVLQRTDDLPKLKYG 1385

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
Polr2ANP_511124.1 RNAP_II_RPB1_N 15..868 CDD:259848 315/914 (34%)
RNA_pol_Rpb1_6 890..1071 CDD:461511 27/190 (14%)
RNAP_II_Rpb1_C 1050..1468 CDD:132720 122/426 (29%)
Herpes_BLLF1 <1498..1862 CDD:282904
PTZ00449 <1707..1880 CDD:185628
NRPC1NP_001190573.1 RNAP_III_RPC1_N 31..909 CDD:259847 316/911 (35%)
rpoC2 859..>1092 CDD:214368 103/317 (32%)
RNAP_III_Rpc1_C 1029..1369 CDD:132723 118/400 (30%)
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.