DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Polr2A and NRPD1B

DIOPT Version :10

Sequence 1:NP_511124.1 Gene:Polr2A / 32100 FlyBaseID:FBgn0003277 Length:1887 Species:Drosophila melanogaster
Sequence 2:NP_181532.2 Gene:NRPD1B / 818591 AraportID:AT2G40030 Length:1976 Species:Arabidopsis thaliana


Alignment Length:1881 Identity:386/1881 - (20%)
Similarity:653/1881 - (34%) Gaps:587/1881 - (31%)


- Green bases have known domain annotations that are detailed below.


  Fly    13 QVKRVQFGILSPDEIRRMSVTEGGVQFAETMEG---GRPKLGGLMDPRQGVIDRTSRCQTC-AGN 73
            ::..:.|.:.|..||...|::|..:.....:..   |.|...|             :|::| |..
plant    13 EIVGITFALASHHEICIQSISESAINHPSQLTNAFLGLPLEFG-------------KCESCGATE 64

  Fly    74 MTECPGHFGHIDLAKPVFHIGFITKTIKILRCVCFYCSKMLVSPHNPKIKEIVMKSRGQPRKRLA 138
            ..:|.||||:|.|..|::|...:.:..::|..:|..|.|             :.|::|.......
plant    65 PDKCEGHFGYIQLPVPIYHPAHVNELKQMLSLLCLKCLK-------------IKKAKGTSGGLAD 116

  Fly   139 YVYDLCKGKTICEGGEDMDLTKENQQPD---------PNKKPGHGGCGHYQPSIRRTGLDLTAEW 194
            .:..:|     ||  |...::.:::..|         |::.....||.::   :.|.|....:::
plant   117 RLLGVC-----CE--EASQISIKDRASDGASYLELKLPSRSRLQPGCWNF---LERYGYRYGSDY 171

  Fly   195 KHQNEDSQEKKIVVSAERVWEILKHITDE---ECFILGMDPKYARPDWMIVTVLPVPP--LAVRP 254
            ...          :.|..|.|||:.|.:|   :....|..|:    :..|:..|||||  |:|..
plant   172 TRP----------LLAREVKEILRRIPEESRKKLTAKGHIPQ----EGYILEYLPVPPNCLSVPE 222

  Fly   255 AVVMFGAAKNQDDLTHKLSDIIK------------ANNELRKNEAS-------------GAAAHV 294
            |...| :..:.|....:|.|::|            .|.|..|.|||             |.|.  
plant   223 ASDGF-STMSVDPSRIELKDVLKKVIAIKSSRSGETNFESHKAEASEMFRVVDTYLQVRGTAK-- 284

  Fly   295 IQENIKMLQFHVATLVDNDMPGMPRAMQKSGKPLKAIKARLKGKEGRIRGNLMGKRVDFSARTVI 359
            ...||.| ::.|:.:.|:                    :..|....::|...:.|...||:|:||
plant   285 AARNIDM-RYGVSKISDS--------------------SSSKAWTEKMRTLFIRKGSGFSSRSVI 328

  Fly   360 TPDPNLRIDQVGVPRSIAQNLTFPELVTPFNIDRMQELV--------RRGNSQYPGAKYIVRDNG 416
            |.|....:::||:|..|||.:||.|.|:..|...:|:||        .:|::.|.     :||..
plant   329 TGDAYRHVNEVGIPIEIAQRITFEERVSVHNRGYLQKLVDDKLCLSYTQGSTTYS-----LRDGS 388

  Fly   417 ERIDLRFHPKSSDLHLQCGYKVERHLRDDDLVIFNRQPTLHKMSMMGHRVKVLPWSTFRMNLSCT 481
            :          ....|:.|..|.|.:.|.|:|..||.||.||.|:...||.|...:|.::|....
plant   389 K----------GHTELKPGQVVHRRVMDGDVVFINRPPTTHKHSLQALRVYVHEDNTVKINPLMC 443

  Fly   482 SPYNADFDGDEMNLHVPQSMETRAEVENIHITPRQIITPQANKPVMGIVQDTLTAVRKMTKRDVF 546
            ||.:||||||.::|..|||:..:|||..:....:|:::....:.::.:..|:|.::|.|.:| ||
plant   444 SPLSADFDGDCVHLFYPQSLSAKAEVMELFSVEKQLLSSHTGQLILQMGSDSLLSLRVMLER-VF 507

  Fly   547 ITREQVMNLLMFLPTWDAKMPQPCILKPR---PLWTGKQIFSLIIPGNVNMIRTHSTHPDEEDEG 608
            :.:.....|.|:   ....:|.|.:.|..   |.||..||..|..|..::               
plant   508 LDKATAQQLAMY---GSLSLPPPALRKSSKSGPAWTVFQILQLAFPERLS--------------- 554

  Fly   609 PYKWISPGDTKVMVEHGELIMGILCKKSLGTSAGSLLHICFLELGHDIAGRFYGNIQTVINNWLL 673
                 ..|| :.:|:..:|:.......::|:....::...|||.|......|:.::|.::...|.
plant   555 -----CKGD-RFLVDGSDLLKFDFGVDAMGSIINEIVTSIFLEKGPKETLGFFDSLQPLLMESLF 613

  Fly   674 FEGHSIGIGDTIADPQTYNEIQQAIKKAKDDVINVIQKAHNM---ELEPTPGNTLRQTF------ 729
            .||.|:.:.|.            ::.:|..|||      ||:   |:.|.. :.||.::      
plant   614 AEGFSLSLEDL------------SMSRADMDVI------HNLIIREISPMV-SRLRLSYRDELQL 659

  Fly   730 ENKVNRILNDARDKTGGSAKKSLTEYNNLKAMVVSGSKGSNINISQVIACVGQQNVEGKRIPYGF 794
            ||.::::...|       |...|..| :::.::...|..:...:.|....:|.|..:.|:    |
plant   660 ENSIHKVKEVA-------ANFMLKSY-SIRNLIDIKSNSAITKLVQQTGFLGLQLSDKKK----F 712

  Fly   795 RKRTLPH----FIKDDYGPESR----GFVENSYLAGLTPSEFYFHAMGGREGLIDTAVKTAETGY 851
            ..:||..    |.|..||..|.    |.|:..:..||.|.|...|::..||.::.::...||.|.
plant   713 YTKTLVEDMAIFCKRKYGRISSSGDFGIVKGCFFHGLDPYEEMAHSIAAREVIVRSSRGLAEPGT 777

  Fly   852 IQRRLIKAMESVMVNYDGTVRNSV-GQLIQLRYGEDGLCGELVEFQN------MPTVKLSNKSFE 909
            :.:.|:..:..:::..||||||:. ..:||.:||.|...|....|:.      :....:||.:: 
plant   778 LFKNLMAVLRDIVITNDGTVRNTCSNSVIQFKYGVDSERGHQGLFEAGEPVGVLAATAMSNPAY- 841

  Fly   910 KRFKFDWSNERLMKKVFTDDVIKEMTDSSEAIQELEAEWDRLVSDRDSLRQIFPNGES-----KV 969
                                  |.:.|||                        ||..|     |.
plant   842 ----------------------KAVLDSS------------------------PNSNSSWELMKE 860

  Fly   970 VLPCNLQRMIWNVQKIFHINKRLPTDLSPIRVIKGVKTLLERCVIVTGND-RISKQ-ANENATLL 1032
            ||.|.:     |.|                      .|..:|.||:..|: ...|: ..|||.  
plant   861 VLLCKV-----NFQ----------------------NTTNDRRVILYLNECHCGKRFCQENAA-- 896

  Fly  1033 FQCLIRSTLCTKYVSEEFRLSTEAFEWLVGEIETRFQQAQANPGEMVGA---------------- 1081
              |.:|:.|      .:..|...|.|:||   |.|   .|....|:.|.                
plant   897 --CTVRNKL------NKVSLKDTAVEFLV---EYR---KQPTISEIFGIDSCLHGHIHLNKTLLQ 947

  Fly  1082 ---LAAQ-----------SLGEPATQMTLNTFHFAGVSSKNVTLGVPRLKEIINI-----SKKPK 1127
               ::.|           |||:...:...:.|       |..:|.|   .|..:.     ||...
plant   948 DWNISMQDIHQKCEDVINSLGQKKKKKATDDF-------KRTSLSV---SECCSFRDPCGSKGSD 1002

  Fly  1128 APSLTVFLTGGAARDAEKAKNVLCRLEHTTLRKVTANTAIYYDPDPQRTVISEDQEF--VNVYYE 1190
            .|.|| |.......|.|:..:|||...:..|.::               ||..|...  .|:.:.
plant  1003 MPCLT-FSYNATDPDLERTLDVLCNTVYPVLLEI---------------VIKGDSRICSANIIWN 1051

  Fly  1191 MPDFDPTRISPWLLRIELDRKRMTDKKLTMEQIAEKINVGFGEDLNCIFNDDNADKLVLRIRIMN 1255
            ..|     ::.|:......|:......:|:|:.|.|                             
plant  1052 SSD-----MTTWIRNRHASRRGEWVLDVTVEKSAVK----------------------------- 1082

  Fly  1256 NEENKFQDEDEAVDKMEDDMFLRCIEANMLSDMTLQGIEAIGKVYMHLPQTDSKKRIVITETGEF 1320
                                                             |:....|:||      
plant  1083 -------------------------------------------------QSGDAWRVVI------ 1092

  Fly  1321 KAIGEWLLETDGTSMMKVLSERDVDPIRTSSNDICEIFQVLGIEAVRKSVEKEMNAVLQFYGLYV 1385
                        .|.:.||  ..:|..|:....:.::.::||:....:...:.::|.::.....|
plant  1093 ------------DSCLSVL--HLIDTKRSIPYSVKQVQELLGLSCAFEQAVQRLSASVRMVSKGV 1143

  Fly  1386 NYRHLALLCDVMTAKGHLMAITRHGINRQDTGALMRCSFEETVDV-----LMDAAAHAETDPMRG 1445
            ...|:.||.:.||..|.::.....|.........::..|.|...:     ...||....||.:..
plant  1144 LKEHIILLANNMTCSGTMLGFNSGGYKALTRSLNIKAPFTEATLIAPRKCFEKAAEKCHTDSLST 1208

  Fly  1446 VSENIIMGQLPKMGTGC-FDLLLDAEKCRFGIE--------------IPNTLGNSMLGGAAMFIG 1495
            |..:...|:...:|||. |:||.:.::.  |::              |..|       .|..|:.
plant  1209 VVGSCSWGKRVDVGTGSQFELLWNQKET--GLDDKEETDVYSFLQMVISTT-------NADAFVS 1264

  Fly  1496 GGSTPSMTPPMTPWANCNTPRYFSPPGHVSAMTPGGPSFSPSA-------ASDASG----MSPSW 1549
            ..........|..||        ..|...||:  |.|.|..||       ....||    .|.||
plant  1265 SPGFDVTEEEMAEWA--------ESPERDSAL--GEPKFEDSADFQNLHDEGKPSGANWEKSSSW 1319

  Fly  1550 SPAHPGSS----PSSPGPSMSP--YFPASPSVSPSYSPTSPN-------YTASSPGGA------- 1594
            .....|.|    ..|.|...:|  .:..:.:|....:.:|.|       .:.|..|||       
plant  1320 DNGCSGGSEWGVSKSTGGEANPESNWEKTTNVEKEDAWSSWNTRKDAQESSKSDSGGAWGIKTKD 1384

  Fly  1595 -----SPNYSPS-SP-------NYSPTSPLYASPRYASTTPNFNPQSTGYSPSSSGYSPTSPVYS 1646
                 :||:..| :|       |..|||.::.....:..:.:.....|..:|::.| |..:.|:.
plant  1385 ADADTTPNWETSPAPKDSIVPENNEPTSDVWGHKSVSDKSWDKKNWGTESAPAAWG-STDAAVWG 1448

  Fly  1647 PTVQFQS--------------SPSFAGSGSNIYSPGNAYSPSSSNYSPNSPSYSPTSPSYS 1693
            .:.:..|              :.|..|||:.:..|.|   ..||....|..::..:..:.|
plant  1449 SSDKKNSETESDAAAWGSRDKNNSDVGSGAGVLGPWN---KKSSETESNGATWGSSDKTKS 1506

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
Polr2ANP_511124.1 RNAP_II_RPB1_N 15..868 CDD:259848 213/923 (23%)
RNA_pol_Rpb1_6 890..1071 CDD:461511 36/193 (19%)
RNAP_II_Rpb1_C 1050..1468 CDD:132720 76/460 (17%)
Herpes_BLLF1 <1498..1862 CDD:282904 52/254 (20%)
PTZ00449 <1707..1880 CDD:185628
NRPD1BNP_181532.2 RNAP_IV_RPD1_N 24..798 CDD:259849 214/918 (23%)
RNAP_IV_NRPD1_C 825..1231 CDD:132724 102/624 (16%)
Ago_hook <1462..1582 CDD:463088 10/48 (21%)
DUF3223 1751..1827 CDD:431917
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.