DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Polr2A and NRPD1A

DIOPT Version :10

Sequence 1:NP_511124.1 Gene:Polr2A / 32100 FlyBaseID:FBgn0003277 Length:1887 Species:Drosophila melanogaster
Sequence 2:NP_176490.2 Gene:NRPD1A / 842605 AraportID:AT1G63020 Length:1453 Species:Arabidopsis thaliana


Alignment Length:1691 Identity:350/1691 - (20%)
Similarity:588/1691 - (34%) Gaps:548/1691 - (32%)


- Green bases have known domain annotations that are detailed below.


  Fly     8 KAPLRQVKRVQFGILSPDEIRRMSVTEGGVQFAETMEGGRPKLGGLMDPRQGVIDRTSRCQTCAG 72
            :.|:..:..:.|.|.:.::..:|||.|        :|....    :.|.|.|:.:..|.|:||..
plant     9 QVPVGTLTSIGFSISNNNDRDKMSVLE--------VEAPNQ----VTDSRLGLPNPDSVCRTCGS 61

  Fly    73 -NMTECPGHFGHIDLAKPVFHIGFITKTIKILRCVCFYCSKMLVSPHNPKIKEIVMK----SRGQ 132
             :...|.||||.|:.|..:.:..|:.:...:|..:|            |..|.|..|    :..|
plant    62 KDRKVCEGHFGVINFAYSIINPYFLKEVAALLNKIC------------PGCKYIRKKQFQITEDQ 114

  Fly   133 PRKRLAYVYDLCKGKTICEGGEDMDL---TKENQQPDPNKKPGHGGCGHYQPSIRRTGLDLTAEW 194
            |.:        |:..|:..|...|..   |||                    ..||:|:.:..  
plant   115 PER--------CRYCTLNTGYPLMKFRVTTKE--------------------VFRRSGIVVEV-- 149

  Fly   195 KHQNEDS----QEKKIVVSAERVWEILKHIT--DEEC---------------FILGMDPKYARPD 238
               ||:|    :::.::......|..|...:  ||.|               .:||:|.:..:.|
plant   150 ---NEESLMKLKKRGVLTLPPDYWSFLPQDSNIDESCLKPTRRIITHAQVYALLLGIDQRLIKKD 211

  Fly   239 WMI-----VTVLPVPPLAVRPAVVMF---GAAKNQDDLTHKLSDIIK-ANNELRKNEASGAAAHV 294
            ..:     :|..||.|...|...::.   ||....|:.|.....::. ..|.|   |.|......
plant   212 IPMFNSLGLTSFPVTPNGYRVTEIVHQFNGARLIFDERTRIYKKLVGFEGNTL---ELSSRVMEC 273

  Fly   295 IQENIKMLQFHVATLVDNDMPGMPRAMQKSGKPLKAIKARLKGKEGRIRGNLMGKRVDFSARTVI 359
            :|.: ::....|::..|:..|     .||.....|....|.      ::..|:|||.|.:.|||:
plant   274 MQYS-RLFSETVSSSKDSANP-----YQKKSDTPKLCGLRF------MKDVLLGKRSDHTFRTVV 326

  Fly   360 TPDPNLRIDQVGVPRSIAQNLTFPE---------LVT---PFNIDRMQELVRRGNSQYPGAKYIV 412
            ..||:|:::::|:|.|||:.|...|         |||   |..:|..:..||||           
plant   327 VGDPSLKLNEIGIPESIAKRLQVSEHLNQCNKERLVTSFVPTLLDNKEMHVRRG----------- 380

  Fly   413 RDNGERIDLRFHPKSSDLHLQCGYKVERHLRDDDLVIFNRQPTLHKMSMMGHRVKVLP-WSTFRM 476
             |....|.:.        .||.|.|:.|.|.|.|.|:.||.|::|:.|::...|::|| .|...:
plant   381 -DRLVAIQVN--------DLQTGDKIFRSLMDGDTVLMNRPPSIHQHSLIAMTVRILPTTSVVSL 436

  Fly   477 NLSCTSPYNADFDGDEMNLHVPQSMETRAEVENIHITPRQIITPQANKPVMGIVQDTLTAVRKM- 540
            |..|..|:..|||||.::.:||||::.:.|::.:....:|:|..|..:.::.:.||:|||...: 
plant   437 NPICCLPFRGDFDGDCLHGYVPQSIQAKVELDELVALDKQLINRQNGRNLLSLGQDSLTAAYLVN 501

  Fly   541 TKRDVFITREQVMNLLMFLPTWDAKMPQPCILK-----PRPLWTGKQIFSLIIPGNVNMIRTHST 600
            .:::.::.|.|:..|.|:.|   .::|.|.|:|     ..|.|||.|:|.::.|...:.     |
plant   502 VEKNCYLNRAQMQQLQMYCP---FQLPPPAIIKASPSSTEPQWTGMQLFGMLFPPGFDY-----T 558

  Fly   601 HPDEEDEGPYKWISPGDTKVMVEHGELIM------------GILCKKSLGTSAGSLLHICFLELG 653
            :|              ...|:|.:|||:.            |...::.|....|.:|.|.:    
plant   559 YP--------------LNNVVVSNGELLSFSEGSAWLRDGEGNFIERLLKHDKGKVLDIIY---- 605

  Fly   654 HDIAGRFYGNIQTVINNWLLFEGHSIGIGDTI--ADPQTYNEIQQAIKKAKDDVINVIQKAHNM- 715
                     :.|.:::.|||..|.|:.:.|..  :|.|:...:.:.|.....:...|..|...| 
plant   606 ---------SAQEMLSQWLLMRGLSVSLADLYLSSDLQSRKNLTEEISYGLREAEQVCNKQQLMV 661

  Fly   716 -------------ELEPTPGNTLRQTFENKVNRILN--------DARDKTGGSAKKSLTEYNNLK 759
                         :.|.:..:..|..:|.:.:..|:        ||.......|.:...:.|:..
plant   662 ESWRDFLAVNGEDKEEDSVSDLARFCYERQKSATLSELAVSAFKDAYRDVQALAYRYGDQSNSFL 726

  Fly   760 AMVVSGSKGSNINISQVIACVGQQNVEGKRIPYGF-RKRTLPHFI-----------KDDYGPES- 811
            .|..:||||:...:.|...|:|.|| ....:.:|| |:.|...:.           ||....|| 
plant   727 IMSKAGSKGNIGKLVQHSMCIGLQN-SAVSLSFGFPRELTCAAWNDPNSPLRGAKGKDSTTTESY 790

  Fly   812 --RGFVENSYLAGLTPSEFYFHAMGGRE----GLIDTAVKTAETGYIQRRLIKAMESVMVNYDGT 870
              .|.:|||:|.||.|.|.:.|::..|:    |..|.      .|.:.|||:..|..:...||||
plant   791 VPYGVIENSFLTGLNPLESFVHSVTSRDSSFSGNADL------PGTLSRRLMFFMRDIYAAYDGT 849

  Fly   871 VRNSVG-QLIQLRYGEDGLCGELVEFQNMPTVKLSNKSFEKRFKFDWSNERLMKKVFTDDVIKEM 934
            ||||.| ||:|..|..||           |                           .:|:..|.
plant   850 VRNSFGNQLVQFTYETDG-----------P---------------------------VEDITGEA 876

  Fly   935 TDSSEAIQELEAEWDRLVSDRD-SLRQIFPNGESKVVLPCNLQRMIWNVQKIFHINKRLPTDLSP 998
            ..|..|....||.:..|  |:. ||.:..|....|.||.|.                        
plant   877 LGSLSACALSEAAYSAL--DQPISLLETSPLLNLKNVLECG------------------------ 915

  Fly   999 IRVIKGVKTLLERCVIVTGNDRISKQANENATLLFQCLIRSTLCTKYVSEEFRLSTEAFEWLVGE 1063
                                   ||:.....|:           :.|:||........||:  |.
plant   916 -----------------------SKKGQREQTM-----------SLYLSEYLSKKKHGFEY--GS 944

  Fly  1064 IETRFQQAQANPGEMVGALAAQSLGEPATQMTLNTFHFAGVSSKNVTLGVPRLKEIINISKKPKA 1128
            :|                                                               
plant   945 LE--------------------------------------------------------------- 946

  Fly  1129 PSLTVFLTGGAARDAEKAKNVLCRLEHTTLRKVTANTAIYYDPDPQRTVISEDQEFVNVYYEMPD 1193
                             .||   .||..:..::.:.:.|.:.|.....|                
plant   947 -----------------IKN---HLEKLSFSEIVSTSMIIFSPSSNTKV---------------- 975

  Fly  1194 FDPTRISPWLLRIELDRKRMTDKKLTMEQIAEKINVGFG---------------EDLNCIFNDDN 1243
                .:|||:....:..|.:..|:|:.|.:...:|..:.               ::.|...:||.
plant   976 ----PLSPWVCHFHISEKVLKRKQLSAESVVSSLNEQYKSRNRELKLDIVDLDIQNTNHCSSDDQ 1036

  Fly  1244 A---DKLVLRIRIMNNEENKFQDEDEAVDKMEDDMFLRCIEANMLSDMTLQGIEAIGKVYMHLPQ 1305
            |   |.:.:.:.::...::...:.|.          :|.:....|.|..::|.:.|.||  ::..
plant  1037 AMKDDNVCITVTVVEASKHSVLELDA----------IRLVLIPFLLDSPVKGDQGIKKV--NILW 1089

  Fly  1306 TDSKKR-------------IVITETGEFKAIGEW--LLETDGTSMMKVLSERD-VDPIRTSSNDI 1354
            ||..|.             :.:|..|:......|  ||||       .|...| :|..|:..::|
plant  1090 TDRPKAPKRNGNHLAGELYLKVTMYGDRGKRNCWTALLET-------CLPIMDMIDWGRSHPDNI 1147

  Fly  1355 CEIFQVLGIEAVRKSVEKEMNAVLQFYGLYVNYRHLALLCDVMTAKGHLMAITRHGINRQ----D 1415
            .:...|.||:|.|......:.:.:...|..:...||.|:.|.::..|..:|:...|.::|    .
plant  1148 RQCCSVYGIDAGRSIFVANLESAVSDTGKEILREHLLLVADSLSVTGEFVALNAKGWSKQRQVES 1212

  Fly  1416 TGA-LMRCSFEETVDVLMDAAAHAETDPMRGVSENIIMGQLPKMGTG------------------ 1461
            |.| ..:..|.......:.||.....|.::|..:.:..|::|..|||                  
plant  1213 TPAPFTQACFSSPSQCFLKAAKEGVRDDLQGSIDALAWGKVPGFGTGDQFEIIISPKVHGFTTPV 1277

  Fly  1462 -CFDLLLDAEKCR---------------FGIEIPNTLGNSMLGGAAMFIGGGSTPSMTPPMTPWA 1510
             .:|||...:..|               ||:     |.::.|....:..|.|...|:...:..|.
plant  1278 DVYDLLSSTKTMRRTNSAPKSDKATVQPFGL-----LHSAFLKDIKVLDGKGIPMSLLRTIFTWK 1337

  Fly  1511 N 1511
            |
plant  1338 N 1338

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
Polr2ANP_511124.1 RNAP_II_RPB1_N 15..868 CDD:259848 226/964 (23%)
RNA_pol_Rpb1_6 890..1071 CDD:461511 26/181 (14%)
RNAP_II_Rpb1_C 1050..1468 CDD:132720 76/475 (16%)
Herpes_BLLF1 <1498..1862 CDD:282904 3/14 (21%)
PTZ00449 <1707..1880 CDD:185628
NRPD1ANP_176490.2 RNAP_IV_RPD1_N 23..851 CDD:259849 228/961 (24%)
RNAP_IV_NRPD1_C 874..1266 CDD:132724 93/575 (16%)
DUF3223 1347..1419 CDD:431917
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.