DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Polr1A and NRPC1

DIOPT Version :10

Sequence 1:NP_523743.1 Gene:Polr1A / 36617 FlyBaseID:FBgn0019938 Length:1642 Species:Drosophila melanogaster
Sequence 2:NP_001190573.1 Gene:NRPC1 / 836126 AraportID:AT5G60040 Length:1391 Species:Arabidopsis thaliana


Alignment Length:1706 Identity:453/1706 - (26%)
Similarity:710/1706 - (41%) Gaps:437/1706 - (25%)


- Green bases have known domain annotations that are detailed below.


  Fly    16 LEFAVFTDQEIRKLSVVKVITGITFDALGHAIPGGLYDIRMGSYGRCMDPCGTCL-KLQDCPGHM 79
            :.|:|.:|.|:.|.:.|:|.....:|........||.|.|||...: ...|.||. ..|:||||.
plant    26 INFSVLSDLEVMKAAEVQVWNIGLYDHSFKPYENGLLDPRMGPPNK-KSICTTCEGNFQNCPGHY 89

  Fly    80 GHIELGTPVYNPFFIKFVQRLL-CIFCLHCYKLQMKDH---ECEIIMLQLRLIDAGYIIEAQELE 140
            |:::|..||||..:..|:..:| ||    |...::.|:   .|..::|..:|.:           
plant    90 GYLKLDLPVYNVGYFNFILDILKCI----CKVTELADYVSLRCSNMLLDEKLYE----------- 139

  Fly   141 LFKSEIVCQNTENLVAIKNGDMVHPHIAAMYKLLEKNEKNSSNSTKTSCSLRTAITHSALQRL-- 203
                       ::|..::|..|         :.|:|.|  .:.:....||.      .|.||:  
plant   140 -----------DHLRKMRNPRM---------EPLKKTE--LAKAVVKKCST------MASQRIIT 176

  Fly   204 GKKCRHCNKSMRFVRYMHRRLVFYVTLADIKERVGTGAETGGQNKVIFADECR---RYLRQIYAN 265
            .|||.:.|..::.:...     |.:.::..:.::     .||:     .|||:   .:.:|..|.
plant   177 CKKCGYLNGMVKKIAAQ-----FGIGISHDRSKI-----HGGE-----IDECKSAISHTKQSTAA 226

  Fly   266 YPELLKLLVP--VLGL------SNTDLTQGDRSPVDLFFMDTLPVTPPRA-RPLNMVGDMLKGNP 321
            ...|..:|.|  ||||      .:.:|......|.:|..  |..:.||.: ||..|:|       
plant   227 INPLTYVLDPNLVLGLFKRMSDKDCELLYIAYRPENLII--TCMLVPPLSIRPSVMIG------- 282

  Fly   322 QTDIYINIIENNHVLNVVLKYMKGGQEKLTEEAKAAYQTLKGETAHEKLYTAWLALQMSVDVLLD 386
                  .|..|.:.|...||.:..|...|       ::.|...|:..|....|..:|:.|...::
plant   283 ------GIQSNENDLTARLKQIILGNASL-------HKILSQPTSSPKNMQVWDTVQIEVARYIN 334

  Fly   387 VNMSREMKSGE-----GLKQIIEKKSGLIRSHMMGKRVNYAARTVITPDPNINVDEIGIPDIFAK 446
            ..:.......|     |:.|.::.|.|..|:::.||||.:..||||:||||:.:.|:|||.:.|:
plant   335 SEVRGCQNQPEEHPLSGILQRLKGKGGRFRANLSGKRVEFTGRTVISPDPNLKITEVGIPILMAQ 399

  Fly   447 KLSYPVPVTEWNVTELRKMVMNGPDVHPGANYIQDKNGFTTYIPADNASKRESLAKLLLSNPKDG 511
            .|::|..|:..|:.:||:.|.|||:.:|||..::..:|.:..:..|   .|:.:|..|...    
plant   400 ILTFPECVSRHNIEKLRQCVRNGPNKYPGARNVRYPDGSSRTLVGD---YRKRIADELAIG---- 457

  Fly   512 IKIVHRHVLNGDVLLLNRQPSLHKPSIMGHKARILHGEKTFRLHYSNCKAYNADFDGDEMNAHYP 576
             .||.||:..|||:|.|||||||:.|||.|:|||:.. :|.|.:.|.|..|||||||||||.|.|
plant   458 -CIVDRHLQEGDVVLFNRQPSLHRMSIMCHRARIMPW-RTLRFNESVCNPYNADFDGDEMNMHVP 520

  Fly   577 QSEVARAEAYNLVNVASNYLVPKDGTPLGGLIQDHVISGVKLSIRGRFFNREDYQQLVFQGLSQL 641
            |:|.||.||..|:.|.:|...||:|..|....||.:.|...::.:..|::|..: .|:...:...
plant   521 QTEEARTEAITLMGVQNNLCTPKNGEILVASTQDFLTSSFLITRKDTFYDRAAF-SLICSYMGDG 584

  Fly   642 KKDIKLLPPTILKPAVLWSGKQILSTII-INIIPEGYERINLDSFAKIAGKNWNVSRPRPPICGT 705
            ...|.|..||||||..||:||||.|.:: .|.....|..:|      :..||:.          .
plant   585 MDSIDLPTPTILKPIELWTGKQIFSVLLRPNASIRVYVTLN------VKEKNFK----------K 633

  Fly   706 NPEGND----LSESQVQIRNGELLVGVLDKQQYGATTY--------GLIHCMYELYGGDVSTLLL 758
            ...|.|    :::..|..||.||:.|.|.|.......:        ||...:...|....:.:.:
plant   634 GEHGFDETMCINDGWVYFRNSELISGQLGKATLALDIFPLGNGNKDGLYSILLRDYNSHAAAVCM 698

  Fly   759 TAFTKVFTFFLQLEGFTLGVKDILVTDVADRKRRKIIR----ECR------NVGNSAVAAALE-- 811
            ....|:...::.:.||::|:.|:...:...::|:..|:    :|.      |.||..:.|.|:  
plant   699 NRLAKLSARWIGIHGFSIGIDDVQPGEELSKERKDSIQFGYDQCHRKIEEFNRGNLQLKAGLDGA 763

  Fly   812 --LEDEPPHDELVEKMEAAYVKDSKFRVLLDRKYKSLLDGYTNDI----NSTCLPRGLITKFPSN 870
              ||.|                               :.|..|.|    ...|: .||..:   |
plant   764 KSLEAE-------------------------------ITGILNTIREATGKACM-SGLHWR---N 793

  Fly   871 NLQLMVLSGAKGSMVNTMQISCLLGQIELEGKRPPLMISGKSLPSFTSFETSPKSGGFIDGRFMT 935
            :..:|...|:|||.:|..|:...:||..:.|.|.|.....:|||.|.....||.:.||:...|.:
plant   794 SPLIMSQCGSKGSPINISQMVACVGQQTVNGHRAPDGFIDRSLPHFPRMSKSPAAKGFVANSFYS 858

  Fly   936 GIQPQDFFFHCMAGREGLIDTAVKTSRSGYLQRCLIKHLEGLSVHYDLTVRDSDNSVVQFLYGED 1000
            |:...:||||.|.|||||:||||||:.:||:.|.|:|.||.|.||||.|||::...::||.||:|
plant   859 GLTATEFFFHTMGGREGLVDTAVKTASTGYMSRRLMKALEDLLVHYDNTVRNASGCILQFTYGDD 923

  Fly  1001 GLD--ILKSKFFND----KFCADFLTQNATAILRPAQLQL------MKDEEQLAKVQRHEKHIRS 1053
            |:|  :::.|   |    .|...||...||...|.....|      .|.||:|.   ||:|.   
plant   924 GMDPALMEGK---DGAPLNFNRLFLKVQATCPPRSHHTYLSSEELSQKFEEELV---RHDKS--- 979

  Fly  1054 WEKKKPAKLRAAFTHFSEELREEVEV-----KRPNEINSKTGRRRFDEGLLKLWKKADAEDKALY 1113
                     |.....|.:.|||.|.:     ..|.::               |:|.:...||.| 
plant   980 ---------RVCTDAFVKSLREFVSLLGVKSASPPQV---------------LYKASGVTDKQL- 1019

  Fly  1114 RKKYARCPDPTVAVYKQDLYYGSVSERTRKLITDYAKRKPALKETIADIMRVKTIKSLAAPGEPV 1178
             :.:.:     :.|::                                 .|.|.|::    |..:
plant  1020 -EVFVK-----ICVFR---------------------------------YREKKIEA----GTAI 1041

  Fly  1179 GLIAAQSIGEPSTQMTLNTFHFAGRGEMNVTLGIPRLREILMLASSNIKTPSMDIPIK-PGQQHQ 1242
            |.|.|||||||.|||||.||||||...||:|.|:||:.||:. ||.||.||.:...:: |.:...
plant  1042 GTIGAQSIGEPGTQMTLKTFHFAGVASMNITQGVPRINEIIN-ASKNISTPVISAELENPLELTS 1105

  Fly  1243 AEKLRINLNSVTLANLLEYVHVSTGLTLDPERSYEYDMRFQFLPREVYKEDYGVRPKHIIKYMHQ 1307
            |..::..:...||..:.|.:.|....|....|.        .|..::.:|               
plant  1106 ARWVKGRIEKTTLGQVAESIEVLMTSTSASVRI--------ILDNKIIEE--------------- 1147

  Fly  1308 TFFKQLIRAILKVSNASRTTKIVVIDDKKDADKEDDND---LDNGDE----VGRSKAKANDDDSS 1365
                    |.|.::..|....|:    |....|.:|||   ||.|.:    |.:|:|..|..:..
plant  1148 --------ACLSITPWSVKNSIL----KTPRIKLNDNDIRVLDTGLDITPVVDKSRAHFNLHNLK 1200

  Fly  1366 DDNDDDDATGVKLKQRKTDEKDYDDPDDVEELHDANDDDDEAEDEDDEEKGQDGNDNDGDDKAVE 1430
            :...:....|:|..:|....:|.                                     ||.:.
plant  1201 NVLPNIIVNGIKTVERVVVAEDM-------------------------------------DKMLA 1228

  Fly  1431 RLLSNDMVKAYTYDKENHLWCQVKLNLSVRYQKPDLTSIIRELAGKSVVHQVQHIKRAIIYKGTD 1495
            :|:               :.|.       |:...:|.::                          
plant  1229 KLI---------------IPCP-------RWACTNLLAV-------------------------- 1245

  Fly  1496 DDQLLKTDGINIGEMFQHNKILDLNRLYSNDIHAIARTYGIEAASQVIVKEVSNVFKVYGITVDR 1560
                :.|.||| |.....|.:::           :::|.|||||...|:.|:..|...:|:::|.
plant  1246 ----MGTPGIN-GRTTTSNNVVE-----------VSKTLGIEAARTTIIDEIGTVMGNHGMSIDI 1294

  Fly  1561 RHLSLIADYMTFDGTFQPLSRKGMEH-SSSPLQQMSFESSLQFLKSAAGFGRADELSSPSSRLMV 1624
            ||:.|:||.||:.|....:.|.|::. ..|.|.|.|||.:...|.|||..|:.|.:...:..:::
plant  1295 RHMMLLADVMTYRGEVLGIQRTGIQKMDKSVLMQASFERTGDHLFSAAASGKVDNIEGVTECVIM 1359

  Fly  1625 GLPVRNGTGAFELLTK 1640
            |:|::.|||..::|.:
plant  1360 GIPMKLGTGILKVLQR 1375

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
Polr1ANP_523743.1 RNAP_I_RPA1_N 18..982 CDD:259844 293/1018 (29%)
HMG-box_SF 1065..1119 CDD:438789 11/58 (19%)
RNAP_I_Rpa1_C 1166..1638 CDD:132722 117/480 (24%)
NRPC1NP_001190573.1 RNAP_III_RPC1_N 31..909 CDD:259847 294/1019 (29%)
rpoC2 859..>1092 CDD:214368 105/310 (34%)
RNAP_III_Rpc1_C 1029..1369 CDD:132723 117/513 (23%)

Return to query results.
Submit another query.