DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment RpI1 and Polr2a

DIOPT Version :9

Sequence 1:NP_523743.1 Gene:RpI1 / 36617 FlyBaseID:FBgn0019938 Length:1642 Species:Drosophila melanogaster
Sequence 2:XP_343923.3 Gene:Polr2a / 363633 RGDID:1587326 Length:1970 Species:Rattus norvegicus


Alignment Length:1741 Identity:499/1741 - (28%)
Similarity:761/1741 - (43%) Gaps:405/1741 - (23%)


- Green bases have known domain annotations that are detailed below.


  Fly    16 LEFAVFTDQEIRKLSVVKVITGITFDAL---GHAIPGGLYDIRMG---SYGRCMDPCGTCL-KLQ 73
            ::|.|.:..|::::||.:  .||.:...   |....|||.|.|.|   ..|||.    ||. .:.
  Rat    21 VQFGVLSPDELKRMSVTE--GGIKYPETTEGGRPKLGGLMDPRQGVIERTGRCQ----TCAGNMT 79

  Fly    74 DCPGHMGHIELGTPVYN-PFFIKFVQRLLCIFCLHCYKLQMKDHECEIIML--------QLRLID 129
            :||||.|||||..||:: .|.:|.::.|.|: |..|.||.:..:..:|..:        :.||  
  Rat    80 ECPGHFGHIELAKPVFHVGFLVKTMKVLRCV-CFFCSKLLVDSNNPKIKDILAKSKGQPKKRL-- 141

  Fly   130 AGYIIEAQELELFKSEIVCQNTENL-----VAIKNGDMVHPHIAAMYKLLEKNEKNSSNSTKTSC 189
                  ....:|.|.:.:|:..|.:     |....||                |..:.......|
  Rat   142 ------THVYDLCKGKNICEGGEEMDNKFGVEQPEGD----------------EDLTKEKGHGGC 184

  Fly   190 S-LRTAITHSALQRLGKKCRHCNKSMRFVRYMHRRLVFYVTLADIKERVGTGAETGGQNKVIFAD 253
            . .:..|..|.|: |..:.:|.|:..:    ..:.|:....:.:|.:|:..             :
  Rat   185 GRYQPRIRRSGLE-LYAEWKHVNEDSQ----EKKILLSPERVHEIFKRISD-------------E 231

  Fly   254 EC------RRYLRQIYANYPELLKLLVPVLGLSNTDLTQGDRSPVDLFFMDTLPVTPPRARPLNM 312
            ||      .||.|      ||.:.:.|                         |||.|...||   
  Rat   232 ECFVLGMEPRYAR------PEWMIVTV-------------------------LPVPPLSVRP--- 262

  Fly   313 VGDMLKGNP--QTDIYINIIENNHVLNVVLKYMKGGQEKLTEEAKAAYQTLKGETAHEKLYTAWL 375
             ..:::|:.  |.|:       .|.|..::|.  ..|.:..|:..||...:..:.   ||     
  Rat   263 -AVVMQGSARNQDDL-------THKLADIVKI--NNQLRRNEQNGAAAHVIAEDV---KL----- 309

  Fly   376 ALQMSVDVLLD---VNMSREM-KSG---EGLKQIIEKKSGLIRSHMMGKRVNYAARTVITPDPNI 433
             ||..|..::|   ..:.|.| |||   :.|||.::.|.|.:|.::|||||:::||||||||||:
  Rat   310 -LQFHVATMVDNELPGLPRAMQKSGRPLKSLKQRLKGKEGRVRGNLMGKRVDFSARTVITPDPNL 373

  Fly   434 NVDEIGIPDIFAKKLSYPVPVTEWNVTELRKMVMNGPDVHPGANYIQDKNGFTTYIPADNASKRE 498
            ::|::|:|...|..:::...||.:|:..|:::|..|...:|||.||...||       |....| 
  Rat   374 SIDQVGVPRSIAANMTFAEIVTPFNIDRLQELVRRGNSQYPGAKYIIRDNG-------DRIDLR- 430

  Fly   499 SLAKLLLSNPKD-----GIKIVHRHVLNGDVLLLNRQPSLHKPSIMGHKARILHGEKTFRLHYSN 558
                 ....|.|     |.| |.||:.:||:::.||||:|||.|:|||:.|||.. .||||:.|.
  Rat   431 -----FHPKPSDLHLQTGYK-VERHMCDGDIVIFNRQPTLHKMSMMGHRVRILPW-STFRLNLSV 488

  Fly   559 CKAYNADFDGDEMNAHYPQSEVARAEAYNLVNVASNYLVPKDGTPLGGLIQDHVISGVKLSIRGR 623
            ...|||||||||||.|.|||...|||...|..|....:.|:...|:.|::||.:.:..|.:.|..
  Rat   489 TTPYNADFDGDEMNLHLPQSLETRAEIQELAMVPRMIVTPQSNRPVMGIVQDTLTAVRKFTKRDV 553

  Fly   624 FFNREDYQQLVFQGLSQLKKDIKLLPPTILKPAVLWSGKQILSTII---INIIPEGYERINLDSF 685
            |..|.:...|: ..||..  |.|:..|.||||..||:||||.|.||   ||.|     |.:    
  Rat   554 FLERGEVMNLL-MFLSTW--DGKVPQPAILKPRPLWTGKQIFSLIIPGHINCI-----RTH---- 606

  Fly   686 AKIAGKNWNVSRPRPPICGTNPEGNDL--------SESQVQIRNGELLVGVLDKQQYGATTYGLI 742
                              .|:|:..|.        .:::|.:.||||::|:|.|:..|.:...|:
  Rat   607 ------------------STHPDDEDSGPYKHISPGDTKVVVENGELIMGILCKKSLGTSAGSLV 653

  Fly   743 HCMYELYGGDVSTLLLTAFTKVFTFFLQLEGFTLGVKDILVTDVADRKRRKIIRECRNVGNSAVA 807
            |..|...|.||:.|..:....|...:|.:||.|:|:.|    .:||   .|..::.:|....|..
  Rat   654 HISYLEMGHDVTRLFYSNIQTVINNWLLIEGHTIGIGD----SIAD---SKTYQDIQNTIKKAKQ 711

  Fly   808 AALELEDEPPHDELVEKMEAAYVKDSKFRVLLDRKYKSLLDGYTNDINSTCLPRGLITKFPSNNL 872
            ..:|:.::..::||..      ...:..|...:.:...:|:...:...|:.  :..::::  ||.
  Rat   712 DVIEVIEKAHNNELEP------TPGNTLRQTFENQVNRILNDARDKTGSSA--QKSLSEY--NNF 766

  Fly   873 QLMVLSGAKGSMVNTMQISCLLGQIELEGKRPPLMISGKSLPSFTSFETSPKSGGFIDGRFMTGI 937
            :.||:||||||.:|..|:..::||..:||||.|.....::||.|...:..|:|.||::..::.|:
  Rat   767 KSMVVSGAKGSKINISQVIAVVGQQNVEGKRIPFGFKHRTLPHFIKDDYGPESRGFVENSYLAGL 831

  Fly   938 QPQDFFFHCMAGREGLIDTAVKTSRSGYLQRCLIKHLEGLSVHYDLTVRDSDNSVVQFLYGEDG- 1001
            .|.:||||.|.||||||||||||:.:||:||.|||.:|.:.|.||.|||:|.|.|||..||||| 
  Rat   832 TPTEFFFHAMGGREGLIDTAVKTAETGYIQRRLIKSMESVMVKYDATVRNSINQVVQLRYGEDGL 896

  Fly  1002 ---------LDILK--SKFFNDKFCADFLTQNATAILRPAQLQLMKDEEQLAKVQRHEKHIRSWE 1055
                     |..||  :|.|..||..|:  .|..|:.|..|..|:||....|.:|.         
  Rat   897 AGESVEFQNLATLKPSNKAFEKKFRFDY--TNERALRRTLQEDLVKDVLSNAHIQN--------- 950

  Fly  1056 KKKPAKLRAAFTHFSEELREEVEVKR---------------------------------PNEINS 1087
                 :|...|    |.:||:.||.|                                 |::::.
  Rat   951 -----ELEREF----ERMREDREVLRVIFPTGDSKVVLPCNLLRMIWNAQKIFHINPRLPSDLHP 1006

  Fly  1088 KTGRRRFDEGLLKLWKK---ADAEDKALYRKKYARCPDPTVAVYKQDLYYGSVSERT---RKLIT 1146
                .:..||:.:|.||   .:.:|           |....|.....|.:......|   |::..
  Rat  1007 ----IKVVEGVKELSKKLVIVNGDD-----------PLSRQAQENATLLFNIHLRSTLCSRRMAE 1056

  Fly  1147 DYAKRKPALKETIADIMRVKTIKSLAAPGEPVGLIAAQSIGEPSTQMTLNTFHFAGRGEMNVTLG 1211
            ::.....|....:.:| ..|..:::|.|||.||.:||||:|||:|||||||||:||....|||||
  Rat  1057 EFRLSGEAFDWLLGEI-ESKFNQAIAHPGEMVGALAAQSLGEPATQMTLNTFHYAGVSAKNVTLG 1120

  Fly  1212 IPRLREILMLASSNIKTPSMDIPIKPGQQHQAEKLR---INLNSVTLANLLEYVHVSTGLTLDPE 1273
            :|||:|::.: |...||||:.:.:.......||:.:   ..|...||..    |..:|.:..||.
  Rat  1121 VPRLKELINI-SKKPKTPSLTVFLLGQSARDAERAKDILCRLEHTTLRK----VTANTAIYYDPN 1180

  Fly  1274 RSYEYDMRFQFLPRE-VYKEDYGVRPKHIIKYMHQTFFKQLIRAILKVSNASRTTKIVVIDDKKD 1337
                        |:. |..||    .:.:..|.....|        .|:..|.....|.:|.|..
  Rat  1181 ------------PQSTVVAED----QEWVNVYYEMPDF--------DVARISPWLLRVELDRKHM 1221

  Fly  1338 ADKEDDNDLDNGDEVGRSKAKANDDDSSDDNDDDDATGVKLKQRKTDEKDYDDPDDVEELHDAND 1402
            .|::    |.......:..|...||.:...|||:   ..||               |..:...|.
  Rat  1222 TDRK----LTMEQIAEKINAGFGDDLNCIFNDDN---AEKL---------------VLRIRIMNS 1264

  Fly  1403 DDDEAEDEDDEEKGQDGNDNDGDDKAVERLLSNDMVKAYTYDKENHLWCQVKLNLSVRYQKPDLT 1467
            |:::.:   :||:..|..|:|...:.:|..:..||.                           |.
  Rat  1265 DENKMQ---EEEEVVDKMDDDVFLRCIESNMLTDMT---------------------------LQ 1299

  Fly  1468 SIIRELAGKSVVH--QVQHIKRAIIYKGTDDDQ-------LLKTDGINIGEMFQHNKILDLNRLY 1523
            .|  |...|..:|  |..:.|:.||   |:|.:       :|:|||:::..:.. .|.:|..|..
  Rat  1300 GI--EQISKVYMHLPQTDNKKKIII---TEDGEFKALQEWILETDGVSLMRVLS-EKDVDPVRTT 1358

  Fly  1524 SNDIHAIARTYGIEAASQVIVKEVSNVFKVYGITVDRRHLSLIADYMTFDGTFQPLSRKGM-EHS 1587
            ||||..|....||||..:.:.:|:.:|....|..|:.|||:|:.|.||..|....::|.|: ...
  Rat  1359 SNDIVEIFTVLGIEAVRKALERELYHVISFDGSYVNYRHLALLCDTMTCRGHLMAITRHGVNRQD 1423

  Fly  1588 SSPLQQMSFESSLQFLKSAAGFGRADELSSPSSRLMVGLPVRNGTGAFELL 1638
            :.||.:.|||.::..|..||..|.:|.:...|..:|:|.....|||.|:||
  Rat  1424 TGPLMKCSFEETVDVLMEAAAHGESDPMKGVSENIMLGQLAPAGTGCFDLL 1474

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
RpI1NP_523743.1 RNA_pol_rpoA1 14..1009 CDD:274106 323/1057 (31%)
RNAP_I_RPA1_N 18..982 CDD:259844 305/1016 (30%)
RNA_pol_Rpb1_5 936..1593 CDD:282807 205/721 (28%)
HMG-box 1057..1119 CDD:294061 15/97 (15%)
RNAP_I_Rpa1_C 1166..1638 CDD:132722 138/485 (28%)
Polr2aXP_343923.3 RNAP_II_RPB1_N 19..876 CDD:259848 305/1018 (30%)
RNA_pol_Rpb1_6 896..1079 CDD:398590 40/218 (18%)
RNAP_II_Rpb1_C 1058..1476 CDD:132720 142/505 (28%)
CTD 1491..1610 CDD:215026
Herpes_BLLF1 <1854..>1959 CDD:282904
Blue background indicates that the domain is not in the aligned region.


Information from Original Tools:


Tool Simple Score Weighted Score Original Tool Information
BLAST Result Score Score Type Cluster ID
Compara 00.000 Not matched by this tool.
Domainoid 00.000 Not matched by this tool.
eggNOG 1 0.900 - - E1_COG0086
Hieranoid 00.000 Not matched by this tool.
Homologene 00.000 Not matched by this tool.
Inparanoid 00.000 Not matched by this tool.
OMA 00.000 Not matched by this tool.
OrthoDB 1 1.010 - - D591636at2759
OrthoFinder 00.000 Not matched by this tool.
OrthoInspector 00.000 Not matched by this tool.
orthoMCL 00.000 Not matched by this tool.
Panther 00.000 Not matched by this tool.
Phylome 1 0.910 - -
SonicParanoid 00.000 Not matched by this tool.
SwiftOrtho 1 1.000 - -
TreeFam 00.000 Not matched by this tool.
43.820

Return to query results.
Submit another query.