DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment prc and COL1A2

DIOPT Version :10

Sequence 1:NP_524658.2 Gene:prc / 43930 FlyBaseID:FBgn0028573 Length:1713 Species:Drosophila melanogaster
Sequence 2:NP_000080.2 Gene:COL1A2 / 1278 HGNCID:2198 Length:1366 Species:Homo sapiens


Alignment Length:1493 Identity:456/1493 - (30%)
Similarity:573/1493 - (38%) Gaps:291/1493 - (19%)


- Green bases have known domain annotations that are detailed below.


  Fly   163 GDQPGVGTQPGVGQPGYGSQPGIGGQTG-AGQPGYGSQPGIGGQTGAGQPGYGSQPVVGAQTG-- 224
            ||:   |.:...|.||...:.|..|.|| .|.||....||:||...|...|.|    ||...|  
Human    36 GDR---GPRGERGPPGPPGRDGEDGPTGPPGPPGPPGPPGLGGNFAAQYDGKG----VGLGPGPM 93

  Fly   225 -----TGQPGYGAQPG-VGTQTGAGQPGYGSQPGIGGQTGARQPGYVTQPGVGAQTG-IGQPGYG 282
                 .|.||....|| .|.|..||:||   :||..|..|||.|  ...||...:.| .|:||..
Human    94 GLMGPRGPPGAAGAPGPQGFQGPAGEPG---EPGQTGPAGARGP--AGPPGKAGEDGHPGKPGRP 153

  Fly   283 AQPGILGQTGA----GQPGYGSQPGIGGQTG----AGQPGYGTQPGVGAQTGTGQPGYGAQPGVG 339
            .:.|::|..||    |.||.....||.|..|    .||||   .|||..:.|.  ||....||  
Human   154 GERGVVGPQGARGFPGTPGLPGFKGIRGHNGLDGLKGQPG---APGVKGEPGA--PGENGTPG-- 211

  Fly   340 TQTGA-GQPGYGSQPGIGGQTGA-GQPG-YGTQPGVGAQTGAGQPGYGAQPG-------VGAQTG 394
             |||| |.||...:.|..|..|| |..| .|.....|....||.||:...||       ||....
Human   212 -QTGARGLPGERGRVGAPGPAGARGSDGSVGPVGPAGPIGSAGPPGFPGAPGPKGEIGAVGNAGP 275

  Fly   395 AGQPGYGSQPGIGGQTG-AGQPGYGSQPGIGGQTGARQPGYGSQPGVGAQTGA-GQPGYGAQPG- 456
            ||..|...:.|:.|.:| .|.||   .||..|.|||:  |....|||   .|| |.||....|| 
Human   276 AGPAGPRGEVGLPGLSGPVGPPG---NPGANGLTGAK--GAAGLPGV---AGAPGLPGPRGIPGP 332

  Fly   457 VGAQTGAGQPGYGSQPGIGGQTGAGQPGYGSQPG-VGAQTGAGQPGYGAQPGVGAQTG-AGQPGY 519
            |||....|..|...:||..|  ..|:.|...:|| .|.|...|..|...:.|...:.| ||.|| 
Human   333 VGAAGATGARGLVGEPGPAG--SKGESGNKGEPGSAGPQGPPGPSGEEGKRGPNGEAGSAGPPG- 394

  Fly   520 GSQPGIGGQTGA-GQPGYGTQPGVGAQTGTGQPGYGAQPGVGTQTGAGQPGYGSQPGIGGQTG-A 582
              .||:.|..|: |.||...:.||     .|.||         ..||..|.     |:.|..| |
Human   395 --PPGLRGSPGSRGLPGADGRAGV-----MGPPG---------SRGASGPA-----GVRGPNGDA 438

  Fly   583 GQPGYGTQPGVGAQTG-TGQPGYGAQPGVGTQTGAGQPGYGSQPGYGTQPG-VGAQTGTGQPGYG 645
            |:||   :||:....| .|.||     .:|.   ||:.|....||...:|| :|.....|:||..
Human   439 GRPG---EPGLMGPRGLPGSPG-----NIGP---AGKEGPVGLPGIDGRPGPIGPAGARGEPGNI 492

  Fly   646 AQPGVGGQTGAGQPGYGTQPGIGGQTG-AGQPGYGSQPGIGGQTGG-GQPGYGSQIGGQTGAGQP 708
            ..||..|.||        .||..|..| ||..|....||..|..|. |.||              
Human   493 GFPGPKGPTG--------DPGKNGDKGHAGLAGARGAPGPDGNNGAQGPPG-------------- 535

  Fly   709 SYGSQPGVGAQNGAGQPGYGTQPVIGGQTGA-------------GQPGYGGQTGVGGSPGFLTQP 760
            ..|.|.|.|.|...|.||:...|...|..|.             |.||..|..|..|.||   :.
Human   536 PQGVQGGKGEQGPPGPPGFQGLPGPSGPAGEVGKPGERGLHGEFGLPGPAGPRGERGPPG---ES 597

  Fly   761 GIGGISGPIG--GKLG--GGQSEAAKPGYWAQPGIGGPSRYGSQPGIGDQTG-TGQSGYGGQPGI 820
            |..|.:||||  |..|  |......:||.....|..|||.....||.....| .|..|..|:||:
Human   598 GAAGPTGPIGSRGPSGPPGPDGNKGEPGVVGAVGTAGPSGPSGLPGERGAAGIPGGKGEKGEPGL 662

  Fly   821 SGQTGGGQPGYGGQATISGLPGYGTQPGIGALTAVPGGHYGYETQPGIGGQTGTNQPGFGGQPGI 885
            .|:.  |.||..|   ..|.||....||       |.|..|...:.|..|..|...|  .|.||.
Human   663 RGEI--GNPGRDG---ARGAPGAVGAPG-------PAGATGDRGEAGAAGPAGPAGP--RGSPGE 713

  Fly   886 GGQTGAGQPGYGFIGQPGIGGQTGTSGRQPGYGT--QPGIGGQT----AAGQPGYGSQTGVGGQI 944
            .|:.|...|. ||.|..|..||.|..|.:...|.  :.|:.|.|    |||..|.....|..|..
Human   714 RGEVGPAGPN-GFAGPAGAAGQPGAKGERGAKGPKGENGVVGPTGPVGAAGPAGPNGPPGPAGSR 777

  Fly   945 G-AGQPGYGSQPGIGGQTGAGQPGYGAQPGFGGQPGYGNQPGVGGQTGAGQPGYGSQPGIG--GQ 1006
            | .|.||....||..|:||...|.     |..|.||   .||..|:.|...| .|.|..:|  |:
Human   778 GDGGPPGMTGFPGAAGRTGPPGPS-----GISGPPG---PPGPAGKEGLRGP-RGDQGPVGRTGE 833

  Fly  1007 TGA-GQPGYGAQPGFGGQLGYGNQPGVGGQTG-AGQPGYGSQPGVGGQTG-------AGQPGYGV 1062
            .|| |.||:..:.|..|:.|....||..|..| .|.||....||..|:.|       .|:||   
Human   834 VGAVGPPGFAGEKGPSGEAGTAGPPGTPGPQGLLGAPGILGLPGSRGERGLPGVAGAVGEPG--- 895

  Fly  1063 IP-GFGGQPGIGGQTAAGKPGYGGQPGIGGSPVYGTQQGTGGQSGISG--GQPGYGTQPGQTGAG 1124
             | |..|.||     |.|.||..|.||:.|:|      |..|:.|..|  |.||...|||.  .|
Human   896 -PLGIAGPPG-----ARGPPGAVGSPGVNGAP------GEAGRDGNPGNDGPPGRDGQPGH--KG 946

  Fly  1125 QPGYGSLPGT-GGQATAGQPG----YGPGSQPGIGGQTVGGHGGYGSQPGIGGAPVYGTQPGGGG 1184
            :.||   ||. |....||.||    .||..:.|..|:|       |....:|.|...|.: |..|
Human   947 ERGY---PGNIGPVGAAGAPGPHGPVGPAGKHGNRGET-------GPSGPVGPAGAVGPR-GPSG 1000

  Fly  1185 QTGVIG--GQPGQIGDRVGQPGYGTQTGQIGAPG-RYTDGSQTVPGAVGTGGVVA-AGTSG--AD 1243
            ..|:.|  |:||:.|.| |.||.....|..|.|| ....|.|..||:||..|... ||.||  ..
Human  1001 PQGIRGDKGEPGEKGPR-GLPGLKGHNGLQGLPGIAGHHGDQGAPGSVGPAGPRGPAGPSGPAGK 1064

  Fly  1244 DAFSQAESSIGDG--------QASASAQGKKNGGTAKTQVSGTYSSGGTFSASAMTSDADRAASA 1300
            |..:....::|..        |..|...|.. |......|||   .|..|........||:..||
Human  1065 DGRTGHPGTVGPAGIRGPQGHQGPAGPPGPP-GPPGPPGVSG---GGYDFGYDGDFYRADQPRSA 1125

  Fly  1301 QVTGNADGAVSQSQGSGGPAQSQAQVQVAKDGGTKASSQSGGIIQQSQSEVHAN------DKGGL 1359
            ......|..|..:..|   ..:|.:..:..:|..|..:::...::.|..|..:.      ::|..
Human  1126 PSLRPKDYEVDATLKS---LNNQIETLLTPEGSRKNPARTCRDLRLSHPEWSSGYYWIDPNQGCT 1187

  Fly  1360 ADA----QSSGPGQTSSQAQ-------IGFRPGQEANPI----AANGGGQAS-SSSGTHSSQSSS 1408
            .||    .....|:|..:||       ..:|..::...:    ..|.|.|.. :..|..|.:.::
Human  1188 MDAIKVYCDFSTGETCIRAQPENIPAKNWYRSSKDKKHVWLGETINAGSQFEYNVEGVTSKEMAT 1252

  Fly  1409 Q------IHGTSSFGVSYHGAAQSASGTKEQVATYREANRELFNTISQFGNNANAVTDRADAVYS 1467
            |      :...:|..::||        .|..:|...|....|...:...|:|...:....::.::
Human  1253 QLAFMRLLANYASQNITYH--------CKNSIAYMDEETGNLKKAVILQGSNDVELVAEGNSRFT 1309

  Fly  1468 GPALTDESDRVPEAQLKSTKPKEEAEQVVS-KLDHPEPVQYDDEDEADPDEYDEDEYYEEKPV 1529
            ...|.|..         |.|..|..:.::. |.:.|..:.:.|....|....|::.:.:..||
Human  1310 YTVLVDGC---------SKKTNEWGKTIIEYKTNKPSRLPFLDIAPLDIGGADQEFFVDIGPV 1363

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
prcNP_524658.2 FhaB 110..1468 CDD:442443 444/1429 (31%)
Glutenin_hmw 148..727 CDD:367362 207/602 (34%)
COL1A2NP_000080.2 gly_rich_SclB <90..>327 CDD:468478 95/257 (37%)
gly_rich_SclB <304..>524 CDD:468478 90/267 (34%)
gly_rich_SclB <472..>691 CDD:468478 80/255 (31%)
SPT5 669..>896 CDD:444063 84/252 (33%)
gly_rich_SclB <829..>1083 CDD:468478 96/282 (34%)
COLFI 1131..1365 CDD:460199 43/253 (17%)

Return to query results.
Submit another query.