DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment vkg and Col3a1

DIOPT Version :9

Sequence 1:NP_001260071.1 Gene:vkg / 33726 FlyBaseID:FBgn0016075 Length:1940 Species:Drosophila melanogaster
Sequence 2:NP_034060.2 Gene:Col3a1 / 12825 MGIID:88453 Length:1464 Species:Mus musculus


Alignment Length:1777 Identity:608/1777 - (34%)
Similarity:720/1777 - (40%) Gaps:525/1777 - (29%)


- Green bases have known domain annotations that are detailed below.


  Fly    32 ICNTTLCD---------------CKGIKGRMGAPGPIGVP---GLEGPAGDIGPPGRAGPLGEKG 78
            :|:..:||               |..|..:...|.|: :|   |.:||.||.||||..|..|:.|
Mouse    62 LCDDIICDEEPLDCPNPEIPFGECCAICPQPSTPAPV-LPDGHGPQGPKGDPGPPGIPGRNGDPG 125

  Fly    79 DVGEYGEQGEKGHRG-------------------DI--GPKGEMGYPGIMGKSGEPGTPGPRGID 122
            ..|:.|..|..|..|                   |:  |..|..||||..|..|.||.||..|..
Mouse   126 LPGQPGLPGPPGSPGICESCPTGGQNYSPQFDSYDVKSGVGGMGGYPGPAGPPGPPGPPGSSGHP 190

  Fly   123 GCDGRPGMQGPSGAPGQNGVRGPPGKPGQQGPPGEAGEGGINSK-GTKGNRGETGQPGGVGPPGF 186
            |..|.||.|||.|.|||.|..||||.||..||.|.||:.|.:.: |..|.||..|.||..||.|.
Mouse   191 GSPGSPGYQGPPGEPGQAGPAGPPGPPGALGPAGPAGKDGESGRPGRPGERGLPGPPGIKGPAGM 255

  Fly   187 DGDRGSKGDTGYAGLTGEKGDPGLPGPKGDTGAVSELPYSLIGPPGAKGEPGDSLSGVLKPDDTL 251
            .|..|.||..|:.|..||||:.|.||.||:.|.                 |||:           
Mouse   256 PGFPGMKGHRGFDGRNGEKGETGAPGLKGENGL-----------------PGDN----------- 292

  Fly   252 KGYKGYVGLQGDEGPQGPTGEQGAVGRNGLPGARGEIGGPGERGKPGKDGEPGRFGDKGMKGAPG 316
                      |..||.||.|..|..||.|||||.|..|..|.|   |.||:|         |.||
Mouse   293 ----------GAPGPMGPRGAPGERGRPGLPGAAGARGNDGAR---GSDGQP---------GPPG 335

  Fly   317 WTGADGLDGSPGERGEDGFTGMPGVQGGAGPPGIYDPSLTKSLPGPIGSQGDIGPPGEQGPPGLP 381
            ..|..|..||||.:||.|..|.||..|..|..|         .|||.|..|..|||   ||||..
Mouse   336 PPGTAGFPGSPGAKGEVGPAGSPGSNGSPGQRG---------EPGPQGHAGAQGPP---GPPGNN 388

  Fly   382 GKPGRRGPIGLAGQSGDPGLNGSRGPPGRSERGEAGDYGFIGPPGPQGPPGEAGLPGRYGLHGEP 446
            |.||.:|.:|.||..|.|||.|:||||                    ||.|..|:||..|..|||
Mouse   389 GSPGGKGEMGPAGIPGAPGLIGARGPP--------------------GPAGTNGIPGTRGPSGEP 433

  Fly   447 GQNVVGPKGEPGLNGQPGLEGYRGDRGEVGLPGDKGLPGEGYNIVGPPGSQGPPGFRGLPGDDGY 511
            |:|  |.|||||.         ||:|||.|.|             |.||.:|..|..|.||:.|.
Mouse   434 GKN--GAKGEPGA---------RGERGEAGSP-------------GIPGPKGEDGKDGSPGEPGA 474

  Fly   512 NGLRGLPGEKGLRGDDCPVCNAGPRGPRGQEGDTGYPGSHGNRGAIGLTGPRGVQGLQGNPGRAG 576
            |||.|..||:            ||.|.||..|..|.||..|..|..|..||.|.:|:.|.|||.|
Mouse   475 NGLPGAAGER------------GPSGFRGPAGPNGIPGEKGPPGERGGPGPAGPRGVAGEPGRDG 527

  Fly   577 HKGLPGPAGIPGEPGKVGAAGPDGKAIEVGSLRKGEIGDTGDSGHRGDTGDDGEKGRDGSDGSKG 641
            ..|.||..|:||.||     ||                  |:.|..|..|..||.||.|..|..|
Mouse   528 TPGGPGIRGMPGSPG-----GP------------------GNDGKPGPPGSQGESGRPGPPGPSG 569

  Fly   642 ERGETGQRGDYGDAGYQGRDGEPGRDGRDGAPGRNATTPKVYLIGEPGYDGIKGERGDDGDTGFK 706
            .||:.|..            |.||..|.|||||:|...      |.||..|:.|..|.:|:||.:
Mouse   570 PRGQPGVM------------GFPGPKGNDGAPGKNGER------GGPGGPGLPGPAGKNGETGPQ 616

  Fly   707 GVKGEPNPGQIYDNTGEPGEDGYTGPKGVKGAKGEQGAIGLRGEIGDRGPAGEVIPGPVGAKGYP 771
            |..|...|      .|:.|:.|..||:|::|..|..|..|..|:.|:.||.||     |||.|.|
Mouse   617 GPPGPTGP------AGDKGDSGPPGPQGLQGIPGTGGPPGENGKPGEPGPKGE-----VGAPGAP 670

  Fly   772 GPTGDYGQQGAPGLPGRDGEPGLDGGIGYKGQRGVPGQEVIQGEIGPPGRSGIKGFPGDVGAPGQ 836
            |..||.|..|..|.||..|.||..||      .|.||.|..:|..||||..|..|.||..|.||:
Mouse   671 GGKGDSGAPGERGPPGTAGIPGARGG------AGPPGPEGGKGPAGPPGPPGASGSPGLQGMPGE 729

  Fly   837 YGLAGRPGPKGVKGEQGPDGAVGQTGLPGNKGQRGDF-LVGPPGPKGQPGRNGRQAPHGAKGQKG 900
            .|..|.|||||.|||  |.|| |..|:||..|.||.. .:|||||.|||                
Mouse   730 RGGPGSPGPKGEKGE--PGGA-GADGVPGKDGPRGPAGPIGPPGPAGQP---------------- 775

  Fly   901 EVGSLGQNGQNGAKGSIGFSGRRGLLGNAGLQGLPGSPGIPGLPGMIGEIGERGEIGYNGRQGDI 965
                 |..|:.|:.|..|.:|.||..|..|..|.||..|.||.||..||.|.:||.|..|.:|:.
Mouse   776 -----GDKGEGGSPGLPGIAGPRGGPGERGEHGPPGPAGFPGAPGQNGEPGAKGERGAPGEKGEG 835

  Fly   966 GPRGPNGEFGPKGLSGDDGPDGYPGANGLPGRKGETGNPGFPGRPGAKGVAAYSGIKGDDGESGL 1030
            ||.||   .||.|.||..||   ||..|:.|.:|..|.||..|.||.:      |:.|..|.:|.
Mouse   836 GPPGP---AGPTGSSGPAGP---PGPQGVKGERGSPGGPGTAGFPGGR------GLPGPPGNNGN 888

  Fly  1031 TGPIGYPGAPGAKGQRGPVGDSQPALDGVAGRKGEVGSPGPNGLPGRHGLKGQRGDRGLPGQQGR 1095
            .||.|..||||..|..||.|:|              ||||..|:.|..|..||.|::|.||.||.
Mouse   889 PGPPGPSGAPGKDGPPGPAGNS--------------GSPGNPGIAGPKGDAGQPGEKGPPGAQGP 939

  Fly  1096 PGEPGAKGLGGYPGRNGINGLKGATGFPGPQGPKGPQ---GESGVVGLDGRNGQIGDQGPRGLIG 1157
            ||.||..|:.|.   .|..||.|..|.|||:|..|||   ||||..|..|.||:.|..||:||.|
Mouse   940 PGSPGPLGIAGL---TGARGLAGPPGMPGPRGSPGPQGIKGESGKPGASGHNGERGPPGPQGLPG 1001

  Fly  1158 EQGEQGEQGDEGEVGIPGRLENLRDRSFYRGFTGDQGLQGERGEQGDMGPIGFIGPPGAKGERGD 1222
            :.|..||         |||             .|:.|..|:.|..|.         ||.||:||:
Mouse  1002 QPGTAGE---------PGR-------------DGNPGSDGQPGRDGS---------PGGKGDRGE 1035

  Fly  1223 IGYAGQLGFDGAEGLKGFQGDQGPRGPPGITLPAEKGDEGVAGLDGRAGRPGHFGQKGAPGPPGE 1287
                                    .|.|                 |..|.|||      |||||.
Mouse  1036 ------------------------NGSP-----------------GAPGAPGH------PGPPGP 1053

  Fly  1288 NGPNGAIGHRGPQIQGPPGPQGDVGFPGAPGHNGRHGLIGPKGELGDMGRQGERGESGYAIVGRQ 1352
            .||:|..|.||.  .||.||.      ||||..|..|..||:|..||.|..||||.:|..     
Mouse  1054 VGPSGKSGDRGE--TGPAGPS------GAPGPAGARGAPGPQGPRGDKGETGERGSNGIK----- 1105

  Fly  1353 GDIGDIGFQGEPGWDGAKGEQGYPGLPGKNGRVGAPGPRGPTGDAGWGGIDGMDGLVGPKGQPGV 1417
               |..||.|.|      |..|.||..|..|.:|:|||.||.|.            |||.|.|  
Mouse  1106 ---GHRGFPGNP------GPPGSPGAAGHQGAIGSPGPAGPRGP------------VGPHGPP-- 1147

  Fly  1418 TYSYSMARPGDRGEPGLDGFQGEEGDGGAPGLIGFQGQRGAVGYRGDQGEVGYTGADGPQGQRGD 1482
                                 |::|..|.||.||..|.||..|.||.:|..|:.|..||.|..|.
Mouse  1148 ---------------------GKDGTSGHPGPIGPPGPRGNRGERGSEGSPGHPGQPGPPGPPGA 1191

  Fly  1483 KGYMGLTGAPGLRGLPGPQGEPAPAPPAPKSRGFIFARHSQSVHVPQCP----ANTNLLWEGY-S 1542
            .|.....||..:.|:.|           .||.||       |.:....|    .||..:.... |
Mouse  1192 PGPCCGGGAAAIAGVGG-----------EKSGGF-------SPYYGDDPMDFKINTEEIMSSLKS 1238

  Fly  1543 LSGNVAASRAVGQDLGQSGSCMMRFTTMPYMLCDITNVCHFAQNNDDSLWLSTAEPMPMTMTPIQ 1607
            ::|.:.:.      :...||     ...|...|.....|| .:......|:.          |.|
Mouse  1239 VNGQIESL------ISPDGS-----RKNPARNCRDLKFCH-PELKSGEYWVD----------PNQ 1281

  Fly  1608 G--RDLMKYISRCVVCETTTRIIALHSQSMSIP------DCPGGWEEMWTG--------YSY--- 1653
            |  .|.:|     |.|...|....:::..|::|      |.....:.:|.|        :||   
Mouse  1282 GCKMDAIK-----VFCNMETGETCINASPMTVPRKHWWTDSGAEKKHVWFGESMNGGFQFSYGPP 1341

  Fly  1654 ------------FMSTL-----DNV---------------GGVGQNLVSPGSCLEEFRAQPVIEC 1686
                        |:..|     .|:               |.|.::|...||...||:|:     
Mouse  1342 DLPEDVVDVQLAFLRLLSSRASQNITYHCKNSIAYMDQASGNVKKSLKLMGSNEGEFKAE----- 1401

  Fly  1687 HGHGRCNYYDALASFWLTVIEE 1708
             |:.:..|         ||:|:
Mouse  1402 -GNSKFTY---------TVLED 1413

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
vkgNP_001260071.1 Collagen 60..119 CDD:189968 29/79 (37%)
Collagen 102..152 CDD:189968 29/49 (59%)
Collagen 277..334 CDD:189968 26/56 (46%)
Collagen 534..593 CDD:189968 29/58 (50%)
Collagen 814..871 CDD:189968 30/56 (54%)
Collagen 957..1014 CDD:189968 25/56 (45%)
Collagen 990..1049 CDD:189968 23/58 (40%)
Collagen 1070..1128 CDD:189968 26/57 (46%)
C4 1515..1624 CDD:128421 22/115 (19%)
C4 1625..1737 CDD:128421 24/133 (18%)
Col3a1NP_034060.2 VWC 33..89 CDD:278520 4/26 (15%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 97..1195 550/1462 (38%)
Nonhelical region (N-terminal) 155..169 3/13 (23%)
Triple-helical region 170..1195 528/1388 (38%)
Collagen 233..292 CDD:189968 28/75 (37%)
Collagen 278..331 CDD:189968 30/93 (32%)
Collagen 353..422 CDD:189968 38/100 (38%)
Collagen 413..472 CDD:189968 36/102 (35%)
Collagen 449..502 CDD:189968 28/77 (36%)
Collagen 1016..1095 CDD:189968 43/142 (30%)
Collagen 1064..1135 CDD:189968 36/92 (39%)
COLFI 1231..1463 CDD:279718 40/225 (18%)


Information from Original Tools:


Tool Simple Score Weighted Score Original Tool Information
BLAST Result Score Score Type Cluster ID
Compara 00.000 Not matched by this tool.
Domainoid 00.000 Not matched by this tool.
eggNOG 1 0.900 - - E33208_3BAFZ
Hieranoid 00.000 Not matched by this tool.
Homologene 00.000 Not matched by this tool.
Inparanoid 00.000 Not matched by this tool.
Isobase 00.000 Not matched by this tool.
OMA 00.000 Not matched by this tool.
OrthoDB 00.000 Not matched by this tool.
OrthoFinder 00.000 Not matched by this tool.
OrthoInspector 00.000 Not matched by this tool.
orthoMCL 00.000 Not matched by this tool.
Panther 00.000 Not matched by this tool.
Phylome 00.000 Not matched by this tool.
RoundUp 00.000 Not matched by this tool.
SonicParanoid 00.000 Not matched by this tool.
SwiftOrtho 1 1.000 - -
TreeFam 00.000 Not matched by this tool.
21.900

Return to query results.
Submit another query.