DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment vkg and Col3a1

DIOPT Version :9

Sequence 1:NP_001260071.1 Gene:vkg / 33726 FlyBaseID:FBgn0016075 Length:1940 Species:Drosophila melanogaster
Sequence 2:NP_114474.1 Gene:Col3a1 / 84032 RGDID:71029 Length:1463 Species:Rattus norvegicus


Alignment Length:1798 Identity:620/1798 - (34%)
Similarity:737/1798 - (40%) Gaps:518/1798 - (28%)


- Green bases have known domain annotations that are detailed below.


  Fly    13 LGVVYLLGSLVSVTLADGKICNTTLCD-----CKGIKGRMGAPGPIGVPGLEGPAGD---IGP-P 68
            ||..||..|..|..:...:.|...:||     |..|   |....|:..|..|.|.|:   |.| |
  Rat    31 LGCNYLGQSYESRDVWKPEPCQICVCDSGSVLCDDI---MCDDEPLDCPNPEIPFGECCAICPQP 92

  Fly    69 GRAGPLGEKGDVGEYGEQGEKGHRGDIGPKGEMGYPGIMGKSGEPGTPGPRGIDGCDGRPGM--- 130
            ....|:...|:..:             ||||:.|.|||.|::|:||.||..|:.|..|.||:   
  Rat    93 STPAPVIPDGNRPQ-------------GPKGDPGPPGIPGRNGDPGLPGQPGLPGPPGSPGICES 144

  Fly   131 -------QGP--------SGAPGQNGVRGPPGKPGQQGPPGEAGEGGINSKGTKGNRGETGQPGG 180
                   ..|        ||..|..|..||.|.||..||||.:|..|  |.|:.|.:|..|:||.
  Rat   145 CPTGGQNYSPQFDSYDVKSGVGGMGGYPGPAGPPGPPGPPGSSGHPG--SPGSPGYQGPPGEPGQ 207

  Fly   181 VGPPGFDGDRGSKGDTGYAGLTGEKGDPGLPGPKGDTGAVSELPYSLIGPPGAKGEPGDSLSGVL 245
            .||.|..|..|:.|.:|.||..||.|.||.||.:|       ||    ||||.||..|       
  Rat   208 AGPAGPPGPPGAIGPSGPAGKDGESGRPGRPGERG-------LP----GPPGIKGPAG------- 254

  Fly   246 KPDDTLKGYKGYVGLQGDEGPQGPTGEQGAVGRNGLPGARGEIGGPGERGKPGKDGEPGRFGDKG 310
                 :.|:.|..|.:|.:|..|..||.||      ||.:||.|.||:.|.|         |..|
  Rat   255 -----IPGFPGMKGHRGFDGRNGEKGETGA------PGLKGENGLPGDNGAP---------GPMG 299

  Fly   311 MKGAPGWTGADGLDGSPGERGEDGFTGMPGVQGGAGPPGIYDPSLTKSLPGPIGSQGDIGP---P 372
            .:||||..|..||.|:.|.||.||..|..|..|..||||      |...||..|::|::||   |
  Rat   300 PRGAPGERGRPGLPGAAGARGNDGARGSDGQPGPPGPPG------TAGFPGSPGAKGEVGPAGSP 358

  Fly   373 GEQGPPGLPGKPGRRGPIGLAGQSGDPGLNGSRGPPGRSERGEAGDYGFIGPPGPQGPPGEAGLP 437
            |..|.||..|:||.:|..|..|..|.||.|||  |.|:.|.|.||..|..|..|.:||||.|   
  Rat   359 GSNGSPGQRGEPGPQGHAGAQGPPGPPGNNGS--PGGKGEMGPAGIPGAPGLLGARGPPGPA--- 418

  Fly   438 GRYGLHGEPGQNVVGPKGEPGLNGQPGLEGYRGDRGEVGLPGDKGLPGEGYNIVGPPGSQGPPGF 502
               |.:|.|||.  ||.||||.||..|..|.||:|||.|.|             |.||.:|..|.
  Rat   419 ---GANGAPGQR--GPSGEPGKNGAKGEPGARGERGEAGSP-------------GIPGPKGEDGK 465

  Fly   503 RGLPGDDGYNGLRGLPGEKGLRGDDCPVCNAGPRGPRGQEGDTGYPGSHGNRGAIGLTGPRGVQG 567
            .|.||:.|.||:.|.|||:|.         .|.|||.|..|..|..|..|.||..|..|||||  
  Rat   466 DGSPGEPGANGVPGNPGERGA---------PGFRGPAGPNGAPGEKGPAGERGGPGPAGPRGV-- 519

  Fly   568 LQGNPGRAGHKGLPGPAGIPGEPGKVGAAGPDGKAIEVGSLRKGEIGDTGDSGHRGDTGDDGEKG 632
             .|.|||.|..|.||..|:||.||     ||                  |:.|..|..|..||.|
  Rat   520 -AGEPGRDGTPGGPGIRGMPGSPG-----GP------------------GNDGKPGPPGSQGESG 560

  Fly   633 RDGSDGSKGERGETGQRGDYGDAGYQGRDGEPGRDGRDGAPGRNATTPKVYLIGEPGYDGIKGER 697
            |.|..|..|.||:.|..            |.||..|.|||||:|...      |.||..|:.|..
  Rat   561 RPGPPGPSGPRGQPGVM------------GFPGPKGNDGAPGKNGER------GGPGGPGLPGPA 607

  Fly   698 GDDGDTGFKGVKGEPNPGQIYDNTGEPGEDGYTGPKGVKGAKGEQGAIGLRGEIGDRGPAGEVIP 762
            |.:|:|   |.:|.|.|      ||.||:.|..||.|.:|.   ||..|..|..|:.|..||  |
  Rat   608 GKNGET---GPQGPPGP------TGAPGDKGDAGPPGPQGL---QGIPGTSGPPGENGKPGE--P 658

  Fly   763 GPVGAKGYPGPTGDYGQQGAP---GLPGRDGEPGLDGGIGYKGQRGVPGQEVIQGEIGPPGRSGI 824
            ||.|..|.||..|..|..|||   |.||..|.|||.||      .|.||.|..:|..||||..|.
  Rat   659 GPKGEAGAPGVPGGKGDSGAPGERGPPGTAGTPGLRGG------AGPPGPEGGKGPAGPPGPPGT 717

  Fly   825 KGFPGDVGAPGQYGLAGRPGPKGVKGEQGPDGAVGQTGLPGNKGQRGDF-LVGPPGPKGQPGRNG 888
            .|.||..|.||:.|..|.|||||.|||  |.|| |..|:||..|.||.. .:|||||.|||    
  Rat   718 SGPPGLQGMPGERGGPGSPGPKGEKGE--PGGA-GADGVPGKDGPRGPAGPIGPPGPAGQP---- 775

  Fly   889 RQAPHGAKGQKGEVGSLGQNGQNGAKGSIGFSGRRGLLGNAGLQGLPGSPGIPGLPGMIGEIGER 953
                             |..|:.||.|..|.:|.||..|..|..|.||..|.||.||..||.|.:
  Rat   776 -----------------GDKGEGGAPGLPGIAGPRGGPGERGEHGPPGPAGFPGAPGQNGEPGAK 823

  Fly   954 GEIGYNGRQGDIGPRGPNGEFGPKGLSGDDGPDGYPGANGLPGRKGETGNPGFPGRPGAKGVAAY 1018
            ||   .|..|:.|..||.|..||.|.||..||   ||..|:.|.:|..|.||..|.||.:     
  Rat   824 GE---RGAPGEKGEGGPPGAAGPPGGSGPAGP---PGPQGVKGERGSPGGPGAAGFPGGR----- 877

  Fly  1019 SGIKGDDGESGLTGPIGYPGAPGAKGQRGPVGDSQPALDGVAGRKGEVGSPGPNGLPGRHGLKGQ 1083
             |:.|..|.:|..||.|..||||..|..||.|:|              ||||..|:.|..|..||
  Rat   878 -GLPGPPGNNGNPGPPGPSGAPGKDGPPGPAGNS--------------GSPGNPGVAGPKGDAGQ 927

  Fly  1084 RGDRGLPGQQGRPGEPGAKGLGGYPGRNGINGLKGATGFPGPQGPKGPQ---GESGVVGLDGRNG 1145
            .|::|.||.||.||.||..|:.|.   .|..||.|..|.|||:|..|||   ||||..|..|.||
  Rat   928 PGEKGPPGAQGPPGSPGPLGIAGL---TGARGLAGPPGMPGPRGSPGPQGIKGESGKPGASGHNG 989

  Fly  1146 QIGDQGPRGLIGEQGEQGEQGDEGEVGIPGRLENLRDRSFYRGFTGDQGLQGERGEQGDMGPIGF 1210
            :.|..||:||.|:.|..||         |||             .|:.|..|:.|..|.      
  Rat   990 ERGPPGPQGLPGQPGTAGE---------PGR-------------DGNPGSDGQPGRDGS------ 1026

  Fly  1211 IGPPGAKGERGDIGYAGQLGFDGAEGLKGFQGDQGPRGPPGITLPAEKGDEGVAGLDGRAGRPGH 1275
               ||.||:||:                        .|.|                 |..|.|||
  Rat  1027 ---PGGKGDRGE------------------------NGSP-----------------GAPGAPGH 1047

  Fly  1276 FGQKGAPGPPGENGPNGAIGHRGPQIQGPPGPQGDVGFPGAPGHNGRHGLIGPKGELGDMGRQGE 1340
                  |||||..||:|..|.||.  .||.||.      ||||..|..|..||:|..||.|..||
  Rat  1048 ------PGPPGPVGPSGKNGDRGE--TGPAGPS------GAPGPAGARGAPGPQGPRGDKGETGE 1098

  Fly  1341 RGESGYAIVGRQGDIGDIGFQGEPGWDGAKGEQGYPGLPGKNGRVGAPGPRGPTGDAGWGGIDGM 1405
            ||.:|..        |..||.|.|      |..|.||..|..|.||:|||.||.|.         
  Rat  1099 RGSNGIK--------GHRGFPGNP------GPPGSPGAAGHQGAVGSPGPAGPRGP--------- 1140

  Fly  1406 DGLVGPKGQPGVTYSYSMARPGDRGEPGLDGFQGEEGDGGAPGLIGFQGQRGAVGYRGDQGEVGY 1470
               |||.|.|                       |::|..|.||.|                    
  Rat  1141 ---VGPHGPP-----------------------GKDGSSGHPGPI-------------------- 1159

  Fly  1471 TGADGPQGQRGDKGYMGLTGAPGLRGLPGPQGEPAP---------APPAPKSRGFIFARHSQSVH 1526
             |..||:|.||::|..|..|.||..|.|||.|.|.|         .....||.||       |.:
  Rat  1160 -GPPGPRGNRGERGSEGSPGHPGQPGPPGPPGAPGPCCGGGAAIAGVGGEKSGGF-------SPY 1216

  Fly  1527 VPQCP----ANTNLLWEGY-SLSGNVAASRAVGQDLGQSGSCMMRFTTMPYMLCDITNVCHFAQN 1586
            ....|    .||..:.... |::|.:.:.      :...||     ...|...|.....|| .:.
  Rat  1217 YGDDPMDFKINTEEIMSSLKSVNGQIESL------ISPDGS-----RKNPARNCRDLKFCH-PEL 1269

  Fly  1587 NDDSLWLSTAEPMPMTMTPIQG--RDLMKYISRCVVCETTTRIIALHSQSMSIP------DCPGG 1643
            .....|:.          |.||  .|.:|     |.|...|....:::..|::|      |....
  Rat  1270 KSGEYWVD----------PNQGCKMDAIK-----VFCNMETGETCINASPMTVPRKHWWTDAGAE 1319

  Fly  1644 WEEMWTG--------YSY---------------FMSTL-----DNV---------------GGVG 1665
            .:.:|.|        :||               |:..|     .|:               |.|.
  Rat  1320 KKHVWFGESMNGGFQFSYGNPDLPEDVLDVQLAFLRLLSSRASQNITYHCKNSIAYMDQANGNVK 1384

  Fly  1666 QNLVSPGSCLEEFRAQPVIECHGHGRCNYYDALASFWLTVIEE 1708
            ::|...||...||:|:      |:.:..|         ||:|:
  Rat  1385 KSLKLMGSNEGEFKAE------GNSKFTY---------TVLED 1412

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
vkgNP_001260071.1 Collagen 60..119 CDD:189968 21/62 (34%)
Collagen 102..152 CDD:189968 25/67 (37%)
Collagen 277..334 CDD:189968 22/56 (39%)
Collagen 534..593 CDD:189968 30/58 (52%)
Collagen 814..871 CDD:189968 30/56 (54%)
Collagen 957..1014 CDD:189968 24/56 (43%)
Collagen 990..1049 CDD:189968 23/58 (40%)
Collagen 1070..1128 CDD:189968 26/57 (46%)
C4 1515..1624 CDD:128421 22/115 (19%)
C4 1625..1737 CDD:128421 24/133 (18%)
Col3a1NP_114474.1 VWC 33..89 CDD:278520 15/58 (26%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 97..1195 551/1461 (38%)
Nonhelical region (N-terminal) 155..169 3/13 (23%)
Triple-helical region 170..1195 526/1375 (38%)
Collagen 233..292 CDD:189968 32/87 (37%)
Collagen 278..331 CDD:189968 28/67 (42%)
Collagen 353..424 CDD:189968 35/78 (45%)
Collagen 413..472 CDD:189968 35/79 (44%)
Collagen 452..505 CDD:189968 26/74 (35%)
Collagen 629..686 CDD:189968 26/61 (43%)
Collagen <1061..1103 CDD:189968 23/49 (47%)
COLFI 1230..1462 CDD:279718 40/225 (18%)


Information from Original Tools:


Tool Simple Score Weighted Score Original Tool Information
BLAST Result Score Score Type Cluster ID
Compara 00.000 Not matched by this tool.
Domainoid 1 1.000 278 1.000 Domainoid score I1631
eggNOG 1 0.900 - - E33208_3BAFZ
Hieranoid 00.000 Not matched by this tool.
Homologene 00.000 Not matched by this tool.
Inparanoid 00.000 Not matched by this tool.
OMA 00.000 Not matched by this tool.
OrthoDB 00.000 Not matched by this tool.
OrthoFinder 00.000 Not matched by this tool.
OrthoInspector 00.000 Not matched by this tool.
orthoMCL 00.000 Not matched by this tool.
Panther 00.000 Not matched by this tool.
Phylome 00.000 Not matched by this tool.
SonicParanoid 00.000 Not matched by this tool.
SwiftOrtho 1 1.000 - -
TreeFam 00.000 Not matched by this tool.
32.900

Return to query results.
Submit another query.