DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment vkg and Col7a1

DIOPT Version :9

Sequence 1:NP_001260071.1 Gene:vkg / 33726 FlyBaseID:FBgn0016075 Length:1940 Species:Drosophila melanogaster
Sequence 2:NP_001100328.2 Gene:Col7a1 / 301012 RGDID:1311417 Length:2944 Species:Rattus norvegicus


Alignment Length:1665 Identity:668/1665 - (40%)
Similarity:802/1665 - (48%) Gaps:328/1665 - (19%)


- Green bases have known domain annotations that are detailed below.


  Fly    42 GIKGRMGAPGPIGVPGL------------EGPAGDIGPPGRAGPLGEKGDVGEYGEQGEKGHRGD 94
            |:.||.|||||.|.||.            |||.|..|.||..|..|.||..|..|.:|::|.||.
  Rat  1276 GLPGRTGAPGPQGAPGSTQAKGERGFPGPEGPPGSPGLPGVPGSPGVKGSPGWSGPRGDRGERGP 1340

  Fly    95 IGPKGEMGYPGIMGKSGEPGTPGPRGIDGCDGRPGMQGPSGAPGQNGVRGPPGKP---------- 149
            .|||||.|.||.:...|.||.||.:|..|..|.||..||.|.||.   |||||.|          
  Rat  1341 QGPKGEPGEPGQVIGGGRPGLPGKKGDPGPSGPPGPHGPLGDPGP---RGPPGLPGTSVKGDKGD 1402

  Fly   150 -GQQGPPGEAGEGGINSKGTKGNRGETGQPGGVGPPGFDGDRGSKGDTGYAGLTGEKGDPGLPGP 213
             |::||||..  .|.:.:|:.|..|..|.||..||||..|::|.|||.       |.|.|||||.
  Rat  1403 RGERGPPGPG--TGASEQGSPGLPGLPGSPGPQGPPGRTGEKGEKGDC-------EDGGPGLPGQ 1458

  Fly   214 KGDTGAVSELPYSLIGPPGAKGEPGDSLSGVLKPDDTLKGYKGYVGLQGDEGPQGPTGEQGAVGR 278
            .|       :|    |.||.:|.||  ::|. |.|..|.|..|..|.:|:.||.||.|.||..|.
  Rat  1459 PG-------VP----GEPGLRGAPG--VTGP-KGDRGLTGTPGEPGEKGERGPPGPVGPQGLPGA 1509

  Fly   279 NGLPGARGEIGGPGERGKPGKDGEPGRFGD----------KGMKGAPGWTGADGLDGSPGERGED 333
            .|.||..|..|.||..|:.|:.|||||.||          ||.||..|..|..|..|..||:|..
  Rat  1510 AGRPGVEGPEGPPGPPGRRGEKGEPGRPGDPALGPGGAGAKGEKGDAGLPGPRGASGIKGEQGAP 1574

  Fly   334 GFTGMPGVQGGAGPPGIYDPSLTKSLPGPIGSQGDIGPPGEQGPPGLPGKPGRRGPIGLAGQSGD 398
            |. .:||..|..|.||...|.   .|.|..|..||.|||||:|.||.||.||..||.|..|::|:
  Rat  1575 GL-ALPGDPGPKGDPGDRGPI---GLTGRAGPTGDSGPPGEKGEPGRPGSPGPVGPRGRDGEAGE 1635

  Fly   399 PGLNGSRGPPGRSERGEAGDYGFIGPPGPQGPPGEAGLPGRYGLHGEPGQNVVGPKGEPGLNGQP 463
            .|..|:.|.||..  |:||:.|..|.|||:||.||.      |..|:|        ||.|.||.|
  Rat  1636 KGDEGAPGEPGLP--GKAGERGLRGAPGPRGPVGEK------GNEGDP--------GEDGRNGTP 1684

  Fly   464 GLEGYRGDRGEVGLPG-------------DKGLPGEGYNIVGPPGSQGPPGFRGLPGDDGYNGLR 515
            |..|.:|||||.|.||             |||.||:    .||.|.:|.||..|..|:.|.:|||
  Rat  1685 GPSGPKGDRGEPGPPGLPGRLVDAALESRDKGEPGQ----EGPRGPKGDPGPPGASGERGIDGLR 1745

  Fly   516 GLPGEKGLRGDDCPVCNAGPRGPRGQEGDTGYPGSHGNR------GAIGLTGPRGVQGLQGNPGR 574
            |.||.:|         :.|.|||.|.:||.|.||..|..      ||.|..|..|..|..|:|||
  Rat  1746 GPPGPQG---------DPGVRGPAGDKGDRGSPGLDGRNGLDGKPGAPGPPGLHGASGKAGDPGR 1801

  Fly   575 AGHKGL---------PGPAGIPGEPGKVGAAGPDGKAIEVGSLRKGEIGDTGDSGHRGDTGDDGE 630
            .|..||         |||.|:||:||..|..|.:||        .||.||.|:.|.:|:.||.|.
  Rat  1802 DGLPGLRGEHGPPGPPGPPGVPGKPGDDGKPGLNGK--------NGEPGDPGEDGRKGEKGDSGV 1858

  Fly   631 KGRDGSDGSKGERGETGQRGDYGDAGYQGRDGEPGRDGRDGAPGRNATTPKVYLIGEPGYDGIKG 695
            .||:|.||.|||||..|..|..|..|..|:.|.||: |..|.||  .|.||    |:.|..|.||
  Rat  1859 PGREGPDGPKGERGAPGNPGLQGPPGLPGQVGPPGQ-GFPGVPG--VTGPK----GDRGETGSKG 1916

  Fly   696 ERGDDGDTGFKGVKGE-PNP--------------GQIYDNTGE---------------PGEDGYT 730
            |:|..|:.|.:|..|. ||.              .:|.|..||               .|:.|..
  Rat  1917 EQGLPGERGLRGEPGSLPNAERFLETAGIKVSALREIVDTWGESSGSFLLVPERRQGPKGDPGDP 1981

  Fly   731 GPKGVKGAKGEQGAIGLRGEIGDRGPAGEVIPG-PVGAKGYPGPTGDYGQQGAPGLPGRDGEPGL 794
            ||.|.:|:.|..|..||:||.||.||.|.  || .:|.:|.|||:|..|:.|.||:||..|..|.
  Rat  1982 GPPGKEGSIGLPGERGLKGERGDPGPQGP--PGLALGERGPPGPSGLAGEPGKPGIPGLPGRAGA 2044

  Fly   795 DGGIGYKGQRGVPGQEVIQGEIGPPGRSGIKGFPGDVGAPGQY--------GLA---GRPGPKGV 848
            .|..|..|:||..|:   :||.|..||.|..|.||..|.||..        |.|   |.||.||.
  Rat  2045 AGEAGRPGERGERGE---KGERGEQGRDGHPGLPGPPGPPGPKVAIEELGPGPAREQGPPGLKGA 2106

  Fly   849 KGEQGPDGAVGQTGLPGNKGQRGDFLVGPPGPKGQPGRNGRQAPHGAKGQKGEVGSLGQNGQNGA 913
            |||.|.|      |:||.||.|     |.||.||..|..|::.|.|..|..||      .|.:|.
  Rat  2107 KGEPGSD------GVPGPKGDR-----GVPGIKGDAGEPGKRGPDGNPGLPGE------RGVSGP 2154

  Fly   914 KGSIGFSGRRGLLGNAGLQGLPGSPGIPGLPGMIGEIGERGEIGYNGRQGDIGP--RGPNGEFGP 976
            :|..|..|.||..|..|..   |.||.||.||:.|..|.:|..|..|..|:.||  ||..|..|.
  Rat  2155 EGKPGLQGPRGTPGPVGSH---GDPGPPGAPGLAGPAGPQGPSGLKGEPGETGPPGRGLPGPTGA 2216

  Fly   977 KGLSGDDGPDGY---PGANGLPGRKGETGNPGFPGRPGAK---------GVAAYSGIKGDDGESG 1029
            .||.|..||.|.   .|:.||||:.||||.||.|||.|:.         ||....|:.|..|..|
  Rat  2217 VGLPGPPGPSGLVGPQGSPGLPGQVGETGKPGPPGRDGSSGKDGERGGPGVPGLPGLPGPVGPKG 2281

  Fly  1030 LTGPIGYP-----GAPGAKGQRGPVGDSQPALDGVAGRKGEVGSPGPNGLPGRHGLKGQRGDRGL 1089
            ..||:|.|     |.|||||::|..||...||.|..|.||:.|.|||.|..|..|..|:.||.|.
  Rat  2282 EPGPVGAPGQVVVGPPGAKGEKGAPGDLAGALLGEPGAKGDRGLPGPRGEKGEAGHAGEPGDPGE 2346

  Fly  1090 PGQQGRP------GEPGAKGLGGYPGRNGINGLK------------GATGFPGPQGPK------G 1130
            .||:|.|      ||||. |:.|.||.:|..|:|            |..||||..||:      |
  Rat  2347 DGQKGAPGLKGLKGEPGI-GVQGPPGPSGPPGMKGDLGPPGAPGAPGIVGFPGQPGPRGETGQPG 2410

  Fly  1131 PQGESGVVGLDGRN---GQIGDQGPRGLIGEQGEQGEQGDEGE--VGIPGRLENLRDRSFYRGFT 1190
            |.||.|:.|..||.   |.:|..||.|.:|..|..|.:||:|:  .|:||.          ||..
  Rat  2411 PVGERGLAGPPGREGAPGPLGPPGPPGSVGAPGASGFKGDKGDSGAGLPGP----------RGER 2465

  Fly  1191 GDQGLQGERGEQGDMGPIGFIGPPGAKGERGDIGYAGQLGFDGAEGLKGFQGD----QGPRGPPG 1251
            |:.||:||.|..|..||.|.:||||::|||      |:.|..||.||||.:||    :||.||.|
  Rat  2466 GEPGLRGEDGHPGQEGPRGLMGPPGSRGER------GEKGDPGAAGLKGDKGDSAVIEGPAGPRG 2524

  Fly  1252 ITLPAEKGDEGVAGLD------GRAGRPGHFGQKGAPGPPGENGPNGAIGHRGPQIQGPPGPQGD 1310
              ...:.|:.|..|:|      |.:|.||..|.||.||..|..|..|..|..||  :|.||..|.
  Rat  2525 --AKGDMGERGPRGIDGDQGPRGESGDPGDKGSKGEPGDKGSAGSTGVRGLTGP--KGEPGAAGI 2585

  Fly  1311 VGFPGAPGHNGRHGLIGPKGELGDMGRQGERGESGYAIV----GRQGDIGDIGFQGEPGWDGAKG 1371
            .|.|||||.:|..|..|.||::|..|.:|.:||.|....    |.:||.|:.||.|.||..|.||
  Rat  2586 PGEPGAPGKDGAPGFRGDKGDIGFTGPRGLKGERGVKGTCGRDGEKGDKGEAGFPGRPGLSGKKG 2650

  Fly  1372 EQGYPGLPGKNGRVGAPGPRGPTGDAGWGGIDGMDGLVGPKGQPGVTYSYSMARPGDRGEPGLDG 1436
            :.|.||:||::|..|..|..||.||.|:      ||..||||..|        ..|:||.||:.|
  Rat  2651 DMGDPGIPGQSGAPGKEGLIGPKGDRGF------DGQSGPKGDQG--------EKGERGPPGVGG 2701

  Fly  1437 FQGEEGDGGAPGLIGFQGQRGAVGYRGDQGEVGYTGADGPQGQRGDKGYMG--LTGAPGLRGLPG 1499
            |.|..|:.|:.            |..|..|.:|..|.:|.|||:|::|..|  :.||||..|.||
  Rat  2702 FPGPRGNDGSS------------GPPGPPGSIGPKGPEGLQGQKGERGPPGESVVGAPGAPGTPG 2754

  Fly  1500 PQGE---PAPAPPAPKS----------RGFIFARHSQSVH 1526
            .:||   |.||.|..:.          |||:  |...|.|
  Rat  2755 ERGEQGRPGPAGPRGEKGEAALTEDDIRGFV--RQEMSQH 2792

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
vkgNP_001260071.1 Collagen 60..119 CDD:189968 29/58 (50%)
Collagen 102..152 CDD:189968 25/60 (42%)
Collagen 277..334 CDD:189968 28/66 (42%)
Collagen 534..593 CDD:189968 32/73 (44%)
Collagen 814..871 CDD:189968 29/67 (43%)
Collagen 957..1014 CDD:189968 30/70 (43%)
Collagen 990..1049 CDD:189968 32/72 (44%)
Collagen 1070..1128 CDD:189968 29/75 (39%)
C4 1515..1624 CDD:128421 5/12 (42%)
C4 1625..1737 CDD:128421
Col7a1NP_001100328.2 vWA_collagen_alphaI-XII-like 38..202 CDD:238759
fn3 236..318 CDD:394996
fn3 334..406 CDD:394996
fn3 427..488 CDD:394996
FN3 510..589 CDD:238020
FN3 599..681 CDD:238020
FN3 688..772 CDD:238020
fn3 778..856 CDD:394996
fn3 868..935 CDD:394996
FN3 959..1045 CDD:238020
VWA 1055..1223 CDD:395045
PRK07764 <1509..1714 CDD:236090 93/224 (42%)
Collagen 1580..1635 CDD:396114 27/57 (47%)
Collagen 1613..1668 CDD:396114 27/56 (48%)
Collagen 1839..1894 CDD:396114 27/54 (50%)
Collagen 1878..1932 CDD:396114 24/60 (40%)
PHA03169 2053..>2202 CDD:223003 66/171 (39%)
Collagen 2102..2158 CDD:396114 29/72 (40%)
Collagen 2315..2363 CDD:396114 21/47 (45%)
PRK12678 2403..>2636 CDD:237171 99/252 (39%)
Collagen 2457..2511 CDD:396114 30/69 (43%)
Collagen 2533..2589 CDD:396114 23/57 (40%)
Collagen 2611..2666 CDD:396114 23/54 (43%)
Collagen 2728..2774 CDD:396114 20/45 (44%)
KU 2877..2932 CDD:238057
Blue background indicates that the domain is not in the aligned region.


Information from Original Tools:


Tool Simple Score Weighted Score Original Tool Information
BLAST Result Score Score Type Cluster ID
Compara 00.000 Not matched by this tool.
Domainoid 00.000 Not matched by this tool.
eggNOG 1 0.900 - - E33208_3BAFZ
Hieranoid 00.000 Not matched by this tool.
Homologene 00.000 Not matched by this tool.
Inparanoid 00.000 Not matched by this tool.
OMA 00.000 Not matched by this tool.
OrthoDB 00.000 Not matched by this tool.
OrthoFinder 00.000 Not matched by this tool.
OrthoInspector 00.000 Not matched by this tool.
orthoMCL 00.000 Not matched by this tool.
Panther 00.000 Not matched by this tool.
Phylome 1 0.910 - -
SonicParanoid 00.000 Not matched by this tool.
SwiftOrtho 1 1.000 - -
TreeFam 00.000 Not matched by this tool.
32.810

Return to query results.
Submit another query.