DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment vkg and Col27a1

DIOPT Version :9

Sequence 1:NP_001260071.1 Gene:vkg / 33726 FlyBaseID:FBgn0016075 Length:1940 Species:Drosophila melanogaster
Sequence 2:NP_942042.1 Gene:Col27a1 / 298101 RGDID:735115 Length:1855 Species:Rattus norvegicus


Alignment Length:1365 Identity:508/1365 - (37%)
Similarity:613/1365 - (44%) Gaps:374/1365 - (27%)


- Green bases have known domain annotations that are detailed below.


  Fly    42 GIKGRMGAPGPIGVPGLEGPAGDIGPPGRAGPLGEKGDVGEYGEQGEKGHRG------DIGPKGE 100
            |.||..|.|||.|:|||.|..|..||.|..||.|..|..|..|.:|:||..|      ..|.||.
  Rat   622 GSKGDCGLPGPPGLPGLPGSPGPRGPRGPPGPFGNPGLPGPPGAKGQKGDPGLSPGQAHDGAKGN 686

  Fly   101 MGYPGIMGKSGEPGTPGPRGIDGCDGRPGMQGPSGAPGQNGVRGPPGKPGQQGPPGEAG-EGGIN 164
            ||.||:      .|.|||.|..|..|.|   |.:|.||:.|..||.|.||.:|.||..| .|.:.
  Rat   687 MGLPGL------AGNPGPMGRKGHKGHP---GAAGHPGEQGQPGPEGSPGAKGYPGRQGFPGPVG 742

  Fly   165 SKGTKGNRGETGQPGGVGPPGFDGDRGSKGDTGYAGLTGEKGDPGLPGPKGDTGAVSELPYSLIG 229
            ..|.||:||..|.||..|.||.||:||.   .|..|..||.|.||.||..|:.           |
  Rat   743 DPGPKGSRGYIGLPGLFGLPGSDGERGL---PGIPGKRGEMGRPGFPGDFGER-----------G 793

  Fly   230 PPGAKGEPGDSLSGVLKPDDTLKGYKGYVGLQGDEGPQGPTGEQGAVGRNGLPGARGE---IGGP 291
            |||..|.||:.         .|.|..|.:||.||.|..||.|..|..|..||.|..||   .|..
  Rat   794 PPGLDGNPGEI---------GLPGPPGVLGLLGDMGALGPVGYPGPKGMKGLMGGVGEPGLKGDK 849

  Fly   292 GERGKPGKDGEPGRFGDKGMKGAPGWTGADGLDGSPGERGEDGFTGMPGVQGGAGPPGIYDPSLT 356
            ||:|.||..|:||..||||..|.||:.||.|..|..|:.|:.|..|:||.               
  Rat   850 GEQGVPGVSGDPGFQGDKGSHGLPGFPGARGKPGPMGKAGDKGSLGLPGP--------------- 899

  Fly   357 KSLPGPIGSQGDIGPPGEQGPPGLPGKPGRRG-PIGLAGQSGDPGLNGSRGPPGRSERGEAGDYG 420
               |||.|..|||||||:.||.|:.||||.|| |                ||||:          
  Rat   900 ---PGPEGFPGDIGPPGDNGPEGMKGKPGARGLP----------------GPPGQ---------- 935

  Fly   421 FIGPPGPQGPPGEAGLPGRYGLHGEPGQNVVGPKGEPGLNGQPGLEGYRGDRGEVGLPGDKGLPG 485
             :||.|.:||.|..|:|   ||.|:||:     ||.|   |:|||:|.:|:      |||.|.| 
  Rat   936 -LGPEGDEGPMGPPGVP---GLEGQPGR-----KGFP---GRPGLDGSKGE------PGDPGRP- 981

  Fly   486 EGYNIVGPPGSQGPPGFRGLPGDDGYNGLRGLPGEKGLRGDDCPVCNAGPRGPRGQEGDTGYPGS 550
                  ||.|.||..||.||.|:.      |:.||||.||      ..||.|..|.:|..|:||:
  Rat   982 ------GPVGEQGLMGFVGLVGEP------GIVGEKGDRG------VMGPPGAPGPKGSMGHPGT 1028

  Fly   551 HGNRGAIGLTGPRGVQGLQGNPGRAGHKGLPGPAGIPGEPGKVGAAGPDGKAIEVGSLRKGEIGD 615
                       |.||    |:||..|..|.||..|:||                   :|      
  Rat  1029 -----------PGGV----GDPGEPGPWGPPGSRGLPG-------------------MR------ 1053

  Fly   616 TGDSGHRGDTGDDGEKGRDGSDGSKGERGETGQRGDYGDAGYQGRDGEPGRDGRDGAPGRNATTP 680
             |..||||..|.||..|..||.|.||.            .|.:||.|:||:.|            
  Rat  1054 -GAKGHRGPRGPDGPAGEQGSKGLKGR------------VGPRGRPGQPGQQG------------ 1093

  Fly   681 KVYLIGEPGYDGIKGERGDDGDTGFKGVKGEPNPGQIYDNTGEPGEDGYTGPKGVKGAKGEQGAI 745
               ..||.|:.|.||..|..|.:|..|.||.|         |||            |::|.||.:
  Rat  1094 ---AAGERGHSGAKGFLGIPGPSGPPGAKGLP---------GEP------------GSQGPQGPV 1134

  Fly   746 GLRGEIGDRGPAGEVIPGPVGAKGYPGPTGDYGQQGAPGLPGRDGEPGLDGGIGYKGQRGVPGQE 810
            |..||:|.:||     ||.||..|.||.:|..|..|..|.||..         |..||||.||  
  Rat  1135 GPPGEMGPKGP-----PGAVGEPGLPGDSGMKGDLGPLGPPGEQ---------GLIGQRGEPG-- 1183

  Fly   811 VIQGEIGPPGRSGIKGFPGDVGAPGQYGLAGRPGPKGVKGEQGPDGAVGQTGLPGNKGQRGDFLV 875
             ::|::||.|..|:||..||            |||.|..||:|.:|..|:.|||           
  Rat  1184 -LEGDLGPVGPDGLKGDRGD------------PGPDGEHGEKGQEGLKGEEGLP----------- 1224

  Fly   876 GPPGPKGQPGRNGRQAPHGAKGQKGEVGSLGQNGQNGAKGSIGFSGRRGLLGNAGLQGLPGSPGI 940
            ||||..            |.:|::|:.||.|:.||.||||:.|:.|:   ||..|:.|.||.||.
  Rat  1225 GPPGIT------------GVRGREGKPGSQGEKGQRGAKGAKGYQGQ---LGEMGIPGDPGPPGT 1274

  Fly   941 P------------GLPGMIGEIGERGEIGYNGRQGDIGPRGPNGEFGPKGLSGDD----GPDGYP 989
            |            |.||.:|..||.|..||||.:|..||.||.|..|.||..|:|    |..|.|
  Rat  1275 PGPKGSRGTLGPMGAPGRMGAQGEPGLAGYNGHKGITGPLGPPGPKGEKGEQGEDGKTEGAPGPP 1339

  Fly   990 GANGLPGRKGETGNPGFPGRPGAKGVAAYSGIKGDDGESGLTGPIGYPGAPGAKGQRGPVGDSQP 1054
            |..|..|.:|:.|.||.||.||.:||....|..|..|:.|..||.|.||..|:||:.||.|  :|
  Rat  1340 GERGPVGDRGDRGEPGDPGYPGQEGVQGLRGEPGQQGQPGHPGPRGRPGPKGSKGEEGPKG--KP 1402

  Fly  1055 ALDGVAGRKGEVGSPGPNGLPGRHGLKGQRGDRGLPGQQGRPGEPGAKGLGGYPGRNGINGLKGA 1119
               |.||..|..|:.|..||||..|:.|::|..|:.||.|.||.         .||.|..|.:|.
  Rat  1403 ---GKAGASGRRGTQGLQGLPGPRGVVGRQGPEGMAGQDGNPGR---------DGRPGYQGEQGN 1455

  Fly  1120 TGFPGPQGPKGPQGESGVVGLDGRNGQIGDQGPRGLIGEQGEQGEQGDEGEVGIPGRLENLRDRS 1184
            .|.|||.||.|.:|..||.||.|..|..|.:|..||.|:.|..|::|.||..|:||.        
  Rat  1456 DGDPGPVGPAGRRGNPGVAGLPGAQGPPGFKGESGLPGQLGPPGKRGTEGGTGLPGN-------- 1512

  Fly  1185 FYRGFTGDQGLQGERGEQGDMGPIGFIGPPGAKGERGDIGYAGQLGFDGAEGLKGFQGDQGPRGP 1249
                 .|:.|.:|:.|:.|:||..|..|..|.||..|||               ||:|.||||||
  Rat  1513 -----QGEPGSKGQPGDSGEMGFPGVAGLFGPKGPPGDI---------------GFKGIQGPRGP 1557

  Fly  1250 PGITLPAEKGDEGVAGLDGRAGRPGHFGQKGAPGPPGENGPNGAIGHRGPQIQGPPGPQGDVGFP 1314
            ||:     .|.||:      .|.||..|..|.|||.|:.|..|..|.:||  :|||||:|..|.|
  Rat  1558 PGL-----MGKEGI------IGPPGMLGPSGLPGPKGDRGSRGDWGLQGP--RGPPGPRGRPGPP 1609

  Fly  1315 GAPGHNGRHGLIGPKGELGDMGRQGERGESGYAIVGRQGDIGDIGFQGEPGWDGAKG----EQGY 1375
            |.|.|                           .:..:|.|: :..||   .|..|.|    ||||
  Rat  1610 GPPWH---------------------------PVQFQQDDL-EAAFQ---TWMDAHGAVRLEQGY 1643

  Fly  1376  1375
              Rat  1644  1643

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
vkgNP_001260071.1 Collagen 60..119 CDD:189968 25/64 (39%)
Collagen 102..152 CDD:189968 21/49 (43%)
Collagen 277..334 CDD:189968 28/59 (47%)
Collagen 534..593 CDD:189968 21/58 (36%)
Collagen 814..871 CDD:189968 21/56 (38%)
Collagen 957..1014 CDD:189968 29/60 (48%)
Collagen 990..1049 CDD:189968 26/58 (45%)
Collagen 1070..1128 CDD:189968 23/57 (40%)
C4 1515..1624 CDD:128421
C4 1625..1737 CDD:128421
Col27a1NP_942042.1 LamG 52..230 CDD:304605
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 317..428
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 511..580
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 617..787 78/176 (44%)
Triple-helical region 619..1612 495/1301 (38%)
Collagen 688..746 CDD:189968 26/66 (39%)
Collagen 718..777 CDD:189968 28/61 (46%)
Collagen 754..810 CDD:189968 29/78 (37%)
Collagen 826..884 CDD:189968 27/57 (47%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 838..1617 394/1085 (36%)
Collagen 913..984 CDD:189968 43/121 (36%)
Collagen 1047..1105 CDD:189968 29/110 (26%)
Collagen 1139..1193 CDD:189968 27/70 (39%)
Collagen 1168..1227 CDD:189968 32/93 (34%)
Collagen 1201..1259 CDD:189968 31/92 (34%)
Collagen 1334..1391 CDD:189968 25/56 (45%)
Collagen 1376..1433 CDD:189968 27/61 (44%)
Collagen 1439..1498 CDD:189968 28/67 (42%)
Collagen 1472..1529 CDD:189968 24/69 (35%)
Collagen 1511..1570 CDD:189968 31/97 (32%)
COLFI 1656..1854 CDD:295304
Blue background indicates that the domain is not in the aligned region.


Information from Original Tools:


Tool Simple Score Weighted Score Original Tool Information
BLAST Result Score Score Type Cluster ID
Compara 00.000 Not matched by this tool.
Domainoid 00.000 Not matched by this tool.
eggNOG 00.000 Not matched by this tool.
Hieranoid 00.000 Not matched by this tool.
Homologene 00.000 Not matched by this tool.
Inparanoid 00.000 Not matched by this tool.
OMA 00.000 Not matched by this tool.
OrthoDB 00.000 Not matched by this tool.
OrthoFinder 00.000 Not matched by this tool.
OrthoInspector 00.000 Not matched by this tool.
orthoMCL 00.000 Not matched by this tool.
Panther 00.000 Not matched by this tool.
Phylome 1 0.910 - -
SonicParanoid 00.000 Not matched by this tool.
SwiftOrtho 1 1.000 - -
TreeFam 00.000 Not matched by this tool.
21.910

Return to query results.
Submit another query.