DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment vkg and Col1a1

DIOPT Version :9

Sequence 1:NP_001260071.1 Gene:vkg / 33726 FlyBaseID:FBgn0016075 Length:1940 Species:Drosophila melanogaster
Sequence 2:NP_445756.1 Gene:Col1a1 / 29393 RGDID:61817 Length:1453 Species:Rattus norvegicus


Alignment Length:1497 Identity:574/1497 - (38%)
Similarity:664/1497 - (44%) Gaps:286/1497 - (19%)


- Green bases have known domain annotations that are detailed below.


  Fly    27 LADGKICNTTLCDCKGIKGRMGAPGPI----------GVPGLEGPAGDIGPPGRAGPLGEKGDVG 81
            :.||.:|...| ||...:.|.|...|.          .|.|:|||.||.||              
  Rat    60 VCDGVLCKEDL-DCPNPQKREGECCPFCPEEYVSPDAEVIGVEGPKGDPGP-------------- 109

  Fly    82 EYGEQGEKGHRGDIGPKGEMGYPGIMGKSGEPGTPGPRGIDGCDGR------------------P 128
                   :|.||.:||.|:.|.||..|..|.||.|||.|..|..|.                  |
  Rat   110 -------QGPRGPVGPPGQDGIPGQPGLPGPPGPPGPPGPPGLGGNFASQMSYGYDEKSAGVSVP 167

  Fly   129 GMQGPSGAPGQNGVRGPPGKPGQQGPPGEAGEGGINSKGTKGNRGETGQPGGVGPPGFDGDRGSK 193
            |..||||..|..|..|.||..|.||||||.||     .|..|..|..|.|   |||      |..
  Rat   168 GPMGPSGPRGLPGPPGAPGPQGFQGPPGEPGE-----PGASGPMGPRGPP---GPP------GKN 218

  Fly   194 GDTGYAGLTGEKGDPGLPGPKGDTGAVSELPYSLIGPPGAKGEPGDSLSGVLKPDDTLKGYKGYV 258
            ||.|.||..|..|:.|.|||:|..|.           ||..|.||            :||::|:.
  Rat   219 GDDGEAGKPGRPGERGPPGPQGARGL-----------PGTAGLPG------------MKGHRGFS 260

  Fly   259 GL---QGDEGPQGPTGEQGAVGRNGLPGARGEIGGPGERGKPGKDGEPGRFGDKGMKGAPGWTGA 320
            ||   :||.||.||.||.|:.|.||.||..|..|.|||||:||..|..|..|:.|..||.|..|.
  Rat   261 GLDGAKGDTGPAGPKGEPGSPGENGAPGQMGPRGLPGERGRPGPPGSAGARGNDGAVGAAGPPGP 325

  Fly   321 DGLDGSPGERGEDGFTGMPGVQGGAGPPGIYDPSLTKSLPGPIGSQGDIGPPGEQGPPGLPGKPG 385
            .|..|.|      ||.|..|.:|.|||.               |::|..||.|.:|.||.||..|
  Rat   326 TGPTGPP------GFPGAAGAKGEAGPQ---------------GARGSEGPQGVRGEPGPPGPAG 369

  Fly   386 RRGPIGLAGQSGDPGLNGSRGPPGRSERGEAGDYGFIGPPGPQGPPGEAGLPGRYGLHGEPG--- 447
            ..||.|..|..|.||..|:.|.|     |.||..||.|..||.||.|.:|.||..|..||||   
  Rat   370 AAGPAGNPGADGQPGAKGANGAP-----GIAGAPGFPGARGPSGPQGPSGAPGPKGNSGEPGAPG 429

  Fly   448 -QNVVGPKGEP---GLNGQPGLEGYRGDRGEVGLPGDKGLPGEGYNIVGPPGSQGPPGFRGLPGD 508
             :...|.||||   |:.|.||..|..|.||..|.||..|||       ||||.:|.||.||.||.
  Rat   430 NKGDTGAKGEPGPAGVQGPPGPAGEEGKRGARGEPGPSGLP-------GPPGERGGPGSRGFPGA 487

  Fly   509 DGYNGLRGLPGEKGLRGDDCPVCNAGPRGPRGQEGDTGYPGSHGNRGAIGLTGPRGVQGLQGN-- 571
            ||..|.:|..||:|         :.||.||:|..|:.|.||..|..||.||||..|..|..|.  
  Rat   488 DGVAGPKGPAGERG---------SPGPAGPKGSPGEAGRPGEAGLPGAKGLTGSPGSPGPDGKTG 543

  Fly   572 -PGRAGHKGLPGPAGIP---GEPGKVGAAGPDGKAIEVGSLRKGEIGDTGDSGHRGDTGDDGEKG 632
             ||.||..|.|||||.|   |:.|.:|..||.|.|.|.|  :.||.|..|..|..|..|.|||.|
  Rat   544 PPGPAGQDGRPGPAGPPGARGQAGVMGFPGPKGTAGEPG--KAGERGVPGPPGAVGPAGKDGEAG 606

  Fly   633 RDGSDGSKGERGETGQRGDYGDAGYQGRDGEPGRDGRDGAPGRNATTPKVYLIGEPGYDGIKGER 697
            ..|:.|..|..||.|::|..|..|:||..|..|..|..|.||.......   :|.||..|.:|||
  Rat   607 AQGAPGPAGPAGERGEQGPAGSPGFQGLPGPAGPPGEAGKPGEQGVPGD---LGAPGPSGARGER 668

  Fly   698 GDDGDTGFKGVKGEPNPGQIYDNTGEPGEDGYTGPKGVKGAKGEQGAIGLRGEIGDRGPAGEVIP 762
            |..|:   :||:|.|.|.....|.|.||.||..|..|..||.|.|||.||:|..|:||.||  :|
  Rat   669 GFPGE---RGVQGPPGPAGPRGNNGAPGNDGAKGDTGAPGAPGSQGAPGLQGMPGERGAAG--LP 728

  Fly   763 GPVGAKGYPGPTGDYGQQGAPGLPGRDGEPGLDGGIGYKGQRGVPGQEVIQGEIGPPGRSGIKGF 827
            ||.|.:      ||.|.:||.|.||:||..||.|.||..|..|.||.:...|..||.|.:|.:|.
  Rat   729 GPKGDR------GDAGPKGADGSPGKDGVRGLTGPIGPPGPAGAPGDKGETGPSGPAGPTGARGA 787

  Fly   828 PGDVGAPGQYGLAGRPGPKGVKGEQGPDGAVGQTGLPGNKGQRGDFLVGPPGPKGQPGRNGRQAP 892
            |||.|.||.      |||.|..|..|.||..|..|.||:.|.:||  .|||||.|..|      |
  Rat   788 PGDRGEPGP------PGPAGFAGPPGADGQPGAKGEPGDTGVKGD--AGPPGPAGPAG------P 838

  Fly   893 HGAKGQKGEVGSLGQNGQNGAKGSIGFSGRRGLLGNAGLQGLPGSPGIPGLPGMIGEIGERGEIG 957
            .|..|..|..|..|..|..|..|:.||.|..|.:|..|..|..|.||.||..|..|..|.|||.|
  Rat   839 PGPIGNVGAPGPKGSRGAAGPPGATGFPGAAGRVGPPGPSGNAGPPGPPGPVGKEGGKGPRGETG 903

  Fly   958 YNGRQGDIGPRGPNGEFGPKGLSGDDGPDGYPGANGLPGRKGETGNPGFPGRPGAKGVAAYSGIK 1022
            ..||.|::||.||.|..|.||..|.|||.|.||.   ||.:|..|..|..|.||.:|...:.|:.
  Rat   904 PAGRPGEVGPPGPPGPAGEKGSPGADGPAGSPGT---PGPQGIAGQRGVVGLPGQRGERGFPGLP 965

  Fly  1023 GDDGESGLTGPIGYPGAPGAKGQRGPVGDSQPALDGVAGRKGEVGSPGPNGLPGRHGLKGQRGDR 1087
            |..||.|..||   .||.|.:|..||:|  .|.|.|..|..|..||||..|.|||.|..|.:|||
  Rat   966 GPSGEPGKQGP---SGASGERGPPGPMG--PPGLAGPPGESGREGSPGAEGSPGRDGAPGAKGDR 1025

  Fly  1088 GLPGQQGRPGEPGAKGLGGYPGRNGINGLKGATGFPGPQGPKGPQGESGVVGLDGRNGQIGDQGP 1152
            |..|..|.||.|||.|..|..|..|.||.:|.||..||.||.||.         |..|..|.|||
  Rat  1026 GETGPAGPPGAPGAPGAPGPVGPAGKNGDRGETGPAGPAGPIGPA---------GARGPAGPQGP 1081

  Fly  1153 RGLIGEQGEQGEQGDEGEVGIPGRLENLRDRSFYRGFTGDQGLQGERGEQGDMGPIGFIGPPGAK 1217
            |   |::||.|||||.|..|             :|||:|.||..|..|..|:.||.|..||    
  Rat  1082 R---GDKGETGEQGDRGIKG-------------HRGFSGLQGPPGSPGSPGEQGPSGASGP---- 1126

  Fly  1218 GERGDIGYAGQLGFDGAEGLKGFQGDQGPRGPPGITLPAEKGDEGVAGLDGRAGRPGHFGQKGAP 1282
                                      .||||||                 |.||.||..|..|.|
  Rat  1127 --------------------------AGPRGPP-----------------GSAGSPGKDGLNGLP 1148

  Fly  1283 GPPGENGPNGAIGHRGPQIQGPPGPQGDVGFPGAPGHNGRHG---LIGPKGELGDMGRQGERGES 1344
            ||.|..||.|..|..||  .|||||.|..|.||.|  :|.:.   |..|..|....|.:..|.:.
  Rat  1149 GPIGPPGPRGRTGDSGP--AGPPGPPGPPGPPGPP--SGGYDFSFLPQPPQEKSQDGGRYYRADD 1209

  Fly  1345 GYAIVGRQGDIGDIGFQGEPGWDGAKGEQGYPGLPG-------------KNGRVGAPGPRGPTGD 1396
            ...:..|..::...........:..:..:|....|.             |:|.......:|...|
  Rat  1210 ANVVRDRDLEVDTTLKSLSQQIENIRSPEGSRKNPARTCRDLKMCHSDWKSGEYWIDPNQGCNLD 1274

  Fly  1397 A--GWGGIDGMDGLVGPKGQPGVTYS--YSMARPGDR-----GEPGLDGFQGEEG-DGGAPGLIG 1451
            |  .:..::.....|.|. ||.|...  |....|.::     ||...||||.|.| :|..|..:.
  Rat  1275 AIKVYCNMETGQTCVFPT-QPSVPQKNWYISPNPKEKKHVWFGESMTDGFQFEYGSEGSDPADVA 1338

  Fly  1452 FQ 1453
            .|
  Rat  1339 IQ 1340

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
vkgNP_001260071.1 Collagen 60..119 CDD:189968 21/58 (36%)
Collagen 102..152 CDD:189968 25/67 (37%)
Collagen 277..334 CDD:189968 25/56 (45%)
Collagen 534..593 CDD:189968 32/64 (50%)
Collagen 814..871 CDD:189968 25/56 (45%)
Collagen 957..1014 CDD:189968 27/56 (48%)
Collagen 990..1049 CDD:189968 22/58 (38%)
Collagen 1070..1128 CDD:189968 29/57 (51%)
C4 1515..1624 CDD:128421
C4 1625..1737 CDD:128421
Col1a1NP_445756.1 VWC 31..86 CDD:214564 9/26 (35%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 97..1206 538/1322 (41%)
Nonhelical region (N-terminal) 152..167 0/14 (0%)
Triple-helical region 168..1181 503/1203 (42%)
Collagen 225..284 CDD:189968 29/81 (36%)
Collagen 264..316 CDD:189968 26/51 (51%)
Collagen 390..449 CDD:189968 28/63 (44%)
Collagen 486..545 CDD:189968 27/67 (40%)
Collagen 525..584 CDD:189968 29/60 (48%)
Collagen 657..714 CDD:189968 28/59 (47%)
Collagen 696..748 CDD:189968 28/59 (47%)
Cell attachment site. /evidence=ECO:0000255 734..736 0/7 (0%)
Collagen 1068..1123 CDD:189968 30/79 (38%)
Cell attachment site. /evidence=ECO:0000255 1082..1084 1/4 (25%)
Major antigenic determinant (of neutral salt-extracted rat skin collagen) 1176..1186 5/11 (45%)
Nonhelical region (C-terminal) 1182..1207 5/24 (21%)
COLFI 1219..1452 CDD:279718 25/123 (20%)
Blue background indicates that the domain is not in the aligned region.


Information from Original Tools:


Tool Simple Score Weighted Score Original Tool Information
BLAST Result Score Score Type Cluster ID
Compara 00.000 Not matched by this tool.
Domainoid 00.000 Not matched by this tool.
eggNOG 1 0.900 - - E33208_3BAFZ
Hieranoid 00.000 Not matched by this tool.
Homologene 00.000 Not matched by this tool.
Inparanoid 00.000 Not matched by this tool.
OMA 00.000 Not matched by this tool.
OrthoDB 00.000 Not matched by this tool.
OrthoFinder 00.000 Not matched by this tool.
OrthoInspector 00.000 Not matched by this tool.
orthoMCL 00.000 Not matched by this tool.
Panther 00.000 Not matched by this tool.
Phylome 00.000 Not matched by this tool.
SonicParanoid 00.000 Not matched by this tool.
SwiftOrtho 1 1.000 - -
TreeFam 00.000 Not matched by this tool.
21.900

Return to query results.
Submit another query.