DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment vkg and Col4a3

DIOPT Version :10

Sequence 1:NP_477190.1 Gene:vkg / 33726 FlyBaseID:FBgn0016075 Length:1940 Species:Drosophila melanogaster
Sequence 2:NP_001400917.1 Gene:Col4a3 / 363265 RGDID:71085 Length:1671 Species:Rattus norvegicus


Alignment Length:1860 Identity:761/1860 - (40%)
Similarity:899/1860 - (48%) Gaps:327/1860 - (17%)


- Green bases have known domain annotations that are detailed below.


  Fly     7 RHLSGLLGVVYLLGSLVSVTLADGKIC---NTTLC-DCKGIKGRMGAPGPIGVPGLEG-PAGDIG 66
            |.|:.||.::.||.:...| ::.|.:|   ...|| ..||.||.:|.|||.|.||.:| |    |
  Rat     8 RFLALLLPILLLLLAAPPV-VSKGCVCEGKGKCLCWGTKGEKGEIGFPGPPGFPGQKGFP----G 67

  Fly    67 PPGRAGPLGEKGDVGEYGEQGEKGHRGDIGPKGEMGYPGIMGKSGEPGTPGPRGIDGCDGRPGMQ 131
            |.|..||.|.||..|..|..|.||.||..|..|..|.||:.|..|.||.||..|:.||:|..|.|
  Rat    68 PEGLPGPQGPKGSPGLPGLTGPKGIRGITGLPGFAGPPGLPGIPGYPGPPGLAGLPGCNGSKGEQ 132

  Fly   132 G-PS--GAPGQNGVRGPPGKPGQQGPPGEAGEGGINSKGTKGNRGETGQPGGVGPPGFDGDRGSK 193
            | |.  |.||..|:.||.|..||:|.|.:..:|..|.||..|..|..|..|..|||||.|..|..
  Rat   133 GFPGIPGTPGYAGLPGPDGLKGQKGKPAQGEDGEFNGKGDPGPPGAPGFQGLPGPPGFPGPAGPP 197

  Fly   194 GDTGYAGLTGEKGDPGLPGPKGDTGAVSELPYSLIGPPGAKGEPGDSLSGVLKPDDTLKGYKGYV 258
            |..|:.|.      ||..||:|..|.:.:   |.||..|.||               :||..|..
  Rat   198 GPPGFFGF------PGAMGPRGPKGRMGD---STIGQEGEKG---------------VKGLTGPP 238

  Fly   259 GL-----------------QGDEGPQGPTGEQGAVGRNGLPG-----ARGEIGGPGERGKPGKDG 301
            ||                 ||.:|.:|..||.|..|.:|.||     .:|..|.||.||||||||
  Rat   239 GLPGPVIFTLRHPYRKSDFQGQKGDEGERGEPGPPGPSGPPGDSYGSEKGAPGEPGPRGKPGKDG 303

  Fly   302 EPGRFGDKGMKGAPGWTGADGLDGSPGERGEDGFTGMPGVQGGAGPPGI-------YDPSL---T 356
            .|         |.||..||.|..|.||.|||   .|:.|.:|..||||.       ||..|   .
  Rat   304 AP---------GFPGTEGAKGTRGFPGLRGE---AGIKGWKGDIGPPGFPGPTEECYDAHLQKGD 356

  Fly   357 KSLPGPIGSQGDIGPPGEQGPPGLPGKPGRRGPIGLAGQSGDPGLNGS---RGPPGRSERGEAGD 418
            |.:|||.|.:|..||.|..||||:||.||...| ||.|..|.|||.||   |||||....|..|.
  Rat   357 KGMPGPPGPKGVRGPRGPSGPPGVPGSPGPSRP-GLRGPVGWPGLKGSKGERGPPGIDTVGPPGS 420

  Fly   419 YGFIGPPGPQGPPGEAGLPGRY-------GLHGEPGQ-------NVVGPKGEPG--------LNG 461
            .|..|.|||.||||..|.||..       |.||.||.       .|.|||||||        :.|
  Rat   421 LGCPGSPGPPGPPGPPGRPGDTVFQPGPPGDHGAPGDIGPPGVPGVDGPKGEPGQPCTECHCIPG 485

  Fly   462 QPGLEGYRGDRGEVGLPGDKGLPGEGYNIVGPPGSQGPPGFRGLPGDDGYNGL------------ 514
            .||:.|..|..|..|:||.:|.||...| .|.||:.|.|||.|.|||.|:.||            
  Rat   486 PPGVPGVPGLDGVKGIPGGRGAPGVKGN-PGSPGNAGLPGFAGFPGDQGHPGLKGDKGDTPLPWG 549

  Fly   515 ----------RGLPGEKGLRGDDCPVCNAGPRGPRGQ------EGDTGYPGSHGNRGAIGLTGPR 563
                      |||||.||..|........|||||||:      :||.|.||:.|:.|..|..||.
  Rat   550 QVGDPGDPGHRGLPGRKGFDGSPGGPGAKGPRGPRGEPALSGRKGDQGPPGAPGSPGPPGPAGPA 614

  Fly   564 GVQGL--QGNPGRAGHKGLPGPAGIPGE-------------------PGKVGAAGPDGKAIEVGS 607
            |..|.  ||.||..|.:|:||..|.|||                   ||..|.|||.|    :..
  Rat   615 GPPGYGPQGEPGPKGAQGVPGALGPPGEAGLKGESSASIPVLGPPGPPGPPGQAGPRG----LPG 675

  Fly   608 LRKGEIGDTGDSGHRGDTGDDGEKG--RDGSDGSKGERGETGQRGDYGDAGYQGRDGEPGRDGRD 670
            | .|.:| |.|.||   .|.|||.|  ..|..|::|.:|:.|..|..|..||.|..|.||..|..
  Rat   676 L-PGPVG-TCDPGH---PGPDGEPGIPEVGFPGARGPKGDQGFPGTIGLPGYPGETGRPGYPGEM 735

  Fly   671 GAPGRNATTPKVYLIGEPGYDGIKGERGDDGDTGFKGVKGEPNPGQIYDNTGEPGEDGYTGPKGV 735
            |.||... .|.|...||||..|..||||:.|:.|..|:.|.|.|      .|.||:||:.||.|.
  Rat   736 GVPGAKG-EPSVGRPGEPGKPGFPGERGNSGENGDIGLPGLPGP------PGTPGKDGFDGPPGD 793

  Fly   736 KGAKGEQGAIGLRGEIGDRGPAGEVIPGPVGAKGYPGPTGDYGQQGAPGLPGRDGEPGLDG--GI 798
            .|..|..||         :||.|..||||.|.:|.||..|..||.|..|..|..|:||:.|  ..
  Rat   794 PGQSGPPGA---------KGPPGRCIPGPRGTQGLPGLNGLKGQPGRRGDTGPKGDPGIPGMDRS 849

  Fly   799 GYKGQRGVPGQEVIQGEIGPPGRSGIKGFPGDVGAPGQYGLAGRPGPKGVKGEQGPDGAVGQTGL 863
            |..|:||.||...:.||:||||:   ||:||..|.|   ||.|..|..|:.|..|..|..|..|.
  Rat   850 GVPGERGPPGTPGLPGEMGPPGQ---KGYPGPPGFP---GLPGEKGEVGIMGYPGTTGLPGLPGK 908

  Fly   864 PGNKGQRGDFLVGPPGPKGQPGRNGRQAPHGAKGQKGEVGS------LGQNGQNGAKGSIGFSGR 922
            ||::||||:  :|.||.||:.||.|.:   |.:|:||:.|.      .|..|:.|.||.:|..|.
  Rat   909 PGSQGQRGN--LGIPGVKGERGRPGVK---GERGEKGKPGPPHAPHLKGDKGEPGLKGFVGNPGE 968

  Fly   923 RGLLGNAGLQGLPGSPGIPGLPGMIGEIGERGEIGYNGRQGDIGPRGPNGEFGPKGLSGDDGPDG 987
            :|..||.||.|..|..|:|||||.            .|.:||.|..|..|..||:||.|..|..|
  Rat   969 KGNRGNPGLPGPKGLEGVPGLPGS------------PGPRGDTGSSGDPGRPGPQGLPGSMGNMG 1021

  Fly   988 YPGANGLPGRKGETGNPGFPGRPGAKGVAAYSGIKGDDGESGLTGPIGYPGAPGAKGQRGPVGDS 1052
            .||..   ||||.:|.||..|||                        |.||.||.:|.:|..|.|
  Rat  1022 VPGPK---GRKGTSGFPGVAGRP------------------------GLPGIPGPQGDKGEPGYS 1059

  Fly  1053 QPALDGVAGRKGEVGSPGPNGLPGRHGLKGQRGDRGLPGQQGRPGEPGAKGLGGYPGRNGINGLK 1117
            :.|..|..|.||:.|.||..|..|..||.|..|..|..|..|.||.||:.|..|.||.:|.:|||
  Rat  1060 EGASPGPPGPKGDPGLPGDKGKKGERGLPGPPGHSGPAGPDGAPGSPGSPGHPGRPGPDGDSGLK 1124

  Fly  1118 GATGFPGPQGPKGPQGESGVVGLDGRNGQIGDQGPRGLIGEQGEQGEQGDEGEVGIPGRLENLRD 1182
            |..|||||.|..||.|..|:.||.|..|..||||..|:.|..||:||.|..|             
  Rat  1125 GQKGFPGPPGSTGPPGPPGLPGLPGPMGMRGDQGQDGIPGPPGEKGETGLLG------------- 1176

  Fly  1183 RSFYRGFTGDQGLQGERGEQGDMGPIGFIGPPGAKGERGDIGYAGQLGFDGAEGLKGFQGDQGPR 1247
                 ...|.:|..|..|.:||.|..|..|.||.||..||:         |.:|..|..|..||.
  Rat  1177 -----AHPGQKGSPGVPGVKGDRGVPGLSGLPGRKGTMGDV---------GPQGPPGTTGLPGPP 1227

  Fly  1248 GPPGITLPAEKGDEGVAGLDGRAGRPGHFGQKGAPGPPGENGPNGAIGHRGPQIQGPPGPQGDVG 1312
            |.||..:|..||:.|:.||.|..|.|      |.|||||..|               .|.:||.|
  Rat  1228 GLPGTIVPGPKGNRGLPGLRGNPGEP------GPPGPPGPVG---------------EGIKGDKG 1271

  Fly  1313 FPGAPGHNGRHGLIGPKGELGDMGRQGERGESGYAIVGRQGDIGDIGFQGEPGWDGAKGEQGYPG 1377
            |.|.||..|..|::      ||.|..|:.|..|  |.|..|..||.||   ||:.|.|||:|.||
  Rat  1272 FMGLPGSRGLPGMV------GDTGAPGQPGAPG--IPGLPGVRGDPGF---PGFPGVKGEKGNPG 1325

  Fly  1378 LPGKNGRVGAPGPRGPTGDAGWGGIDGMDGLVGPKGQPGVTYSYSM-ARPGDRGEPGLDGFQGEE 1441
            ..|..|..|..||:||.               ||:|:||.....|: ..||..|.||..|.:|:.
  Rat  1326 FLGSIGHPGPVGPKGPP---------------GPQGKPGTLKVISLPGSPGPPGAPGQPGVKGDP 1375

  Fly  1442 GDGGAPGLIGFQGQRGAVGYRGDQGEVGYTGADGPQGQRGDKGYMGLTGAPGLRGLPGPQGEPAP 1506
            |..|.||:.|..|.||..|..|..|..|..|..|.:|.:|::|..||.|.|||:|.||.:|.|| 
  Rat  1376 GPLGPPGIPGPCGPRGQPGKDGKPGAPGPPGVKGSKGSKGEQGPPGLDGLPGLKGKPGDRGTPA- 1439

  Fly  1507 APPAPKSRGFIFARHSQSVHVPQCPANTNLLWEGYSL---SGNVAASRAVGQDLGQSGSCMMRFT 1568
              ...:.|||||.||||:...|.||..|..|:.|:||   .||   ..|.|||||..|||:.|||
  Rat  1440 --NGTRMRGFIFTRHSQTTANPSCPEGTQPLYSGFSLLFVQGN---EHAHGQDLGTLGSCLQRFT 1499

  Fly  1569 TMPYMLCDITNVCHFAQNNDDSLWLSTAEPMPMTMTPIQGRDLMKYISRCVVCETTTRIIALHSQ 1633
            |||::.|::.|||:||..||.|.||||..||||.|.||.||.|..|:|||.|||.....||:|||
  Rat  1500 TMPFLFCNVDNVCNFASRNDYSYWLSTPAPMPMDMAPITGRALEPYVSRCTVCEGPAMAIAVHSQ 1564

  Fly  1634 SMSIPDCPGGWEEMWTGYSYFMSTLDNVGGVGQNLVSPGSCLEEFRAQPVIECHGHGRCNYYDAL 1698
            :.:||.||.||..:|.|:|:.|.|.....|.||.|.|||||||||||.|.|||||.|.||||...
  Rat  1565 TTAIPPCPQGWVSLWKGFSFVMFTSAGSEGAGQALASPGSCLEEFRASPFIECHGRGTCNYYSNS 1629

  Fly  1699 ASFWLTVIEEQDQFVQPRQQTLKA-DFTSKISRCTVCRRR 1737
            .||||..:..:..|.:|...|:|| |....||||.||.::
  Rat  1630 YSFWLASLNPERMFRKPIPSTVKAGDLEKIISRCQVCMKK 1669

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
vkgNP_477190.1 gly_rich_SclB <63..>349 CDD:468478 121/310 (39%)
gly_rich_SclB <465..>712 CDD:468478 114/297 (38%)
gly_rich_SclB <637..>871 CDD:468478 98/235 (42%)
gly_rich_SclB <894..>1132 CDD:468478 96/243 (40%)
gly_rich_SclB <1078..>1342 CDD:468478 100/263 (38%)
gly_rich_SclB <1228..1515 CDD:468478 106/287 (37%)
C4 1515..1624 CDD:128421 63/111 (57%)
C4 1625..1737 CDD:128421 58/112 (52%)
Col4a3NP_001400917.1 gly_rich_SclB <62..>342 CDD:468478 125/319 (39%)
gly_rich_SclB <390..>594 CDD:468478 83/204 (41%)
gly_rich_SclB <495..>755 CDD:468478 103/270 (38%)
gly_rich_SclB <750..>1030 CDD:468478 132/320 (41%)
gly_rich_SclB <950..>1205 CDD:468478 120/311 (39%)
gly_rich_SclB <1080..>1336 CDD:468478 123/314 (39%)
C4 1447..1553 CDD:460201 60/108 (56%)
C4 1557..1668 CDD:460201 58/110 (53%)

Return to query results.
Submit another query.