DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment vkg and Col4a1

DIOPT Version :10

Sequence 1:NP_477190.1 Gene:vkg / 33726 FlyBaseID:FBgn0016075 Length:1940 Species:Drosophila melanogaster
Sequence 2:NP_001128481.1 Gene:Col4a1 / 290905 RGDID:1307148 Length:1669 Species:Rattus norvegicus


Alignment Length:1883 Identity:770/1883 - (40%)
Similarity:914/1883 - (48%) Gaps:382/1883 - (20%)


- Green bases have known domain annotations that are detailed below.


  Fly    16 VYLLGSLVSVTLAD------------GKICNTTLCDCKGIKGRM------GAPGPIGVPGLEGPA 62
            |:||..|.::.|.:            |..|..  |||.|:||:.      |..|.||.||::||.
  Rat     7 VWLLLPLAALLLHEERSRAAAKGDCGGSGCGK--CDCHGVKGQKGERGFPGLQGVIGFPGMQGPE 69

  Fly    63 GDIGPPGRAGPLGEKGDVGEYGEQGEKGHRGDIGPKGEMGYPGIMGKSGEPGTPGPRGIDGCDGR 127
            |..|||      |:|||.||.|..|.||.||..|..|..|.||:.|..|:.|.|||.||.||:|.
  Rat    70 GPHGPP------GQKGDAGEPGLPGTKGTRGPPGAAGYPGNPGLPGIPGQDGPPGPPGIPGCNGT 128

  Fly   128 PGMQGPSGAPGQNGVRGPPGKPGQQGPPGEAGE--GGINSKGTKGNRGETGQPGGVGPPGFDGDR 190
            .|.:||.|.||..|..|.||.||..|..|:.||  |.:.....||.||..|.||..|.||..|.:
  Rat   129 KGERGPLGPPGLPGFSGNPGPPGLPGMKGDPGEILGHVPGTLLKGERGFPGIPGAPGSPGLPGLQ 193

  Fly   191 GSKGDTGYAGLTGEKGDPGLPGPKGDTGAVSELPYSLIGPPGAKGEPGDSLSGVLKPDDTLKGYK 255
            |..|..|:.|..|..|.||.||.||..|:      |..||.|.|||.|            :.|..
  Rat   194 GPVGPPGFTGPPGPPGPPGPPGEKGQMGS------SFQGPKGDKGEQG------------VSGPP 240

  Fly   256 GYVGL-----QGDEGPQGPTGEQGAVGRNGLP--GARGEIGGPGERGKPGKDGE----------- 302
            |..|.     :||..|.|..|::|..|..|.|  |.:||.|.||.|||||||||           
  Rat   241 GVPGQAQVKEKGDFAPTGEKGQKGEPGFQGTPGYGEKGEPGKPGPRGKPGKDGEKGEKGSPGFPG 305

  Fly   303 -------PGRFGDKGMKGAPGWTGAD---------------GLDGSPGERGEDGFTGMPGVQGGA 345
                   |||.|.:|.||.||..|..               |..||||.|||.|..|.||..|..
  Rat   306 ESGYPGLPGRQGPQGEKGEPGLPGPPGTVVGTRPLGEKGERGYPGSPGLRGEPGPKGYPGSPGQP 370

  Fly   346 GPPGIYDPSLT-------------------KSLPGPIGSQGDIGPPGEQGPPG-----------L 380
            ||||...|..|                   .|||||.|..|..||||..||||           .
  Rat   371 GPPGFAVPGQTGAPGFPGERGEKGERGSPGVSLPGPSGRDGAPGPPGPPGPPGQPGHTNGIVECQ 435

  Fly   381 PGKPGRRGPIGLAGQSGDPGLNGSRGPPGRSERGEA----GDYGFIGPPGPQGPPGEAGLPGRYG 441
            ||.||.:||.|:.||   |||.|..|..|  ::||:    ...|..|||||||||||.|.||:.|
  Rat   436 PGPPGDQGPPGIPGQ---PGLTGEVGQKG--QKGESCLACDTEGLRGPPGPQGPPGEIGFPGQPG 495

  Fly   442 LHGEPG-------QNVVGPKGEPGLNGQPGLEGYRGDRGEV----GLPGDKGLPGEGYNIVGPPG 495
            ..|:.|       :.:.||:|.|||.||||.   :|:.||:    .|.||||.||.       ||
  Rat   496 AKGDRGLPGRDGLEGLPGPQGSPGLIGQPGA---KGEPGEIFFDMRLKGDKGDPGF-------PG 550

  Fly   496 SQGPPGFRGLPGDDGYNGLRGLPGEKGLRGDDCPVCNAGPRGPRGQEGDTGYPGSHGNRG----- 555
            ..|.||..|.||.||:   .||||.||..|      :.|.:|.||..|..|:|||.|:.|     
  Rat   551 QPGMPGRAGTPGRDGH---PGLPGPKGSPG------SIGLKGERGPPGGVGFPGSRGDIGPPGPP 606

  Fly   556 AIGLTGPRGVQGLQGNPGRAGHKGLPGPAGIPGEPGK-VGAAGPDGKAIEVGSLRKGEIGDTGDS 619
            .:|..||.|.:|..|.||..|..|||||   .||.|| |...||.|.|        |..|..|..
  Rat   607 GVGPIGPIGEKGQAGLPGGPGSPGLPGP---KGEAGKVVPLPGPPGAA--------GLPGSPGFP 660

  Fly   620 GHRGDTGDDGEKGRDGSDGSKGERGETGQRGDYGDAGYQGRDGEPGRDGRDGAPGRNATTPKVYL 684
            |.:||.|..|..||.|:.|.||..|:.| .|..|..|.:|.||.||..||.|:|||         
  Rat   661 GPQGDRGFPGTPGRPGNPGEKGAVGQPG-IGFPGLPGPKGVDGLPGEIGRPGSPGR--------- 715

  Fly   685 IGEPGYDGIKGERGDDGDTGFKGVKGEPNPGQIYDNTGEPGEDGYTGPKGVKGAKGEQGAIGLRG 749
               ||::|:.|..|..|.      ||||       ..|.||..|..|..|:.|..||:|:||..|
  Rat   716 ---PGFNGLPGNPGPQGQ------KGEP-------GIGLPGLKGQPGLPGIPGTPGEKGSIGGPG 764

  Fly   750 EIGDRGPAGEVIPGPVGAKGYPGPTGDYGQQGAPGLPGRDGEPGLDGGIGYKGQRGVPGQEVIQG 814
            ..|::|..|.  ||..|.:|.|||.|..|..|.||:|          |||..|..|.||.:...|
  Rat   765 VPGEQGLTGP--PGLQGIRGDPGPPGVQGPAGPPGVP----------GIGPPGAMGPPGGQGPPG 817

  Fly   815 EIGPPGRSGIKGFPGDVGAPGQYGLAGRPGPKGVKGEQGPDGAVGQTGLPGNKGQRGDFLVGPPG 879
            ..||||..|.||||   |.||    ...|||||.||.||..|..||:||||..||:     |.||
  Rat   818 SSGPPGVKGEKGFP---GFPG----LDMPGPKGDKGSQGLPGLTGQSGLPGLPGQQ-----GTPG 870

  Fly   880 PKGQPGRNGRQAPHGAKGQKGEVGSLGQNGQNGAKGSIGFSGRRGLLGNAGLQGLPGSPGIPGLP 944
            ..|.||..|.....|..||.|..|..|..|..|.||..||.|..|..|:.|.:|..|..|:||:|
  Rat   871 VPGFPGSKGEMGVMGTPGQPGSPGPAGAPGLPGEKGDHGFPGSSGPRGDPGFKGDKGDVGLPGMP 935

  Fly   945 G---------MIGEIGERGEIGYNGRQGDIGPRG-------PN--------GEFGPKGLSGDDGP 985
            |         |.|:.|::||.|..|..||.|.||       |.        |:.|||   ||.|.
  Rat   936 GSMEHVDMGSMKGQKGDQGEKGQIGPTGDKGSRGDPGTPGVPGKDGQAGHPGQPGPK---GDPGL 997

  Fly   986 DGYPGANGLPGRKGETGNPGFPGRPGAKGVAAY------------SGIKGDDGESGLTGPIGYPG 1038
            .|.||:.||||.||..|..|.||.||.|||...            .|.||:.|:|||.| ||.||
  Rat   998 SGIPGSPGLPGPKGSVGGMGLPGSPGEKGVPGIPGSQGVPGSPGEKGAKGEKGQSGLPG-IGIPG 1061

  Fly  1039 APGAKGQRGPVGDSQPALDGVAGRKGEVGSPGPNGLPGRHGLKGQRGDRGLPGQQGRPGEPGAKG 1103
            .||.||.:|..|     ..|..|.|||.||.|..|:||..|.:|..|:.|.||..|.|||.|.||
  Rat  1062 RPGDKGDQGLAG-----FPGSPGEKGEKGSAGTPGMPGSPGPRGSPGNIGHPGSPGLPGEKGDKG 1121

  Fly  1104 LGGYPGRNGINGLKGATGFPGPQGPKGPQGESGVVGLDGRNGQIGDQGPRGLIGE-----QGEQG 1163
            |   ||.:|:.|:||..|.||..||.||.|:.|..|.||..|..|::|.:||.|.     .|.:|
  Rat  1122 L---PGLDGVPGVKGEAGLPGTPGPTGPAGQKGEPGSDGIPGSAGEKGEQGLPGRGFPGFPGTKG 1183

  Fly  1164 EQGDEGEVGIPGRLENLRDRSFYRGFTGDQGLQGERGEQGDMGPIGFIGPPGAKGERGDIGYAGQ 1228
            ::|.:||||.|             |..|..|:.|.:|||      ||:||||.:         ||
  Rat  1184 DKGSKGEVGFP-------------GLAGSPGIPGAKGEQ------GFMGPPGPQ---------GQ 1220

  Fly  1229 LGFDGAEG--LKGFQGDQGPRGPPGITLPAEKGDEGVAGLDGRAGRPGHFGQKGAPGPPGENGPN 1291
            .|..|..|  ::|.:||:||:|.||:                    |||.|..|.||.||.:||.
  Rat  1221 PGLPGTPGHPVEGPKGDRGPQGQPGL--------------------PGHPGPMGPPGFPGIDGPK 1265

  Fly  1292 GAIGHRG-PQIQGPPGPQGDVGFPGAPGHNGRHGLIGPKGELGDMGRQGERGESGYAIVGRQGDI 1355
            |..|:.| |...|.|||:||.||.|.||..|..|:.|.||::|..|..|.:|:.|.         
  Rat  1266 GDKGNPGWPGAPGAPGPKGDPGFQGMPGIGGSPGITGSKGDMGPPGVPGFQGQKGL--------- 1321

  Fly  1356 GDIGFQGEPGWDGAKGEQGYPGLPGKNGRVGAPGPRGPTGDAGWGGIDGMDGLVGPKGQPGVTYS 1420
                    ||..|.||:||..|:||..|..|.|||.||     :..|.|..||.||:|.||:   
  Rat  1322 --------PGLQGVKGDQGDQGIPGPKGLQGPPGPPGP-----YDVIKGEPGLPGPEGPPGL--- 1370

  Fly  1421 YSMARPGDRGEPGLDGFQGEEGDGGAPGLIGFQGQRGAVGYRGDQGEVGYTGADGPQGQRGDKGY 1485
                       .||.|..|.:|..|..|.:|..|..|..|:.|..|:.|.||..||.|.|     
  Rat  1371 -----------KGLQGPPGPKGQQGVTGSVGLPGPPGVPGFDGAPGQKGETGPFGPPGPR----- 1419

  Fly  1486 MGLTGAPGLRGLPGPQGEPAPAPPAPKS--RGFIFARHSQSVHVPQCPANTNLLWEGYSL---SG 1545
             |..|.||..||||..|     ||...|  .||:..||||:...|.||..|.:|:.||||   .|
  Rat  1420 -GFPGPPGPDGLPGSMG-----PPGTPSVDHGFLVTRHSQTTDDPLCPPGTKILYHGYSLLYVQG 1478

  Fly  1546 NVAASRAVGQDLGQSGSCMMRFTTMPYMLCDITNVCHFAQNNDDSLWLSTAEPMPMTMTPIQGRD 1610
            |   .||.|||||.:|||:.:|:|||::.|:|.|||:||..||.|.||||.|||||:|.||.|.:
  Rat  1479 N---ERAHGQDLGTAGSCLRKFSTMPFLFCNINNVCNFASRNDYSYWLSTPEPMPMSMAPISGDN 1540

  Fly  1611 LMKYISRCVVCETTTRIIALHSQSMSIPDCPGGWEEMWTGYSYFMSTLDNVGGVGQNLVSPGSCL 1675
            :..:||||.|||....::|:|||::.||.||.||..:|.|||:.|.|.....|.||.|.||||||
  Rat  1541 IRPFISRCAVCEAPAMVMAVHSQTIQIPQCPNGWSSLWIGYSFVMHTSAGAEGSGQALASPGSCL 1605

  Fly  1676 EEFRAQPVIECHGHGRCNYYDALASFWLTVIEEQDQFVQPRQQTLKA-DFTSKISRCTVCRRR 1737
            ||||:.|.|||||.|.||||....||||..||..:.|.:|...|||| :..:.:|||.||.||
  Rat  1606 EEFRSAPFIECHGRGTCNYYANAYSFWLATIERSEMFKKPTPSTLKAGELRTHVSRCQVCMRR 1668

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
vkgNP_477190.1 gly_rich_SclB <63..>349 CDD:468478 133/327 (41%)
gly_rich_SclB <465..>712 CDD:468478 99/256 (39%)
gly_rich_SclB <637..>871 CDD:468478 96/233 (41%)
gly_rich_SclB <894..>1132 CDD:468478 116/273 (42%)
gly_rich_SclB <1078..>1342 CDD:468478 106/271 (39%)
gly_rich_SclB <1228..1515 CDD:468478 106/291 (36%)
C4 1515..1624 CDD:128421 61/111 (55%)
C4 1625..1737 CDD:128421 58/112 (52%)
Col4a1NP_001128481.1 gly_rich_SclB <42..>320 CDD:468478 123/301 (41%)
gly_rich_SclB <267..>577 CDD:468478 133/327 (41%)
gly_rich_SclB <506..>766 CDD:468478 124/315 (39%)
gly_rich_SclB <717..>992 CDD:468478 121/311 (39%)
gly_rich_SclB <948..>1163 CDD:468478 99/226 (44%)
gly_rich_SclB <1118..>1335 CDD:468478 105/284 (37%)
gly_rich_SclB <1229..>1443 CDD:468478 102/280 (36%)
C4 1446..1553 CDD:460201 59/109 (54%)
C4 1556..1667 CDD:460201 58/110 (53%)

Return to query results.
Submit another query.