DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment vkg and Col4a2

DIOPT Version :9

Sequence 1:NP_001260071.1 Gene:vkg / 33726 FlyBaseID:FBgn0016075 Length:1940 Species:Drosophila melanogaster
Sequence 2:XP_038951185.1 Gene:Col4a2 / 306628 RGDID:1308085 Length:1707 Species:Rattus norvegicus


Alignment Length:1877 Identity:751/1877 - (40%)
Similarity:908/1877 - (48%) Gaps:347/1877 - (18%)


- Green bases have known domain annotations that are detailed below.


  Fly    11 GLLGVVYLLGSLVSVTLADGKICNTTLCDCKGIKGRMGAPGPIGVPGLEGPAGDIGPPGRAGPLG 75
            |||....|.|...|.....|:.|:.. |.|...||..|.||.:|..|..||.|..|.||..   |
  Rat    25 GLLAQSVLGGVKKSDVPCGGRDCSGG-CQCFPEKGARGQPGEVGPQGYNGPPGLQGFPGLQ---G 85

  Fly    76 EKGDVGEYGEQGEKGHRGDIGPKGEMGYPGIMGKSGEPGTPGPRGIDGCDGRPGMQGPSGAPGQN 140
            .|||.||.|..|..|.:||:|.:|..|:||..|..|.||..||||..|.||..|.:|.:|..|.:
  Rat    86 RKGDKGERGAPGPTGPKGDVGARGVSGFPGADGIPGHPGQGGPRGRPGYDGCNGTRGDAGPQGPS 150

  Fly   141 GVRGPPGKPGQQGPPGEAGEGGINSKGTKGN-RGETGQPGGV---GPPGFDGDRGSKGDTGYAGL 201
            |..|.||.||.|||.|:.||....||..:.. |||.|:||.|   ||||..|.         .|.
  Rat   151 GTGGFPGLPGPQGPKGQKGEPYALSKEDRDKYRGEPGEPGSVGYQGPPGRPGP---------IGQ 206

  Fly   202 TGEKGDPGLPGPKGDTGAVSELPYSLIGPPGAKGEPGDSLSGVLKPDDTLKGYKGYVGLQGDEG- 265
            .|..|.||.|||.              ||||.||:||:...|....    ||.||.||..|..| 
  Rat   207 MGPMGAPGRPGPP--------------GPPGPKGQPGNRGLGFYGE----KGEKGDVGQPGPNGI 253

  Fly   266 PQ-----GPT----------GEQGAVGRNGLPG--ARGE---IGGPGERGKPGKDGEPGRFGDKG 310
            |.     |||          ||:|:.|..|:||  .:||   :|.||.||.||.|||.|..|.||
  Rat   254 PSDITLIGPTPSTYHPDMYKGEKGSQGEPGIPGITLKGEEGIMGFPGTRGFPGLDGEKGVSGQKG 318

  Fly   311 MKGAPGWTGADGLDGSPGERGEDGFTGMPGVQGGAGPPG--IYD--PSLTKSLPGPIGSQGDIGP 371
            .:|..|:.|..|..|..|||||            .||||  .|.  |||.|      |::||.|.
  Rat   319 SRGLDGFQGPSGPRGPKGERGE------------LGPPGPPAYSPHPSLAK------GARGDPGF 365

  Fly   372 PGEQGPPGLPGKPGRRGPIGLAGQS-GDP----GLNGSRGPPGRS-ERGEAGDY----GFIGPPG 426
            .|..|.||..|:||..||:|..|.| ||.    ||.|..||.|.| |.|.:..|    |..|.||
  Rat   366 QGAHGEPGSRGEPGDPGPVGPPGLSIGDEDSKRGLPGEMGPKGFSGEPGPSAYYPGPPGADGKPG 430

  Fly   427 PQGPPGEAGLPG----RYGLHGEPGQ-NVVGPKGEPGLNGQPGLEGYRGD--RGEV-----GLPG 479
            |||.||.||.||    .:||.|..|: ...||.|.||..||.|.:|..||  .|:|     ||||
  Rat   431 PQGLPGPAGPPGPDGFLFGLKGSEGRVGYPGPSGFPGGRGQKGWKGEAGDCQCGQVIPGLPGLPG 495

  Fly   480 DKGLPGEGYNIVGPPGSQGPPGFRGLPGDDGYN---GLRGLPGEKGLRGDDCPVCNAGPRGPRGQ 541
            .||.||..... |..|.||.||..|:||..|:.   |:.|.||.||::||...:..      :|:
  Rat   496 PKGFPGVNGEF-GKKGDQGDPGLHGIPGFPGFKGAPGIAGAPGPKGVKGDSRTITT------KGE 553

  Fly   542 EGDTGYPGSHGNRGAIGLTGPRGVQGLQGNPGRAGH--KGLPGPAGIPGEPGKVGAAGPDGKAIE 604
            .|..|.||.||.:|..|:.|..|:.|..|.||..|.  ||.||.||:||.||..|.         
  Rat   554 RGQPGIPGVHGMKGDDGVPGRDGLDGFPGLPGPPGDGIKGPPGDAGLPGTPGTKGF--------- 609

  Fly   605 VGSLRKGEIGDTGDSGHRGDTGDDGEKGRDGSDGSKGERGETGQRGDYGDAGYQGRDGEPGRDGR 669
                 .||:|..|                .|..|.|||||..      ||||..|..|.||..|.
  Rat   610 -----PGEVGPPG----------------QGLPGPKGERGFP------GDAGLPGPPGFPGPPGL 647

  Fly   670 DGAPGRNATTPKVYLIGEPGYDGIKGERGDDGDTGFK-----GVKGEPNPGQIYDNTGEPGEDGY 729
            .|.||:                       .|.|||.|     |.:....||.:....|.||:.|.
  Rat   648 PGTPGQ-----------------------ADCDTGVKRPIGGGQQVVIQPGCVEGPAGSPGQPGP 689

  Fly   730 TGPKGVKGAKGEQGAIGLRGEIGDRGPAGEVIPGPVGAKGYPGP---TGDYGQQGAPGLPGRDGE 791
            .||.|.||.:|..|..|..||.|.:|     .||..|.:|:|||   .|..|.:|||||||.||.
  Rat   690 PGPTGAKGIRGIPGFPGASGEQGLKG-----FPGDPGREGFPGPPGFMGPRGSKGAPGLPGPDGP 749

  Fly   792 P---GLDGGIGYKGQRGVPGQEVI------QGEIGPPGRSGIKGFPGDVGAPGQYGLAGRPGPKG 847
            |   ||.|..|..|.||:|| ||:      :|:.|.||:.|:|||||::||||..|..|.||..|
  Rat   750 PGPIGLPGPAGPPGDRGIPG-EVLGAQPGARGDAGLPGQPGLKGFPGEIGAPGFRGSQGMPGMPG 813

  Fly   848 VKGEQGPDGAVGQTGLPGNKGQRG----------------DFLVGPPGPKGQPGRNGRQAPHGAK 896
            :||:.|..|..||.||.|..||.|                ..|.|.||.:|:||..|...|.|.|
  Rat   814 LKGQPGFPGPSGQPGLSGPPGQHGFPGAPGREGPLGLPGSPGLGGLPGDRGEPGEPGEPGPVGMK 878

  Fly   897 GQKGEVGSLGQNGQNGAKGSIGFSGRRGLLGNAGLQGLPGSPGI------------PGLPGMIGE 949
            |..|:.|..|.:|:.|..||.||.|..|:.|..|.:|..||||:            ||.||:.||
  Rat   879 GVSGDRGDAGVSGERGHPGSPGFKGMAGMPGIPGQKGDRGSPGMDGFQGMLGLKGRPGFPGIKGE 943

  Fly   950 IGERGEIGYNGRQGDIGPRGPNGEFGP--------------KGLSGDDGP---DGY---PGANGL 994
            .|..|..|..|..|:.|.:|..|:.||              ||..||:||   .||   .|..|:
  Rat   944 AGFFGVPGLKGLPGEPGVKGNRGDRGPPGPPPLILPGMKDIKGEKGDEGPMGLKGYLGLKGIQGM 1008

  Fly   995 PGRKGETGNPGFPGRP----GAKGVAAYSGIKGDDGESGLTGPIGYPGAPGAKGQRGPVGDSQPA 1055
            ||..|.:|.||.||||    ||||.....|..|..|..|::||.|..|.||..|.||..|  .|.
  Rat  1009 PGVPGLSGIPGLPGRPGFIKGAKGDIGVPGTPGLPGFPGVSGPPGITGFPGFTGSRGEKG--TPG 1071

  Fly  1056 LDGVAGRKGEVGSPGPNG----LPGRHGLKGQRGDRGLPGQQGRPGEPGAKGLGGYPGRNGI--- 1113
            :.||.|..|..|..|..|    |||..||||:||..|:||.:|..||.||:|..|:||..|:   
  Rat  1072 VAGVFGETGPTGDFGDIGDTVDLPGSPGLKGERGVTGIPGLKGLFGEKGAEGDVGFPGITGMAGA 1136

  Fly  1114 ---NGLKGATGFPGPQGPKGPQGESGVVGLDGRNGQIGDQGPRGLIGEQGEQGEQGDEGEVGIPG 1175
               .||||.|||||..|.:|||||.|.:|:.|..   ||.|..|:.|..|..|.:|..|..|:||
  Rat  1137 QGSPGLKGQTGFPGLTGLQGPQGEPGRIGIPGDK---GDFGWPGVPGRPGIPGIRGISGLHGLPG 1198

  Fly  1176 RLENLRDRSFYRGFT--------GDQGLQGERGEQGDMGPIGFI-GPPGAKGERGDIGYAGQLGF 1231
            .          :||.        ||.|..|..|::||.|....: ||.||.|::|:.|..|:.|.
  Rat  1199 T----------KGFPGSPGVDAHGDPGFPGPTGDRGDRGEANTLPGPVGAPGQKGEQGIPGERGP 1253

  Fly  1232 DGAEGLKGFQGDQGPRGPPGITLPAEKGDEGVAGLDGRAGRPGHFGQKGAPGPPGENGPNGAIGH 1296
            .|:.||:||         |||:.|:     .::||.|..|.||.||.:|..||||..|||...|.
  Rat  1254 VGSPGLQGF---------PGISPPS-----NISGLPGDVGAPGIFGLQGYQGPPGPPGPNALPGI 1304

  Fly  1297 RGPQ----IQGPPGPQGDVGFPGAPGHNGRHGLIGPKGELGDMGRQGERGESGYAIVGRQGDIGD 1357
            :|.:    ..|.||.:|.||.||..|..|.|||.|.||..|:.|..|..|.||..     ||.|.
  Rat  1305 KGDEGSSGAAGFPGEKGWVGDPGPQGQPGVHGLPGEKGPKGEQGFMGNTGPSGAV-----GDRGP 1364

  Fly  1358 IGFQGEPGWDGAKGEQGYPGLPGKNGRVGA-PGPRGPTGDAGWGGIDGMDGLVGPKGQPGVTYSY 1421
            .|.:|:.|:.||.|..|.||:||...::.. ||..||.|..   |:.|..|.:||:|.||     
  Rat  1365 KGPKGDQGFPGAPGSMGSPGIPGIPQKIAVQPGTMGPQGRR---GLPGALGEMGPQGPPG----- 1421

  Fly  1422 SMARPGDRGEPGLDGFQGEEGDGGAPGLIGFQGQRGAVGYRGDQGEVGYTGADGPQGQRGDKGYM 1486
               .||.||.||..|.||..|....|            |:|||||.:|:   .||.||.|:.   
  Rat  1422 ---DPGFRGAPGKAGPQGRGGVSAVP------------GFRGDQGPMGH---QGPIGQEGEP--- 1465

  Fly  1487 GLTGAPGLRGLPGPQGEPAPAPPAPKSRGFIFARHSQSVHVPQCPANTNLLWEGYSLSGNVAASR 1551
            |..|:|||.|:||          ...|.|::..:|||:...|.||...|.||.||||.......:
  Rat  1466 GRPGSPGLPGMPG----------RSVSIGYLLVKHSQTDQEPMCPVGMNKLWSGYSLLYFEGQEK 1520

  Fly  1552 AVGQDLGQSGSCMMRFTTMPYMLCDITNVCHFAQNNDDSLWLSTAEPMPMTMTPIQGRDLMKYIS 1616
            |..||||.:|||:.||:|||::.|:..:||::|..||.|.||||..|:|  |.|:...::..|||
  Rat  1521 AHNQDLGLAGSCLARFSTMPFLYCNPGDVCYYASRNDKSYWLSTTAPLP--MMPVAEEEIKPYIS 1583

  Fly  1617 RCVVCETTTRIIALHSQSMSIPDCPGGWEEMWTGYSYFMSTLDNVGGVGQNLVSPGSCLEEFRAQ 1681
            ||.|||.....||:|||.:|||.||.||..:|.|||:.|.|.....|.||:|||||||||:|||.
  Rat  1584 RCSVCEAPAVAIAVHSQDVSIPHCPAGWRSLWIGYSFLMHTAAGDEGGGQSLVSPGSCLEDFRAT 1648

  Fly  1682 PVIECH-GHGRCNYYDALASFWLTVIEEQDQFVQPRQQTLKAD-FTSKISRCTVCRR 1736
            |.|||: |.|.|:|:....|||||.|.||:....|...||||. ..:.||||.||.:
  Rat  1649 PFIECNGGRGTCHYFANKYSFWLTTIPEQNFQSTPSADTLKAGLIRTHISRCQVCMK 1705

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
vkgNP_001260071.1 Collagen 60..119 CDD:189968 27/58 (47%)
Collagen 102..152 CDD:189968 24/49 (49%)
Collagen 277..334 CDD:189968 30/61 (49%)
Collagen 534..593 CDD:189968 26/60 (43%)
Collagen 814..871 CDD:189968 30/56 (54%)
Collagen 957..1014 CDD:189968 30/80 (38%)
Collagen 990..1049 CDD:189968 29/62 (47%)
Collagen 1070..1128 CDD:189968 33/67 (49%)
C4 1515..1624 CDD:128421 50/108 (46%)
C4 1625..1737 CDD:128421 60/114 (53%)
Col4a2XP_038951185.1 Collagen 70..121 CDD:396114 24/53 (45%)
Collagen 291..347 CDD:396114 30/67 (45%)
Collagen 488..538 CDD:396114 21/50 (42%)
Collagen 777..833 CDD:396114 28/55 (51%)
Collagen 903..959 CDD:396114 20/55 (36%)
Collagen 1135..1189 CDD:396114 25/56 (45%)
Med15 1212..>1492 CDD:312941 127/337 (38%)
Collagen 1327..1383 CDD:396114 25/60 (42%)
C4 1485..1589 CDD:396133 47/105 (45%)
C4 1595..1704 CDD:396133 59/108 (55%)


Information from Original Tools:


Tool Simple Score Weighted Score Original Tool Information
BLAST Result Score Score Type Cluster ID
Compara 00.000 Not matched by this tool.
Domainoid 00.000 Not matched by this tool.
eggNOG 00.000 Not matched by this tool.
Hieranoid 00.000 Not matched by this tool.
Homologene 00.000 Not matched by this tool.
Inparanoid 00.000 Not matched by this tool.
OMA 00.000 Not matched by this tool.
OrthoDB 1 1.010 - - D41315at33208
OrthoFinder 1 1.000 - - FOG0000443
OrthoInspector 1 1.000 - - otm45195
orthoMCL 1 0.900 - - OOG6_100768
Panther 00.000 Not matched by this tool.
Phylome 1 0.910 - -
SonicParanoid 00.000 Not matched by this tool.
SwiftOrtho 1 1.000 - -
TreeFam 00.000 Not matched by this tool.
65.820

Return to query results.
Submit another query.