DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment vkg and Col4a6

DIOPT Version :9

Sequence 1:NP_001260071.1 Gene:vkg / 33726 FlyBaseID:FBgn0016075 Length:1940 Species:Drosophila melanogaster
Sequence 2:XP_038956452.1 Gene:Col4a6 / 363458 RGDID:1589724 Length:1690 Species:Rattus norvegicus


Alignment Length:1806 Identity:733/1806 - (40%)
Similarity:889/1806 - (49%) Gaps:248/1806 - (13%)


- Green bases have known domain annotations that are detailed below.


  Fly    30 GKICNTTLCDCKGIKGRMGAPGPIGVPGLEGPAGDIGPPGRAGPLGEKGDVGEYGEQGEKGHRGD 94
            |:.|:.. |.|...||..|..||||..|..||.|..||.|.:|..||:|..|..|..|.||.:|.
  Rat    32 GQDCSGA-CACSPEKGARGHTGPIGTQGPAGPEGFAGPTGLSGLKGERGSPGRLGPYGPKGDKGP 95

  Fly    95 IGPKGEMGYPGIMGKSGEPGTPGPRGIDGC------------DGRPGMQGPSGAPGQNGVRGPPG 147
            ||..|.:|..||.|..|:||..||.|:|||            ||.||:.||.|.||..|.:|.|.
  Rat    96 IGVPGFVGISGIPGHPGQPGPRGPPGLDGCNGTQGAVGFPGTDGYPGVLGPPGLPGHKGAKGEPA 160

  Fly   148 K-----PGQQGPPGEAGEGGINSKGTKGNRGETGQPGGVGPPGFDGDRGSKG--------DTGYA 199
            .     .|.:|.||..|..||.  |..|:.|..|..|.:|||||.|..|..|        ..|:.
  Rat   161 SFQGSIIGMKGDPGLPGLHGIT--GPSGSPGSPGDAGPIGPPGFQGPPGPPGLPGPDGNMGLGFQ 223

  Fly   200 GLTGEKGDPGLPGPKGDTGAVSELPYSLI--GPPGAKGEPGDSLSGVLKPDDTLKGYKGYVGLQG 262
            |..|.|||.|||||.|...:..||.:...  |..|:|||||.            .|:.|..||.|
  Rat   224 GEKGIKGDVGLPGPAGPPPSTGELEFMGFPKGEKGSKGEPGP------------PGFPGRSGLPG 276

  Fly   263 DEGPQ-GPTGEQGAVGRNGLPGARGEIGGPGERGKPGKDGEPGRFGDKGMKGAPGWTGADGLDGS 326
              .|: |..||:|..|..||||.||.:|..|.:|.|||.|:.|..|..|..|.|   |..|..||
  Rat   277 --VPELGSIGEKGERGILGLPGPRGPVGSEGIQGYPGKQGKKGTSGFPGTNGFP---GIKGEKGS 336

  Fly   327 PGERGEDGFTGMPG--VQGGAGPPGIYDPSLTKSLPGPIGSQGDIGPPGEQGPPGLPGKPGRRGP 389
            .|.||.|.||...|  :.|..|.||:         ||..|.:||.|..|::||.|..|.|...|.
  Rat   337 IGVRGPDSFTDAEGTVISGFPGDPGV---------PGLPGLRGDEGIQGQRGPAGTAGLPSLTGL 392

  Fly   390 IGLAGQSGDPGLNGSRGPPGRSERGEAGDYGFIGPPGPQGPPGEAGLPGRYGLHG-----EPGQN 449
            .|..|..|.|||.|.||..||:..||||..|.:|.||..|.||.:|.|||..:.|     |||  
  Rat   393 PGALGPQGSPGLKGDRGNSGRTTFGEAGLPGRVGLPGLPGLPGPSGPPGRTFVSGPLLSIEPG-- 455

  Fly   450 VVGPKGEPGLNGQPGLEGYRGDRGEVGLPGDKGLPGEGYNIVGPPGSQGPPGFRGLPGDDGYNGL 514
            :.|.:||.||.|..|::|.:||.|.....|  |.|    ||       ||.|..||||..|..||
  Rat   456 LPGLQGEQGLKGHQGIKGVKGDSGFCACEG--GAP----NI-------GPHGESGLPGIQGPIGL 507

  Fly   515 RGLPGEKGLRGDDCPVCNAGPRGPRGQEGDTGYPGSHGNRGAIGLTGPRGVQGLQGNPGRAGHKG 579
            ||:   ||.|||.         |.||..|.||.||..|.||..||.|.:|...:......||.||
  Rat   508 RGI---KGTRGDP---------GSRGASGPTGTPGLFGPRGQTGLKGKKGEPTVSRGSKMAGDKG 560

  Fly   580 LPGPAGIP---GEPGKVGAAGPDGKAIEVGSLRKGEIGDTGDSGHRGDTGDDGEKGRDGSDGSKG 641
            .|||.|||   |.|||.|..|..|..           |..||.|    :|..||:|..|..|.||
  Rat   561 DPGPQGIPGLAGAPGKDGIPGLPGFP-----------GTQGDDG----SGFPGERGLPGLPGEKG 610

  Fly   642 ERGETGQRGDYGDAGYQGRDGEPGRDGRDGAPGRNATTPKVYLIGEPGYDGIKGERGDDGDTGFK 706
            ..|.||.||          .|.||..|..|.|            |:.|.||:.|::      |.:
  Rat   611 HDGPTGPRG----------IGLPGLPGPRGLP------------GDKGVDGLPGQQ------GLR 647

  Fly   707 GVKGEPNPGQI---YDNTGEPGEDGYTGPKGVKGAKGEQGAIGLRGEIGDRGPAGEV-IPGPVGA 767
            |.||...|..|   |..:|.||..|:.|.||.:|..|..|..|..|..|:.|..|.: :|   |.
  Rat   648 GAKGVTLPCIIPGSYGPSGFPGAPGFPGSKGARGLPGIPGKPGTHGSKGEPGSPGLIHLP---GF 709

  Fly   768 KGYPGPTGDYGQQGAPGLPGRDGEPGLDGGIGYKGQRGVPG-----QEVIQGEIGPPGRSGIKGF 827
            .|:||..|:.|..|.|||.|:.|.||..|..|..|.:||.|     :....||.|..|..|.|||
  Rat   710 PGFPGARGEKGLPGFPGLLGKHGYPGKAGSPGVPGSKGVAGDIFGAENGASGEQGLQGLPGDKGF 774

  Fly   828 PGDVGAPGQYGLAGRPGPKGVKGEQGPDGAVGQTGLPGNKGQRGDFLV----GPPGPKGQPGRNG 888
            |||.|.||..||:|:.|..|.|||:|..|..|..|.||..|....|.:    |.||..|.||.:|
  Rat   775 PGDSGLPGPKGLSGKSGMLGPKGERGNPGTSGPPGQPGPSGSTDPFGIKGTSGLPGAPGLPGISG 839

  Fly   889 RQAPHGAKGQKGEVGSLGQNGQNGAKGSIGFSGRRGLLGNAGLQGLPGSPGIPGLPGMIGEIGER 953
            ..   |.|||:|::|..|..|:.|..|..|..|.:||.|..|..||.|..|:||:||..||.|..
  Rat   840 HP---GKKGQRGDIGHPGSTGKRGLPGIKGLPGPQGLAGFLGSPGLSGVTGLPGIPGQKGEKGSS 901

  Fly   954 GEIGYNGRQGDIGPRGPNGEFGPKGLSGDDGPDGYPGANGLPGRKGETGNPG------------- 1005
            |.:|:.|..|..|..|.:   |.||.||..|..|.||..|.||.||:.|:||             
  Rat   902 GPVGFPGLPGLPGLPGAD---GLKGFSGSFGKVGQPGQAGTPGEKGDRGDPGPVGISSPRPPMLN 963

  Fly  1006 --FPGRPGAKGVAAYSGIKGDDGESGLTGPIGYPGAPGAKGQRGPVGDSQPALDGVAGRKGEVGS 1068
              |.|..|::|.|...|..|..|:.|..|..|.||||||.||...:             ||..||
  Rat   964 LWFKGEKGSQGSAGSDGFPGPRGDKGEPGIPGLPGAPGAPGQSNTI-------------KGLSGS 1015

  Fly  1069 PGPNGLPGRHGLKGQRGDRGLPGQQGRPGEPGAKGLGGYP---GRNGINGLKG----ATGFPGPQ 1126
            ||..|..||.||.|.:|..|:.|..|.||:.|::||.|.|   |..||.||||    ..|..|..
  Rat  1016 PGSPGSMGRRGLPGLKGSLGIAGFPGIPGKSGSQGLTGTPGPLGATGIPGLKGDQGPTLGISGSP 1080

  Fly  1127 GPKGPQGESGVVGLDGRNGQIGDQGPRGLIGEQGEQGEQGDEGEVGIPGRLENLRDRSFYRGFTG 1191
            ||||..||.|..|:.|::|.:||   ||..|.:|:.|:.|..|:.|.||.          .|..|
  Rat  1081 GPKGQPGELGFKGVKGKDGLVGD---RGYPGNKGDSGKVGSAGDPGFPGS----------PGLKG 1132

  Fly  1192 DQGLQGERGEQGDMGPIGFIGPPGAKGERGDIGYAGQLGFDGAEGLKGFQGDQGPRGPPGIT--- 1253
            ..|:.|:.|..|..|.:|.||.||..|..|..|:.   |..|..||.|..|.:|..|.||.:   
  Rat  1133 ISGMNGDPGFPGSSGHVGSIGRPGPSGLIGPKGFP---GLPGLHGLNGLPGTKGTHGTPGASITG 1194

  Fly  1254 ------LPAEKGDEGVAGL-DGRAGRPGHFGQKGAPGPPGENGPNGAIGHRG---PQ-IQGPPGP 1307
                  ||..||::|:.|: .|..|:.|..||||..|.||..||.|..|..|   |. |.|.|  
  Rat  1195 VPGPAGLPGPKGEKGMPGIVIGDPGKQGLRGQKGDQGSPGLQGPAGTPGASGISLPSVIAGQP-- 1257

  Fly  1308 QGDVGFPGAPGHNGRHGLIGPKGELGDMGRQGERGESGY-AIVGRQGDIGDIGFQGEPGWDGAKG 1371
             ||.|.||..|..||.||.||.|..|....||:.|:||: .|.|.||..|:   ||.||:.|..|
  Rat  1258 -GDPGQPGLDGERGRPGLPGPPGPPGPSSDQGDPGDSGFPGIPGLQGLKGN---QGLPGFSGLSG 1318

  Fly  1372 EQGYPGLPGKNGRVGAPGPRGPTGDAGWGGIDGM---DGLVGPKGQPGVTYSYSMAR---PGDRG 1430
            :.|..|:.|:.|.:|.||..||.||.|:.|:.|.   .|..||:|.||.| ..:.||   ||..|
  Rat  1319 DLGLKGMRGEPGLMGTPGKIGPPGDPGFPGMKGKAGPRGFSGPQGAPGHT-PIAEARHVPPGPLG 1382

  Fly  1431 EPGLDGFQGEEGDGGAPGLIGFQGQRGAVGYRGDQGEVGYTGADGPQGQRGDKGYMGLTGAPGLR 1495
            .||:||..|..||.|:.|.:|.||.:|..|..|..|.   :|..||.|..||.|..||.|.||..
  Rat  1383 LPGIDGIPGLTGDPGSQGSVGLQGSKGLPGIPGKDGP---SGLPGPSGILGDPGLPGLQGPPGFE 1444

  Fly  1496 GLPGPQGE-PAPAPPAPKSR-GFIFARHSQSVHVPQCPANTNLLWEGYSLSGNVAASRAVGQDLG 1558
            |.||.||. ..|..|....| |:...:||||.|||.||...:.||.||||.......:|..||||
  Rat  1445 GAPGNQGPIGQPGMPGHSVRVGYTLVKHSQSEHVPPCPIGMSQLWVGYSLLFVEGQEKAHNQDLG 1509

  Fly  1559 QSGSCMMRFTTMPYMLCDITNVCHFAQNNDDSLWLSTAEPMPMTMTPIQGRDLMKYISRCVVCET 1623
            .:|||:.||:|||::.|:|..|||:|:.||.|.||||..|:|  |.|:....:.:|||||.|||.
  Rat  1510 FAGSCLPRFSTMPFIYCNINEVCHYARRNDKSYWLSTTAPIP--MMPVGETQIPQYISRCSVCEA 1572

  Fly  1624 TTRIIALHSQSMSIPDCPGGWEEMWTGYSYFMSTLDNVGGVGQNLVSPGSCLEEFRAQPVIECHG 1688
            .::.||:|||.:::|.||.||..:|.|||:.|.|.....|.||:|||||||||:|||.|.|||.|
  Rat  1573 PSQAIAVHSQDITVPQCPLGWHSLWIGYSFLMHTAAGTEGGGQSLVSPGSCLEDFRANPFIECSG 1637

  Fly  1689 -HGRCNYYDALASFWLTVIEEQDQF-VQPRQQTLK-ADFTSKISRCTVCRR 1736
             .|.|:|:....|||||.:||:.|| .||..:.|| ....:::|||.||.:
  Rat  1638 ARGTCHYFANKYSFWLTTVEERGQFQEQPVSENLKTGQLHTRVSRCQVCMK 1688

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
vkgNP_001260071.1 Collagen 60..119 CDD:189968 27/58 (47%)
Collagen 102..152 CDD:189968 26/66 (39%)
Collagen 277..334 CDD:189968 26/56 (46%)
Collagen 534..593 CDD:189968 28/61 (46%)
Collagen 814..871 CDD:189968 29/56 (52%)
Collagen 957..1014 CDD:189968 25/71 (35%)
Collagen 990..1049 CDD:189968 28/73 (38%)
Collagen 1070..1128 CDD:189968 27/64 (42%)
C4 1515..1624 CDD:128421 54/108 (50%)
C4 1625..1737 CDD:128421 57/115 (50%)
Col4a6XP_038956452.1 Collagen 557..618 CDD:396114 31/75 (41%)
Collagen 595..651 CDD:396114 26/83 (31%)
Collagen 666..722 CDD:396114 22/58 (38%)
Collagen 696..750 CDD:396114 23/56 (41%)
Collagen 845..899 CDD:396114 24/53 (45%)
Collagen 1011..1065 CDD:396114 25/53 (47%)
Collagen 1403..1459 CDD:396114 25/58 (43%)
C4 1467..1571 CDD:396133 51/105 (49%)
C4 1575..1687 CDD:396133 56/111 (50%)


Information from Original Tools:


Tool Simple Score Weighted Score Original Tool Information
BLAST Result Score Score Type Cluster ID
Compara 1 0.930 - - C166342144
Domainoid 00.000 Not matched by this tool.
eggNOG 00.000 Not matched by this tool.
Hieranoid 00.000 Not matched by this tool.
Homologene 00.000 Not matched by this tool.
Inparanoid 00.000 Not matched by this tool.
OMA 00.000 Not matched by this tool.
OrthoDB 1 1.010 - - D41315at33208
OrthoFinder 00.000 Not matched by this tool.
OrthoInspector 1 1.000 - - otm45195
orthoMCL 1 0.900 - - OOG6_100768
Panther 1 1.100 - - O PTHR24023
Phylome 1 0.910 - -
SonicParanoid 00.000 Not matched by this tool.
SwiftOrtho 1 1.000 - -
TreeFam 00.000 Not matched by this tool.
76.850

Return to query results.
Submit another query.