DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Col4a3 and vkg

DIOPT Version :9

Sequence 1:NP_031760.2 Gene:Col4a3 / 12828 MGIID:104688 Length:1669 Species:Mus musculus
Sequence 2:NP_001260071.1 Gene:vkg / 33726 FlyBaseID:FBgn0016075 Length:1940 Species:Drosophila melanogaster


Alignment Length:1826 Identity:764/1826 - (41%)
Similarity:888/1826 - (48%) Gaps:256/1826 - (14%)


- Green bases have known domain annotations that are detailed below.


Mouse     7 PRFLVFL--LLTLLLLLAA--SPVASKGCVCKGKGQCLCAGTKGEKGEKGVPGSPGFPGQKGFPG 67
            ||.|..|  ||.::.||.:  |...:.|.:| ....|.|.|.||..|..|..|.||..|..|..|
  Fly     3 PRDLRHLSGLLGVVYLLGSLVSVTLADGKIC-NTTLCDCKGIKGRMGAPGPIGVPGLEGPAGDIG 66

Mouse    68 PEGLPGPQGPKGSPGLPGLTGPKGIRGITGLPGFAGPPGLPGLPGHPGPRGLAGLPGCNGSKGEQ 132
            |.|..||.|.||..|..|..|.||.||..|..|..|.||:.|..|.||..|..|:.||:|..|.|
  Fly    67 PPGRAGPLGEKGDVGEYGEQGEKGHRGDIGPKGEMGYPGIMGKSGEPGTPGPRGIDGCDGRPGMQ 131

Mouse   133 GFPGFPGTPGYAGLPGPDGLKGQKGEPAQGEDRGFNGKGDPGPPGVPGFQGFPGLPGFPGPAGPP 197
            | |.  |.||..|:.||.|..||:|.|.:..:.|.|.||..|..|..|..|..|.|||.|..|..
  Fly   132 G-PS--GAPGQNGVRGPPGKPGQQGPPGEAGEGGINSKGTKGNRGETGQPGGVGPPGFDGDRGSK 193

Mouse   198 GPPGFFGL------PGAMGPRGPKGHMGD---SVIGQKGERGMKG--LTGPPGPPGTVIFTLTQP 251
            |..|:.||      ||..||:|..|.:.:   |:||..|.:|..|  |:|...|..|:       
  Fly   194 GDTGYAGLTGEKGDPGLPGPKGDTGAVSELPYSLIGPPGAKGEPGDSLSGVLKPDDTL------- 251

Mouse   252 YNKSDFKGEKGDEGERGEPGPPGPSGPPG----DSYGSEKGAPGEPGPRGKPGKDGAP------- 305
                  ||.||..|.:|:.||.||:|..|    :.....:|..|.||.||||||||.|       
  Fly   252 ------KGYKGYVGLQGDEGPQGPTGEQGAVGRNGLPGARGEIGGPGERGKPGKDGEPGRFGDKG 310

Mouse   306 --GFPGTEGAKGNRGFPGLRGE---AGIKGRKGDIGPPGFPGPTEYYDAYLEKGERGMPGLPGPK 365
              |.||..||.|..|.||.|||   .|:.|.:|..||||.      ||..|.|      .||||.
  Fly   311 MKGAPGWTGADGLDGSPGERGEDGFTGMPGVQGGAGPPGI------YDPSLTK------SLPGPI 363

Mouse   366 GAR---GPQGPSGPPGVPGSPGLSRP-GLRGPIGWPGLKGSKGERGPPGKDTVGPPGPLGCPGSP 426
            |::   ||.|..||||:||.||...| ||.|..|.|||.||   |||||:...|..|..|..|.|
  Fly   364 GSQGDIGPPGEQGPPGLPGKPGRRGPIGLAGQSGDPGLNGS---RGPPGRSERGEAGDYGFIGPP 425

Mouse   427 GPPGPPGPPGCPGDIVFKCSPGEHGMPGDTGPPGVPGLDGPKGEPGSPCTECHCFPGPPGVPGFP 491
            ||.||||..|.||..      |.||.||.       .:.|||||||        ..|.||:.|:.
  Fly   426 GPQGPPGEAGLPGRY------GLHGEPGQ-------NVVGPKGEPG--------LNGQPGLEGYR 469

Mouse   492 GLDGIKGIPGGRGVPGLKGN-PGSPGSAGLPGFAGFPGDQGHPGLK---GDKG----DTPLPWGQ 548
            |..|..|:||.:|:||...| .|.|||.|.|||.|.|||.|:.||:   |:||    |.|     
  Fly   470 GDRGEVGLPGDKGLPGEGYNIVGPPGSQGPPGFRGLPGDDGYNGLRGLPGEKGLRGDDCP----- 529

Mouse   549 VGNPGDPGLRGLPGRKGFDGTPGGPGAKGPPGPQGEPALS---GRKGDQGPPGPPGFPGPPGPAG 610
            |.|.|..|.||..|..|:.|:.|..||.|..||:|...|.   ||.|.:|.|||.|.||.||..|
  Fly   530 VCNAGPRGPRGQEGDTGYPGSHGNRGAIGLTGPRGVQGLQGNPGRAGHKGLPGPAGIPGEPGKVG 594

Mouse   611 PAGPPG---------------------YGPQGEPGPKGAQGVPGVLGPPGEAGLKGEPSTSTPDL 654
            .|||.|                     .|..|:.|.||..|..|..|..||.|.:|:..    |.
  Fly   595 AAGPDGKAIEVGSLRKGEIGDTGDSGHRGDTGDDGEKGRDGSDGSKGERGETGQRGDYG----DA 655

Mouse   655 GPPGPPGPPGQAGPRGLPGL------------PGPVG-------KCDPGLPGPDGEP-------G 693
            |..|..|.||:.|..|.||.            ||..|       ..|.|..|..|||       .
  Fly   656 GYQGRDGEPGRDGRDGAPGRNATTPKVYLIGEPGYDGIKGERGDDGDTGFKGVKGEPNPGQIYDN 720

Mouse   694 IPEAGCPGPPGPKGNQGFPGTKGSPGCPGEMGKPGRPGE--PGIPGAKGEPS-VGRPGKPGKPGF 755
            ..|.|..|..||||.:|..|.:|:.|..||:|..|..||  ||..||||.|. .|..|:.|.||.
  Fly   721 TGEPGEDGYTGPKGVKGAKGEQGAIGLRGEIGDRGPAGEVIPGPVGAKGYPGPTGDYGQQGAPGL 785

Mouse   756 PGERGNAGENGDIGLPGLPGLP---------GTPGRGGLDGPPGDPGQPGSPGAKGSPG-RCIPG 810
            ||..|..|.:|.||..|..|:|         |.|||.|:.|.|||.|.||..|..|.|| :.:.|
  Fly   786 PGRDGEPGLDGGIGYKGQRGVPGQEVIQGEIGPPGRSGIKGFPGDVGAPGQYGLAGRPGPKGVKG 850

Mouse   811 PRGTQGLPGLNGLKGQPGRRGD-----TGPKGDPGIPGMDRSGVPGDPGPPGTPGCPGEMGPPGQ 870
            .:|..|..|..||.|..|:|||     .||||.||     |:|.....|..|..|..|.:|..||
  Fly   851 EQGPDGAVGQTGLPGNKGQRGDFLVGPPGPKGQPG-----RNGRQAPHGAKGQKGEVGSLGQNGQ 910

Mouse   871 KGYPGAPGFPGPPGEKGEVGMMGYPGTTGPPGLPGKPGSQGQRGSLGIPGMKGEKGRPGAKGERG 935
            .|..|:.||.|..|..|..|:.|.||:.|.|||||..|..|:||.:|..|.:|:.|..|..||.|
  Fly   911 NGAKGSIGFSGRRGLLGNAGLQGLPGSPGIPGLPGMIGEIGERGEIGYNGRQGDIGPRGPNGEFG 975

Mouse   936 EKGKPGPSQTTLLKGDKGEPGLKGFVGNPGEKGNRGNPGLPGPKGLEGLPGLPGPPGPRGDTGSR 1000
            .||..|..      |..|.||..|..|..||.||.|.||.||.||:....|:.|..|..|.||..
  Fly   976 PKGLSGDD------GPDGYPGANGLPGRKGETGNPGFPGRPGAKGVAAYSGIKGDDGESGLTGPI 1034

Mouse  1001 GNPGRPGPHGMPGSMGIM--GVPGPKGRKGTSGLPGLAGRPGLTGIHGPQGDKGEPGYSEGARPG 1063
            |.||.||..|..|.:|..  .:.|..||||..|.||..|.||..|:.|.:||:|.||  :..|||
  Fly  1035 GYPGAPGAKGQRGPVGDSQPALDGVAGRKGEVGSPGPNGLPGRHGLKGQRGDRGLPG--QQGRPG 1097

Mouse  1064 PPGPKGDPGLPGD---KGKKGERGVPGPPGQSGPAGPDGAPGSPGSPGHPGKPGPAGDLGLKGQK 1125
            .||.||..|.||.   .|.||..|.|||.|..||.|..|..|..|..|..|..||.|.:|.:|::
  Fly  1098 EPGAKGLGGYPGRNGINGLKGATGFPGPQGPKGPQGESGVVGLDGRNGQIGDQGPRGLIGEQGEQ 1162

Mouse  1126 GFPGPPGSTGPPGP-------------PGLPGLPGPMGMRGDQGRDGIPGPPGEKGETGLLG--- 1174
            |..|..|..|.||.             .|..||.|..|.:||.|..|..||||.|||.|.:|   
  Fly  1163 GEQGDEGEVGIPGRLENLRDRSFYRGFTGDQGLQGERGEQGDMGPIGFIGPPGAKGERGDIGYAG 1227

Mouse  1175 --------------AYPGPKGSPGV--PGAKGDRGVPGLSGLPGRKGVMGDVGPQGPPGTAGLPG 1223
                          ...||:|.||:  |..|||.||.||.|..||.|..|..|..||||..|..|
  Fly  1228 QLGFDGAEGLKGFQGDQGPRGPPGITLPAEKGDEGVAGLDGRAGRPGHFGQKGAPGPPGENGPNG 1292

Mouse  1224 PPGLPGAII---PGPKGDRGLPGLRGNPGEPGPPGPPGPIGKGIKGDKGFMGPPGPKG--LPGTV 1283
            ..|..|..|   |||:||.|.||..|:.|..|..||     ||..||.|..|..|..|  :.|..
  Fly  1293 AIGHRGPQIQGPPGPQGDVGFPGAPGHNGRHGLIGP-----KGELGDMGRQGERGESGYAIVGRQ 1352

Mouse  1284 GDMGPPGFPGAPGTPGLPGVRGDPGFPGFPGIKGEKGNPGFLGPIGHP---GPVGPKGPPGPRGK 1345
            ||:|..||   .|.||..|.:|:.|:||.||..|..|.||..||.|..   |..|..|..||:|:
  Fly  1353 GDIGDIGF---QGEPGWDGAKGEQGYPGLPGKNGRVGAPGPRGPTGDAGWGGIDGMDGLVGPKGQ 1414

Mouse  1346 PGTLKVISL--PGSPGPPGVPGQPGMKGD---PGPLGLPGIPGPCGPRGKPGKDGKPGTPGPAGT 1405
            ||.....|:  ||..|.||:.|..|.:||   ||.:|..|..|..|.||..|:.|..|..||.|.
  Fly  1415 PGVTYSYSMARPGDRGEPGLDGFQGEEGDGGAPGLIGFQGQRGAVGYRGDQGEVGYTGADGPQGQ 1479

Mouse  1406 KGNKGLKGQQGPPGLDGLPGLKGNPGDRGTPA-TGTRMRGFIFTRHSQTTAIPSCPEGTQPLYSG 1469
            :|:||..|..|.|||.||||.:|.|    .|| ...:.|||||.||||:..:|.||..|..|:.|
  Fly  1480 RGDKGYMGLTGAPGLRGLPGPQGEP----APAPPAPKSRGFIFARHSQSVHVPQCPANTNLLWEG 1540

Mouse  1470 FSLLFVQGN---KRAHGQDLGTLGSCLQRFTTMPFLFCNINNVCNFASRNDYSYWLSTPALMPMD 1531
            :||   .||   .||.|||||..|||:.||||||::.|:|.|||:||..||.|.||||...|||.
  Fly  1541 YSL---SGNVAASRAVGQDLGQSGSCMMRFTTMPYMLCDITNVCHFAQNNDDSLWLSTAEPMPMT 1602

Mouse  1532 MAPISGRALEPYISRCTVCEGPAMAIAVHSQTTAIPPCPQDWVSLWKGFSFIMFTSAGSEGAGQA 1596
            |.||.||.|..|||||.|||.....||:|||:.:||.||..|..:|.|:|:.|.|.....|.||.
  Fly  1603 MTPIQGRDLMKYISRCVVCETTTRIIALHSQSMSIPDCPGGWEEMWTGYSYFMSTLDNVGGVGQN 1667

Mouse  1597 LASPGSCLEEFRASPFIECHGRGTCNYYSNSYSFWLASLNPERMFRKPIPSTVKAGDLEKIISRC 1661
            |.|||||||||||.|.|||||.|.||||....||||..:..:..|.:|...|:|| |....||||
  Fly  1668 LVSPGSCLEEFRAQPVIECHGHGRCNYYDALASFWLTVIEEQDQFVQPRQQTLKA-DFTSKISRC 1731

Mouse  1662 QVCMKK 1667
            .||.::
  Fly  1732 TVCRRR 1737

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
Col4a3NP_031760.2 7S domain. /evidence=ECO:0000250|UniProtKB:Q01955 29..42 3/12 (25%)
Collagen 42..94 CDD:189968 23/51 (45%)
Triple-helical region 43..1436 626/1549 (40%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 44..473 191/459 (42%)
Collagen 288..343 CDD:189968 32/66 (48%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 500..1439 428/1065 (40%)
Collagen 712..765 CDD:189968 25/55 (45%)
Cell attachment site. /evidence=ECO:0000255 830..832 1/1 (100%)
Cell attachment site. /evidence=ECO:0000255 994..996 0/1 (0%)
Collagen 998..1055 CDD:189968 25/58 (43%)
Cell attachment site. /evidence=ECO:0000255 1152..1154 0/1 (0%)
Cell attachment site. /evidence=ECO:0000255 1304..1306 0/1 (0%)
Collagen 1377..1436 CDD:189968 26/58 (45%)
Epitope recognized by Goodpasture antibodies. /evidence=ECO:0000250|UniProtKB:Q01955 1425..1443 5/18 (28%)
C4 1445..1551 CDD:279721 62/108 (57%)
Required for the anti-angiogenic activity of tumstatin. /evidence=ECO:0000250|UniProtKB:Q01955 1478..1556 48/80 (60%)
C4 1555..1665 CDD:279721 56/109 (51%)
Required for the anti-tumor cell activity of tumstatin. /evidence=ECO:0000250|UniProtKB:Q01955 1609..1627 12/17 (71%)
vkgNP_001260071.1 Collagen 60..119 CDD:189968 27/58 (47%)
Collagen 102..152 CDD:189968 24/52 (46%)
Collagen 277..334 CDD:189968 25/56 (45%)
Collagen 534..593 CDD:189968 27/58 (47%)
Collagen 814..871 CDD:189968 24/56 (43%)
Collagen 957..1014 CDD:189968 26/62 (42%)
Collagen 990..1049 CDD:189968 27/58 (47%)
Collagen 1070..1128 CDD:189968 28/59 (47%)
C4 1515..1624 CDD:128421 65/111 (59%)
C4 1625..1737 CDD:128421 57/112 (51%)


Information from Original Tools:


Tool Simple Score Weighted Score Original Tool Information
BLAST Result Score Score Type Cluster ID
Compara 00.000 Not matched by this tool.
Domainoid 1 1.000 131 1.000 Domainoid score I5118
eggNOG 00.000 Not matched by this tool.
Hieranoid 00.000 Not matched by this tool.
Homologene 00.000 Not matched by this tool.
Inparanoid 00.000 Not matched by this tool.
Isobase 00.000 Not matched by this tool.
OMA 00.000 Not matched by this tool.
OrthoDB 1 1.010 - - D41315at33208
OrthoFinder 1 1.000 - - FOG0000443
OrthoInspector 00.000 Not matched by this tool.
orthoMCL 1 0.900 - - OOG6_100768
Panther 1 1.100 - - O PTHR24023
Phylome 1 0.910 - -
RoundUp 00.000 Not matched by this tool.
SonicParanoid 00.000 Not matched by this tool.
SwiftOrtho 1 1.000 - -
TreeFam 1 0.960 - -
87.880

Return to query results.
Submit another query.