DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment COL4A3 and vkg

DIOPT Version :9

Sequence 1:NP_000082.2 Gene:COL4A3 / 1285 HGNCID:2204 Length:1670 Species:Homo sapiens
Sequence 2:NP_001260071.1 Gene:vkg / 33726 FlyBaseID:FBgn0016075 Length:1940 Species:Drosophila melanogaster


Alignment Length:1858 Identity:738/1858 - (39%)
Similarity:886/1858 - (47%) Gaps:339/1858 - (18%)


- Green bases have known domain annotations that are detailed below.


Human    12 LLLPLLLVLLAAAPAASKGCVCKDKGQCFCDGAKGEKGEKGFPGP---PGSPGQKGFTGPEGLPG 73
            ||..|:.|.||      .|.:|   ....|| .||.||..|.|||   ||..|..|..||.|..|
  Fly    18 LLGSLVSVTLA------DGKIC---NTTLCD-CKGIKGRMGAPGPIGVPGLEGPAGDIGPPGRAG 72

Human    74 PQGPKGFPGLPGLTGSKGVRGISGLPGFSGSPGLPGTPGNTGPYGLVGVPGCSGSKGEQGFPGLP 138
            |.|.||..|..|..|.||.||..|..|..|.||:.|..|..|..|..|:.||.|..|.||..|.|
  Fly    73 PLGEKGDVGEYGEQGEKGHRGDIGPKGEMGYPGIMGKSGEPGTPGPRGIDGCDGRPGMQGPSGAP 137

Human   139 GTLGYPGIPGAAGLKGQKGAPAKEEDIELDAKGDPGLPGAPGPQGLPGPPGFPGPVGPPGPPGFF 203
            |..|..|.||.   .||:|.|.:..:..:::||..|..|..|..|..|||||.|..|..|..|:.
  Fly   138 GQNGVRGPPGK---PGQQGPPGEAGEGGINSKGTKGNRGETGQPGGVGPPGFDGDRGSKGDTGYA 199

Human   204 GF------PGAMGPRGPKGHMGERVIGHKGERGVKGLTGPPGPPGTVIVTLTG---PDNRTDLKG 259
            |.      ||..||:|..|.:.|...         .|.||||..|....:|:|   ||:  .|||
  Fly   200 GLTGEKGDPGLPGPKGDTGAVSELPY---------SLIGPPGAKGEPGDSLSGVLKPDD--TLKG 253

Human   260 EKGDKGAMGEPGPPGP---------SGLPGESYGSEKGAPGDPGLQGKPGKDGVPGFPGSEGVKG 315
            .||..|..|:.||.||         :||||     .:|..|.||.:|||||||.||..|.:|:||
  Fly   254 YKGYVGLQGDEGPQGPTGEQGAVGRNGLPG-----ARGEIGGPGERGKPGKDGEPGRFGDKGMKG 313

Human   316 NRGFPGLMGEDGIKGQKGD---IGPPGFR---GPTEYYDTYQEKGDEGTPGPPGPRGARGPQGPS 374
            ..|:.|..|.||..|::|:   .|.||.:   ||...||....|   ..|||.|.:|..||.|..
  Fly   314 APGWTGADGLDGSPGERGEDGFTGMPGVQGGAGPPGIYDPSLTK---SLPGPIGSQGDIGPPGEQ 375

Human   375 GPPGVPGSPGSSRP-GLRGAPGWPGLKGSKGERGRPGKDAMGTPGSPGCAGSPGLPGSPGPPGPP 438
            ||||:||.||...| ||.|..|.|||.||   ||.||:...|..|..|..         ||||| 
  Fly   376 GPPGLPGKPGRRGPIGLAGQSGDPGLNGS---RGPPGRSERGEAGDYGFI---------GPPGP- 427

Human   439 GDIVFRKGPPGDHGLPGYLGSPGIPG--VDGPKGEPGLLCTQCPYIPGPPGLPGLPGLHGVKGIP 501
                  :||||:.||||..|..|.||  |.||||||||                       .|.|
  Fly   428 ------QGPPGEAGLPGRYGLHGEPGQNVVGPKGEPGL-----------------------NGQP 463

Human   502 GRQGAAGLKGSPGSPGNTGLPG----FPGFPGAQGDPGLKGEKGETLQPEGQVGVPGDPGLRGQP 562
            |.:|..|.:|..|.||:.||||    ..|.||:|                      |.||.||.|
  Fly   464 GLEGYRGDRGEVGLPGDKGLPGEGYNIVGPPGSQ----------------------GPPGFRGLP 506

Human   563 GRKGLDGIPGTPGVKGLPGPKGELALSGEKGDQGPPGDPGSPGSPGPAGPAGPPGYGPQGEPGLQ 627
            |..|.:|:.|.||.|||.|....:..:|.:|.:|..||.|.|||           :|.:|..||.
  Fly   507 GDDGYNGLRGLPGEKGLRGDDCPVCNAGPRGPRGQEGDTGYPGS-----------HGNRGAIGLT 560

Human   628 GTQGVPGAPGPPGEAGPRGELSVSTPVPGPPGPPGPPGHPGPQGPPG--------IPGSLGKCGD 684
            |.:||.|..|.||.||.:|       :|||.|.||.||..|..||.|        ..|.:|..||
  Fly   561 GPRGVQGLQGNPGRAGHKG-------LPGPAGIPGEPGKVGAAGPDGKAIEVGSLRKGEIGDTGD 618

Human   685 PGLPGPDGEPGIPG-IGFPGPPGPKGDQGFPGTKGSLGCPGKMGEPGLPGKPGLPGAKG-EPAVA 747
            .|..|..|:.|..| .|..|..|.:|:.|..|..|..|..|:.||||..|:.|.||... .|.|.
  Fly   619 SGHRGDTGDDGEKGRDGSDGSKGERGETGQRGDYGDAGYQGRDGEPGRDGRDGAPGRNATTPKVY 683

Human   748 MPGGPGTPGFPGERGNSGEHGEIGLPGLPGLP-------GTPGNEGLDGPRGDPGQPGPP----- 800
            :.|.||..|..||||:.|:.|..|:.|.|. |       |.||.:|..||:|..|..|..     
  Fly   684 LIGEPGYDGIKGERGDDGDTGFKGVKGEPN-PGQIYDNTGEPGEDGYTGPKGVKGAKGEQGAIGL 747

Human   801 ----GEQGPPGRCIEGPRGAQGLPGLNGLKGQQGRRGKTGPKGDPGIPGLDRSGFPGETGSPGIP 861
                |::||.|..|.||.||:|.||..|..||||..|..|..|:||:.|  ..|:.|:.|.||..
  Fly   748 RGEIGDRGPAGEVIPGPVGAKGYPGPTGDYGQQGAPGLPGRDGEPGLDG--GIGYKGQRGVPGQE 810

Human   862 GHQGEMGPLGQRGYPGNPGILGPPGE------------------DGVIGMMGFPG--------AI 900
            ..|||:||.|:.|..|.||.:|.||:                  ||.:|..|.||        .:
  Fly   811 VIQGEIGPPGRSGIKGFPGDVGAPGQYGLAGRPGPKGVKGEQGPDGAVGQTGLPGNKGQRGDFLV 875

Human   901 GPPGPPGNPGTPGQRGSPGIPGVKGQRGTPGAKGEQGDKGNPGPSEISHVIGDKGEPGLKGFAGN 965
            |||||.|.||..|::...|..|.||:.|:.|..|:.|.||:         ||..|..||.|.||.
  Fly   876 GPPGPKGQPGRNGRQAPHGAKGQKGEVGSLGQNGQNGAKGS---------IGFSGRRGLLGNAGL 931

Human   966 PGEKGNRGVPGMPGL---------KGLKGLPGPAGPPGPRGDLGSTGNPGEPGLRGIPGSMGNMG 1021
            .|..|:.|:||:||:         .|..|..|..||.||.|:.|..|..|:.|..|.||:   .|
  Fly   932 QGLPGSPGIPGLPGMIGEIGERGEIGYNGRQGDIGPRGPNGEFGPKGLSGDDGPDGYPGA---NG 993

Human  1022 MPGSKGKRGTLGFPGRAGRPGLPGIHGLQGDKGEPGYSEGT-RPGPP------GPTGD--PGLPG 1077
            :||.||:.|..|||||.|..|:....|::||.||.|.:... .||.|      ||.||  |.|.|
  Fly   994 LPGRKGETGNPGFPGRPGAKGVAAYSGIKGDDGESGLTGPIGYPGAPGAKGQRGPVGDSQPALDG 1058

Human  1078 DMGKKGEMGQPGP---PGH---------LGPAGPEGAPGSPGSPGLPGKPGPHGDLGFKGIKGLL 1130
            ..|:|||:|.|||   ||.         .|..|.:|.||.||:.||.|.||.:|..|.||..|..
  Fly  1059 VAGRKGEVGSPGPNGLPGRHGLKGQRGDRGLPGQQGRPGEPGAKGLGGYPGRNGINGLKGATGFP 1123

Human  1131 GPPGIRGPPGLPGFPGSPGPMGIRGDQGRDGIPGPAGEKGETG------------------LLRA 1177
            ||.|.:||.|..|..|..|..|..||||..|:.|..||:||.|                  ..|.
  Fly  1124 GPQGPKGPQGESGVVGLDGRNGQIGDQGPRGLIGEQGEQGEQGDEGEVGIPGRLENLRDRSFYRG 1188

Human  1178 PPGPRGNPGAQGAKGDRGAPGFPGLPGRKGAMGDA---------GPRGPTGIEGFPGPPGLPGAI 1233
            ..|.:|..|.:|.:||.|..||.|.||.||..||.         |..|..|.:|..||.|.||..
  Fly  1189 FTGDQGLQGERGEQGDMGPIGFIGPPGAKGERGDIGYAGQLGFDGAEGLKGFQGDQGPRGPPGIT 1253

Human  1234 IPGQTGNRGPPGSRGSPGAP------GPPGPPGSHVIGIKGDKGSMGHPGP--KGPPGTAGDMGP 1290
            :|.:.|:.|..|..|..|.|      |.|||||.:     |..|::||.||  :||||..||:|.
  Fly  1254 LPAEKGDEGVAGLDGRAGRPGHFGQKGAPGPPGEN-----GPNGAIGHRGPQIQGPPGPQGDVGF 1313

Human  1291 PGRLGAPGTPGLPGPR--------------------------GDPGFQGFP---GVKGEKGNPGF 1326
            ||..|..|..||.||:                          ||.||||.|   |.|||:|.||.
  Fly  1314 PGAPGHNGRHGLIGPKGELGDMGRQGERGESGYAIVGRQGDIGDIGFQGEPGWDGAKGEQGYPGL 1378

Human  1327 LGS---IGPPGPIGPKGPPGVRGDPGTLKIISLPGSPG--------PPGTPGEPGM---QGEPGP 1377
            .|.   :|.|||.||.|..|..|..|...::...|.||        .||..||||:   |||.|.
  Fly  1379 PGKNGRVGAPGPRGPTGDAGWGGIDGMDGLVGPKGQPGVTYSYSMARPGDRGEPGLDGFQGEEGD 1443

Human  1378 PGPPGNLGPCGPRGKPGKDGKPGTPGPAGEKGNKGSKGEPGPAGSDGLPGLKGKRGDSGSPA--- 1439
            .|.||.:|..|.||..|..|..|..|..|..|.:|.:|:.|..|..|.|||:|..|..|.||   
  Fly  1444 GGAPGLIGFQGQRGAVGYRGDQGEVGYTGADGPQGQRGDKGYMGLTGAPGLRGLPGPQGEPAPAP 1508

Human  1440 -TWTTRGFVFTRHSQTTAIPSCPEGTVPLYSGFSFLFVQGN---QRAHGQDLGTLGSCLQRFTTM 1500
             ...:|||:|.||||:..:|.||..|..|:.|:|   :.||   .||.|||||..|||:.|||||
  Fly  1509 PAPKSRGFIFARHSQSVHVPQCPANTNLLWEGYS---LSGNVAASRAVGQDLGQSGSCMMRFTTM 1570

Human  1501 PFLFCNVNDVCNFASRNDYSYWLSTPALMPMNMAPITGRALEPYISRCTVCEGPAIAIAVHSQTT 1565
            |::.|::.:||:||..||.|.||||...|||.|.||.||.|..|||||.|||.....||:|||:.
  Fly  1571 PYMLCDITNVCHFAQNNDDSLWLSTAEPMPMTMTPIQGRDLMKYISRCVVCETTTRIIALHSQSM 1635

Human  1566 DIPPCPHGWISLWKGFSFIMFTSAGSEGTGQALASPGSCLEEFRASPFLECHGRGTCNYYSNSYS 1630
            .||.||.||..:|.|:|:.|.|.....|.||.|.|||||||||||.|.:||||.|.||||....|
  Fly  1636 SIPDCPGGWEEMWTGYSYFMSTLDNVGGVGQNLVSPGSCLEEFRAQPVIECHGHGRCNYYDALAS 1700

Human  1631 FWLASLNPERMFRKPIPSTVKAGELEKIISRCQVCMKK 1668
            |||..:..:..|.:|...|:||....| ||||.||.::
  Fly  1701 FWLTVIEEQDQFVQPRQQTLKADFTSK-ISRCTVCRRR 1737

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
COL4A3NP_000082.2 7S domain 29..42 2/12 (17%)
Triple-helical region 43..1438 607/1588 (38%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 49..78 15/31 (48%)
Collagen 67..124 CDD:189968 25/56 (45%)
Collagen 97..155 CDD:189968 23/57 (40%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 167..469 132/328 (40%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 502..1442 414/1107 (37%)
Cell attachment site. /evidence=ECO:0000255 791..793 0/1 (0%)
Collagen 830..885 CDD:189968 23/54 (43%)
Cell attachment site. /evidence=ECO:0000255 996..998 0/1 (0%)
Cell attachment site. /evidence=ECO:0000255 1154..1156 0/1 (0%)
Collagen 1164..1219 CDD:189968 25/81 (31%)
Cell attachment site. /evidence=ECO:0000255 1306..1308 0/27 (0%)
Cell attachment site. /evidence=ECO:0000255 1345..1347 0/1 (0%)
Collagen 1390..1438 CDD:189968 19/47 (40%)
Epitope recognized by Goodpasture antibodies 1427..1444 7/20 (35%)
Cell attachment site. /evidence=ECO:0000255 1432..1434 0/1 (0%)
C4 1446..1552 CDD:279721 58/108 (54%)
Required for the anti-angiogenic activity of tumstatin 1479..1557 46/80 (58%)
C4 1556..1666 CDD:279721 56/109 (51%)
Required for the anti-tumor cell activity of tumstatin 1610..1628 11/17 (65%)
vkgNP_001260071.1 Collagen 60..119 CDD:189968 26/58 (45%)
Collagen 102..152 CDD:189968 22/52 (42%)
Collagen 277..334 CDD:189968 28/61 (46%)
Collagen 534..593 CDD:189968 31/76 (41%)
Collagen 814..871 CDD:189968 18/56 (32%)
Collagen 957..1014 CDD:189968 27/59 (46%)
Collagen 990..1049 CDD:189968 24/61 (39%)
Collagen 1070..1128 CDD:189968 23/57 (40%)
C4 1515..1624 CDD:128421 61/111 (55%)
C4 1625..1737 CDD:128421 57/112 (51%)


Information from Original Tools:


Tool Simple Score Weighted Score Original Tool Information
BLAST Result Score Score Type Cluster ID
Compara 00.000 Not matched by this tool.
Domainoid 1 1.000 133 1.000 Domainoid score I5070
eggNOG 00.000 Not matched by this tool.
Hieranoid 00.000 Not matched by this tool.
Homologene 00.000 Not matched by this tool.
Inparanoid 00.000 Not matched by this tool.
Isobase 00.000 Not matched by this tool.
OMA 00.000 Not matched by this tool.
OrthoDB 1 1.010 - - D41315at33208
OrthoFinder 1 1.000 - - FOG0000443
OrthoInspector 00.000 Not matched by this tool.
orthoMCL 1 0.900 - - OOG6_100768
Panther 1 1.100 - - O PTHR24023
Phylome 1 0.910 - -
RoundUp 00.000 Not matched by this tool.
SonicParanoid 00.000 Not matched by this tool.
SwiftOrtho 00.000 Not matched by this tool.
TreeFam 00.000 Not matched by this tool.
User_Submission 00.000 Not matched by this tool.
65.920

Return to query results.
Submit another query.