DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment vkg and COL4A2

DIOPT Version :9

Sequence 1:NP_001260071.1 Gene:vkg / 33726 FlyBaseID:FBgn0016075 Length:1940 Species:Drosophila melanogaster
Sequence 2:NP_001837.2 Gene:COL4A2 / 1284 HGNCID:2203 Length:1712 Species:Homo sapiens


Alignment Length:1894 Identity:764/1894 - (40%)
Similarity:929/1894 - (49%) Gaps:347/1894 - (18%)


- Green bases have known domain annotations that are detailed below.


  Fly     4 RDLRHLSG-------LLGVVYLLGSLVSVTLA---------DGKICNTTLCDCKGIKGRMGAPGP 52
            ||.|.::|       |||.| .:|.|....||         .|:.|:.. |.|...||..|.|||
Human     3 RDQRAVAGPALRRWLLLGTV-TVGFLAQSVLAGVKKFDVPCGGRDCSGG-CQCYPEKGGRGQPGP 65

  Fly    53 IGVPGLEGPAGDIGPPGRAGPLGEKGDVGEYGEQGEKGHRGDIGPKGEMGYPGIMGKSGEPGTPG 117
            :|..|..||.|..|.||..   |.|||.||.|..|..|.:||:|.:|..|:||..|..|.||..|
Human    66 VGPQGYNGPPGLQGFPGLQ---GRKGDKGERGAPGVTGPKGDVGARGVSGFPGADGIPGHPGQGG 127

  Fly   118 PR---GIDGCDGRPGMQGPSGAPGQNGVRGPP---GKPGQQGPP------------GEAGEGG-I 163
            ||   |.|||:|..|..||.|.||..|..|||   |..||:|.|            ||.||.| :
Human   128 PRGRPGYDGCNGTQGDSGPQGPPGSEGFTGPPGPQGPKGQKGEPYALPKEERDRYRGEPGEPGLV 192

  Fly   164 NSKGTKGNRGETGQ---------PGGVGPPGFDGDRGSKGDTGYAGLTGEKGDPGLPGPKG---D 216
            ..:|..|..|..||         ||..||||..|.:|::| .|:.|:.|||||.|.|||.|   |
Human   193 GFQGPPGRPGHVGQMGPVGAPGRPGPPGPPGPKGQQGNRG-LGFYGVKGEKGDVGQPGPNGIPSD 256

  Fly   217 T-------GAVSELPYSLIGPPGAKGEPGDSLSGV-LKPDDTLKGY---KGYVGL---------- 260
            |       ..|:..|....|..|::||||  :.|: ||.::.:.|:   :||.||          
Human   257 TLHPIIAPTGVTFHPDQYKGEKGSEGEPG--IRGISLKGEEGIMGFPGLRGYPGLSGEKGSPGQK 319

  Fly   261 --------QGDEGPQGPTGEQGAVGRNGLP----------GARGEIGGPGERGKPGKDGEPGRFG 307
                    ||.:||:||.||.|..|..|||          ||||:.|.||.:|:||..|||   |
Human   320 GSRGLDGYQGPDGPRGPKGEAGDPGPPGLPAYSPHPSLAKGARGDPGFPGAQGEPGSQGEP---G 381

  Fly   308 DKGMKGAPGWTGADG--LDGSPGERGEDGFTGMPGVQGGAGPPGIYDPSLTKSLPGPIGSQGDIG 370
            |.|:.|.||.:..||  ..|.|||.|..||.|.||:      |.:|.        ||.|..|..|
Human   382 DPGLPGPPGLSIGDGDQRRGLPGEMGPKGFIGDPGI------PALYG--------GPPGPDGKRG 432

  Fly   371 PPGEQGPPGLPGKPGRRGPI----GLAGQSGDPGLNGS---RGPPG-RSERGEA----GDYGFIG 423
            ||   |||||||.||..|.:    |..|::|.|||.||   |||.| :.:.||.    ||....|
Human   433 PP---GPPGLPGPPGPDGFLFGLKGAKGRAGFPGLPGSPGARGPKGWKGDAGECRCTEGDEAIKG 494

  Fly   424 PPGPQGPPGEAGLPGRYGLHGEPGQNVVGPKGEPGLNGQPGLEGYRGDRGEVGLPGDKGLPGEGY 488
            .||..||.|.|      |::||||:.  |.:|:||.:|.||..|.:|..|.:|.||.||..|:..
Human   495 LPGLPGPKGFA------GINGEPGRK--GDRGDPGQHGLPGFPGLKGVPGNIGAPGPKGAKGDSR 551

  Fly   489 NIV-----GPPGSQGPPGFRGLPGDDGYNGLRGLPGEKGLRGDDCPVCNAGPRGPRGQEGDTGYP 548
            .|.     |.||..|.||.:|..|..|.:||.|.||..|..||          |.:|..||.|||
Human   552 TITTKGERGQPGVPGVPGMKGDDGSPGRDGLDGFPGLPGPPGD----------GIKGPPGDPGYP 606

  Fly   549 GSHGNRGAIGLTGPRGVQGLQGNPGRAGHKGLPGPAGIPGEPGKVGAAGPDGKAIEVG---SLRK 610
            |..|.:|..|..||.|:    |.||..|.:|.||.||:||.||.:|..||.|...::.   .:::
Human   607 GIPGTKGTPGEMGPPGL----GLPGLKGQRGFPGDAGLPGPPGFLGPPGPAGTPGQIDCDTDVKR 667

  Fly   611 GEIGDTGDSGHRGDTGDDGEKGRDGSDGSKGERGETGQRGDYGDAGYQGRDGEPGRDGRDGAPGR 675
            ...||..::...|..|     |..|..|..|..|.||.:|..|..|:.|.||.||..|..|..||
Human   668 AVGGDRQEAIQPGCIG-----GPKGLPGLPGPPGPTGAKGLRGIPGFAGADGGPGPRGLPGDAGR 727

  Fly   676 NATTPKVYLIGEPGYDGIKGERGDDGDTGFKGVKGEPNPGQIYDNTGEPGEDGYTG-----PKGV 735
            .         |.||..|..|.||..|..|..|..|.|.|..:....|.|||.|..|     ..|.
Human   728 E---------GFPGPPGFIGPRGSKGAVGLPGPDGSPGPIGLPGPDGPPGERGLPGEVLGAQPGP 783

  Fly   736 KGAKGEQGAIGLRGEIGDRGPAGEVIPGPVGAKGYPGPTGDYGQQGAPGLPGRDGEPGLDGGIGY 800
            :|..|..|..||:|..|||||     ||..|::|.||..|..||   |||||..|:|||.|..|.
Human   784 RGDAGVPGQPGLKGLPGDRGP-----PGFRGSQGMPGMPGLKGQ---PGLPGPSGQPGLYGPPGL 840

  Fly   801 KGQRGVPGQEVIQGEIGPPGRSGIKGFPGDVGAPGQYGLAGRPGPKGVKGEQGPDGAVGQTGLPG 865
            .|..|.||||   |.:|.||..|.:|.|||.|.||..|..|..|.||:.|::|..|..|:.|.||
Human   841 HGFPGAPGQE---GPLGLPGIPGREGLPGDRGDPGDTGAPGPVGMKGLSGDRGDAGFTGEQGHPG 902

  Fly   866 NKGQRG-DFLVGPPGPK---GQPGRNGRQAPHGAKGQKGEVGSLGQNGQNGAKGSIGFSGRRGLL 926
            :.|.:| |.:.|.||.|   |.||.:|.|...|.||:.|..||.|:      .|..|..|.:||.
Human   903 SPGFKGIDGMPGTPGLKGDRGSPGMDGFQGMPGLKGRPGFPGSKGE------AGFFGIPGLKGLA 961

  Fly   927 GNAGLQGLPGSPGIPG-----LPGMIGEIGERGEIGYNGRQGDIGPRGPNGEFGPKGLSGDDGPD 986
            |..|.:|..|.||.||     ||||....||:         ||.||.|..|..|.||:.|..|..
Human   962 GEPGFKGSRGDPGPPGPPPVILPGMKDIKGEK---------GDEGPMGLKGYLGAKGIQGMPGIP 1017

  Fly   987 GYPGANGLPGR-------KGETGNPGFPGRPGAKGVAAYSGIKGDDGESGLTGPIGYPGAPGAKG 1044
            |..|..|||||       ||:.|.||.||.||..|||         |..|:|   |:||..|::|
Human  1018 GLSGIPGLPGRPGHIKGVKGDIGVPGIPGLPGFPGVA---------GPPGIT---GFPGFIGSRG 1070

  Fly  1045 QRGPVGDSQPALDGVAGRKGEVGSPGPNG-------LPGRHGLKGQRGDRGLPGQQGRPGEPGAK 1102
            .:|        ..|.||..||:|:.|..|       ||||.||||:||..|:||.:|..||.|.:
Human  1071 DKG--------APGRAGLYGEIGATGDFGDIGDTINLPGRPGLKGERGTTGIPGLKGFFGEKGTE 1127

  Fly  1103 GLGGYPGRNGIN------GLKGATGFPGPQGPKGPQGESGVVGLDGRNGQIGDQGPRGLIGEQGE 1161
            |..|:||..|:.      ||||.|||||..||.|.|||.|.:||.|..|..|..|..||.|..|.
Human  1128 GDIGFPGITGVTGVQGPPGLKGQTGFPGLTGPPGSQGELGRIGLPGGKGDDGWPGAPGLPGFPGL 1192

  Fly  1162 QGEQGDEGEVGIPGRLENLRDRSFYRGFT--------GDQGLQGERGEQGDMGPIGFI-GPPGAK 1217
            :|.:|..   |:||.          :||.        ||.|..|..||:||.|....: ||.|..
Human  1193 RGIRGLH---GLPGT----------KGFPGSPGSDIHGDPGFPGPPGERGDPGEANTLPGPVGVP 1244

  Fly  1218 GERGDIGYAGQLGFDGAEGLKGFQGDQGPRGPPGITLPAEKGDEGVAGLDGRAGRPGHFGQKGAP 1282
            |::||.|..|:.|..|:.||:||         ||||.|:     .::|..|..|.||.||.||..
Human  1245 GQKGDQGAPGERGPPGSPGLQGF---------PGITPPS-----NISGAPGDKGAPGIFGLKGYR 1295

  Fly  1283 GPPGENGPNGAIGHRGPQIQGPPGPQGDVGFPGAPGHNGRHGLI---GPKGELGDMGRQGERGES 1344
            ||||..| :.|:          ||.:||.|.|||||..|..|..   ||:|..|..|..||:|..
Human  1296 GPPGPPG-SAAL----------PGSKGDTGNPGAPGTPGTKGWAGDSGPQGRPGVFGLPGEKGPR 1349

  Fly  1345 GYAIVGRQGDIGDIGFQGEPGWDGAKGEQGYPGLPGKNGRVGAPGPRG-PTGDAGWGGIDGMDGL 1408
                 |.||.:|:.|..|..|..|.||.:|.||.||..|.|||||..| |.      .|....|.
Human  1350 -----GEQGFMGNTGPTGAVGDRGPKGPKGDPGFPGAPGTVGAPGIAGIPQ------KIAVQPGT 1403

  Fly  1409 VGPKGQPGVTYSYSMARPGDRGEPGLDGFQGEEGDGGAPGLIGFQGQRGAV----GYRGDQGEVG 1469
            |||:|:.|        .||..||.|..|..||.|..||||..|.|| ||.|    |:|||:|.:|
Human  1404 VGPQGRRG--------PPGAPGEMGPQGPPGEPGFRGAPGKAGPQG-RGGVSAVPGFRGDEGPIG 1459

  Fly  1470 YTGADGPQGQRGDKGYMGLTGAPGLRGLPGPQGEPAPAPPAPKSRGFIFARHSQSVHVPQCPANT 1534
            :      ||..|.:|..|..|:|||.|:||          ...|.|::..:|||:...|.||...
Human  1460 H------QGPIGQEGAPGRPGSPGLPGMPG----------RSVSIGYLLVKHSQTDQEPMCPVGM 1508

  Fly  1535 NLLWEGYSLSGNVAASRAVGQDLGQSGSCMMRFTTMPYMLCDITNVCHFAQNNDDSLWLSTAEPM 1599
            |.||.||||.......:|..||||.:|||:.||:|||::.|:..:||::|..||.|.||||..|:
Human  1509 NKLWSGYSLLYFEGQEKAHNQDLGLAGSCLARFSTMPFLYCNPGDVCYYASRNDKSYWLSTTAPL 1573

  Fly  1600 PMTMTPIQGRDLMKYISRCVVCETTTRIIALHSQSMSIPDCPGGWEEMWTGYSYFMSTLDNVGGV 1664
            |  |.|:...::..|||||.|||.....||:|||.:|||.||.||..:|.|||:.|.|.....|.
Human  1574 P--MMPVAEDEIKPYISRCSVCEAPAIAIAVHSQDVSIPHCPAGWRSLWIGYSFLMHTAAGDEGG 1636

  Fly  1665 GQNLVSPGSCLEEFRAQPVIECH-GHGRCNYYDALASFWLTVIEEQDQFVQPRQQTLKAD-FTSK 1727
            ||:|||||||||:|||.|.|||: |.|.|:||....|||||.|.||.....|...||||. ..:.
Human  1637 GQSLVSPGSCLEDFRATPFIECNGGRGTCHYYANKYSFWLTTIPEQSFQGSPSADTLKAGLIRTH 1701

  Fly  1728 ISRCTVCRR 1736
            ||||.||.:
Human  1702 ISRCQVCMK 1710

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
vkgNP_001260071.1 Collagen 60..119 CDD:189968 27/58 (47%)
Collagen 102..152 CDD:189968 27/55 (49%)
Collagen 277..334 CDD:189968 30/68 (44%)
Collagen 534..593 CDD:189968 27/58 (47%)
Collagen 814..871 CDD:189968 25/56 (45%)
Collagen 957..1014 CDD:189968 27/63 (43%)
Collagen 990..1049 CDD:189968 27/65 (42%)
Collagen 1070..1128 CDD:189968 33/70 (47%)
C4 1515..1624 CDD:128421 50/108 (46%)
C4 1625..1737 CDD:128421 61/114 (54%)
COL4A2NP_001837.2 Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 60..237 75/180 (42%)
Collagen 67..121 CDD:189968 25/56 (45%)
Triple-helical region 184..1484 579/1478 (39%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 271..448 77/198 (39%)
Collagen 293..346 CDD:189968 15/52 (29%)
Collagen 493..548 CDD:189968 27/62 (44%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 507..640 60/148 (41%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 690..906 98/235 (42%)
Collagen 791..849 CDD:189968 32/65 (49%)
Collagen 821..880 CDD:189968 33/64 (52%)
Collagen 857..916 CDD:189968 25/58 (43%)
Collagen 896..950 CDD:189968 23/59 (39%)
Collagen 1033..1088 CDD:189968 27/74 (36%)
Collagen 1107..1165 CDD:189968 28/57 (49%)
Collagen 1155..1212 CDD:189968 25/69 (36%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1157..1480 151/386 (39%)
Collagen 1341..1394 CDD:189968 26/57 (46%)
C4 1490..1594 CDD:279721 47/105 (45%)
C4 1598..1709 CDD:279721 60/110 (55%)


Information from Original Tools:


Tool Simple Score Weighted Score Original Tool Information
BLAST Result Score Score Type Cluster ID
Compara 00.000 Not matched by this tool.
Domainoid 00.000 Not matched by this tool.
eggNOG 1 0.900 - - E33208_3BAFZ
Hieranoid 00.000 Not matched by this tool.
Homologene 00.000 Not matched by this tool.
Inparanoid 00.000 Not matched by this tool.
Isobase 00.000 Not matched by this tool.
OMA 00.000 Not matched by this tool.
OrthoDB 1 1.010 - - D41315at33208
OrthoFinder 1 1.000 - - FOG0000443
OrthoInspector 00.000 Not matched by this tool.
orthoMCL 1 0.900 - - OOG6_100768
Panther 00.000 Not matched by this tool.
Phylome 1 0.910 - -
RoundUp 00.000 Not matched by this tool.
SonicParanoid 00.000 Not matched by this tool.
SwiftOrtho 00.000 Not matched by this tool.
TreeFam 00.000 Not matched by this tool.
User_Submission 00.000 Not matched by this tool.
54.720

Return to query results.
Submit another query.