DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment COL4A4 and vkg

DIOPT Version :9

Sequence 1:NP_000083.3 Gene:COL4A4 / 1286 HGNCID:2206 Length:1690 Species:Homo sapiens
Sequence 2:NP_001260071.1 Gene:vkg / 33726 FlyBaseID:FBgn0016075 Length:1940 Species:Drosophila melanogaster


Alignment Length:1837 Identity:748/1837 - (40%)
Similarity:879/1837 - (47%) Gaps:294/1837 - (16%)


- Green bases have known domain annotations that are detailed below.


Human    30 LFSVQYVYGSGKKYIGPCGGRDC--SVCHCVPEKGSRGPPGP---PGPQGPIGPLGAPGPIGLSG 89
            |..|.|:.|| ...:....|:.|  ::|.|...||..|.|||   ||.:||.|.:|.||..|..|
  Fly    12 LLGVVYLLGS-LVSVTLADGKICNTTLCDCKGIKGRMGAPGPIGVPGLEGPAGDIGPPGRAGPLG 75

Human    90 EKGMRGDRGPPGAAGDKGDKGPTGVPGFPGLDGIPGHPGPPGPRGKPGMSGHNGSRGDPGFPGGR 154
            |||..|:.|..|..|.:||.||.|..|:||:.|..|.||.|||||..|..|..|.:|..|.||..
  Fly    76 EKGDVGEYGEQGEKGHRGDIGPKGEMGYPGIMGKSGEPGTPGPRGIDGCDGRPGMQGPSGAPGQN 140

Human   155 GALGPGGPLGH---PGEKGEKG-NSVFILGAVKGIQGDRGD---------PGLPGLPGSWGAGGP 206
            |..||.|..|.   |||.||.| ||       ||.:|:||:         ||..|..||.|..|.
  Fly   141 GVRGPPGKPGQQGPPGEAGEGGINS-------KGTKGNRGETGQPGGVGPPGFDGDRGSKGDTGY 198

Human   207 AGPTGYPGEPGLVGPPGQPGR-----------PGLKGNPGVGVKGQM-------GDPGEVGQQGS 253
            ||.||..|:|||.||.|..|.           ||.||.||..:.|.:       |..|.||.||.
  Fly   199 AGLTGEKGDPGLPGPKGDTGAVSELPYSLIGPPGAKGEPGDSLSGVLKPDDTLKGYKGYVGLQGD 263

Human   254 PGPTLLVEPPDFCLYKGEKGIKGIPGMVGLPGPPGRKGESGIGAKGEKGIP---GFPGPRGDPGS 315
            .||               :|..|..|.||..|.||.:||  ||..||:|.|   |.||..||.|.
  Fly   264 EGP---------------QGPTGEQGAVGRNGLPGARGE--IGGPGERGKPGKDGEPGRFGDKGM 311

Human   316 YGSPGF---------PGLKGELGLVGDPGLFGLIGPKG--DP----------GNRGHPGPPGVLV 359
            .|:||:         ||.:||.|..|.||:.|..||.|  ||          |::|..||||.  
  Fly   312 KGAPGWTGADGLDGSPGERGEDGFTGMPGVQGGAGPPGIYDPSLTKSLPGPIGSQGDIGPPGE-- 374

Human   360 TPPLPLKGPPGDPGFPGRYGETGDVGPPGPPGLLG--------RPGEA-CAGMIGPPGPQGFPGL 415
                  :||||.||.|||.|..|..|..|.|||.|        ..||| ..|.||||||||.||.
  Fly   375 ------QGPPGLPGKPGRRGPIGLAGQSGDPGLNGSRGPPGRSERGEAGDYGFIGPPGPQGPPGE 433

Human   416 PGLPGEAGIPGRPD----SAPGKPGKPGSPGLP---------GAPGLQGLPGSSVIYCSVGNPGP 467
            .||||..|:.|.|.    ...|:||..|.|||.         |.||.:||||..  |..||.||.
  Fly   434 AGLPGRYGLHGEPGQNVVGPKGEPGLNGQPGLEGYRGDRGEVGLPGDKGLPGEG--YNIVGPPGS 496

Human   468 QGIKGKVGPPGG------RGPKGEKGNEG----LCACEP-GPMGPPGPPGLPGRQGSKGDLGL-- 519
            ||..|..|.||.      ||..||||..|    :|...| ||.|..|..|.||..|::|.:||  
  Fly   497 QGPPGFRGLPGDDGYNGLRGLPGEKGLRGDDCPVCNAGPRGPRGQEGDTGYPGSHGNRGAIGLTG 561

Human   520 ----------PGWLGTKGDPGPPGAEGPPGLPGKHGASGPPG--------NKGAKGDMVVSRVKG 566
                      ||..|.||.|||.|.   ||.|||.||:||.|        .||..||...|   |
  Fly   562 PRGVQGLQGNPGRAGHKGLPGPAGI---PGEPGKVGAAGPDGKAIEVGSLRKGEIGDTGDS---G 620

Human   567 HKGERGPDGPPGFPGQPGSHGRDGHAGEKGDPGPPGDHEDATPGGKGFPGPLGPPGKAGPVGPPG 631
            |:|:.|.|      |:.|..|.||..||:|:.|..||:.||     |:.|..|.||:.|..|.||
  Fly   621 HRGDTGDD------GEKGRDGSDGSKGERGETGQRGDYGDA-----GYQGRDGEPGRDGRDGAPG 674

Human   632 LGFPGPP----GERGHPGVPGHPGVRGPDGLKGQKGD----TISCNVTYPGRHGPPGFDGPPGPK 688
            .....|.    ||.|:.|:.|..|..|..|.||.||:    .|..|.   |..|..|:.||.|.|
  Fly   675 RNATTPKVYLIGEPGYDGIKGERGDDGDTGFKGVKGEPNPGQIYDNT---GEPGEDGYTGPKGVK 736

Human   689 GFPGPQGAPGLSGSDGHKGR-----PGTPGTAEIPGPPGFRGDMGDPGFGGEKGSSPVGPPGPPG 748
            |..|.|||.||.|..|.:|.     ||..|....|||.|..|..|.||..|..|.     ||..|
  Fly   737 GAKGEQGAIGLRGEIGDRGPAGEVIPGPVGAKGYPGPTGDYGQQGAPGLPGRDGE-----PGLDG 796

Human   749 SPGVNGQKGIPGDPAF-GHLGPPGKRGLSGVPGIKGPRGDPGCPGAEGPAGIPGFLGLKGPKGRE 812
            ..|..||:|:||.... |.:||||:.|:.|.|      ||.|.||..|.||.||..|:||.:|.:
  Fly   797 GIGYKGQRGVPGQEVIQGEIGPPGRSGIKGFP------GDVGAPGQYGLAGRPGPKGVKGEQGPD 855

Human   813 GHAGFPGVPGPPGHSCE--RGAPGIPGQPGLPGYPGSPGAPGGKGQPGDVGPPGPAGMKGLPGLP 875
            |..|..|:||..|...:  .|.||..||||..|.....||.|.||:.|.:|..|..|.||..|..
  Fly   856 GAVGQTGLPGNKGQRGDFLVGPPGPKGQPGRNGRQAPHGAKGQKGEVGSLGQNGQNGAKGSIGFS 920

Human   876 GRPGAHGPPGLPGIPGPFGDDGLPGPPGPKGPRGLPGFPGFPGERGKPGAEGCPGAKGEPGEKGM 940
            ||.|..|..||.|:||..|..||||..|..|.||..|:.|..|:.|..|..|..|.||..|:.|.
  Fly   921 GRRGLLGNAGLQGLPGSPGIPGLPGMIGEIGERGEIGYNGRQGDIGPRGPNGEFGPKGLSGDDGP 985

Human   941 SGLPGDRGLRGAKGAIGPPGDEGEMAIISQKGTPGEPGPPGDDGFPGERGDKGTPGMQGRRGEP- 1004
            .|.||..||.|.||..|.|            |.||.||..|...:.|.:||.|..|:.|..|.| 
  Fly   986 DGYPGANGLPGRKGETGNP------------GFPGRPGAKGVAAYSGIKGDDGESGLTGPIGYPG 1038

Human  1005 -----GRYGPPGFHR----GEPGEKGQPGPPGPPGPPGSTGLRGFIGFPGLPGDQGEPGSPGPPG 1060
                 |:.||.|..:    |..|.||:.|.|||.|.||..||:|..|..||||.||.||.||..|
  Fly  1039 APGAKGQRGPVGDSQPALDGVAGRKGEVGSPGPNGLPGRHGLKGQRGDRGLPGQQGRPGEPGAKG 1103

Human  1061 FSGIDGARGPKGNKGDPASHF----GPPGPKGEPGSPGCPGHFGASGEQGLPGIQGPRGSPGRPG 1121
            ..|..|..|..|.||  |:.|    ||.||:||.|..|..|..|..|:||..|:.|.:|..|..|
  Fly  1104 LGGYPGRNGINGLKG--ATGFPGPQGPKGPQGESGVVGLDGRNGQIGDQGPRGLIGEQGEQGEQG 1166

Human  1122 PPGSSGPPG----------CPGDHGMPGLRGQPGEMGDPGPRGLQGDPGIPGPPGIKGPSGSPGL 1176
            ..|..|.||          ..|..|..||:|:.||.||.||.|..|.||..|..|..|.:|..|.
  Fly  1167 DEGEVGIPGRLENLRDRSFYRGFTGDQGLQGERGEQGDMGPIGFIGPPGAKGERGDIGYAGQLGF 1231

Human  1177 NGLHGLKGQKGTKGASGLHDVGPPGPVGIPGLKGERGDPGSPGI----SPPGPRGKKGPPGPPGS 1237
            :|..||||.:|        |.||.||.||. |..|:||.|..|:    ..||..|:||.|||||.
  Fly  1232 DGAEGLKGFQG--------DQGPRGPPGIT-LPAEKGDEGVAGLDGRAGRPGHFGQKGAPGPPGE 1287

Human  1238 SGPPGPAGATGRAPKDIPDPGPPGDQGPPGPDGPRGAPGPPGLPGSVDLL--RGEPGDCGLPGPP 1300
            :||.|..|..|           |..||||||.|..|.||.||..|...|:  :||.||.|..|..
  Fly  1288 NGPNGAIGHRG-----------PQIQGPPGPQGDVGFPGAPGHNGRHGLIGPKGELGDMGRQGER 1341

Human  1301 GP--------PGPPGPPGYKGFPGCDGKDGQKGPVGFPGPQGPHGFPGPPGEKGLPGPPGRKGPT 1357
            |.        .|..|..|::|.||.||..|::|..|.||..|..|.|||.|..|..|..|..|..
  Fly  1342 GESGYAIVGRQGDIGDIGFQGEPGWDGAKGEQGYPGLPGKNGRVGAPGPRGPTGDAGWGGIDGMD 1406

Human  1358 GLPGPRGEPG--------PPADVDDCPRIPGLPGAPGMRGPEGAMGLPGMRGPSGPGCKGEPGLD 1414
            ||.||:|:||        .|.|       .|.||..|.:|.||..|.||:.|..|.  :|..|..
  Fly  1407 GLVGPKGQPGVTYSYSMARPGD-------RGEPGLDGFQGEEGDGGAPGLIGFQGQ--RGAVGYR 1462

Human  1415 GRRGVDGVPGSPGPPGRKGDTGEDGYPGGP---GPPGPIGDPGPKGFGPGYLGGFLLVLHSQTDQ 1476
            |.:|..|..|:.||.|::||.|..|..|.|   |.|||.|:|.|....| ...||:...|||:..
  Fly  1463 GDQGEVGYTGADGPQGQRGDKGYMGLTGAPGLRGLPGPQGEPAPAPPAP-KSRGFIFARHSQSVH 1526

Human  1477 EPTCPLGMPRLWTGYSLLYLEGQEKAHNQDLGLAGSCLPVFSTLPFAYCNIHQVCHYAQRNDRSY 1541
            .|.||.....||.||||.......:|..||||.:|||:..|:|:|:..|:|..|||:||.||.|.
  Fly  1527 VPQCPANTNLLWEGYSLSGNVAASRAVGQDLGQSGSCMMRFTTMPYMLCDITNVCHFAQNNDDSL 1591

Human  1542 WLASAAPLP--MMPLSEEAIRPYVSRCAVCEAPAQAVAVHSQDQSIPPCPQTWRSLWIGYSFLMH 1604
            ||::|.|:|  |.|:....:..|:|||.|||...:.:|:|||..|||.||..|..:|.|||:.|.
  Fly  1592 WLSTAEPMPMTMTPIQGRDLMKYISRCVVCETTTRIIALHSQSMSIPDCPGGWEEMWTGYSYFMS 1656

Human  1605 TGAGDQGGGQALMSPGSCLEDFRAAPFLECQGRQGTCHFFANKYSFWLTTVKADLQFSSAPAPDT 1669
            |.....|.||.|:|||||||:|||.|.:||.| .|.|:::....|||||.::...||.. |...|
  Fly  1657 TLDNVGGVGQNLVSPGSCLEEFRAQPVIECHG-HGRCNYYDALASFWLTVIEEQDQFVQ-PRQQT 1719

Human  1670 LKESQAQRQKISRCQVC 1686
            ||....  .|||||.||
  Fly  1720 LKADFT--SKISRCTVC 1734

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
COL4A4NP_000083.3 7S domain 39..64 7/26 (27%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 61..173 57/117 (49%)
Triple-helical region 65..1459 633/1569 (40%)
Cell attachment site. /evidence=ECO:0000255 94..96 0/1 (0%)
Cell attachment site. /evidence=ECO:0000255 145..147 0/1 (0%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 187..258 37/97 (38%)
Cell attachment site. /evidence=ECO:0000255 189..191 1/1 (100%)
Collagen 296..355 CDD:189968 31/82 (38%)
Cell attachment site. /evidence=ECO:0000255 310..312 0/1 (0%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 369..390 11/20 (55%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 405..451 26/58 (45%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 469..1457 433/1083 (40%)
Cell attachment site. /evidence=ECO:0000255 724..726 0/1 (0%)
Cell attachment site. /evidence=ECO:0000255 785..787 0/1 (0%)
Collagen 829..880 CDD:189968 22/52 (42%)
Collagen 914..964 CDD:189968 20/49 (41%)
Cell attachment site. /evidence=ECO:0000255 989..991 0/1 (0%)
Cell attachment site. /evidence=ECO:0000255 1212..1214 0/1 (0%)
C4 1466..1570 CDD:279721 46/105 (44%)
C4 1574..1686 CDD:279721 52/111 (47%)
vkgNP_001260071.1 Collagen 60..119 CDD:189968 29/58 (50%)
Collagen 102..152 CDD:189968 24/49 (49%)
Collagen 277..334 CDD:189968 25/58 (43%)
Collagen 534..593 CDD:189968 25/61 (41%)
Collagen 814..871 CDD:189968 28/62 (45%)
Collagen 957..1014 CDD:189968 26/68 (38%)
Collagen 990..1049 CDD:189968 24/70 (34%)
Collagen 1070..1128 CDD:189968 28/59 (47%)
C4 1515..1624 CDD:128421 49/108 (45%)
C4 1625..1737 CDD:128421 54/114 (47%)


Information from Original Tools:


Tool Simple Score Weighted Score Original Tool Information
BLAST Result Score Score Type Cluster ID
Compara 1 0.930 - - C165148265
Domainoid 00.000 Not matched by this tool.
eggNOG 1 0.900 - - E33208_3BAFZ
Hieranoid 00.000 Not matched by this tool.
Homologene 00.000 Not matched by this tool.
Inparanoid 00.000 Not matched by this tool.
Isobase 00.000 Not matched by this tool.
OMA 00.000 Not matched by this tool.
OrthoDB 1 1.010 - - D41315at33208
OrthoFinder 1 1.000 - - FOG0000443
OrthoInspector 00.000 Not matched by this tool.
orthoMCL 1 0.900 - - OOG6_100768
Panther 1 1.100 - - O PTHR24023
Phylome 1 0.910 - -
RoundUp 00.000 Not matched by this tool.
SonicParanoid 00.000 Not matched by this tool.
SwiftOrtho 00.000 Not matched by this tool.
TreeFam 00.000 Not matched by this tool.
User_Submission 00.000 Not matched by this tool.
76.750

Return to query results.
Submit another query.