DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment vkg and Col4a4

DIOPT Version :9

Sequence 1:NP_001260071.1 Gene:vkg / 33726 FlyBaseID:FBgn0016075 Length:1940 Species:Drosophila melanogaster
Sequence 2:NP_031761.1 Gene:Col4a4 / 12829 MGIID:104687 Length:1682 Species:Mus musculus


Alignment Length:1824 Identity:739/1824 - (40%)
Similarity:885/1824 - (48%) Gaps:307/1824 - (16%)


- Green bases have known domain annotations that are detailed below.


  Fly    34 NTTLCDCKGIKGRMGAPGPIGVPGLEGPAGDIGPPGRAGPLGEKGDVGEYGEQGEKGHRGDIGPK 98
            |.::|.|...||..|.|||:|..|..||.|.:||.|..|..||:||.|..|..|||   ||.||.
Mouse    43 NCSVCQCFPEKGSRGHPGPLGPQGPIGPLGPLGPIGIPGEKGERGDSGSPGPPGEK---GDKGPT 104

  Fly    99 GEMGYPGIMGKSGEPGTPGPRGIDGCDGRPGMQGPSGAPGQNGVRGPPGKPGQQGPPGEAGEGGI 163
            |..|:||:.|..|.||.|||||..|.||..|.:|..|.||:.|..||.|.|||.|..||.|....
Mouse   105 GVPGFPGVDGVPGHPGPPGPRGKPGVDGYNGSRGDPGYPGERGAPGPGGPPGQPGENGEKGRSVY 169

  Fly   164 NSKGTKGNRGETGQPGGVGPPGFDGDRGSKGDTGYAGLTGEKGDPGLPGPKGDTGAVSELPYSLI 228
            .:.|.||.:|:.|.|   ||||..|.||::|.   .|..|..|.|||.||.|..|:         
Mouse   170 ITGGVKGIQGDRGDP---GPPGLPGSRGAQGS---PGPMGHAGAPGLAGPIGHPGS--------- 219

  Fly   229 GPPGAKGEPGDSLSGVLKPDDTLKGYKGYVGLQGDEGPQGPT------------GEQGAVGRNGL 281
              ||.||.|...          |||.:|..|..|..||.|||            ||:|.   .|:
Mouse   220 --PGLKGNPATG----------LKGQRGEPGEVGQRGPPGPTLLVQPPDLSIYKGEKGV---KGM 269

  Fly   282 PGARGEIGGPGERGKPGKDGEPGRFGDKGMKGAPGWTGADGLDGSPGERGEDGFTGMPGVQGGAG 346
            ||..|..|.||.:|.||       .|.||.||.||:.|.   .|.||..|..||.|..|:||.||
Mouse   270 PGMIGPPGPPGRKGAPG-------VGIKGEKGIPGFPGP---RGEPGSHGPPGFPGFKGIQGAAG 324

  Fly   347 PPGIYDPSLTKSLPGPIGSQGDIGPPGEQGPPG--------LPGKPGRRGPIGLAGQSGDPGLNG 403
            .||::         |.:|.:||:|..|..||||        |.|.||..||.|..|:.||.||.|
Mouse   325 EPGLF---------GFLGPKGDLGDRGYPGPPGILLTPAPPLKGVPGDPGPPGYYGEIGDVGLPG 380

  Fly   404 SRGPPGRSERGEAGDYGFIGPPGPQGPPGEAGLPGRYGLHGEPGQNVVGPKGEPGLNGQPGLEGY 468
            ..|||||.  ||... |.:|||||.|.||..|.||..|:   ||:....| |:||..|.|||.  
Mouse   381 PPGPPGRP--GETCP-GMMGPPGPPGVPGPPGFPGEAGV---PGRLDCAP-GKPGKPGLPGLP-- 436

  Fly   469 RGDRGEVGLPGDKGLPGEGYNIVGPPGSQGPPGFRGLPGDDGYNGLRGLPGEKGLRGDDCPVCNA 533
                   |.||.:|.||... |...||..||.|.:|..|..|..|.:|..|.|||    | .|..
Mouse   437 -------GAPGPEGPPGSDV-IYCRPGCPGPMGEKGKVGPPGRRGAKGAKGNKGL----C-TCPP 488

  Fly   534 GPRGPRGQEGDTGYPGSHGNRGAIGLTGPRGVQGLQGNPGRAGHKGLPGPAGIPGEPGKVGAAGP 598
            ||.||.|..|.   ||..|::|.:||.   |..|.:|:||:.|.:|.|||      ||:.||.||
Mouse   489 GPMGPPGPPGP---PGRQGSKGDLGLP---GWHGEKGDPGQPGAEGPPGP------PGRPGAMGP 541

  Fly   599 DGKAIEVGSLRKGEI-GDTGDSGHRGDTGDDGEKGRDGSDGSKGERGETGQRGDYGDA-----GY 657
            .|...|.|.:....: |..|:.|..|..|..|..|:||.||..||||:.|.|||:.||     |.
Mouse   542 PGHKGEKGDMVISRVKGQKGERGLDGPPGFPGPHGQDGGDGRPGERGDPGPRGDHKDAAPGERGL 606

  Fly   658 QGRDGEPGRDGRDGAPGRNATTP--KVYLIGEP------GYDGIKGERGD--------------- 699
            .|..|.|||.|.:|.||.....|  :..|.|||      |:||.||::||               
Mouse   607 PGLPGPPGRTGPEGPPGLGFPGPPGQRGLPGEPGRPGTRGFDGTKGQKGDSILCNVSYPGKPGLP 671

  Fly   700 --DGDTGFKGVKGEPNPGQIYDNTGEPG---EDGYTGPKGVKGAKGEQGAIGLRGEIGDRGPAGE 759
              ||..|.||..|.|         |.||   .||..|.:|..|..|..|..|.||::||.|..||
Mouse   672 GLDGPPGLKGFPGPP---------GAPGMRCPDGQKGQRGKPGMSGIPGPPGFRGDMGDPGIKGE 727

  Fly   760 VIPGPVGAKGYPGPTGDYGQQGAP-----GLPGRDGEPGLDGGIGYKGQRGVPGQEVIQGEIGPP 819
            ....|:|..|.||..|..||:|.|     |.||..||.||.|..|.|||:|.||   ..|..|||
Mouse   728 KGTSPIGPPGPPGSPGKDGQKGIPGDPAFGDPGPPGERGLPGAPGMKGQKGHPG---CPGAGGPP 789

  Fly   820 GRSGIKGFPGDVGAPGQYGLAGRPGPKGVKGEQGPDGAVGQTGLPGNKGQRGDFLVGPPGPKGQP 884
            |..|..|..|..|..|..|..|.||..|...|:|..|..||.||||..|.     .|.||.||||
Mouse   790 GIPGSPGLKGPKGREGSRGFPGIPGSPGHSCERGAPGIPGQPGLPGTPGD-----PGAPGWKGQP 849

  Fly   885 GRNGRQAPHGAKGQKGEVGSLGQNGQNGAKGSIGFSGRRGLLGNAGLQGLPGSPGIPGL------ 943
            |..|...|.|.||..|..|..|.:|..|..|..|.:|..||.|..||:||||.||.||.      
Mouse   850 GDMGPSGPAGMKGLPGLPGLPGADGLRGPPGIPGPNGEDGLPGLPGLKGLPGLPGFPGFPGERGK 914

  Fly   944 PGMIGEIGERGEI---GYNGRQGDIGPRGPNGEFGPKGLSGDDGPDGYPGANGLPGRKGETGNPG 1005
            ||..||.|.:||:   |:.|.:||:|.||..|:   :||.||.| :......|.||..|..|:.|
Mouse   915 PGPDGEPGRKGEVGEKGWPGLKGDLGERGAKGD---RGLPGDAG-EAVTSRKGEPGDAGPPGDGG 975

  Fly  1006 FPGRPGAKGVAAYSGIKGDDGESGL-------------TGPIGYPGAPGAKGQRGPVGDSQPALD 1057
            |.|..|.||.:...|.:||.|..||             .||.|.||.||:.|.||.:|     ..
Mouse   976 FSGERGDKGSSGMRGGRGDPGRDGLPGLHRGQPGIDGPPGPPGPPGPPGSPGLRGVIG-----FP 1035

  Fly  1058 GVAGRKGEVGSPGPNGLPGRHGLKGQRGDRGLPGQQ-GRPGEPGAKGLGGYPGRNGINGLKGATG 1121
            |..|.:|:.|||||.|.||..|.:|.:|.:|.|..| |.||..|..|..||.||.|:.|.|   |
Mouse  1036 GFPGDQGDPGSPGPPGFPGDDGARGPKGYKGDPASQCGPPGPKGEPGSPGYQGRTGVPGEK---G 1097

  Fly  1122 FPGPQGPKGP---------------QGESGVVGLDGRNGQIGDQGPRGLIGEQGEQGEQGDEGEV 1171
            |||.:||:||               .|:.|:.||.|..|::||.||||..|:.|..|..|.:|.:
Mouse  1098 FPGDEGPRGPPGRPGQPGSFGPPGCPGDPGMPGLKGHPGEVGDPGPRGDAGDFGRPGPAGVKGPL 1162

  Fly  1172 GIPGRLENLRDRSFYRGFTGDQGLQGERGEQG-----DMGPIGFIGPPGAKGERGDIGYAGQLGF 1231
            |.|             |..|..||:||:|.:|     :|||.|.:|.||.|||:||.|..| :..
Mouse  1163 GSP-------------GLNGLHGLKGEKGTKGASGLLEMGPPGPMGMPGQKGEKGDPGSPG-ISP 1213

  Fly  1232 DGAEGLKGFQGDQGPRGPPGITLPAEKGDEGVAGLDGRAGRPGHFGQKGAPGPPGENGPNGAIGH 1296
            .|..|.|||.|..|..|||           |.||..|||.:    |....|||||:.||.|..|.
Mouse  1214 PGLPGEKGFPGPPGRPGPP-----------GPAGAPGRAAK----GDIPDPGPPGDRGPPGPDGP 1263

  Fly  1297 RGPQIQGPP----------GPQGDVGFPGAPGHNGRHGLIGPKGELGDMGRQGERGESGYAIVGR 1351
            ||  :.|||          |..||.|.||.||..|..|..|.:|..|..|:.|::|..|..    
Mouse  1264 RG--VPGPPGSPGNVDLLKGDPGDCGLPGPPGSRGPPGPPGCQGPPGCDGKDGQKGPMGLP---- 1322

  Fly  1352 QGDIGDIGFQGEPGWDGAKGEQGYPGLPGKNGRVGAPGPR---GPTGDA-GWGGIDGMDGLVGPK 1412
                   |..|.||..||.||:|.||.||:.|.||.||.|   ||..|. ....|.|:.|:.||:
Mouse  1323 -------GLPGPPGLPGAPGEKGLPGPPGRKGPVGPPGCRGEPGPPADVDSCPRIPGLPGVPGPR 1380

  Fly  1413 GQPGVTYSYSMARPGDRGEPGLDGFQGEEGDGGAPGLIGFQGQRGAVGYRGDQGEVGYTGADGPQ 1477
            |..|     :|..||.||.|| .|.:||.|..|.                  :|:.|..|:.||.
Mouse  1381 GPEG-----AMGEPGRRGLPG-PGCKGEPGPDGR------------------RGQDGIPGSPGPP 1421

  Fly  1478 GQRGDKGYMGLTGAPGLRGLPGPQGEPAPAPPAPKS-RGFIFARHSQSVHVPQCPANTNLLWEGY 1541
            |::||.|..|..|||   |.|||.|:|.|....|.| .||:...|||:...|.||.....||.||
Mouse  1422 GRKGDTGEAGCPGAP---GPPGPTGDPGPKGFGPGSLSGFLLVLHSQTDQEPACPVGMPRLWTGY 1483

  Fly  1542 SLSGNVAASRAVGQDLGQSGSCMMRFTTMPYMLCDITNVCHFAQNNDDSLWLSTAEPMPMTMTPI 1606
            ||.......:|..||||.:|||:..|:|:|:..|:|..|||:||.||.|.|||:|.|:|  |.|:
Mouse  1484 SLLYMEGQEKAHNQDLGLAGSCLPVFSTLPFAYCNIHQVCHYAQRNDRSYWLSSAAPLP--MMPL 1546

  Fly  1607 QGRDLMKYISRCVVCETTTRIIALHSQSMSIPDCPGGWEEMWTGYSYFMSTLDNVGGVGQNLVSP 1671
            ...::..|||||.|||...:.:|:|||..|||.||..|..:|.|||:.|.|.....|.||.|:||
Mouse  1547 SEEEIRSYISRCAVCEAPAQAVAVHSQDQSIPPCPRTWRSLWIGYSFLMHTGAGDQGGGQALMSP 1611

  Fly  1672 GSCLEEFRAQPVIECHG-HGRCNYYDALASFWLTVIEEQDQFVQ-PRQQTLKADFTS--KISRCT 1732
            |||||:|||.|.:||.| .|.|:::....|||||.:....||.. |...|||.....  |||||.
Mouse  1612 GSCLEDFRAAPFVECQGRQGTCHFFANEYSFWLTTVNPDLQFASGPSPDTLKEVQAQRRKISRCQ 1676

  Fly  1733 VCRR 1736
            ||.:
Mouse  1677 VCMK 1680

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
vkgNP_001260071.1 Collagen 60..119 CDD:189968 30/58 (52%)
Collagen 102..152 CDD:189968 26/49 (53%)
Collagen 277..334 CDD:189968 22/56 (39%)
Collagen 534..593 CDD:189968 24/58 (41%)
Collagen 814..871 CDD:189968 25/56 (45%)
Collagen 957..1014 CDD:189968 22/56 (39%)
Collagen 990..1049 CDD:189968 27/71 (38%)
Collagen 1070..1128 CDD:189968 27/58 (47%)
C4 1515..1624 CDD:128421 51/108 (47%)
C4 1625..1737 CDD:128421 54/116 (47%)
Col4a4NP_031761.1 7S domain. /evidence=ECO:0000250|UniProtKB:P53420 31..56 5/12 (42%)
Collagen 54..111 CDD:189968 29/59 (49%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 56..255 98/228 (43%)
Triple-helical region. /evidence=ECO:0000250|UniProtKB:P53420 57..1451 627/1577 (40%)
Cell attachment site. /evidence=ECO:0000255 86..88 0/1 (0%)
Cell attachment site. /evidence=ECO:0000255 137..139 0/1 (0%)
Cell attachment site. /evidence=ECO:0000255 181..183 0/1 (0%)
Collagen 291..348 CDD:189968 28/68 (41%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 379..1453 472/1205 (39%)
Cell attachment site. /evidence=ECO:0000255 587..589 1/1 (100%)
Cell attachment site. /evidence=ECO:0000255 593..595 1/1 (100%)
Cell attachment site. /evidence=ECO:0000255 716..718 1/1 (100%)
Collagen 805..858 CDD:189968 25/57 (44%)
Collagen 889..948 CDD:189968 28/61 (46%)
Cell attachment site. /evidence=ECO:0000255 980..982 0/1 (0%)
Cell attachment site. /evidence=ECO:0000255 992..994 0/1 (0%)
Cell attachment site. /evidence=ECO:0000255 1144..1146 1/1 (100%)
Collagen 1338..1405 CDD:189968 31/72 (43%)
C4 1458..1562 CDD:279721 48/105 (46%)
C4 1566..1679 CDD:279721 53/112 (47%)


Information from Original Tools:


Tool Simple Score Weighted Score Original Tool Information
BLAST Result Score Score Type Cluster ID
Compara 1 0.930 - - C167838355
Domainoid 00.000 Not matched by this tool.
eggNOG 1 0.900 - - E33208_3BAFZ
Hieranoid 00.000 Not matched by this tool.
Homologene 00.000 Not matched by this tool.
Inparanoid 00.000 Not matched by this tool.
Isobase 00.000 Not matched by this tool.
OMA 00.000 Not matched by this tool.
OrthoDB 00.000 Not matched by this tool.
OrthoFinder 1 1.000 - - FOG0000443
OrthoInspector 00.000 Not matched by this tool.
orthoMCL 1 0.900 - - OOG6_100768
Panther 1 1.100 - - O PTHR24023
Phylome 1 0.910 - -
RoundUp 00.000 Not matched by this tool.
SonicParanoid 00.000 Not matched by this tool.
SwiftOrtho 00.000 Not matched by this tool.
TreeFam 00.000 Not matched by this tool.
65.740

Return to query results.
Submit another query.