DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Col4a4 and vkg

DIOPT Version :9

Sequence 1:NP_031761.1 Gene:Col4a4 / 12829 MGIID:104687 Length:1682 Species:Mus musculus
Sequence 2:NP_001260071.1 Gene:vkg / 33726 FlyBaseID:FBgn0016075 Length:1940 Species:Drosophila melanogaster


Alignment Length:1824 Identity:739/1824 - (40%)
Similarity:885/1824 - (48%) Gaps:307/1824 - (16%)


- Green bases have known domain annotations that are detailed below.


Mouse    43 NCSVCQCFPEKGSRGHPGPLGPQGPIGPLGPLGPIGIPGEKGERGDSGSPGPPGEK---GDKGPT 104
            |.::|.|...||..|.|||:|..|..||.|.:||.|..|..||:||.|..|..|||   ||.||.
  Fly    34 NTTLCDCKGIKGRMGAPGPIGVPGLEGPAGDIGPPGRAGPLGEKGDVGEYGEQGEKGHRGDIGPK 98

Mouse   105 GVPGFPGVDGVPGHPGPPGPRGKPGVDGYNGSRGDPGYPGERGAPGPGGPPGQPGENGEKGRSVY 169
            |..|:||:.|..|.||.|||||..|.||..|.:|..|.||:.|..||.|.|||.|..||.|....
  Fly    99 GEMGYPGIMGKSGEPGTPGPRGIDGCDGRPGMQGPSGAPGQNGVRGPPGKPGQQGPPGEAGEGGI 163

Mouse   170 ITGGVKGIQGDRGDP---GPPGLPGSRGAQGS---PGPMGHAGAPGLAGPIGHPGS--------- 219
            .:.|.||.:|:.|.|   ||||..|.||::|.   .|..|..|.|||.||.|..|:         
  Fly   164 NSKGTKGNRGETGQPGGVGPPGFDGDRGSKGDTGYAGLTGEKGDPGLPGPKGDTGAVSELPYSLI 228

Mouse   220 --PGLKGNPATG----------LKGQRGEPGEVGQRGPPGPTLLVQPPDLSIYKGEKGV---KGM 269
              ||.||.|...          |||.:|..|..|..||.|||            ||:|.   .|:
  Fly   229 GPPGAKGEPGDSLSGVLKPDDTLKGYKGYVGLQGDEGPQGPT------------GEQGAVGRNGL 281

Mouse   270 PGMIGPPGPPGRKGAPG-------VGIKGEKGIPGFPGP---RGEPGSHGPPGFPGFKGIQGAAG 324
            ||..|..|.||.:|.||       .|.||.||.||:.|.   .|.||..|..||.|..|:||.||
  Fly   282 PGARGEIGGPGERGKPGKDGEPGRFGDKGMKGAPGWTGADGLDGSPGERGEDGFTGMPGVQGGAG 346

Mouse   325 EPGLF---------GFLGPKGDLGDRGYPGPPGILLTPAPPLKGVPGDPGPPGYYGEIGDVGLPG 380
            .||::         |.:|.:||:|..|..||||        |.|.||..||.|..|:.||.||.|
  Fly   347 PPGIYDPSLTKSLPGPIGSQGDIGPPGEQGPPG--------LPGKPGRRGPIGLAGQSGDPGLNG 403

Mouse   381 PPGPPGRP--GETCP-GMMGPPGPPGVPGPPGFPGEAGV---PGRLDCAP-GKPGKPGLPGLP-- 436
            ..|||||.  ||... |.:|||||.|.||..|.||..|:   ||:....| |:||..|.|||.  
  Fly   404 SRGPPGRSERGEAGDYGFIGPPGPQGPPGEAGLPGRYGLHGEPGQNVVGPKGEPGLNGQPGLEGY 468

Mouse   437 -------GAPGPEGPPGSDV-IYCRPGCPGPMGEKGKVGPPGRRGAKGAKGNKGL----C-TCPP 488
                   |.||.:|.||... |...||..||.|.:|..|..|..|.:|..|.|||    | .|..
  Fly   469 RGDRGEVGLPGDKGLPGEGYNIVGPPGSQGPPGFRGLPGDDGYNGLRGLPGEKGLRGDDCPVCNA 533

Mouse   489 GPMGPPGPPGP---PGRQGSKGDLGLP---GWHGEKGDPGQPGAEGPPGP------PGRPGAMGP 541
            ||.||.|..|.   ||..|::|.:||.   |..|.:|:||:.|.:|.|||      ||:.||.||
  Fly   534 GPRGPRGQEGDTGYPGSHGNRGAIGLTGPRGVQGLQGNPGRAGHKGLPGPAGIPGEPGKVGAAGP 598

Mouse   542 PGHKGEKGDMVISRVKGQKGERGLDGPPGFPGPHGQDGGDGRPGERGDPGPRGDHKDAAPGERGL 606
            .|...|.|.:....: |..|:.|..|..|..|..|:||.||..||||:.|.|||:.||     |.
  Fly   599 DGKAIEVGSLRKGEI-GDTGDSGHRGDTGDDGEKGRDGSDGSKGERGETGQRGDYGDA-----GY 657

Mouse   607 PGLPGPPGRTGPEGPPGLGFPGPPGQRGLPGEPGRPGTRGFDGTKGQKGDSILCNVSYPGKPGLP 671
            .|..|.|||.|.:|.||.....|  :..|.|||      |:||.||::||               
  Fly   658 QGRDGEPGRDGRDGAPGRNATTP--KVYLIGEP------GYDGIKGERGD--------------- 699

Mouse   672 GLDGPPGLKGFPGPP---------GAPGMRCPDGQKGQRGKPGMSGIPGPPGFRGDMGDPGIKGE 727
              ||..|.||..|.|         |.||   .||..|.:|..|..|..|..|.||::||.|..||
  Fly   700 --DGDTGFKGVKGEPNPGQIYDNTGEPG---EDGYTGPKGVKGAKGEQGAIGLRGEIGDRGPAGE 759

Mouse   728 KGTSPIGPPGPPGSPGKDGQKGIPGDPAFGDPGPPGERGLPGAPGMKGQKGHPG---CPGAGGPP 789
            ....|:|..|.||..|..||:|.|     |.||..||.||.|..|.|||:|.||   ..|..|||
  Fly   760 VIPGPVGAKGYPGPTGDYGQQGAP-----GLPGRDGEPGLDGGIGYKGQRGVPGQEVIQGEIGPP 819

Mouse   790 GIPGSPGLKGPKGREGSRGFPGIPGSPGHSCERGAPGIPGQPGLPGTPGD-----PGAPGWKGQP 849
            |..|..|..|..|..|..|..|.||..|...|:|..|..||.||||..|.     .|.||.||||
  Fly   820 GRSGIKGFPGDVGAPGQYGLAGRPGPKGVKGEQGPDGAVGQTGLPGNKGQRGDFLVGPPGPKGQP 884

Mouse   850 GDMGPSGPAGMKGLPGLPGLPGADGLRGPPGIPGPNGEDGLPGLPGLKGLPGLPGFPGFPGERGK 914
            |..|...|.|.||..|..|..|.:|..|..|..|.:|..||.|..||:||||.||.||.      
  Fly   885 GRNGRQAPHGAKGQKGEVGSLGQNGQNGAKGSIGFSGRRGLLGNAGLQGLPGSPGIPGL------ 943

Mouse   915 PGPDGEPGRKGEVGEKGWPGLKGDLGERGAKGD---RGLPGDAG-EAVTSRKGEPGDAGPPGDGG 975
            ||..||.|.:||:   |:.|.:||:|.||..|:   :||.||.| :......|.||..|..|:.|
  Fly   944 PGMIGEIGERGEI---GYNGRQGDIGPRGPNGEFGPKGLSGDDGPDGYPGANGLPGRKGETGNPG 1005

Mouse   976 FSGERGDKGSSGMRGGRGDPGRDGLPGLHRGQPGIDGPPGPPGPPGPPGSPGLRGVIG-----FP 1035
            |.|..|.||.:...|.:||.|..||             .||.|.||.||:.|.||.:|     ..
  Fly  1006 FPGRPGAKGVAAYSGIKGDDGESGL-------------TGPIGYPGAPGAKGQRGPVGDSQPALD 1057

Mouse  1036 GFPGDQGDPGSPGPPGFPGDDGARGPKGYKGDPASQCGPPGPKGEPGSPGYQGRTGVPGEK---G 1097
            |..|.:|:.|||||.|.||..|.:|.:|.:|.|..| |.||..|..|..||.||.|:.|.|   |
  Fly  1058 GVAGRKGEVGSPGPNGLPGRHGLKGQRGDRGLPGQQ-GRPGEPGAKGLGGYPGRNGINGLKGATG 1121

Mouse  1098 FPGDEGPRGPPGRPGQPGSFGPPGCPGDPGMPGLKGHPGEVGDPGPRGDAGDFGRPGPAGVKGPL 1162
            |||.:||:||               .|:.|:.||.|..|::||.||||..|:.|..|..|.:|.:
  Fly  1122 FPGPQGPKGP---------------QGESGVVGLDGRNGQIGDQGPRGLIGEQGEQGEQGDEGEV 1171

Mouse  1163 GSP-------------GLNGLHGLKGEKGTKGASGLLEMGPPGPMGMPGQKGEKGDPGSPG-ISP 1213
            |.|             |..|..||:||:|.:|     :|||.|.:|.||.|||:||.|..| :..
  Fly  1172 GIPGRLENLRDRSFYRGFTGDQGLQGERGEQG-----DMGPIGFIGPPGAKGERGDIGYAGQLGF 1231

Mouse  1214 PGLPGEKGFPGPPGRPGPP-----------GPAGAPGRAAK----GDIPDPGPPGDRGPPGPDGP 1263
            .|..|.|||.|..|..|||           |.||..|||.:    |....|||||:.||.|..|.
  Fly  1232 DGAEGLKGFQGDQGPRGPPGITLPAEKGDEGVAGLDGRAGRPGHFGQKGAPGPPGENGPNGAIGH 1296

Mouse  1264 RG--VPGPPGSPGNVDLLKGDPGDCGLPGPPGSRGPPGPPGCQGPPGCDGKDGQKGPMGLP---- 1322
            ||  :.|||          |..||.|.||.||..|..|..|.:|..|..|:.|::|..|..    
  Fly  1297 RGPQIQGPP----------GPQGDVGFPGAPGHNGRHGLIGPKGELGDMGRQGERGESGYAIVGR 1351

Mouse  1323 -------GLPGPPGLPGAPGEKGLPGPPGRKGPVGPPGCRGEPGPPADVDSCPRIPGLPGVPGPR 1380
                   |..|.||..||.||:|.||.||:.|.||.||.|   ||..|. ....|.|:.|:.||:
  Fly  1352 QGDIGDIGFQGEPGWDGAKGEQGYPGLPGKNGRVGAPGPR---GPTGDA-GWGGIDGMDGLVGPK 1412

Mouse  1381 GPEG-----AMGEPGRRGLPG-PGCKGEPGPDGR------------------RGQDGIPGSPGPP 1421
            |..|     :|..||.||.|| .|.:||.|..|.                  :|:.|..|:.||.
  Fly  1413 GQPGVTYSYSMARPGDRGEPGLDGFQGEEGDGGAPGLIGFQGQRGAVGYRGDQGEVGYTGADGPQ 1477

Mouse  1422 GRKGDTGEAGCPGAP---GPPGPTGDPGPKGFGPGSLSGFLLVLHSQTDQEPACPVGMPRLWTGY 1483
            |::||.|..|..|||   |.|||.|:|.|....|.| .||:...|||:...|.||.....||.||
  Fly  1478 GQRGDKGYMGLTGAPGLRGLPGPQGEPAPAPPAPKS-RGFIFARHSQSVHVPQCPANTNLLWEGY 1541

Mouse  1484 SLLYMEGQEKAHNQDLGLAGSCLPVFSTLPFAYCNIHQVCHYAQRNDRSYWLSSAAPLP--MMPL 1546
            ||.......:|..||||.:|||:..|:|:|:..|:|..|||:||.||.|.|||:|.|:|  |.|:
  Fly  1542 SLSGNVAASRAVGQDLGQSGSCMMRFTTMPYMLCDITNVCHFAQNNDDSLWLSTAEPMPMTMTPI 1606

Mouse  1547 SEEEIRSYISRCAVCEAPAQAVAVHSQDQSIPPCPRTWRSLWIGYSFLMHTGAGDQGGGQALMSP 1611
            ...::..|||||.|||...:.:|:|||..|||.||..|..:|.|||:.|.|.....|.||.|:||
  Fly  1607 QGRDLMKYISRCVVCETTTRIIALHSQSMSIPDCPGGWEEMWTGYSYFMSTLDNVGGVGQNLVSP 1671

Mouse  1612 GSCLEDFRAAPFVECQGRQGTCHFFANEYSFWLTTVNPDLQFASGPSPDTLKEVQAQRRKISRCQ 1676
            |||||:|||.|.:||.| .|.|:::....|||||.:....||.. |...|||.....  |||||.
  Fly  1672 GSCLEEFRAQPVIECHG-HGRCNYYDALASFWLTVIEEQDQFVQ-PRQQTLKADFTS--KISRCT 1732

Mouse  1677 VCMK 1680
            ||.:
  Fly  1733 VCRR 1736

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
Col4a4NP_031761.1 7S domain. /evidence=ECO:0000250|UniProtKB:P53420 31..56 5/12 (42%)
Collagen 54..111 CDD:189968 29/59 (49%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 56..255 98/228 (43%)
Triple-helical region. /evidence=ECO:0000250|UniProtKB:P53420 57..1451 627/1577 (40%)
Cell attachment site. /evidence=ECO:0000255 86..88 0/1 (0%)
Cell attachment site. /evidence=ECO:0000255 137..139 0/1 (0%)
Cell attachment site. /evidence=ECO:0000255 181..183 0/1 (0%)
Collagen 291..348 CDD:189968 28/68 (41%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 379..1453 472/1205 (39%)
Cell attachment site. /evidence=ECO:0000255 587..589 1/1 (100%)
Cell attachment site. /evidence=ECO:0000255 593..595 1/1 (100%)
Cell attachment site. /evidence=ECO:0000255 716..718 1/1 (100%)
Collagen 805..858 CDD:189968 25/57 (44%)
Collagen 889..948 CDD:189968 28/61 (46%)
Cell attachment site. /evidence=ECO:0000255 980..982 0/1 (0%)
Cell attachment site. /evidence=ECO:0000255 992..994 0/1 (0%)
Cell attachment site. /evidence=ECO:0000255 1144..1146 1/1 (100%)
Collagen 1338..1405 CDD:189968 31/72 (43%)
C4 1458..1562 CDD:279721 48/105 (46%)
C4 1566..1679 CDD:279721 53/112 (47%)
vkgNP_001260071.1 Collagen 60..119 CDD:189968 30/58 (52%)
Collagen 102..152 CDD:189968 26/49 (53%)
Collagen 277..334 CDD:189968 22/56 (39%)
Collagen 534..593 CDD:189968 24/58 (41%)
Collagen 814..871 CDD:189968 25/56 (45%)
Collagen 957..1014 CDD:189968 22/56 (39%)
Collagen 990..1049 CDD:189968 27/71 (38%)
Collagen 1070..1128 CDD:189968 27/58 (47%)
C4 1515..1624 CDD:128421 51/108 (47%)
C4 1625..1737 CDD:128421 54/116 (47%)


Information from Original Tools:


Tool Simple Score Weighted Score Original Tool Information
BLAST Result Score Score Type Cluster ID
Compara 1 0.930 - - C167838355
Domainoid 00.000 Not matched by this tool.
eggNOG 1 0.900 - - E33208_3BAFZ
Hieranoid 00.000 Not matched by this tool.
Homologene 00.000 Not matched by this tool.
Inparanoid 00.000 Not matched by this tool.
Isobase 00.000 Not matched by this tool.
OMA 00.000 Not matched by this tool.
OrthoDB 00.000 Not matched by this tool.
OrthoFinder 1 1.000 - - FOG0000443
OrthoInspector 00.000 Not matched by this tool.
orthoMCL 1 0.900 - - OOG6_100768
Panther 1 1.100 - - O PTHR24023
Phylome 1 0.910 - -
RoundUp 00.000 Not matched by this tool.
SonicParanoid 00.000 Not matched by this tool.
SwiftOrtho 00.000 Not matched by this tool.
TreeFam 00.000 Not matched by this tool.
65.740

Return to query results.
Submit another query.