DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment vkg and Col4a1

DIOPT Version :10

Sequence 1:NP_477190.1 Gene:vkg / 33726 FlyBaseID:FBgn0016075 Length:1940 Species:Drosophila melanogaster
Sequence 2:NP_034061.2 Gene:Col4a1 / 12826 MGIID:88454 Length:1669 Species:Mus musculus


Alignment Length:1844 Identity:763/1844 - (41%)
Similarity:908/1844 - (49%) Gaps:358/1844 - (19%)


- Green bases have known domain annotations that are detailed below.


  Fly    38 CDCKGIKGRM------GAPGPIGVPGLEGPAGDIGPPGRAGPLGEKGDVGEYGEQGEKGHRGDIG 96
            |||.|:||:.      |..|.||.||::||.|..|||      |:|||.||.|..|.||.||..|
Mouse    39 CDCHGVKGQKGERGLPGLQGVIGFPGMQGPEGPHGPP------GQKGDAGEPGLPGTKGTRGPPG 97

  Fly    97 PKGEMGYPGIMGKSGEPGTPGPRGIDGCDGR---------------------------------- 127
            ..|..|.||:.|..|:.|.|||.||.||:|.                                  
Mouse    98 AAGYPGNPGLPGIPGQDGPPGPPGIPGCNGTKGERGPLGPPGLPGFSGNPGPPGLPGMKGDPGEI 162

  Fly   128 ---------------------------PGMQGPSGAPGQNGVRGPPGKPGQQGPPGEAGEGGINS 165
                                       ||:|||.|.|   |..||||.||..|||||.|:.|.:.
Mouse   163 LGHVPGTLLKGERGFPGIPGMPGSPGLPGLQGPVGPP---GFTGPPGPPGPPGPPGEKGQMGSSF 224

  Fly   166 KGTKGNRGETGQPGGVGPPGF-----------DGDRGSKGDTGYAGLT--GEKGDPGLPGPKGDT 217
            :|.||::||.|..|..|.||.           .|::|.||:.|:.|:.  ||||:||..||:|..
Mouse   225 QGPKGDKGEQGVSGPPGVPGQAQVKEKGDFAPTGEKGQKGEPGFPGVPGYGEKGEPGKQGPRGKP 289

  Fly   218 GAVSELPYSLIGPPGAKGEPGDSLSGVLKPDDTLKGYKGYVGLQGDEGPQGPTGEQGAVGRNG-- 280
            |...|.     |..|:.|.||||               ||.||.|.:||||..||.|..|..|  
Mouse   290 GKDGEK-----GERGSPGIPGDS---------------GYPGLPGRQGPQGEKGEAGLPGPPGTV 334

  Fly   281 ---LP----GARGEIGGPGERGKPGKDGEPGRFGDKGMKG--APGWTGADGLDGSPGERGEDGFT 336
               :|    |.||..|.||.||:||..|.||..|..|..|  .||..||.|..|..||:|:.||.
Mouse   335 IGTMPLGEKGDRGYPGAPGLRGEPGPKGFPGTPGQPGPPGFPTPGQAGAPGFPGERGEKGDQGFP 399

  Fly   337 G--MPGVQGGAGPPGIYDPSLTKSLPGPIGSQGDI--------GPPGEQGPPGLPGKPGRRGPIG 391
            |  :||..|..|.||...|      |||.|..|..        ||||:|||||.||:||..|.:|
Mouse   400 GVSLPGPSGRDGAPGPPGP------PGPPGQPGHTNGIVECQPGPPGDQGPPGTPGQPGLTGEVG 458

  Fly   392 LAGQSGDPGL----NGSRGPPGRSERGEAGDYGFIGPPGPQGPPGEAGLPGRYGLHGEPGQNVVG 452
            ..||.|:..|    .|.|||||  .:|..|:.||   ||..|..|:.|||||.||.|.|     |
Mouse   459 QKGQKGESCLACDTEGLRGPPG--PQGPPGEIGF---PGQPGAKGDRGLPGRDGLEGLP-----G 513

  Fly   453 PKGEPGLNGQPGLEGYRGDRGEV----GLPGDKGLPGEGYNIVGPPGSQGPPGFRGLPGDDGYNG 513
            |:|.|||.||||.   :|:.||:    .|.||||.||.       ||..|.||..|.||.||:  
Mouse   514 PQGSPGLIGQPGA---KGEPGEIFFDMRLKGDKGDPGF-------PGQPGMPGRAGTPGRDGH-- 566

  Fly   514 LRGLPGEKGLRGDDCPVCNAGPRGPRGQEGDTGYPGSHGNRG-----AIGLTGPRGVQGLQGNPG 573
             .||||.||..|      :.|.:|.||..|..|:|||.|:.|     .:|..||.|.:|..|.||
Mouse   567 -PGLPGPKGSPG------SIGLKGERGPPGGVGFPGSRGDIGPPGPPGVGPIGPVGEKGQAGFPG 624

  Fly   574 RAGHKGLPGPAGIPGEPGK-VGAAGPDGKAIEVGSLRKGEIGDTGDSGHRGDTGDDGEKGRDGSD 637
            ..|..|||||   .||.|| |...||.|.|        |..|..|..|.:||.|..|..||.|..
Mouse   625 GPGSPGLPGP---KGEAGKVVPLPGPPGAA--------GLPGSPGFPGPQGDRGFPGTPGRPGIP 678

  Fly   638 GSKGERGETGQRGDYGDAGYQGRDGEPGRDGRDGAPGRNATTPKVYLIGEPGYDGIKGERGDDGD 702
            |.||..|:.| .|..|..|.:|.||.||..||.|:|||....      |.||..|.:|::|:.| 
Mouse   679 GEKGAVGQPG-IGFPGLPGPKGVDGLPGEIGRPGSPGRPGFN------GLPGNPGPQGQKGEPG- 735

  Fly   703 TGFKGVKGEPN-PGQIYDNTGEPGEDGYTGPKGVKGAKGEQGAIGLRGEIGDRGPAGEVIPGPVG 766
            .|..|:||:|. ||    ..|.|||.|..|..||.|.:|..|..||:|..||.||.|  :.||.|
Mouse   736 IGLPGLKGQPGLPG----IPGTPGEKGSIGGPGVPGEQGLTGPPGLQGIRGDPGPPG--VQGPAG 794

  Fly   767 AKGYP--GPTGDYGQQGAPGLPGRDGEPGLDGGIGYKGQRG--VPGQEVIQGEIGPPGRSGIKGF 827
            ..|.|  ||.|..|..|..|.||..|.||:.|..|:.|..|  :||.:..:|..|.||.:|..|.
Mouse   795 PPGVPGIGPPGAMGPPGGQGPPGSSGPPGIKGEKGFPGFPGLDMPGPKGDKGSQGLPGLTGQSGL 859

  Fly   828 PGDVGAPGQYGLAGRPGPKGVKGEQGPDGAVGQTGLPGNKGQRGDFLVGPPGPKGQPGRNGRQAP 892
            |   |.|||.|..|.||..|.|||.|..|..||.|.||..|..|  |.|..|..|.||.:|.:..
Mouse   860 P---GLPGQQGTPGVPGFPGSKGEMGVMGTPGQPGSPGPAGTPG--LPGEKGDHGLPGSSGPRGD 919

  Fly   893 HGAKGQKGEVGSLGQNG------QNGAKGSIGFSGRRGLLGNAGLQGLPGSPGIPGLPGMIGEIG 951
            .|.||.||:||..|..|      ....||..|..|.:|.:|..|.:|..|.||.||:||..|:.|
Mouse   920 PGFKGDKGDVGLPGMPGSMEHVDMGSMKGQKGDQGEKGQIGPTGDKGSRGDPGTPGVPGKDGQAG 984

  Fly   952 ERGEIGYNGRQGDIGPRGPNGEFGPKGLSGDDGPDGYPGANGLPGRKGETGNPGFPGRPGAKGVA 1016
            ..|:         .||:|..|..|..|..|..||.|..|..||||..||.|.||.||..|..|..
Mouse   985 HPGQ---------PGPKGDPGLSGTPGSPGLPGPKGSVGGMGLPGSPGEKGVPGIPGSQGVPGSP 1040

  Fly  1017 AYSGIKGDDGESGLTGPIGYPGAPGAKGQRGPVGDSQPALDGVAGRKGEVGSPGPNGLPGRHGLK 1081
            ...|.||:.|:|||.| ||.||.||.||.:|..|     ..|..|.|||.||.|..|:||..|.:
Mouse  1041 GEKGAKGEKGQSGLPG-IGIPGRPGDKGDQGLAG-----FPGSPGEKGEKGSAGTPGMPGSPGPR 1099

  Fly  1082 GQRGDRGLPGQQGRPGEPGAKGLGGYPGRNGINGLKGATGFPGPQGPKGPQGESGVVGLDGRNGQ 1146
            |..|:.|.||..|.|||.|.|||   ||.:|:.|:||..|.||..||.||.|:.|..|.||..|.
Mouse  1100 GSPGNIGHPGSPGLPGEKGDKGL---PGLDGVPGVKGEAGLPGTPGPTGPAGQKGEPGSDGIPGS 1161

  Fly  1147 IGDQGP-----RGLIGEQGEQGEQGDEGEVGIPGRLENLRDRSFYRGFTGDQGLQGERGEQGDMG 1206
            .|::|.     ||..|..|.:|::|.:||||.|             |..|..|:.|.:|||    
Mouse  1162 AGEKGEQGVPGRGFPGFPGSKGDKGSKGEVGFP-------------GLAGSPGIPGVKGEQ---- 1209

  Fly  1207 PIGFIGPPGAKGERGDIGYAGQLGFDGAEG--LKGFQGDQGPRGPPGITLPAEKGDEGVAGLDGR 1269
              ||:||||.:         ||.|..|..|  ::|.:||:||:|.||:                 
Mouse  1210 --GFMGPPGPQ---------GQPGLPGTPGHPVEGPKGDRGPQGQPGL----------------- 1246

  Fly  1270 AGRPGHFGQKGAPGPPGENGPNGAIGHRG-PQIQGPPGPQGDVGFPGAPGHNGRHGLIGPKGELG 1333
               |||.|..|.||.||.|||.|..|::| |...|.|||:||.||.|.||..|..|:.|.||::|
Mouse  1247 ---PGHPGPMGPPGFPGINGPKGDKGNQGWPGAPGVPGPKGDPGFQGMPGIGGSPGITGSKGDMG 1308

  Fly  1334 DMGRQGERGESGYAIVGRQGDIGDIGFQGEPGWDGAKGEQGYPGLPGK----NGRVGAPGPRGPT 1394
            ..|..|.:|:.|  :.|.||..||.|.||.|   |.||.||.||.||.    .|..|.|||.||.
Mouse  1309 LPGVPGFQGQKG--LPGLQGVKGDQGDQGVP---GPKGLQGPPGPPGPYDVIKGEPGLPGPEGPP 1368

  Fly  1395 GDAGWGGIDGMDGLVGPKGQPGVTYSYSMARPGDRGEPGLDGFQGEEGDGGAPGLIGFQGQRGAV 1459
                  |:.|:.|..|||||.|||.|..:  ||..|.||.||..|::|:                
Mouse  1369 ------GLKGLQGPPGPKGQQGVTGSVGL--PGPPGVPGFDGAPGQKGE---------------- 1409

  Fly  1460 GYRGDQGEVGYTGADGPQGQRGDKGYMGLTGAPGLRGLPGPQGEPAPAPPAPKS--RGFIFARHS 1522
                       ||..||.|.|      |..|.||..||||..|     ||...|  .||:..|||
Mouse  1410 -----------TGPFGPPGPR------GFPGPPGPDGLPGSMG-----PPGTPSVDHGFLVTRHS 1452

  Fly  1523 QSVHVPQCPANTNLLWEGYSL---SGNVAASRAVGQDLGQSGSCMMRFTTMPYMLCDITNVCHFA 1584
            |:...|.||..|.:|:.||||   .||   .||.|||||.:|||:.:|:|||::.|:|.|||:||
Mouse  1453 QTTDDPLCPPGTKILYHGYSLLYVQGN---ERAHGQDLGTAGSCLRKFSTMPFLFCNINNVCNFA 1514

  Fly  1585 QNNDDSLWLSTAEPMPMTMTPIQGRDLMKYISRCVVCETTTRIIALHSQSMSIPDCPGGWEEMWT 1649
            ..||.|.||||.|||||:|.||.|.::..:||||.|||....::|:|||::.||.||.||..:|.
Mouse  1515 SRNDYSYWLSTPEPMPMSMAPISGDNIRPFISRCAVCEAPAMVMAVHSQTIQIPQCPNGWSSLWI 1579

  Fly  1650 GYSYFMSTLDNVGGVGQNLVSPGSCLEEFRAQPVIECHGHGRCNYYDALASFWLTVIEEQDQFVQ 1714
            |||:.|.|.....|.||.|.||||||||||:.|.|||||.|.||||....||||..||..:.|.:
Mouse  1580 GYSFVMHTSAGAEGSGQALASPGSCLEEFRSAPFIECHGRGTCNYYANAYSFWLATIERSEMFKK 1644

  Fly  1715 PRQQTLKA-DFTSKISRCTVCRRR 1737
            |...|||| :..:.:|||.||.||
Mouse  1645 PTPSTLKAGELRTHVSRCQVCMRR 1668

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
vkgNP_477190.1 gly_rich_SclB <63..>349 CDD:468478 134/372 (36%)
gly_rich_SclB <465..>712 CDD:468478 102/256 (40%)
gly_rich_SclB <637..>871 CDD:468478 101/238 (42%)
gly_rich_SclB <894..>1132 CDD:468478 106/243 (44%)
gly_rich_SclB <1078..>1342 CDD:468478 107/271 (39%)
gly_rich_SclB <1228..1515 CDD:468478 113/295 (38%)
C4 1515..1624 CDD:128421 61/111 (55%)
C4 1625..1737 CDD:128421 58/112 (52%)
Col4a1NP_034061.2 gly_rich_SclB <42..>331 CDD:468478 111/317 (35%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 47..1443 636/1606 (40%)
Triple-helical region 173..1440 595/1471 (40%)
gly_rich_SclB <273..>577 CDD:468478 151/352 (43%)
gly_rich_SclB <506..>766 CDD:468478 126/306 (41%)
gly_rich_SclB <717..>992 CDD:468478 118/301 (39%)
gly_rich_SclB <948..>1163 CDD:468478 103/232 (44%)
gly_rich_SclB <1103..>1335 CDD:468478 113/284 (40%)
gly_rich_SclB <1229..>1443 CDD:468478 109/284 (38%)
C4 1446..1553 CDD:460201 59/109 (54%)
C4 1556..1667 CDD:460201 58/110 (53%)

Return to query results.
Submit another query.