DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Col4a1 and col4a1

DIOPT Version :10

Sequence 1:NP_723044.1 Gene:Col4a1 / 33727 FlyBaseID:FBgn0000299 Length:1779 Species:Drosophila melanogaster
Sequence 2:XP_694040.5 Gene:col4a1 / 554269 ZFINID:ZDB-GENE-081105-114 Length:1640 Species:Danio rerio


Alignment Length:1845 Identity:797/1845 - (43%)
Similarity:966/1845 - (52%) Gaps:342/1845 - (18%)


- Green bases have known domain annotations that are detailed below.


  Fly    51 IDDSYDIVDSAGVARGDLPPKNCTAGYAGCVPKCIAEKGNRGLPGPLGPTGLKGEMGFPGMEGPS 115
            ||:|        .|:|.....:|    .||  .|   .|.:|..|.:|..||.|.:|||||:   
Zfish    17 IDES--------TAKGGCSGSSC----GGC--DC---SGVKGAKGEVGLPGLMGLVGFPGMQ--- 61

  Fly   116 GDKGQKGDPGPYGQRGDKGERGSPGLHGQAGVPGVQGPAGNPGAPGINGKDGCDGQDGIPGLEGL 180
               |.:|.||..|.:||:||.|:||:.|..|:||..|..|.||.||:.|.:|..|..||||..|.
Zfish    62 ---GHEGPPGSMGPKGDRGEIGAPGIKGSRGLPGQAGFPGRPGIPGLPGLEGAPGPQGIPGCNGT 123

  Fly   181 SGMPGPRGYAGQLGSKGEKGEPAKENGDYAKGEKGEPG--------WRGTAGLAGPQGFPGEKGE 237
            .|..|..|..|..|..|..|.|.      .||.||:||        .:|..||.|..||||:||.
Zfish   124 KGERGFDGIPGITGLLGPPGIPG------IKGGKGDPGGVIGLPIPLKGDRGLPGQHGFPGQKGN 182

  Fly   238 RGDSGPYGAKGPRGEHGLKGEKGASCYGPMKPGAPGIKGEKGEPASSFPVKPTHTVMGPRGDMGQ 302
            .|..||.|..||              |||  ||.||..|:||||....      |.:|.:|:.||
Zfish   183 AGQQGPVGPPGP--------------YGP--PGRPGDPGQKGEPGDKL------TFIGEKGEKGQ 225

  Fly   303 K---GEPGLVGRKGE----------PGPEGDTGLDGQKGEKG--LPGGPGDRGRQGNFGPPGSTG 352
            :   |.||..||..|          |||.|:||..|.||:||  .|...|.:|..|..||.|..|
Zfish   226 RGLTGPPGPPGRSEEYEGPATTLYLPGPPGETGSTGDKGDKGSCAPHQHGIKGEPGPPGPIGKPG 290

  Fly   353 QKGDRGEPGLNGLPGNPGQKGEPGRAGATGKPGLLGPPGPPGGGRGTPGPPGPKGPRGYVGAPGP 417
            :.|..||.|..|.|||||..|..|..|..         ||||.|.|:||||||.   |..||.|.
Zfish   291 KDGQSGEKGEIGFPGNPGLDGFKGDKGER---------GPPGYGAGSPGPPGPP---GLPGAEGE 343

  Fly   418 QGLNGVDGLPGPQGYNGQKGGAGLPGRPGNEGPPGKKGEKGTAGLN--GPKGSIGPIGHPGPPGP 480
            .|..|..|.|||.|    :...|.||.||..|..|||||||..|.:  ||||:.|..|.||||||
Zfish   344 TGFQGEPGAPGPPG----RYIEGPPGPPGFPGEIGKKGEKGADGRSFPGPKGTDGQPGPPGPPGP 404

  Fly   481 --------EGQKGDAGLPG-YGIQGSKGDA----------GIPGYPGLKGSKGERGFKGNAGAPG 526
                    :|..|..|.|| .|..|.|||.          |.||.||.:|.||.:||.|.||..|
Zfish   405 INEECDVTKGAPGPPGPPGLQGEVGQKGDQGETCTECNAFGPPGLPGPQGPKGLQGFPGPAGIKG 469

  Fly   527 DSKLGRPGTP-GAAGAPGQKGDAGRPGTPGQKGDM----GIKGDVGGKCSSCRAGPKGDKGTSGL 586
            :..|..|..| |.||..|..|..|:||..|:.||:    |:|||.|      ..|..||:|..|:
Zfish   470 EKGLPGPSGPAGNAGFNGAPGLMGKPGAQGEPGDIFVAPGLKGDKG------LPGTTGDRGFPGV 528

  Fly   587 PGIPGKDGARGPPGERGYP------GERGHDGINGQTGPPGEKGEDGRTGLPGATGEPGKPALCD 645
            .|:|||||..|.||.:|.|      |:||.||..|..|||||:|..|..|: |..|:||.     
Zfish   529 DGLPGKDGRPGYPGTKGEPAKFGIKGDRGPDGDFGVPGPPGERGPPGEPGI-GRPGQPGD----- 587

  Fly   646 LSLIEPLKGDKGYPGAPGAKGVQGFKGAEG----LP---GIPGPKGEFGFKGEKGLSGAPGNDGT 703
                   ||..|:|||||..|:||.||..|    ||   |.|||:||.|..|.:|..|.||..|.
Zfish   588 -------KGSAGFPGAPGRPGLQGAKGEAGKVISLPGPQGAPGPRGESGRPGLQGDRGFPGERGV 645

  Fly   704 PGRAGRDGYPGIPGQSIKGEPGFHGRDGAKGDKGSFGRSGEKGEPGSCALDEIKMPAK----GNK 764
            ||..|..|.||:||..:.|.||..|..|..|..|..|.|||.|:.|        :|.:    |.|
Zfish   646 PGFPGDKGDPGLPGIGLPGPPGPKGYSGIPGPSGIPGESGEPGQDG--------LPGRAGTPGQK 702

  Fly   765 GEPGQTGMPGPPGEDGSPGERGYTGLKGNTGPQGPPGVEGPRGLNGPRGEKGNQGAVGVPGNPGK 829
            ||||: |:|||.|..|.||..|:.|.|||||..|.||.||..|..||:|.|||      ||.||.
Zfish   703 GEPGR-GLPGPKGTTGLPGVPGFPGEKGNTGMPGVPGTEGRTGPPGPQGLKGN------PGPPGV 760

  Fly   830 DGLRGIPGRNGQPGPRGEPGISRPGPMGPPGLNGLQGEKGDRGPTGPIGFPGADGSVGYPGDRGD 894
            .|..|.||    |...||||  .||||||||           || |||      |..|..||:|.
Zfish   761 HGALGPPG----PPGLGEPG--APGPMGPPG-----------GP-GPI------GQSGIKGDKGY 801

  Fly   895 AGLPGVSGRPGIVGEKGDVGPIGPAGVAGPPGVPGIDGVRGRDGAKGEPGSPGLVGMPGNKGDRG 959
            .||||:.                   :.|||               |:.||||:.|:||:||..|
Zfish   802 QGLPGLD-------------------MPGPP---------------GDRGSPGVPGLPGSKGLPG 832

  Fly   960 APGNDGPKGFAGVTGAPGKRGPAGIPGVSGAKGDKGATGLTGNDGPVGGRGPPGAPGLMGIKGDQ 1024
            .||..|..||      ||:|||.|..|:.|..|..|..||.||.|..|.:|.||.||:.|..||.
Zfish   833 QPGVPGKDGF------PGERGPKGEIGIMGMAGPPGFQGLVGNPGDPGQKGDPGMPGIRGDFGDL 891

  Fly  1025 GLAGAPGQQGLDGMPGEKGNQGFPGLDGPPG---------LPGDASEKGQKGEPGPSGLRGDTGP 1080
                           |.||.:|.|||.||||         :.||..:.|.:|:|||:|.:|..|.
Zfish   892 ---------------GTKGERGEPGLQGPPGNMSDVDMEHMKGDKGDIGDRGDPGPTGEKGFPGT 941

  Fly  1081 AGTPGWPGEKGLPGLAVHGRAGPPGEKGDQGRSGIDGRDGINGEKGEQGLQGVWGQPGEKGSVGA 1145
            :|.||.||:.|:|        |.||:.||:|..||.|:.|..||.|:.|.:|..|.||..|..||
Zfish   942 SGDPGMPGKDGMP--------GSPGQPGDKGDPGIIGKPGSIGEPGDTGARGDMGYPGSMGPKGA 998

  Fly  1146 PGIPGAPGMDGLPGAAGAPGAVGYPGDRGDKGEPGLSGLPGLKGETGPVGLQGFTGAPGPKGERG 1210
            .|:.|:||..|..|..|:      .||:||.||||: |:||..|..|.:|..||.|.||.||::|
Zfish   999 KGVGGSPGHPGFKGTDGS------KGDKGDPGEPGI-GIPGPPGTKGNIGSPGFPGEPGEKGQKG 1056

  Fly  1211 IRGQPGLPATVPDIRGDKGSQGERGYTGEKGEQGERGLTGPAGVAGAKGDRGLQGPPGASGLNGI 1275
            ..|..|.|.|    :|.||.||..||.|..|:.||:|:   .|:.|:.|:.|..|.||..||.|.
Zfish  1057 TMGMTGTPGT----QGHKGDQGSIGYPGSPGKPGEKGV---GGLPGSPGEPGTPGRPGEPGLQGP 1114

  Fly  1276 PGAKGDIGPRGEIGYPGVTIKGEKGLPGRPGRNGRQGLIGAPGLIGERGLPGLAGEPGLVGLPGP 1340
            ||..|:.|..|:.|.||.|  ||:|.||.|||       |.||..||.|..|..|.|||.|:||.
Zfish  1115 PGPHGEKGQSGQDGIPGFT--GERGEPGVPGR-------GLPGSRGEPGGKGDKGNPGLPGIPGS 1170

  Fly  1341 IGPAGSKGERGLAGSP---GQPGQDGFPG--APGLKGDTGPQGFKGERGLNGFE---------GQ 1391
            .|..|:||::||:|.|   |:||:.|.||  ..|.|||.|.||..||||..|.:         |.
Zfish  1171 PGIPGTKGDKGLSGLPGDQGRPGERGLPGQAMEGPKGDRGDQGQPGERGATGEQGPPGIPGSAGS 1235

  Fly  1392 KGDKGDRGLQGPSGLPGLVGQKGDTGYPGLNGNDGPVGAPGERGFTGPKGRDGRDGTPGLPGQKG 1456
            ||:|||.|.|   |.||..|.|||.|:.|::|.      ||..|:.||||..|..|.||.||.||
Zfish  1236 KGEKGDVGFQ---GAPGSRGYKGDKGFIGVSGE------PGLPGYNGPKGEMGPQGVPGFPGTKG 1291

  Fly  1457 EPGMLPP---------PGPKGEPGQPGRNGP----KGEPGRP---GERGLIGIQGERGEKGERGL 1505
            |.|::..         ||.||..|.||..||    ||:||.|   |.:||:||||..|.||::|:
Zfish  1292 EQGVIGSKGEGGDRGFPGLKGSEGPPGPPGPHTYVKGDPGPPGPQGPQGLVGIQGFPGAKGQQGV 1356

  Fly  1506 I---------GETGNVGRPGPKGDRGE---PGERGYEGAIGLIGQKGEPGAPAPAALDYLTGILI 1558
            :         |..|..|.||.||:.|:   ||.|||.|..|..|..|..|.|.|:::|:  |.|:
Zfish  1357 MGLQGLKGSDGTPGYNGHPGAKGEPGQTGPPGPRGYPGPPGPDGIPGHIGPPGPSSMDH--GFLV 1419

  Fly  1559 TRHSQSETVPACSAGHTELWTGYSLLYVDGNDYAHNQDLGSPGSCVPRFSTLPVLSCGQNNVCNY 1623
            |||||:..||.|..|.|.::.|||||||.||:.:|.||||:.|||:.:||.:|.|.|..|||||:
Zfish  1420 TRHSQTIDVPHCPQGTTPIYDGYSLLYVQGNERSHGQDLGTAGSCLRKFSPMPFLFCNINNVCNF 1484

  Fly  1624 ASRNDKTFWLTTNAAIP--MMPVENIEIRQYISRCVVCEAPANVIAVHSQTIEVPDCPNGWEGLW 1686
            |||||.::||||...:|  |.||....|:.:||||.||||||.|||||||||::|:||:||..||
Zfish  1485 ASRNDYSYWLTTPEPMPMSMAPVTGESIKPFISRCAVCEAPAMVIAVHSQTIQIPNCPDGWASLW 1549

  Fly  1687 IGYSFLMHTAVGNGGGGQALQSPGSCLEDFRATPFIECNGAKGTCHFYETMTSFWMYNLESSQPF 1751
            |||||:|||:.|..|.||||.|||||||:||:.|||||:| :|||::|....|||:..:|.:..|
Zfish  1550 IGYSFVMHTSAGAEGSGQALASPGSCLEEFRSAPFIECHG-RGTCNYYANSYSFWLATIEDTDMF 1613

  Fly  1752 ERPQQQTIKAGERQSHVSRCQVCMK 1776
            .:|...|:|||..::|:||||||||
Zfish  1614 TKPVPATLKAGSLRTHISRCQVCMK 1638

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
Col4a1NP_723044.1 gly_rich_SclB <107..>361 CDD:468478 110/276 (40%)
gly_rich_SclB <355..>642 CDD:468478 139/318 (44%)
gly_rich_SclB <543..820 CDD:468478 131/297 (44%)
gly_rich_SclB <727..>968 CDD:468478 96/244 (39%)
gly_rich_SclB <969..>1218 CDD:468478 105/257 (41%)
gly_rich_SclB <1186..>1420 CDD:468478 111/247 (45%)
gly_rich_SclB <1321..>1547 CDD:468478 116/267 (43%)
C4 1555..1662 CDD:128421 60/108 (56%)
C4 1663..1777 CDD:128421 69/114 (61%)
col4a1XP_694040.5 gly_rich_SclB <40..>211 CDD:468478 82/198 (41%)
gly_rich_SclB <296..>542 CDD:468478 116/267 (43%)
gly_rich_SclB <469..>737 CDD:468478 127/295 (43%)
gly_rich_SclB <700..>972 CDD:468478 148/365 (41%)
gly_rich_SclB <890..>1142 CDD:468478 119/290 (41%)
gly_rich_SclB <1056..>1290 CDD:468478 115/258 (45%)
C4 1417..1524 CDD:460201 58/106 (55%)
C4 1527..1638 CDD:460201 66/111 (59%)

Return to query results.
Submit another query.