DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Col4a1 and si:ch211-196i2.1

DIOPT Version :9

Sequence 1:NP_723044.1 Gene:Col4a1 / 33727 FlyBaseID:FBgn0000299 Length:1779 Species:Drosophila melanogaster
Sequence 2:XP_009293536.1 Gene:si:ch211-196i2.1 / 564099 ZFINID:ZDB-GENE-091204-150 Length:1813 Species:Danio rerio


Alignment Length:1714 Identity:588/1714 - (34%)
Similarity:720/1714 - (42%) Gaps:457/1714 - (26%)


- Green bases have known domain annotations that are detailed below.


  Fly    28 TAGTAGSIQDSVKHYNRN-----------EPKFPIDDSYDIVDSAGVARGDLPPKNCTAGYAGCV 81
            |:...|.::.:.||.:.|           .||.|:|.:...:|.:....|...|...........
Zfish   337 TSSRDGKLESTSKHLDENITIDKHKPGKPLPKKPVDTTIINLDVSEKPTGSTVPSREIHLMTTSS 401

  Fly    82 PKCIAEKGNRGL-----------PGPL---------GPTGLKGEMGFPGMEGPSGDKGQKGDPGP 126
            |    |.|...|           ||..         |.|..|.    |.:...||. |.:..|..
Zfish   402 P----EDGESNLLKEDFEVTTQAPGRTSWDNVSKGSGRTPSKS----PDIHAGSGG-GSEEHPDT 457

  Fly   127 YGQRGDKGE--RGSPG--LHGQAGVPGVQGPAGNPGAPGINGKDGCDGQDGIPGLEGLSGMPGPR 187
            ....|.:|:  |||.|  ...|.|..|..||.|.||.|                  |.:|.|||:
Zfish   458 IVLTGRQGDIVRGSDGKMYRIQRGPMGPMGPPGEPGCP------------------GNAGYPGPK 504

  Fly   188 GYAGQLGSKGEKGEPAKENGDYAKGEKGEPG------WRGT-----------------AGLAGPQ 229
            |..|..|.:|..|.|.      ..|..|.||      ||.|                 ||....|
Zfish   505 GDKGAFGVRGRPGRPG------FLGSPGPPGLPSFYLWRNTPEDWAAFQQTSFFQLLLAGWPRAQ 563

  Fly   230 GFPGEKGERGDSGPYGAKGPRGEHGLKGEKGASCYGPMKPGAPGIKGEKGEPASSFPVKPTHTVM 294
            |.||.:||.|..   ||:||                   ||.|   ||:|.|             
Zfish   564 GLPGPEGEMGKP---GAQGP-------------------PGEP---GERGPP------------- 590

  Fly   295 GPRGDMGQKGEPGLVGRKGEPGPEGDTGLDGQKGEKGLPGGPGDRGRQGNFGPPGSTGQKGDRGE 359
            |..||||.:|..|:|||.|..|.:|:.||||.:|..|:|      |.:|..|..|.:|..|::||
Zfish   591 GRMGDMGDRGPKGVVGRAGVYGRDGENGLDGPQGPSGIP------GPKGPLGYKGESGSHGEKGE 649

  Fly   360 PGLNGLPGNPGQKGEPGRAGATGKPGLLGPPGPPGGGRGTPGPPGPKGPRGYVGAPGPQGLNGVD 424
            .|:.|..|..|.||:||..|:                :|.||.||..||      |||||:.|::
Zfish   650 EGITGSEGPCGDKGKPGEKGS----------------KGAPGAPGAVGP------PGPQGIRGME 692

  Fly   425 GL---PGPQGYNGQKGGAGLPGRPGNEGPPGKKGEKGTAGLNGPKGSIGPIGHPGPPGPEGQKGD 486
            ||   |||.|.:|..|..||||.||..|..|..|.:|..|..|..|..|.:|.|||.||:|.:|.
Zfish   693 GLEGHPGPDGEDGMNGSPGLPGAPGAPGWTGLIGAQGANGSRGEPGPSGAVGLPGPQGPQGLEGQ 757

  Fly   487 AGLPGYGIQGSKGDAGIPGYPGLKGSKGERGFKGNAGAPGDSKLGRPGTPGAAGAPGQKGDAGRP 551
            .|.|     |.:|..|:.|..||.|.|||.|..|..|..||              ||.:|..|..
Zfish   758 IGPP-----GQRGPPGLSGREGLHGPKGEPGPVGPVGIRGD--------------PGFEGLKGVL 803

  Fly   552 GTPGQKGDMGIKGDVGGKCSSCRAGPKGDKGTSGLPGIPGKDGARGPPGERGYPGERGHDGINGQ 616
            ||||.||..||||:.|........||||:||..|..|:.|..|.:||.|..|:|      ||.|.
Zfish   804 GTPGPKGFTGIKGNRGPDGDQGDIGPKGNKGARGAAGVQGIQGEQGPIGFPGFP------GIRGP 862

  Fly   617 TGPPGEKGEDGRTGLPGATGEPGKPALCDLSLIEPLKGDKGYPGAPGAKGVQGFKGAEGLPGIPG 681
            .||.|.:||||..|..|..|:                           :|..|.:|.|||.|.||
Zfish   863 PGPQGNQGEDGEAGPHGKVGK---------------------------EGSTGSRGPEGLSGKPG 900

  Fly   682 PKGEFGFKGEKGLSGAPGNDGTPGRAGRDGYPGIPGQSIKGEPGFHGRDGAKGDKGSFGRSGEKG 746
            |:|..|..|::|.:|.||.            ||:||.  .|..|..|:.|.:|.:|..|.:|.||
Zfish   901 PRGLKGRTGQRGHTGMPGP------------PGLPGP--PGPFGAEGKPGTQGLQGLVGNAGNKG 951

  Fly   747 EPGSCALDEIKMPAKGNKGEPGQTGMPGPPGEDGSPGERGYTGLKGNTGPQGPPGVEGPRGLNGP 811
                         :.|.:|.||:.|..||||..|.||..         ||.|.||..|.||..|.
Zfish   952 -------------STGLQGIPGERGQDGPPGTMGPPGRE---------GPMGSPGPPGERGFGGK 994

  Fly   812 RGEKGNQGAVGVPGNPGKDGLRGIPGRNGQPGPRGEPGISRPGPMGPPGLNGLQGEKGDRGPTGP 876
            .|.||.||.||:.|.||..|:||.||..|.   ||||                    |.||||||
Zfish   995 TGMKGPQGPVGMYGFPGTVGMRGRPGHTGD---RGEP--------------------GPRGPTGP 1036

  Fly   877 IGFPGADGSVGYPGDRGDAGLPGVSGRPGIVGEKGDVGPIGPAGVAGPPGVPGIDGVRGRDGAKG 941
            ||..|..|.   ||..|:.||||..||.|..|.:|..||.||.|..|..||.|..|.||..|..|
Zfish  1037 IGKTGVPGP---PGPVGEKGLPGPQGRQGDPGHQGTDGPCGPPGRHGVSGVIGKPGPRGSIGPPG 1098

  Fly   942 EPGSPGLVGMPGNKGDRGAPGNDGPKGFAGVTGAPGKRGPAGIPGVSGAKGDKGATGLTGNDGPV 1006
            |||..|..|:||.:|:.|.||.||.||..|:.|:||..||.|:.|..|..|..|..|..|..|..
Zfish  1099 EPGPSGPPGIPGPQGENGVPGIDGEKGEKGMMGSPGDDGPLGLEGQQGLTGPAGLDGAPGRKGDK 1163

  Fly  1007 GGRGPPGAPGLMGIKGDQGLAGAPGQQGLDGMPGEKGNQGFPGLDGPPGLPGDASEKGQKGEPGP 1071
            |.|||.|..||.|.|||   .|..|.:|..|.||..||:|.   .||.|:.||...||:||:.||
Zfish  1164 GDRGPIGTSGLQGPKGD---IGPVGDKGPPGPPGLVGNEGD---SGPAGVAGDPGSKGEKGDIGP 1222

  Fly  1072 SGLRGDTGPAGTPGWPGEKGLPGLAVHGRAGPPGEKGDQGRSGIDGRDGINGEKGEQGLQGVWGQ 1136
            .|..|..||     |                  |:.|:||..||.|..|..|.:||.||.|..|.
Zfish  1223 PGFTGLEGP-----W------------------GDTGEQGEKGIKGAKGQIGMQGEPGLVGGVGH 1264

  Fly  1137 PGEKGSVGAPGIPGAPGMDGLPGAAGAPGAVGYPGDRGDKGEPGLSGLPGLKGETGPVGLQGFTG 1201
            ||.||:.|.||.||..|.||..|:.|..|..|.||:.|:.|..|::|..|::|:||..|.||..|
Zfish  1265 PGLKGAEGVPGHPGLVGPDGPAGSKGKTGPSGSPGEPGNPGPEGIAGPKGVRGDTGKGGTQGGVG 1329

  Fly  1202 APGPKGERGIRGQPGLPATVPDIRGDKGSQGERGYTGEKGEQGERGLTGPAGVAGAKGDRGLQGP 1266
            ..||.|.                .|.||..|..|..||.|::||.||.||.|:      :|:|||
Zfish  1330 LVGPIGP----------------LGPKGLSGPEGEKGEPGDKGEPGLEGPDGL------QGVQGP 1372

  Fly  1267 PGASGLNGIPGAKGDIGPRGEIGYPGVTIKGEKGLPGRPGRNGRQGLIGAPGLIGERGLPGLAGE 1331
            ||..||.|                    .:||.|:.|.||..|.||.:|.||..|..|:.|..|:
Zfish  1373 PGLPGLEG--------------------SQGEAGMRGPPGLPGAQGGVGKPGPDGREGMKGEKGD 1417

  Fly  1332 PGLVGLPGPIGPAGSKGERGLAGSPGQPGQDGFPGAPGLKGDTGPQGFKGERGLNGFEGQKGDKG 1396
            .|..|.||.:|..|.:|:||.|               ||:...||:|.|||         |||.|
Zfish  1418 QGKNGAPGKLGHIGRRGKRGKA---------------GLRATRGPRGEKGE---------KGDVG 1458

  Fly  1397 DRGLQGPSGLPGLVGQKGDTGYPGLNGNDGPVGAPGERGFTGPKGRDGRDGTPGLPGQKGEPGML 1461
            ::||.|..|..|:.|.:|:.      |::|..||.||:|..||.|..||||..|..|..|.||  
Zfish  1459 EKGLPGWGGSMGIQGPRGEP------GDEGVKGAEGEKGDQGPFGSPGRDGIKGSLGPLGPPG-- 1515

  Fly  1462 PPPGPKGEPGQPGRNGPKGEPGRPGERGLIGIQGERGEKGERGLIGETGNVGRPGPKGDRGEPGE 1526
             ||||||:.|..|.:||:|.||.||..||.|.:|.:|.:|.            |||.|.||.|  
Zfish  1516 -PPGPKGDQGNIGPSGPRGTPGIPGLPGLFGDKGLKGFQGP------------PGPPGTRGLP-- 1565

  Fly  1527 RGYEGAIGLIGQKGEPGAPAPAALDY-LTGILITRHSQSETVPACSAGHT--------------- 1575
                         |.||.|.|..:.. ||.|.|.....|...|..|...|               
Zfish  1566 -------------GPPGPPGPPGISVNLTLIQIKDLMYSSDKPNFSLIQTLLDSLYHDLQLLVDP 1617

  Fly  1576 -------------ELW-------TGYSLLYVDGND---------YAHNQDLGSPGSCVPRFSTLP 1611
                         |||       :|:  .|:|.|.         |.:..:.|:.....||.:.||
Zfish  1618 PDGTKQHPASTCLELWLCHPNYTSGF--YYIDPNQGSPLDALLAYCNFSESGAETCLHPRDAHLP 1680

  Fly  1612 VLSCGQNNVCNYASRNDKTFWLTT 1635
            .     ....|.:..|.|..||::
Zfish  1681 T-----RTWLNDSGNNSKFHWLSS 1699

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
Col4a1NP_723044.1 Collagen 104..161 CDD:189968 19/60 (32%)
Collagen 322..380 CDD:189968 23/57 (40%)
Collagen 413..465 CDD:189968 25/54 (46%)
Collagen 499..561 CDD:189968 24/61 (39%)
Collagen 574..632 CDD:189968 26/57 (46%)
Collagen 657..714 CDD:189968 17/56 (30%)
Collagen 765..824 CDD:189968 27/58 (47%)
Collagen 854..911 CDD:189968 22/56 (39%)
Collagen 884..943 CDD:189968 27/58 (47%)
Collagen 923..982 CDD:189968 29/58 (50%)
Collagen 1028..1085 CDD:189968 24/56 (43%)
Collagen 1229..1287 CDD:189968 22/57 (39%)
Collagen 1318..1376 CDD:189968 17/57 (30%)
Collagen 1399..1458 CDD:189968 24/58 (41%)
Collagen 1477..1534 CDD:189968 20/56 (36%)
C4 1555..1662 CDD:128421 24/125 (19%)
C4 1663..1777 CDD:128421
si:ch211-196i2.1XP_009293536.1 LamG 67..217 CDD:304605
Collagen 573..631 CDD:189968 32/101 (32%)
Collagen 618..677 CDD:189968 27/80 (34%)
Collagen 663..720 CDD:189968 31/78 (40%)
Collagen 699..757 CDD:189968 26/57 (46%)
Collagen 819..877 CDD:189968 26/63 (41%)
Collagen 855..907 CDD:189968 26/84 (31%)
Collagen 1002..1076 CDD:189968 42/99 (42%)
Collagen 1200..1257 CDD:189968 28/82 (34%)
Collagen 1254..1312 CDD:189968 27/57 (47%)
Collagen 1338..1396 CDD:189968 30/83 (36%)
Collagen 1371..1425 CDD:189968 25/73 (34%)
Collagen 1434..1491 CDD:189968 29/86 (34%)
COLFI 1618..1813 CDD:295304 18/89 (20%)
Blue background indicates that the domain is not in the aligned region.


Information from Original Tools:


Tool Simple Score Weighted Score Original Tool Information
BLAST Result Score Score Type Cluster ID
Compara 00.000 Not matched by this tool.
Domainoid 00.000 Not matched by this tool.
eggNOG 00.000 Not matched by this tool.
Hieranoid 00.000 Not matched by this tool.
Homologene 00.000 Not matched by this tool.
Inparanoid 00.000 Not matched by this tool.
OMA 00.000 Not matched by this tool.
OrthoDB 00.000 Not matched by this tool.
OrthoFinder 00.000 Not matched by this tool.
OrthoInspector 00.000 Not matched by this tool.
orthoMCL 00.000 Not matched by this tool.
Panther 00.000 Not matched by this tool.
Phylome 1 0.910 - -
RoundUp 00.000 Not matched by this tool.
SonicParanoid 00.000 Not matched by this tool.
SwiftOrtho 00.000 Not matched by this tool.
TreeFam 00.000 Not matched by this tool.
ZFIN 00.000 Not matched by this tool.
10.910

Return to query results.
Submit another query.