DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Col4a1 and col-135

DIOPT Version :10

Sequence 1:NP_723044.1 Gene:Col4a1 / 33727 FlyBaseID:FBgn0000299 Length:1779 Species:Drosophila melanogaster
Sequence 2:NP_001255859.1 Gene:col-135 / 3565413 WormBaseID:WBGene00000708 Length:660 Species:Caenorhabditis elegans


Alignment Length:922 Identity:329/922 - (35%)
Similarity:392/922 - (42%) Gaps:326/922 - (35%)


- Green bases have known domain annotations that are detailed below.


  Fly   589 IPGKDGARGPPGERGYPGERGHDGINGQTGPPGEKGEDGRTGLPGATGEPGKPALCDLSLIEPLK 653
            :||..|..||||..  .|:.|..|.||:.|.||:||:         ||:               |
 Worm    59 LPGPPGPPGPPGTG--TGKDGVPGTNGKDGTPGDKGD---------TGD---------------K 97

  Fly   654 GDKGYPGAPGAKGVQGFKGAEGLPGIPGPKGEFGFKGEKGLSGAPGNDGTPGRAGRDGYPGIPGQ 718
            ||||..||||.                            |:.|.||..|.||..|:||..|.||.
 Worm    98 GDKGDTGAPGV----------------------------GVKGDPGAQGPPGEKGKDGDKGAPGA 134

  Fly   719 SIKGEPGFHGRDGAKGDKGSFGRSGEKGEPGSCALDEIKMPAKGNKGEPGQTGMPGPPGEDGSPG 783
            .     |..|.||.||||||.|..||||.||.          ||:|||.|:.|..|.||..|..|
 Worm   135 K-----GSKGDDGKKGDKGSSGEKGEKGSPGQ----------KGDKGEKGEKGSNGSPGSKGDKG 184

  Fly   784 ERGYTGLKGNTGPQGPPGVEGPRGLN---GPRGEKGNQGAVGVPGNPGKDGLRGIPGRNGQPGPR 845
            |:   |..||.||:|..|.:|.:|.:   |.:|:||:.||.|..||||..|..|..|..|..|.:
 Worm   185 EK---GADGNPGPKGDKGADGAKGADGTPGSKGDKGSDGAKGADGNPGSKGDTGDKGEKGSDGAK 246

  Fly   846 GEPGISRPGPMGPPGLNGLQGEKGDRGPTGPIGFPGADGSVGYPGDRGDAGLPGVSGRPGIVGEK 910
            |:           .|.||..|.|||:|..|..|..||||:   ||.:||.|..|..|..|..|:|
 Worm   247 GD-----------KGANGTPGSKGDKGEKGADGAKGADGT---PGSKGDKGADGAKGADGTPGQK 297

  Fly   911 GDVGPIGPAGVAGPPGVPGIDGVRGRDGAKGEPGSPGLVGMPGNKGDRGAPGNDGPKGFAGVTGA 975
            ||                  .|.:|.||.||:.|:.|..|..|:|||:||   ||.|   |..|.
 Worm   298 GD------------------KGEKGLDGPKGDKGADGTPGAKGDKGDKGA---DGAK---GADGT 338

  Fly   976 PGKRGPAGIPGVSGAKGDKGATGLTGNDGPVGGRGPPGAPGLMGIKGDQGLAGAPGQQGLDGMPG 1040
            ||.:|..|..|:.|.||||||      ||..|..|.|||      |||:|..|..|.:|.||.||
 Worm   339 PGSKGDKGEKGLDGPKGDKGA------DGAKGANGTPGA------KGDKGEKGTDGAKGADGTPG 391

  Fly  1041 EKGNQGFPGLDGPPGLPGDASEKGQKGEPGPSGLRGDTGPAGTPGWPGEKGLPGLAVHGRAGPPG 1105
            .||::|..|||||                     :||.|..||||..|:|               
 Worm   392 AKGDKGEKGLDGP---------------------KGDKGADGTPGAKGDK--------------- 420

  Fly  1106 EKGDQGRSGIDGRDGINGEKGEQGLQGVWGQPGEKGSVGAPGIPGAPGMDGLPGAAGAPGAVGYP 1170
                                            ||||:.||.|..|.||..|..||.||.||.|.|
 Worm   421 --------------------------------GEKGADGAKGADGTPGSKGDKGADGAKGADGTP 453

  Fly  1171 GDRGDKGEPGLSGLPGLKGETGPVGLQGFTGAPGPKGERGIRGQPGLPATVPDIRGDKGSQGERG 1235
            |.:|||||.||.                     ||||::|..|.||       .:|||       
 Worm   454 GQKGDKGEKGLD---------------------GPKGDKGADGTPG-------AKGDK------- 483

  Fly  1236 YTGEKGEQGERGLTGPAGVAGAKGDRGLQGPPGASGLNGIPGAKGDIGPRGEIGYPGVTIKGEKG 1300
              ||||..|.:|..|..|..|.|||:||.||.|..|.:|.||:|||.|.:|:||.|         
 Worm   484 --GEKGADGAKGADGTPGSKGDKGDKGLDGPKGDKGADGTPGSKGDKGEKGDIGQP--------- 537

  Fly  1301 LPGRPGRNGRQGLIGAPGLIGERGLPGLAGEPGLVGLPGPIGPAGSKGERGLAGSPGQPGQDGFP 1365
                                                     |..|.|||:|..|.|||.||||.|
 Worm   538 -----------------------------------------GQKGDKGEKGQDGQPGQKGQDGQP 561

  Fly  1366 GAPGLKGDTGPQGFKGERGLNGFEGQKGDKGDRGLQGPSGLPGLVGQKGDTGYPGLNGN---DGP 1427
                                    ||||||||:| ||..|..|..||||..|.||..|:   .|.
 Worm   562 ------------------------GQKGDKGDKG-QGEKGQDGQPGQKGQDGQPGQKGDKGEKGD 601

  Fly  1428 VGAPGERGFTGPKGRDGRDGTPGLPGQKGEPGMLPPPGPKGEPGQPGRNGPKGEPGRPGERGLIG 1492
            :|.||::|..|.||:||:      |||||:.|.   ||.||:.||||:.|.|||.|.        
 Worm   602 IGQPGQKGDKGDKGQDGQ------PGQKGQDGQ---PGQKGQDGQPGQKGDKGEKGE-------- 649

  Fly  1493 IQGERGEKGERG 1504
             :||:||||::|
 Worm   650 -KGEKGEKGQKG 660

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
Col4a1NP_723044.1 gly_rich_SclB <107..>361 CDD:468478
gly_rich_SclB <355..>642 CDD:468478 19/52 (37%)
gly_rich_SclB <543..820 CDD:468478 84/233 (36%)
gly_rich_SclB <727..>968 CDD:468478 98/243 (40%)
gly_rich_SclB <969..>1218 CDD:468478 84/248 (34%)
gly_rich_SclB <1186..>1420 CDD:468478 73/233 (31%)
gly_rich_SclB <1321..>1547 CDD:468478 72/187 (39%)
C4 1555..1662 CDD:128421
C4 1663..1777 CDD:128421
col-135NP_001255859.1 gly_rich_SclB <66..>256 CDD:468478 97/272 (36%)
gly_rich_SclB <184..>409 CDD:468478 114/298 (38%)
gly_rich_SclB <343..>567 CDD:468478 131/414 (32%)
Collagen 582..638 CDD:460189 29/64 (45%)
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.