DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Col4a1 and Col4a5

DIOPT Version :10

Sequence 1:NP_723044.1 Gene:Col4a1 / 33727 FlyBaseID:FBgn0000299 Length:1779 Species:Drosophila melanogaster
Sequence 2:NP_001413925.1 Gene:Col4a5 / 363457 RGDID:1565499 Length:1691 Species:Rattus norvegicus


Alignment Length:1789 Identity:819/1789 - (45%)
Similarity:972/1789 - (54%) Gaps:212/1789 - (11%)


- Green bases have known domain annotations that are detailed below.


  Fly    73 CTAGYAGCVPKC-----IAEKGNRGLPGPLGPTGLKGEMGFPGMEGPSGDKGQKGDPGPYGQRGD 132
            |::|     .||     ..|||.||.||..|..||.   ||||.|||.|.:|||||.|..|..|.
  Rat    32 CSSG-----SKCDCSGIKGEKGERGFPGLEGHPGLP---GFPGPEGPPGPRGQKGDDGIPGPPGP 88

  Fly   133 KGERGSPGLHGQAGVPGVQGPAGNPGAPGINGKDGCDGQDGIPGLEGLSGMPGPRGYAGQLGSKG 197
            ||.||.|||.|..|.||:.|..|:.||||..|..||:|..|..|..|..|.||.:|..|..|..|
  Rat    89 KGIRGPPGLPGFPGTPGLPGMPGHDGAPGPQGIPGCNGTKGERGFPGSPGFPGLQGPPGPPGIPG 153

  Fly   198 EKGEPAKENGDYAKGEKGEPGWRGTAGLAGPQGFPGEKGERGDSGPYGAKGPRGEHGLKGEKGAS 262
            .||||.........|.||..|:.|..|:.||   ||.||..|.|||                   
  Rat   154 MKGEPGSIIMSSLPGPKGNQGYPGPPGIQGP---PGPKGVSGPSGP------------------- 196

  Fly   263 CYGPMKPGAPGIKGEKGEPASSFPVKPTHTVMGPRGDM-----GQKGEPGLVGRKGEPGPEGDTG 322
                  ||.||:.|..|.|...          ||:|:|     |.|||.|..|.:|.|||.|...
  Rat   197 ------PGPPGLMGPPGPPGPP----------GPKGNMGLNFQGPKGEKGEQGLQGPPGPPGQIS 245

  Fly   323 -------LDGQKGEKGLPGGPGDRGRQGNFGPPGST-GQKGDRGEPGLNGLPGNPGQKGEPGRAG 379
                   ||.|||::|:||..|..|..|..||||.. |.||::||      ||.||::|:||:.|
  Rat   246 EQKRPIDLDFQKGDQGVPGDRGPPGPPGIPGPPGPPGGVKGEKGE------PGEPGKRGKPGKDG 304

  Fly   380 ATGKPGLLGPPGPPGGGRGTPGPPGPKGPRGYVGAPGPQGLNGVDGLPGPQGYNGQKGGAGLPGR 444
            .||:||:.|.||.| |..|.||..|.||.:|..|.|||.|.  |...||.....|.||..|.||.
  Rat   305 ETGQPGIPGLPGDP-GYPGEPGRHGEKGQKGDTGPPGPPGF--VIPTPGTGVTIGAKGNIGQPGL 366

  Fly   445 PGNEGPPGKKGEKGTAGLNGPKGS--IGPIGHPGPPGPEGQKGDAGLPGYGIQGSKGDAGIPGYP 507
            ||.:|..|..|.:|..|..||.|:  :||.|.||.||..|||||.|.||         ..|||.|
  Rat   367 PGEKGERGFPGIQGPPGFPGPPGTAVMGPPGPPGYPGERGQKGDEGPPG---------ICIPGSP 422

  Fly   508 GLKGSKGERGFKGNAGAPGDS--------KLGRPGTPGAAGAPGQKGDAGRPGTPGQKGDMGIKG 564
            ||.|..|..||.|..|.||..        |.|.||.||:            ||..|.:|:.|:||
  Rat   423 GLDGQPGAPGFPGPPGPPGPQLPPSDEICKAGPPGPPGS------------PGDQGLQGEQGVKG 475

  Fly   565 DVGGKCSSCRAGPKGDKGTSGLPGIPGKDGARGPPGERGYPGERGHDGINGQTGP---PGEKGED 626
            |.|..|.:|..     .|.||.||.||..|..||||..|.|||:|..|..|.|||   ||..|..
  Rat   476 DKGDTCFNCIG-----TGISGPPGQPGLPGLPGPPGSLGIPGEKGEKGQAGTTGPKGLPGIPGPP 535

  Fly   627 GRTGLPGATGEPGKPALCDLSLIEPLKGDK---GYPGAPGAKGVQGFKGAEGLPGIPGPKGEFG- 687
            |..|.||:.|:||     |:..:..:||:|   |.|||||..|:.|..|.:||||:||||||.| 
  Rat   536 GAPGFPGSKGDPG-----DVLTLPGMKGEKGELGSPGAPGLPGLPGSPGKDGLPGLPGPKGEPGG 595

  Fly   688 --FKGEKGLSGAPGNDGTPGRAGRDGYPGIPGQSIKGEPGFHGRDGAKGDKGSFGRSGEKGEPGS 750
              ||||:   |.|||.|.||..|..|..|..|....|..|..|..|..|:.|..|..|.||:|| 
  Rat   596 ITFKGER---GPPGNPGLPGLPGNMGPTGPVGFGPPGPVGEKGIQGVAGNPGQPGLPGPKGDPG- 656

  Fly   751 CALDEIKMPAK-GNKGEPGQTGMPGPPGEDGSPGERGYTGLKGNTGPQGPPGVEGPRGLNGPRGE 814
               ..|..|.| |..|.||:.|..|.||:.|.||:.|..|:.|:.|..|.||: ||.|..||:|.
  Rat   657 ---QTISQPGKPGLPGNPGRDGEVGLPGDPGLPGQPGLPGIPGSKGEPGIPGI-GPPGPPGPKGF 717

  Fly   815 KGNQGAVGVPGNPGKDGLRGIPGRNGQPGPRGEPGISRPGPMGPPGLNGLQ---GEKGDRGPTGP 876
            .|..|..|.||.||:.|..|.||..|.|||:||||...|||.||||..|.:   |.|||||..||
  Rat   718 PGIPGPPGPPGAPGRMGPEGPPGLPGFPGPKGEPGFGLPGPPGPPGFPGFKGTPGPKGDRGLPGP 782

  Fly   877 IGFPGADGSVGYPGDRGDA---GLPGVSGRPGIVGEKGDVGPIGPAGVAGPPGVPGIDGVRGRDG 938
            .|.||..|..|.||.:|||   |.||..|.||:.| .|..||.||.|..||.|.||:.|:   .|
  Rat   783 PGPPGHTGLDGLPGPKGDAGPNGQPGPIGPPGLPG-IGLQGPPGPPGTPGPIGQPGLHGI---PG 843

  Fly   939 AKGEPGSPGLVGMPGNKGDRGAPGNDGPKGFAGVTGAPGKRGPAGIPGVSGAKGDKGATGLTGND 1003
            .||:||.||| .:||..|:||:||..|..|..|..|:||..|.||:||..|.||:.|..|..|..
  Rat   844 EKGDPGPPGL-DVPGLPGERGSPGLPGAPGLIGPPGSPGIPGRAGVPGFPGVKGEMGMMGPPGPP 907

  Fly  1004 GPVGGRGPPGAPGLMGIKGDQGLAGAPGQQGLDGMPGEKGNQGFPGLDGPPG--LPGDASEKGQK 1066
            ||:|..|..|||||   |||.|:.|.|   ||.|..||||.:|.|||.||||  .|.....||:|
  Rat   908 GPLGTPGRSGAPGL---KGDDGMQGQP---GLPGPVGEKGGKGEPGLPGPPGPMRPDFLGSKGEK 966

  Fly  1067 GEPGPSGLRGDTGPAGTPGWPGEKGLPGLAVHGRAGPPGEKGDQGRSGIDGRDGINGEKGEQGLQ 1131
            |:||..|:.|.:||.|..|.||:.|.|||:  |:.|.||..|.:|..|:.|..|:.         
  Rat   967 GDPGLPGIPGLSGPKGYQGLPGDPGQPGLS--GQPGSPGLPGPKGTPGLPGPPGLT--------- 1020

  Fly  1132 GVWGQPGEKGSVGAPGIPGAPGMD---GLPGAAGAPGAVGYPGDRGDKGEPGLS--GLPGLKGET 1191
               |.||.||::|..|.||..|:|   |.||..|.||:.|.||.:|.|||||:|  |||||.|..
  Rat  1021 ---GPPGPKGNIGDMGFPGPQGVDGPPGPPGGPGQPGSPGLPGQKGSKGEPGVSGIGLPGLPGPK 1082

  Fly  1192 GPVGLQGFTGAPGPKGERGIRGQPGLPATVPDIRGDKGSQGERGYTGEKGEQGERGLTGPAGVAG 1256
            |..||.|:.|.||.||..|..|.||||.:    .|.||..|..|:.|..|..|.:|:.||.|..|
  Rat  1083 GEPGLPGYPGNPGIKGSMGEAGLPGLPGS----PGAKGQPGLPGFPGTPGLPGPKGMNGPPGNPG 1143

  Fly  1257 AKGD------RGLQGPPGASGLNGIPGAKGDIGPRGEIGYPGVTIKGEKGLPGRPGRNGRQGLIG 1315
            ..|:      .|..||||..|..|.||..|..||.|:.|.||....|..|.||.||.:|::|..|
  Rat  1144 LPGEPGPVGGGGRPGPPGPPGEKGKPGQDGIPGPAGQKGEPGQPGFGIPGPPGLPGLSGQKGDGG 1208

  Fly  1316 APGLIGERGLPGLAGEP---GLVGLPGPIGPAGS--------KGERGLAGSPGQPGQDGFPGAPG 1369
            ||||.|..||||..|||   |..|:|||.||.||        ||..|..|.||:||..||.|.||
  Rat  1209 APGLPGNPGLPGSKGEPGFQGFPGMPGPPGPPGSPGPSLEGPKGNPGPQGPPGRPGPPGFQGPPG 1273

  Fly  1370 LKGDTGPQ---GFKGERGLNGFEGQ------KGDKGDRGLQGPSGLPGLVGQKGDTGYPGL---N 1422
            .:|..||.   |.|||||..|..||      |||:|..||.|..|.|||.|.|||.|.||:   .
  Rat  1274 TEGPRGPPGNGGIKGERGNPGPPGQPGLPGLKGDQGPPGLPGNPGRPGLNGMKGDPGLPGVPGFP 1338

  Fly  1423 GNDGPVGAPGERGFTGPKGRDGRDGTPGLPGQKGEPGMLP-PPGPKGEPGQPGRNGPKGEPGRPG 1486
            |..||:|.||..|..|..|..|..|.|||||..|:..::. .|||.|.|||||.   ||.||.||
  Rat  1339 GMKGPIGVPGSTGPDGEPGLTGPPGPPGLPGPSGQSIVIKGDPGPPGIPGQPGL---KGLPGLPG 1400

  Fly  1487 ERGLIGIQGERGEKGERGLIGETGNVGRPGPKGDRGEPGERGYEGAIGLIGQKGEPGAPAPAALD 1551
            .:||.|..|..|:.|..||.|..|..||.|..|..|:||.||.:|..|..|.:|.||.|...::.
  Rat  1401 PQGLPGPIGPPGDPGRNGLPGFDGAGGRKGDPGLPGQPGSRGLDGPPGPDGLQGPPGPPGTTSVA 1465

  Fly  1552 YLTGILITRHSQSETVPACSAGHTELWTGYSLLYVDGNDYAHNQDLGSPGSCVPRFSTLPVLSCG 1616
            :  |.|||||||:...|.|..|...::.|:|||||.||..||.||||:.|||:.||||:|.:.|.
  Rat  1466 H--GFLITRHSQTTDAPQCPQGTVHIYDGFSLLYVQGNKRAHGQDLGTAGSCLRRFSTMPFMFCN 1528

  Fly  1617 QNNVCNYASRNDKTFWLTTNAAIP--MMPVENIEIRQYISRCVVCEAPANVIAVHSQTIEVPDCP 1679
            .|||||:|||||.::||:|...:|  |.|::...|:.:||||.||||||.|||||||||::|.||
  Rat  1529 INNVCNFASRNDYSYWLSTPEPMPMNMEPLKGQSIQPFISRCAVCEAPAVVIAVHSQTIQIPHCP 1593

  Fly  1680 NGWEGLWIGYSFLMHTAVGNGGGGQALQSPGSCLEDFRATPFIECNGAKGTCHFYETMTSFWMYN 1744
            .||:.|||||||:|||:.|..|.||||.|||||||:||:.|||||:| :|||::|....|||:..
  Rat  1594 QGWDSLWIGYSFMMHTSAGAEGSGQALASPGSCLEEFRSAPFIECHG-RGTCNYYANSYSFWLAT 1657

  Fly  1745 LESSQPFERPQQQTIKAGERQSHVSRCQVCMKNS 1778
            ::.|..|.:||.:|:|||:.::.:||||||||.:
  Rat  1658 VDMSDMFSKPQSETLKAGDLRTRISRCQVCMKRT 1691

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
Col4a1NP_723044.1 gly_rich_SclB <107..>361 CDD:468478 109/266 (41%)
gly_rich_SclB <355..>642 CDD:468478 129/299 (43%)
gly_rich_SclB <543..820 CDD:468478 122/286 (43%)
gly_rich_SclB <727..>968 CDD:468478 117/247 (47%)
gly_rich_SclB <969..>1218 CDD:468478 119/255 (47%)
gly_rich_SclB <1186..>1420 CDD:468478 119/259 (46%)
gly_rich_SclB <1321..>1547 CDD:468478 118/249 (47%)
C4 1555..1662 CDD:128421 58/108 (54%)
C4 1663..1777 CDD:128421 68/113 (60%)
Col4a5NP_001413925.1 None

Return to query results.
Submit another query.