DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Col4a1 and col4a2

DIOPT Version :10

Sequence 1:NP_723044.1 Gene:Col4a1 / 33727 FlyBaseID:FBgn0000299 Length:1779 Species:Drosophila melanogaster
Sequence 2:XP_002933063.2 Gene:col4a2 / 100486079 XenbaseID:XB-GENE-5950101 Length:1722 Species:Xenopus tropicalis


Alignment Length:1871 Identity:773/1871 - (41%)
Similarity:966/1871 - (51%) Gaps:342/1871 - (18%)


- Green bases have known domain annotations that are detailed below.


  Fly    52 DDSYDIVDSAGVARGDLPPKNCTAGYAGCVPKCIAEKGNRGLPGPLGPTGLKGE---MGFPGMEG 113
            |.||         .|....::|:   .||  :|:.|||:||.|||||..|..|.   :|..|::|
 Frog    48 DKSY---------TGPCGGRDCS---QGC--QCLPEKGSRGQPGPLGGQGTSGPPGLIGIAGLQG 98

  Fly   114 PSGDKGQKGDPGPYGQRGDKGERGSPGLHGQAGVPGVQGPAGNPGAPGINGKDGCDGQDGIPGLE 178
            ..||||::|.||..|..||||:.|..|..|..||||..|..|..|.|   |.|||:|..|..|:.
 Frog    99 RKGDKGERGFPGVTGPSGDKGQSGVTGFPGADGVPGHTGQGGPRGKP---GHDGCNGTQGDAGVG 160

  Fly   179 GLSGMPGPRGYAGQLGSKGEKGEP---AKENGDYAKGEKGEPGWR---------GTAGLAGPQGF 231
            |..|..||.|.:|..|:||:||||   :.|..:..:|:.||.|:.         ||||..||||:
 Frog   161 GSHGSRGPPGISGGFGAKGQKGEPFHVSIEVKNRLRGDPGEAGFNGIQGPRGSPGTAGFIGPQGY 225

  Fly   232 PGEKGERGDSGPYGAKGPRGEHGL--KGEKGASCYGPMKPGAPGIKGEKGEPASSFPVKPTHTV- 293
               :|.||..||.|.:||:|..||  .||||.    |..||.||.:..:||.      :..|.: 
 Frog   226 ---RGPRGPPGPPGPQGPQGNRGLGYYGEKGE----PGDPGPPGEQPRQGER------EQKHKIE 277

  Fly   294 ------MGPRGDMGQKGEPGLVGRKGEPGPEGDTGLDGQKGEKGLPGGPGDRGRQGNFGPPGSTG 352
                  .|.:||.|:||.|   |.||:|      .:.|:|||:|:.|.||.||..||   .|::|
 Frog   278 ILLLQYKGAKGDQGEKGNP---GDKGQP------AVTGEKGEEGITGFPGQRGFPGN---DGNSG 330

  Fly   353 QKGDRGEPGLNGLPGNPGQKGEPGRAGATGKPGLLGPPGPPGGGRGTPGPPGPKGPRGYVGAPGP 417
            ..|:.|.||.||:||.||.||:.|..|.      |||.||.   ....|....||..|.:|.||.
 Frog   331 ASGEIGFPGANGVPGLPGTKGKKGEVGD------LGPQGPV---TFVNGQQKRKGYEGEIGFPGL 386

  Fly   418 QGLNGVDGLPGPQGYNGQKGGAGLPGR-PGNEGPPGKKGEKGTAGLNGPKGSIGPIGHPGPPGPE 481
            .|:.|.||.||..|:.||.|.   |.| ||..|..|.||::|..|..|..|.  |...|||||.:
 Frog   387 NGVYGDDGDPGDPGFPGQNGS---PSRFPGQTGDTGLKGQRGPKGYQGDTGL--PAFQPGPPGID 446

  Fly   482 ---GQKGDAGLPGY-----------GIQGSKGDAGIPGYPGLKGSKGERG----------FKGNA 522
               |::|..|.|||           |..|..|..|.||..||:|.|||:|          .:|.|
 Frog   447 GAPGKEGPPGQPGYPGGRIGMKGLPGFPGRDGQRGAPGSRGLRGIKGEQGECRCYEGDEAGRGPA 511

  Fly   523 GAPGDSKLGRPGTPGAAGAPGQKGDAGRPGTPGQKGDMGI---------KGDVGGKCSSCRAGPK 578
            |.||  .:|.||..|.||..|:.||.|..|.||..|.:|:         .|:.|.|..:...|.|
 Frog   512 GEPG--PVGFPGFQGHAGRKGEAGDRGVQGLPGAPGTVGLAGPAGFPGPAGEKGDKVFATEKGSK 574

  Fly   579 GDKGTSGLPGIPGKDGARGPPGERGYPGERGHDGINGQTGPPGEKGEDGRTGLPGATGEPGKPAL 643
            |.:|.||.||:||:|         ||||..|.||..|..||.|    ||..||||:.|.||.|..
 Frog   575 GAQGDSGFPGVPGRD---------GYPGRNGRDGSPGFPGPSG----DGIKGLPGSHGFPGLPGN 626

  Fly   644 CDLSLIEPLKGDKGYPGAPGAKGVQGFKGAEGLPGIPGPKGEFGFKGEKGLSGAPGN-------- 700
            ..      .:|:.|:||       .||.|::||.|.||.:|..||.|..||||..|:        
 Frog   627 AG------ARGETGFPG-------PGFPGSKGLRGEPGNRGVSGFPGVPGLSGPQGDCYQETDEE 678

  Fly   701 -----DGT-------PGRAGRDGYPGIPG-QSIKGEPGF---HGRDGA---KGDKGSFGRSGEKG 746
                 :||       |..||..|.||:|| ..|:|:.||   ||.||.   ||.||..|.....|
 Frog   679 NEIGKEGTGCRLIAPPAPAGAPGPPGLPGYNGIRGQKGFPGPHGFDGVFGLKGIKGDRGTDSLPG 743

  Fly   747 EPGSCALDEIKMPAKGNKGEPGQTGMPGPPGEDGSPGERGYTGLKGNTGPQ--GPPGVEGPRGLN 809
            .||...       .:|::|..|..|:||.||..|.||.:|..|.||:.|.:  .|.|.:|..||.
 Frog   744 PPGFSG-------QRGDRGNAGLPGLPGVPGLSGKPGFKGVQGEKGSVGDRLGAPSGEQGNTGLP 801

  Fly   810 GPRGEKGNQGAVGVPGNPGKDGLRGIPGRNGQPGPRGEPGISRPGPMGPPGLNGLQGEKGDRGPT 874
            |.:|.||.||..|:||..||.|..|.||..|..|..|.||:  ||..||..|             
 Frog   802 GLQGLKGTQGDQGIPGLQGKVGTPGTPGNKGIKGSPGAPGV--PGIPGPAAL------------- 851

  Fly   875 GPIGFPGADGSVGYPGDRGDAGLPGVSGRPGIVGEKGDVGPIGPAGVAGPPGVPGIDGVRGRDGA 939
              .||||..|:.|.||.:|..|.||..|:.|..||||..||.|..|:.|..||.|..|||||.|.
 Frog   852 --FGFPGMPGNPGTPGPQGSMGPPGPPGQRGDEGEKGPPGPAGMKGLPGELGVGGFIGVRGRHGN 914

  Fly   940 KGEPGSPGLVGMPGNKGDRGAPGNDGPKGFAGVTGAPGKRGPAGIPGVSGAKGDKGATGLTGNDG 1004
            .|.||..|:.|:||.||.:|:||.:|.:|..|..|.|..||..|..|..|..|.||..|.:|..|
 Frog   915 PGIPGLVGMHGLPGVKGIKGSPGMEGFQGMKGAKGRPAWRGLKGPSGNIGFPGVKGGNGTSGTKG 979

  Fly  1005 PVGGRGPPGAP-----------------------GLMGIKGDQGLAGAPGQQGLDGMP------- 1039
            ..|.:||||.|                       |:.|:||.:|:.|.|||.||.|:|       
 Frog   980 DRGEQGPPGDPPKMPEMMLLTKGETGDQGVSGFKGISGLKGSKGMPGPPGQLGLPGLPGLRSFEF 1044

  Fly  1040 GEKGNQGFPGLDGPPGLPGDASEKGQKGEPGPSGLRGDTGPAGTPGWPGEKGLPGLAVHGRAGP- 1103
            |:||..|.||:.|..|:.|:....|..|.||.:|.||..|.:|.||.|||:|..|..  |..|. 
 Frog  1045 GDKGETGTPGVIGNQGILGEVGPDGIIGFPGFTGPRGTPGISGFPGVPGERGKYGDI--GERGDT 1107

  Fly  1104 ---PGEKGDQGRS---GIDGRDGINGEK---GEQGLQGVWGQPGEKGSVGAPGIPGAPGMDGLPG 1159
               |||:|.:|:.   |:.|:.||:|||   |::|.:|:.|..||.|..|..|.||..|:.|.||
 Frog  1108 IDLPGERGTKGQGGTPGLPGQRGIHGEKGTTGDEGFRGIEGVIGEHGQTGEKGFPGQQGLVGFPG 1172

  Fly  1160 AAGAPGAVGYPGDRGDKGEPGLSGLPGLKGETGPVGLQGFTGAPGPKGERGIRGQPGLPATVPDI 1224
            :.|.||..|.||::|.      ||.|||.|:.|..|::|..|..|..|.:||.||||     .||
 Frog  1173 SQGFPGLPGVPGEKGS------SGFPGLPGQHGFPGIRGIAGLDGLPGTKGINGQPG-----ADI 1226

  Fly  1225 RGDKGSQGERGYTGEKGEQGER----GLTGPAGVAGAKGDRGLQ---GPPGASGLNGIPGAKGDI 1282
            .|.||.   :|.:|.||:.||.    |:.||||..||:||:||.   |.||..|:.|:||.....
 Frog  1227 IGLKGF---KGLSGGKGQPGEASTIVGMPGPAGSKGAEGDQGLPGLIGLPGTVGVRGLPGFSNKT 1288

  Fly  1283 GPRGEIGYPGVTIKGEKGLPGRPGRNGRQGLIGAPGLIGERGLPGLAGEPGLVGLPGPIGPAGSK 1347
            |..|::|.|     |..||||.||..||               |||..:||:.|:.|.:|..|:.
 Frog  1289 GLLGDVGPP-----GPPGLPGFPGPIGR---------------PGLPSKPGVKGVIGDLGLQGNY 1333

  Fly  1348 GERGLAGSPGQ---PGQDGFPGAPGLKGDTGPQGFKGERGLNGFEGQK---GDKGDRGLQGPSGL 1406
            |.:|:.|..|:   ||..|.|||.||||..|.|||.|..|..|.:|.|   ||.|:.||:||.|.
 Frog  1334 GAKGIPGDEGKVGVPGLAGIPGAKGLKGSAGFQGFVGVSGYRGDQGPKGLRGDIGEYGLKGPPGQ 1398

  Fly  1407 PGLVGQ----KGDTGYPGLNGNDGPVGAPGERGFTGPKGRDGRDGTPGLPGQKGEPGMLPPPGPK 1467
            ||.:.:    ..:.||||.:   |.:|..|.:|..|||   |..||||.|||.|:.|:       
 Frog  1399 PGPMSEPPLITAEQGYPGAH---GIIGHQGVQGDMGPK---GLVGTPGAPGQTGKEGL------- 1450

  Fly  1468 GEPGQPGRNGPKGEPGRPGERGLIGIQGERGEKGERGLIGETGNVGRPGPKGDRGEPGERGYEGA 1532
              ||.|      |.|..||.|                  ||.|.:||.||         .|::||
 Frog  1451 --PGVP------GWPAPPGSR------------------GEQGPMGRTGP---------NGFKGA 1480

  Fly  1533 IGLIGQKGEPGAPAPAALDYLTGILITRHSQSETVPACSAGHTELWTGYSLLYVDGNDYAHNQDL 1597
            .|:.|:.|.||.|..:.   ..|.|:.:|||::..|.|..|...|||||||||.:|.:.||||||
 Frog  1481 SGVPGRAGLPGMPGRSV---NIGYLLVKHSQTDEEPMCPVGMARLWTGYSLLYFEGQEKAHNQDL 1542

  Fly  1598 GSPGSCVPRFSTLPVLSCGQNNVCNYASRNDKTFWLTTNAAIPMMPVENIEIRQYISRCVVCEAP 1662
            |..|||:.||||:|.|.|...:||.||:||||::||:|.|.:|||||...|||.|||||.|||||
 Frog  1543 GLAGSCLQRFSTMPFLYCNPGDVCYYANRNDKSYWLSTTAPLPMMPVVEEEIRPYISRCSVCEAP 1607

  Fly  1663 ANVIAVHSQTIEVPDCPNGWEGLWIGYSFLMHTAVGNGGGGQALQSPGSCLEDFRATPFIECNGA 1727
            |..||||||.:.:|.||:||..||||||||||||.|:.||||:|.|||||||||||||||||||.
 Frog  1608 AVAIAVHSQDVSIPHCPDGWRSLWIGYSFLMHTAAGDEGGGQSLSSPGSCLEDFRATPFIECNGG 1672

  Fly  1728 KGTCHFYETMTSFWMYNLESSQPFE-RPQQQTIKAGERQSHVSRCQVCMKN 1777
            :||||::....|||:..::  :||: .|...|:|||..::|:|||||||||
 Frog  1673 RGTCHYFANKYSFWLTTID--EPFQSSPPADTLKAGLIRTHISRCQVCMKN 1721

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
Col4a1NP_723044.1 gly_rich_SclB <107..>361 CDD:468478 109/274 (40%)
gly_rich_SclB <355..>642 CDD:468478 127/320 (40%)
gly_rich_SclB <543..820 CDD:468478 117/314 (37%)
gly_rich_SclB <727..>968 CDD:468478 103/245 (42%)
gly_rich_SclB <969..>1218 CDD:468478 112/288 (39%)
gly_rich_SclB <1186..>1420 CDD:468478 101/250 (40%)
gly_rich_SclB <1321..>1547 CDD:468478 87/235 (37%)
C4 1555..1662 CDD:128421 63/106 (59%)
C4 1663..1777 CDD:128421 71/114 (62%)
col4a2XP_002933063.2 gly_rich_SclB <95..>363 CDD:468478 123/301 (41%)
gly_rich_SclB <285..>515 CDD:468478 101/255 (40%)
gly_rich_SclB <502..>772 CDD:468478 111/304 (37%)
gly_rich_SclB <803..1050 CDD:468478 106/263 (40%)
gly_rich_SclB <965..>1223 CDD:468478 102/265 (38%)
gly_rich_SclB <1112..>1355 CDD:468478 109/276 (39%)
gly_rich_SclB <1312..>1495 CDD:468478 87/230 (38%)
C4 1501..1606 CDD:460201 61/104 (59%)
C4 1611..1720 CDD:460201 69/110 (63%)

Return to query results.
Submit another query.