DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Col4a1 and Col16a1

DIOPT Version :10

Sequence 1:NP_723044.1 Gene:Col4a1 / 33727 FlyBaseID:FBgn0000299 Length:1779 Species:Drosophila melanogaster
Sequence 2:NP_001391281.1 Gene:Col16a1 / 107581 MGIID:1095396 Length:1598 Species:Mus musculus


Alignment Length:1576 Identity:594/1576 - (37%)
Similarity:716/1576 - (45%) Gaps:416/1576 - (26%)


- Green bases have known domain annotations that are detailed below.


  Fly    62 GVARGDLPP-------KNCTAGYAGCVPKCIAEKGNRGLPGPLGPTGLKGEMG-----------F 108
            |....:.||       .|.|.|.:|..    ..||.|||.||.||.|.||..|           .
Mouse   310 GTGVNECPPCAHSARESNVTLGPSGLK----GGKGERGLTGPSGPKGEKGARGNDCVRVSPDAPL 370

  Fly   109 PGMEGPSGDKGQKGD------PGPYGQRGDKGERGSPGLHGQAGVPGVQGPAGNPGAPGINGKDG 167
            ..:|||.|:||:.||      |||.||:|.|||:|..||   .|:||..|..|.||...:.|..|
Mouse   371 QCVEGPKGEKGESGDLGPPGLPGPTGQKGQKGEKGDGGL---KGLPGKPGRDGRPGEICVIGPKG 432

  Fly   168 CDGQDGIPGLEGLSGMPGPRGYAGQLGSKGEKGEPAKENGDYAKGEKGEPGWRGTAGLAGPQGFP 232
            ..|..|..|.|||:|.|||.|..|..|.    |.|         |..|:||        ||   |
Mouse   433 QKGDPGFVGPEGLAGEPGPPGLPGPPGI----GLP---------GTPGDPG--------GP---P 473

  Fly   233 GEKGERGDSGPYGAKGPRGEHGLKGEKGASCYGPMKPGAPGIKGEKGEPASSFPVKPTHTVMGPR 297
            |.|||:|.||..|.:||.|:             |.|||.||.|||||:|....|..|        
Mouse   474 GPKGEKGSSGIPGKEGPGGK-------------PGKPGVPGTKGEKGDPCEVCPTLP-------- 517

  Fly   298 GDMGQKGEPGLVGRKGEPGPEGDTGLDGQKGEKGLPGGPGDRGRQGNFGPPGSTGQKGDRGEPGL 362
                 :|....||..|:|||:|:.| |.....:||                |:.|.|||||:||:
Mouse   518 -----EGSQNFVGLPGKPGPKGEPG-DPAPAWEGL----------------GTVGLKGDRGDPGI 560

  Fly   363 NGLPGNPGQKGEPGRAGATGKPGLLGPPGPPGGGRGTPGPPGPKGPRGYVGAPGPQGLNGVDGLP 427
            .|:   .|:||||..:.::|.......|.|   |.|.||.||..      |.|||:||.      
Mouse   561 QGM---KGEKGEPCSSCSSGVGAQHLGPSP---GHGLPGLPGTS------GIPGPRGLK------ 607

  Fly   428 GPQGYNGQKGGAGLPGRPGNEGPPGKKGEKG------TAGLNGPKGSIGPIGHPGPPGPEGQKGD 486
            |.:|..|..|.||:||.||..||.|.||.||      ||......|.:..: |  .|||.|:||:
Mouse   608 GEKGSFGDTGPAGVPGSPGPVGPAGIKGAKGEPCEPCTALSELQDGDMRVV-H--LPGPAGEKGE 669

  Fly   487 AGLPGYGIQGSKGDAGIPGYPGLKGSKGERGFKGNAGAPGDSKLGRPGTPGAAGAPGQKGDAGRP 551
            .|.||:|:.|.:|.|         |.:|.:|.||:||.|||     |||||..|.||..|:    
Mouse   670 PGSPGFGLPGKQGKA---------GERGLKGQKGDAGNPGD-----PGTPGITGQPGISGE---- 716

  Fly   552 GTPGQKGDMGIKGDVGGKCSSCRAGPKGDKGTSGLPGIPGKDGARGPPGERGYPGERGHDGINGQ 616
              ||.:|..|.||:.|..|::|.:........|||||.||..|..||.|. |:||:.|..|:.|.
Mouse   717 --PGIRGPAGPKGEKGDGCTACPSLQGALTDVSGLPGKPGPKGEPGPEGV-GHPGKPGQPGLPGV 778

  Fly   617 TGPPGEKGEDGRTGLPGATGEPGKPALCDLSLIEPLKGDKGYPGAPGAKGVQGFKGAEGLP---G 678
            .||||.||..|..|.||.                            ||:|.||..|.:|||   |
Mouse   779 QGPPGPKGTQGEPGPPGT----------------------------GAEGPQGEPGTQGLPGTQG 815

  Fly   679 IPGPKGEFGFKGEKGLSGAPGNDGTPGRAGRDGYPGIPGQSIKGEPGFHGRDGAKGDKGSFGRSG 743
            :|||:|..|..||||..|:||..|..|..|.      ||..:.|.|         |.|||   .|
Mouse   816 LPGPRGPPGSAGEKGAQGSPGPKGAIGPMGP------PGAGVSGPP---------GQKGS---RG 862

  Fly   744 EKGEPGSCALDEIKMPAKGNKGEPGQTGMPGPPGEDGSPGERGYTGLKGNTGPQGPPGVEGPRGL 808
            ||||||.|:..        ::|||..:||||.||.        :.|.....||||||||.||   
Mouse   863 EKGEPGECSCP--------SRGEPIFSGMPGAPGL--------WMGSSSQPGPQGPPGVPGP--- 908

  Fly   809 NGPRGEKGNQGAVGVPGNPGKDGLRGIPGRNGQPGPRGEPGISRPGPMGPPGLNGLQGEK----- 868
                           ||.||..||:|:||.||.|   |:||::..       |..|..||     
Mouse   909 ---------------PGPPGMPGLQGVPGHNGLP---GQPGLTAE-------LGSLPIEKHLLKS 948

  Fly   869 --GDRGPTGPIGFPGADGSVGYP------GDRGDAGLPGVSG------------RPGIVGEKGDV 913
              ||          .|.|...:|      |::||.|:|||.|            ||.....:|  
Mouse   949 ICGD----------CAQGQTAHPAFLLEKGEKGDQGIPGVPGFDNCARCFIERERPRAEEARG-- 1001

  Fly   914 GPIGPAGVAGPPGVPGIDGVRGRDGAKGEPGSPGLVGMPGNKGDRGAPGNDGPKGFAGVTGAPGK 978
                             |...|..|..|.||.||..||||.:|:.|.|         |:.|:||.
Mouse  1002 -----------------DNSEGEPGCSGSPGLPGPPGMPGQRGEEGPP---------GMRGSPGP 1040

  Fly   979 RGPAGIPGVSGAKGDKGATGLTGNDGPVGGRGPPGAPGLMGIKGDQGLAGAPGQQGLDGMPGEKG 1043
            .||.|.||..||                     .|:|||.|::|::||.|..|.:   |.||..|
Mouse  1041 PGPIGPPGFPGA---------------------VGSPGLPGLQGERGLTGLTGDK---GEPGPPG 1081

  Fly  1044 NQGFPGLDGPPGLPGDASEKGQKGEPGPSGLRGDTGPAGTPGWPGEKGLPGLAVHGRAGPPGEKG 1108
            ..|:||..|||||||   .||::|..||||.:|::||      ||.:||||  ..|.|||.||:|
Mouse  1082 QPGYPGAMGPPGLPG---IKGERGYTGPSGEKGESGP------PGSEGLPG--PQGPAGPRGERG 1135

  Fly  1109 DQGRSGIDGRDGINGEKGEQGLQGVWGQPGEKGSVGAPGIPGAPGMDGLPGAAGAPGAVGYPGDR 1173
            .||.|         ||||:||.|            |.||.||.||..|.||.|||||..|...::
Mouse  1136 PQGSS---------GEKGDQGFQ------------GQPGFPGPPGPPGFPGKAGAPGPPGPQAEK 1179

  Fly  1174 GDKGEPGLSGLPGLKGETGPVGLQGFTGAPGPKGERGIRGQPGLPATVPDIRGDKGSQGERGYTG 1238
            |.:|..|.|||||..|..||.|:|      ||.|..|:.|:.|.|.    :|||.|..|..|..|
Mouse  1180 GSEGIRGPSGLPGSPGPPGPPGIQ------GPAGLDGLDGKDGKPG----LRGDPGPAGPPGLMG 1234

  Fly  1239 EKGEQGERGLTGPAGVAGAKGDRGLQGPPGASGLNGIPGAKGDIGPRGEIGYPGVTIKGEKGLPG 1303
            ..|.:|:   ||..|:.|.|||.|..||||:||..|..|..|.:||:|..|.||..  |.:|.||
Mouse  1235 PPGFKGK---TGHPGLPGPKGDCGKPGPPGSSGRPGAEGEPGAMGPQGRPGPPGHL--GPQGPPG 1294

  Fly  1304 RPGRNGRQ--GLIGAPGLIGERGLPGLAGEPGLVGLPGPIGPAGSKGERGLAGSPGQPGQDGFPG 1366
            :||..|..  ||.|..|:.|||||.||.|:||..|.|||.|..||.   |.||..|.||:.|..|
Mouse  1295 QPGPPGLSTVGLKGDRGVPGERGLAGLPGQPGTPGHPGPPGEPGSD---GAAGKEGPPGKQGLYG 1356

  Fly  1367 APGLKGDTGPQGFKGERGLNGFEGQKGDKGDRGLQGPSGLPGLVGQKGDTGYPGLNGNDGPVGAP 1431
            .||.|||.||.|.||:.|..|..|..|..|..|..||.|.||..|::|..|.||..||.|..|.|
Mouse  1357 PPGPKGDPGPAGQKGQAGEKGRSGMPGGPGKSGSMGPIGPPGPAGERGHPGSPGPAGNPGLPGLP 1421

  Fly  1432 GERG--------------------------FTG----PKGRDGRDGTPGLPGQKGEPGMLPPPGP 1466
            |..|                          :|.    |.......|.||.||:.|.||.   ||.
Mouse  1422 GSMGDMVNYDDIKRFIRQEIIKLFDERMAYYTSRMQFPMEVAAAPGRPGPPGKDGAPGR---PGA 1483

  Fly  1467 KGEPGQPGRNGPKGEPGRPGERGLIGIQGERGEKGERGLIGETGNVGRPGPKGDRGEPGERGYEG 1531
            .|.||.||:.|.:|..|.||.|||.|.:||:|:.|    :|..|..|.|||.|.:|.|| .|..|
Mouse  1484 PGSPGLPGQIGREGRQGLPGMRGLPGTKGEKGDIG----VGIAGENGLPGPPGPQGPPG-YGKMG 1543

  Fly  1532 AIGLIGQKGEPGAPAP 1547
            |.|.:||:|.||.|.|
Mouse  1544 ATGPMGQQGIPGIPGP 1559

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
Col4a1NP_723044.1 gly_rich_SclB <107..>361 CDD:468478 96/270 (36%)
gly_rich_SclB <355..>642 CDD:468478 116/292 (40%)
gly_rich_SclB <543..820 CDD:468478 99/279 (35%)
gly_rich_SclB <727..>968 CDD:468478 82/265 (31%)
gly_rich_SclB <969..>1218 CDD:468478 103/248 (42%)
gly_rich_SclB <1186..>1420 CDD:468478 105/235 (45%)
gly_rich_SclB <1321..>1547 CDD:468478 108/255 (42%)
C4 1555..1662 CDD:128421
C4 1663..1777 CDD:128421
Col16a1NP_001391281.1 TSPN 50..231 CDD:214560
Nonhelical region 10 (NC10). /evidence=ECO:0000255 232..374 20/67 (30%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 324..547 105/296 (35%)
gly_rich_SclB <331..>541 CDD:468478 101/267 (38%)
Triple-helical region 9 (COL9) with 3 imperfections. /evidence=ECO:0000255 375..509 72/173 (42%)
gly_rich_SclB <503..>725 CDD:468478 106/292 (36%)
Nonhelical region 9 (NC9). /evidence=ECO:0000255 510..524 3/26 (12%)
Triple-helical region 8 (COL8) with 1 imperfection. /evidence=ECO:0000255 525..570 23/64 (36%)
Cell attachment site. /evidence=ECO:0000255 555..557 1/1 (100%)
Nonhelical region 8 (NC8). /evidence=ECO:0000255 571..586 1/14 (7%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 585..935 179/471 (38%)
Triple-helical region 7 (COL7) with 1 imperfection. /evidence=ECO:0000255 587..640 30/67 (45%)
Nonhelical region 7 (NC7). /evidence=ECO:0000255 641..661 4/22 (18%)
Triple-helical region 6 (COL6) with 1 imperfection. /evidence=ECO:0000255 662..732 38/89 (43%)
gly_rich_SclB <665..>868 CDD:468478 103/269 (38%)
Nonhelical region 6 (NC6). /evidence=ECO:0000255 733..747 2/13 (15%)
Triple-helical region 5 (COL5) with 3 imperfections. /evidence=ECO:0000255 748..870 66/168 (39%)
gly_rich_SclB <827..>1144 CDD:468478 159/460 (35%)
Nonhelical region 5 (NC5). /evidence=ECO:0000255 871..881 3/17 (18%)
Triple-helical region 4 (COL4) with 2 imperfections. /evidence=ECO:0000255 882..933 32/79 (41%)
Nonhelical region 4 (NC4). /evidence=ECO:0000255 934..967 9/49 (18%)
Triple-helical region 3 (COL3). /evidence=ECO:0000255 968..982 8/13 (62%)
Nonhelical region 3 (NC3). /evidence=ECO:0000255 983..1005 4/40 (10%)
Cell attachment site. /evidence=ECO:0000255 1000..1002 1/20 (5%)
gly_rich_SclB <1054..>1273 CDD:468478 118/266 (44%)
gly_rich_SclB <1210..>1509 CDD:468478 130/313 (42%)
Cell attachment site. /evidence=ECO:0000255 1221..1223 1/1 (100%)
Nonhelical region 2 (NC2). /evidence=ECO:0000255 1428..1466 2/37 (5%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1463..1541 38/85 (45%)
Triple-helical region 1 (COL1) with 2 imperfections. /evidence=ECO:0000255 1467..1572 49/101 (49%)
Nonhelical region 1 (NC1). /evidence=ECO:0000255 1573..1598
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.