DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Col4a1 and COL17A1

DIOPT Version :10

Sequence 1:NP_723044.1 Gene:Col4a1 / 33727 FlyBaseID:FBgn0000299 Length:1779 Species:Drosophila melanogaster
Sequence 2:NP_000485.3 Gene:COL17A1 / 1308 HGNCID:2194 Length:1497 Species:Homo sapiens


Alignment Length:1184 Identity:367/1184 - (30%)
Similarity:444/1184 - (37%) Gaps:414/1184 - (34%)


- Green bases have known domain annotations that are detailed below.


  Fly    87 EKGN-RGLPGPLGPTGLKGEMGFPGMEGPSGDKGQKGDPGPYGQRGDKGERGSPGLHGQAGVPGV 150
            |.|| ||.|||                     ||..|.|||      ||:||.|      |.||:
Human   561 ENGNLRGSPGP---------------------KGDMGSPGP------KGDRGFP------GTPGI 592

  Fly   151 QGPAGNPGAPGINGKDGCDGQDGIPGLEGLSGMPGPRGYAGQLGSKGEKGEPAKENGDYAKGEKG 215
            .||.|:||..|..|:.|..|.   ||:||..|.   ||..|.:|.:||.|.|       ..||||
Human   593 PGPLGHPGPQGPKGQKGSVGD---PGMEGPMGQ---RGREGPMGPRGEAGPP-------GSGEKG 644

  Fly   216 EPGWRGTAGLAGPQGFPGEKGERGDSGPYGAKGPRGEHGLKGEKGASCYGPMKPGAPGIKGEKGE 280
            |.|..|..|..||.|.||..|.:|.||..|.:||.|..||:|.:|       :.|.||:||:|| 
Human   645 ERGAAGEPGPHGPPGVPGSVGPKGSSGSPGPQGPPGPVGLQGLRG-------EVGLPGVKGDKG- 701

  Fly   281 PASSFPVKPTHTVMGPRGDMGQKGEPGLVGRKGEPGPEGDTGLDGQKGEKGLPGGPGDRGRQGNF 345
                 |:.|.    ||:||.|:||..||.   |||            |.:||||..|:.|.:|..
Human   702 -----PMGPP----GPKGDQGEKGPRGLT---GEP------------GMRGLPGAVGEPGAKGAM 742

  Fly   346 GPPGSTGQKGDRGEPGLNGLPGNPGQKGEPGRAGATGKPGLLGPPGPPGGGRGTPGPPGPKGPRG 410
            ||.|..|.:|.|||.||.|:   ||.:|.||.:|..|||||.||.||    :|.||.||..|.:|
Human   743 GPAGPDGHQGPRGEQGLTGM---PGIRGPPGPSGDPGKPGLTGPQGP----QGLPGTPGRPGIKG 800

  Fly   411 YVGAPG----PQGLNGVDGLPGPQGYNGQKGGAGLPGRPGNEGPPGKKGEKGTAGLNGPKGSIGP 471
            ..||||    .:| :.:..:|||.|..|..|..|.||.||..||.|..|.:....|.||.|..||
Human   801 EPGAPGKIVTSEG-SSMLTVPGPPGPPGAMGPPGPPGAPGPAGPAGLPGHQEVLNLQGPPGPPGP 864

  Fly   472 IGHPGP--PGPEGQKGDAGLPGYGIQGSKGDAGIPGYPGLKGSKGERGFKGNAGAPGDSKLGRPG 534
            .|.|||  |||.|.:|..|            .|:||.||..||     |..|      |:....|
Human   865 RGPPGPSIPGPPGPRGPPG------------EGLPGPPGPPGS-----FLSN------SETFLSG 906

  Fly   535 TPGAAGAPGQKGDAGRPGTPGQKGDMGIKGDVGGKCSSCRAGPKGDKGTSGLPGIPGKDGA---- 595
            .||..|.||.|||.|.|                        ||:|.:|..||||......:    
Human   907 PPGPPGPPGPKGDQGPP------------------------GPRGHQGEQGLPGFSTSGSSSFGL 947

  Fly   596 --RGPPGERGYPGERGHDGINGQTGPPGEKGEDGRTGLPGATGEPGKPALCDLSLIEPLKGDKGY 658
              :|||               |..||.|.||:.|..|:|||.|.|..|:             :| 
Human   948 NLQGPP---------------GPPGPQGPKGDKGDPGVPGALGIPSGPS-------------EG- 983

  Fly   659 PGAPGAKGVQGFKGAEGLPGIPGPKGEFGFKGEK-------------------GLSGAPGNDGTP 704
                |:.......|..|.||.|||.|.....|::                   |:.|.||..|.|
Human   984 ----GSSSTMYVSGPPGPPGPPGPPGSISSSGQEIQQYISEYMQSDSIRSYLSGVQGPPGPPGPP 1044

  Fly   705 GRAGRDGYPGIPGQSIKGEPGFHG----------------------------------RDGAK-- 733
            |          |..:|.||...:.                                  ||..:  
Human  1045 G----------PVTTITGETFDYSELASHVVSYLRTSGYGVSLFSSSISSEDILAVLQRDDVRQY 1099

  Fly   734 ------GDKGSFGRSGEKGEPGSCALDEIKMPAK--GNKGEPG-QTGMPGPPGEDGSPGE----- 784
                  |.:|..|..|..|:....:||..::.::  ......| ..|:|||||..|.||.     
Human  1100 LRQYLMGPRGPPGPPGASGDGSLLSLDYAELSSRILSYMSSSGISIGLPGPPGPPGLPGTSYEEL 1164

  Fly   785 ----RG--YTGLKGNTGPQGPPGVEGPRGLNGPRGEKGNQGAVGVPGNPGKD------------- 830
                ||  :.|:.|..||.|||                     |:|||....             
Human  1165 LSLLRGSEFRGIVGPPGPPGPP---------------------GIPGNVWSSISVEDLSSYLHTA 1208

  Fly   831 GLRGIPGRNGQPGPRGEPGISRPGPMGPPGLNGLQGEKGDRGPTGPIGFPGADGSVGYPGDRGDA 895
            ||..|||..|.|||        |||.||||::|                    ....|..:..| 
Human  1209 GLSFIPGPPGPPGP--------PGPRGPPGVSG--------------------ALATYAAENSD- 1244

  Fly   896 GLPGVSGRPGIVG-------EKGDVGPIGPAGVAGPPG---VPGIDGVRGRDGAKGEPGSPGLVG 950
                 |.|..::.       ....|||.||.|..||||   :...|....|..:.....|....|
Human  1245 -----SFRSELISYLTSPDVRSFIVGPPGPPGPQGPPGDSRLLSTDASHSRGSSSSSHSSSVRRG 1304

  Fly   951 MPGNKGDRGAPGNDGPKGFAGVTG-APGKRGPAGI---PGVSGAKGDKGATGLTGNDGPVGGRGP 1011
            ...:.......|..|..|..|..| |.|.|||.|.   ||        |..|.....|...|.| 
Human  1305 SSYSSSMSTGGGGAGSLGAGGAFGEAAGDRGPYGTDIGPG--------GGYGAAAEGGMYAGNG- 1360

  Fly  1012 PGAPGLMG--IKGD--------------------QGLA----GAPGQQGLDGMPG------EKGN 1044
                ||:|  ..||                    ||:|    |.|||.|..|.||      ...|
Human  1361 ----GLLGADFAGDLDYNELAVRVSESMQRQGLLQGMAYTVQGPPGQPGPQGPPGISKVFSAYSN 1421

  Fly  1045 ---------QGFPGLDGPPGLPGDASEKGQKGEPGPSGLRGDTGPAGTPGWPGEKGLPGLAVHGR 1100
                     |.:..:.||||      :||:.|.|||   :||.||||.||.||..|     ..|.
Human  1422 VTADLMDFFQTYGAIQGPPG------QKGEMGTPGP---KGDRGPAGPPGHPGPPG-----PRGH 1472

  Fly  1101 AGPPGEKGDQGRSG 1114
            .|..|:||||..:|
Human  1473 KGEKGDKGDQVYAG 1486

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
Col4a1NP_723044.1 gly_rich_SclB <107..>361 CDD:468478 98/253 (39%)
gly_rich_SclB <355..>642 CDD:468478 110/298 (37%)
gly_rich_SclB <543..820 CDD:468478 81/357 (23%)
gly_rich_SclB <727..>968 CDD:468478 69/319 (22%)
gly_rich_SclB <969..>1218 CDD:468478 62/191 (32%)
gly_rich_SclB <1186..>1420 CDD:468478
gly_rich_SclB <1321..>1547 CDD:468478
C4 1555..1662 CDD:128421
C4 1663..1777 CDD:128421
COL17A1NP_000485.3 Nonhelical region (NC16) 1..566 3/4 (75%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1..154
Necessary for interaction with DST and for the recruitment of DST to hemidesmosome. /evidence=ECO:0000269|PubMed:12482924 145..230
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 167..186
gly_rich_SclB 508..>807 CDD:468478 134/330 (41%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 562..1011 222/614 (36%)
Triple-helical region 567..1482 360/1172 (31%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1209..1234 17/52 (33%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1261..1316 14/54 (26%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1434..1497 29/67 (43%)
Nonhelical region (NC1) 1483..1497 1/4 (25%)
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.