DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Col4a1 and Col5a3

DIOPT Version :9

Sequence 1:NP_723044.1 Gene:Col4a1 / 33727 FlyBaseID:FBgn0000299 Length:1779 Species:Drosophila melanogaster
Sequence 2:NP_058615.1 Gene:Col5a3 / 53867 MGIID:1858212 Length:1739 Species:Mus musculus


Alignment Length:1422 Identity:568/1422 - (39%)
Similarity:665/1422 - (46%) Gaps:308/1422 - (21%)


- Green bases have known domain annotations that are detailed below.


  Fly    87 EKGNRGLPGP-LGPTGLKGEMG-------FPGMEGPSGDKGQKGDPGPYGQRGDKGERGSPGLHG 143
            |:|..|  || :||.....|..       |||    :|:||.||:|... ::|.:.| |..|..|
Mouse   341 EEGEGG--GPTMGPKFRAAEQSLQTEFQIFPG----AGEKGAKGEPATV-EQGQQFE-GPAGAPG 397

  Fly   144 QAGVPGVQGPAGNPGAPGINGKDGCDGQDGIPGLEGLSGMPG-----PRGYAGQLGSKGEKGEPA 203
            ..|:.|..||.|.||.||..|..|..|..||||::|..|:||     |..:|    |...||.|.
Mouse   398 PRGISGPSGPPGPPGFPGDRGLPGPAGLPGIPGIDGARGLPGTVIMMPFHFA----SSSMKGPPV 458

  Fly   204 KENGDYAK--------GEKGEPGWRGTAGLAGP---QGFPGEKGERGDSGPYGAKGPRGEHGLKG 257
            ......|:        ..||.||..|..|..||   .|:||.|||.|:.||.|.:|.:|..|..|
Mouse   459 SFQQAQAQAVLQQAQLSMKGPPGPVGLTGRPGPVGLPGYPGLKGELGEVGPQGPRGLQGPPGPPG 523

  Fly   258 EKGASCYGPMKPGAPGIKGEKGEPASSFPVKPTHTVMGPRGDMGQKGEPGLVGRKGEPGPEGDTG 322
            .:|       |.|..|..|.:|.|..:          ||:||.|..|.|||.|.||:   .||.|
Mouse   524 REG-------KTGRAGADGARGLPGDT----------GPKGDRGFDGLPGLPGEKGQ---RGDFG 568

  Fly   323 LDGQKGEKGLPGGPGDRGRQGNFGPPGSTGQKGDRGEPGLNGLPGNPGQKGEPGRAGATGKPGLL 387
            ..||      ||.||:.|.:|..||||.|||.|:.|..||.|..|.||..|.||..|:.|.||..
Mouse   569 RVGQ------PGPPGEDGVKGLQGPPGPTGQAGEPGPRGLIGPRGLPGPLGRPGVTGSDGAPGAK 627

  Fly   388 GPPGPPGGGRGTPGPPGPKGPRGYVGAPGPQGLNGVDGLPGPQGYNGQKGGAGLPGRPGNEGPPG 452
            |..|||    |.|||||.:|..|..|.|||||..|..|..||      .|..|:||.||:|||||
Mouse   628 GNVGPP----GEPGPPGQQGNHGSQGIPGPQGPIGTPGEKGP------PGNPGIPGVPGSEGPPG 682

  Fly   453 KKGEKGTA---GLNGPKGSIGPIGHPGPPGPEGQKGDAGLPGYGIQGSKGDAGIPGYPGLKGSKG 514
            ..|.:|..   |..||.||.||.|:||..|.:|..|:.||     ||.||:.|..|:||.||.:|
Mouse   683 HPGHEGPTGEKGAQGPPGSAGPRGYPGLRGVKGTSGNRGL-----QGEKGERGEDGFPGFKGDEG 742

  Fly   515 ERGFKGNAGAPGDSKLGRPGTPGAAGAPGQKGDAGRPGTPGQKGDMGIKGDVGGKCSSCRAGPKG 579
            .:|.:||           ||.||..|..|.:|..|..|.||.:|.               .|..|
Mouse   743 PKGDRGN-----------PGPPGPRGEDGPEGQKGPGGLPGDEGP---------------PGAAG 781

  Fly   580 DKGTSGLPGIPGKDGARGPPGERGYPGERGHDGINGQTGPPGEKGEDGRTGLPGATGEPGKPALC 644
            :||..|:||:||..|..||.|..|:|         |..||.||||:.|:.|.||..||.|.|.  
Mouse   782 EKGKLGVPGLPGYPGRPGPKGSIGFP---------GPLGPLGEKGKRGKAGQPGEEGERGTPG-- 835

  Fly   645 DLSLIEPLKGDKGYPGAPGAKGVQGFKGAEGLPGIPGPKGEFGFKGEKGLSGAPGNDGTPGRAGR 709
                   .:||:|.|||.|..|.   ||..|..|.|||.||.|..|.:|..|.||..|.||..|:
Mouse   836 -------TRGDRGQPGATGQPGP---KGDVGQNGSPGPPGEKGLPGLQGPPGFPGPKGPPGPQGK 890

  Fly   710 DGYPGIPGQSIKGEPGFHGRDGAKGDKGSFGRSGEKGEPGSCALDEIKMPAKGNKGEPGQTGMPG 774
            ||..|.|||  :||.||.|..|..|..|..|..|:.|:.|..             ||.|..|.||
Mouse   891 DGISGHPGQ--RGELGFQGLTGPPGPAGVLGPQGKVGDVGPL-------------GERGPPGPPG 940

  Fly   775 PPGEDGSPGERGYTGLKGNTGPQGPPGVEGPRGLNGPRGEKGNQGAVGVPGNPGKDGLRGIPGRN 839
            ||||.|.||..|..|.||..||.|..|.|||   .||||..|.|||   ||:||..||:|     
Mouse   941 PPGEQGLPGIEGREGAKGELGPLGSVGKEGP---PGPRGFPGPQGA---PGDPGPIGLKG----- 994

  Fly   840 GQPGPRGEPGISRPGPMGPPGLNGLQGEKGDRGPTGPIGFPGADGSVGYPGDRGDAGLPGVSGRP 904
                        ..||.||.|.||..||:|..||:|.||.||..|..|..|..|:.|.|      
Mouse   995 ------------DKGPPGPVGANGSPGERGPVGPSGGIGLPGQSGGQGPIGPAGEKGSP------ 1041

  Fly   905 GIVGEKGDVGPIGPAGVAGPPGVPGIDGV---RGRDGAKGEPGSPGLVGMPGNKGDRGAPGND-- 964
               ||:|..||.|..|:.||||:.|..|.   .|.:|.|||.|.||..|..|:|||.|.||..  
Mouse  1042 ---GERGTPGPTGKDGIPGPPGLQGPSGAAGPSGEEGDKGEVGMPGHKGSKGDKGDAGPPGPTGI 1103

  Fly   965 -GPKGFAGVTGAPGKRGPAGIPGVSGAKGDKGATGLTGNDGPVGGRGPPGAPGLMGIKGDQGLAG 1028
             ||.|.:|:.||.|.:|..|.||:.|.|||.|..|.      ||..||||..||.|..|::|..|
Mouse  1104 RGPAGHSGLPGADGAQGRRGPPGLFGQKGDDGVRGF------VGVIGPPGLQGLPGPPGEKGEVG 1162

  Fly  1029 APGQQGLDGMPGEKGNQGFPGLDGPPGLPGDASEKGQKGEPGPSGLRGDTGPAGTPGWPGEKGLP 1093
            ..|..|..|.||.:|..|..|.:|||||||...:.|..||.|..|..||.||.|.||.||.|   
Mouse  1163 DVGSMGPHGAPGPRGPPGPSGSEGPPGLPGGVGQPGAVGEKGEPGDAGDAGPPGIPGIPGPK--- 1224

  Fly  1094 GLAVHGRAGPPGEKGDQGRSGIDGRDGINGEKGEQGLQGVWGQPGEKGSVGAPGIPGAPGMDGLP 1158
                    |..|||||.|.||..|..|..|..||.|.:|..|..|..|.:|.||.||.||:||:|
Mouse  1225 --------GEIGEKGDSGPSGAAGPPGKKGPPGEDGSKGNMGPTGLPGDLGPPGDPGVPGIDGIP 1281

  Fly  1159 GAAGAPGAVGYPGDRGDKGEPGLSGLPGLKGETGPVGLQGFTGAPGPKGERGIRGQPGLPATVPD 1223
            |..|..|.:|.||..|..||||..||||.:|..|.:                             
Mouse  1282 GEKGNAGDIGGPGPPGASGEPGARGLPGKRGSPGRM----------------------------- 1317

  Fly  1224 IRGDKGSQGERGYTGEKGEQGERGLTGPAGVAGAKGDRGLQGPPGASGLNGIPGAKGDIGPRGEI 1288
                          |.:|.:||:         |||||.|..||||.:         |.||.||. 
Mouse  1318 --------------GPEGREGEK---------GAKGDAGPDGPPGRT---------GPIGARGP- 1349

  Fly  1289 GYPGVTIKGEKGLPGRPGRNGRQGLIGAPGLIGERGLPGLAGEPGLVGLPGPIGPAGSKGERGLA 1353
                            |||.|..||.|.||.:||   |||.|.|||:|.|||:||          
Mouse  1350 ----------------PGRIGPDGLPGIPGPVGE---PGLLGPPGLIGPPGPLGP---------- 1385

  Fly  1354 GSPGQPGQDGFPGAPGLKGDTGPQGFKGERGLNGF---EGQKGDKGDRGLQGPSGLPGLVGQKGD 1415
                       ||.||||||.||:|.||..||.|.   .|:.|:|||:||.|..|.|||.|.   
Mouse  1386 -----------PGLPGLKGDAGPKGEKGHIGLIGLIGPPGEAGEKGDQGLPGVQGPPGLQGD--- 1436

  Fly  1416 TGYPGLNGNDGPVGAPGERGFTGPKGRDGRDGTPGLPGQKGEPGMLPPPGPKGEPGQ 1472
               |||.|..|.:|.||..|..||.|:.|..|:||..|.:|:||...||||.|.|.:
Mouse  1437 ---PGLPGPVGSLGHPGPPGVVGPLGQKGSKGSPGSLGPRGDPGPAGPPGPPGSPAE 1490

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
Col4a1NP_723044.1 Collagen 104..161 CDD:189968 22/63 (35%)
Collagen 322..380 CDD:189968 27/57 (47%)
Collagen 413..465 CDD:189968 24/54 (44%)
Collagen 499..561 CDD:189968 23/61 (38%)
Collagen 574..632 CDD:189968 24/57 (42%)
Collagen 657..714 CDD:189968 27/56 (48%)
Collagen 765..824 CDD:189968 32/58 (55%)
Collagen 854..911 CDD:189968 25/56 (45%)
Collagen 884..943 CDD:189968 23/61 (38%)
Collagen 923..982 CDD:189968 30/64 (47%)
Collagen 1028..1085 CDD:189968 26/56 (46%)
Collagen 1229..1287 CDD:189968 18/57 (32%)
Collagen 1318..1376 CDD:189968 24/57 (42%)
Collagen 1399..1458 CDD:189968 26/58 (45%)
Collagen 1477..1534 CDD:189968
C4 1555..1662 CDD:128421
C4 1663..1777 CDD:128421
Col5a3NP_058615.1 LamG 32..211 CDD:304605
Collagen 532..590 CDD:189968 30/76 (39%)
Collagen 568..626 CDD:189968 30/63 (48%)
Collagen 679..731 CDD:189968 25/56 (45%)
Collagen 769..827 CDD:189968 29/81 (36%)
Collagen 805..864 CDD:189968 31/79 (39%)
Collagen 1003..1060 CDD:189968 28/65 (43%)
Collagen 1057..1115 CDD:189968 26/57 (46%)
Collagen 1087..1147 CDD:189968 29/65 (45%)
Collagen 1312..1364 CDD:189968 29/129 (22%)
Collagen 1393..1443 CDD:189968 27/55 (49%)
COLFI 1510..1737 CDD:279718
Blue background indicates that the domain is not in the aligned region.


Information from Original Tools:


Tool Simple Score Weighted Score Original Tool Information
BLAST Result Score Score Type Cluster ID
Compara 00.000 Not matched by this tool.
Domainoid 00.000 Not matched by this tool.
eggNOG 00.000 Not matched by this tool.
Hieranoid 00.000 Not matched by this tool.
Homologene 00.000 Not matched by this tool.
Inparanoid 00.000 Not matched by this tool.
Isobase 00.000 Not matched by this tool.
OMA 00.000 Not matched by this tool.
OrthoDB 00.000 Not matched by this tool.
OrthoFinder 00.000 Not matched by this tool.
OrthoInspector 00.000 Not matched by this tool.
orthoMCL 00.000 Not matched by this tool.
Panther 00.000 Not matched by this tool.
Phylome 1 0.910 - -
RoundUp 00.000 Not matched by this tool.
SonicParanoid 00.000 Not matched by this tool.
SwiftOrtho 1 1.000 - -
TreeFam 00.000 Not matched by this tool.
21.910

Return to query results.
Submit another query.