DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Col4a1 and Col8a1

DIOPT Version :9

Sequence 1:NP_723044.1 Gene:Col4a1 / 33727 FlyBaseID:FBgn0000299 Length:1779 Species:Drosophila melanogaster
Sequence 2:NP_031765.2 Gene:Col8a1 / 12837 MGIID:88463 Length:744 Species:Mus musculus


Alignment Length:756 Identity:296/756 - (39%)
Similarity:354/756 - (46%) Gaps:201/756 - (26%)


- Green bases have known domain annotations that are detailed below.


  Fly   690 GEKGLSGAPGNDGTPGRAGRDGYPGIPGQSIKGEPGF--HGRDGA-KGDKGSFGRSGEKGEPGSC 751
            |:.|||  .|.:....:.|:: ||.:| |.:|..|..  .|::.. |..||....:..:||    
Mouse    64 GKDGLS--MGKEMPHMQYGKE-YPHLP-QYMKEIPPVPRMGKEVVPKKGKGEVPLASLRGE---- 120

  Fly   752 ALDEIKMPAKGNKGEPGQTGMPGPPGEDGSPGERGYTGLKGNTGPQGPPGVEGPRGLNGPRGEKG 816
                     :|.:||||..|.|||||..|    .|..|:||..||||.||:              
Mouse   121 ---------QGPRGEPGPRGPPGPPGLPG----HGMPGIKGKPGPQGYPGI-------------- 158

  Fly   817 NQGAVGVPGNPGKDGLRGIPGRNGQPGPRGEPGISRPGPMGPPGLNGLQGEKGDRGPTGPIGFPG 881
              |..|:||.|||.|..|:||..|:.||:||     .||||.|         |.:||.||.|.||
Mouse   159 --GKPGMPGMPGKPGAMGMPGAKGEIGPKGE-----IGPMGIP---------GPQGPPGPHGLPG 207

  Fly   882 ADGSVGYPGDRGDAGLPGVSGRPGIVGEKGDVGPIGPAGVAGPPGVPGIDGVRGRDGAKGEPGSP 946
                :|.||.      ||:.|:||..||:|..||.||.|:.||               |||.|  
Mouse   208 ----IGKPGG------PGLPGQPGAKGERGPKGPPGPPGLQGP---------------KGEKG-- 245

  Fly   947 GLVGMPGNKGDRGAPGNDGPKGFAGVTGAPGKRGPAGIPGVSGAKGDKGATGLTGNDGPVGGRGP 1011
              .||||..|.:|.||..||.            ||.|:|||    |..|.||..|..||:|..||
Mouse   246 --FGMPGLPGLKGPPGMHGPP------------GPVGLPGV----GKPGVTGFPGPQGPLGKPGP 292

  Fly  1012 PGAPGLMGIKGDQGLAGAPGQQGLDGMPGEKGNQGFPGLDGPPGLPGDASEKGQKGEPGPSGLRG 1076
            ||.|      |.|||.|.||.||..||||    .|.||.||.||.||....||::          
Mouse   293 PGEP------GPQGLIGVPGVQGPPGMPG----VGKPGQDGIPGQPGFPGGKGEQ---------- 337

  Fly  1077 DTGPAGTPGWPGEKGLPGLAVHGRAGPPGEKGDQGRSGIDGRDGINGEKGEQGLQGVWGQPGEKG 1141
                 |.||.||..||||:   |:.|.||.|||:|..|:               .||.|..||||
Mouse   338 -----GLPGLPGPPGLPGV---GKPGFPGPKGDRGIGGV---------------PGVLGPRGEKG 379

  Fly  1142 SVGAPGI---PGAPGMDGLPGAAGAPGAVGYPGDRGDKGEPGLSGLPGLKGETGPVGLQGFTGAP 1203
            .:||||:   ||.||:.|:||..|.|||:|:||.:|:.|..|..|.||.|||.   |||||.|.|
Mouse   380 PIGAPGMGGPPGEPGLPGIPGPMGPPGAIGFPGPKGEGGVVGPQGPPGPKGEP---GLQGFPGKP 441

  Fly  1204 GPKGERGIRGQPGLPATVPDIRGDKGSQGERGYTGEKGEQGERGLTGPAGVAGAKGDRGLQGPPG 1268
            |..||.|..|..|||       |..|.:||.|:.|..|..|..||.||.|..|..||:|||||| 
Mouse   442 GFLGEVGPPGMRGLP-------GPIGPKGEGGHKGLPGLPGVPGLLGPKGEPGIPGDQGLQGPP- 498

  Fly  1269 ASGLNGIPGAKGDIGPRGEIGYPGVTIKGEKGLPGRPGRNGRQGLIGAPGLIGERGLPGLAGEPG 1333
                 ||||..|..||.|..|.||.  |||.||||.|                  |.||: |:||
Mouse   499 -----GIPGIVGPSGPIGPPGIPGP--KGEPGLPGPP------------------GFPGV-GKPG 537

  Fly  1334 LVGLPGPIGPAGSKGERGLAGSPGQPGQDGFPGAPGLKGDTGPQG-FKGERGLNGFEGQK----- 1392
            :.||.||.|..|:.|.:|..|.||.||..|.||.|.:.....||| :..:.|| |.:|.|     
Mouse   538 VAGLHGPPGKPGALGPQGQPGLPGPPGPPGPPGPPAVMPTPSPQGEYLPDMGL-GIDGVKPPHAY 601

  Fly  1393 -GDKGDRGLQGPS-GLPGLVGQKGDTGYPGLNGNDGPVGAP 1431
             |.||..|  ||: .:|....:. ...:|       |||||
Mouse   602 AGKKGKHG--GPAYEMPAFTAEL-TVPFP-------PVGAP 632

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
Col4a1NP_723044.1 Collagen 104..161 CDD:189968
Collagen 322..380 CDD:189968
Collagen 413..465 CDD:189968
Collagen 499..561 CDD:189968
Collagen 574..632 CDD:189968
Collagen 657..714 CDD:189968 7/23 (30%)
Collagen 765..824 CDD:189968 23/58 (40%)
Collagen 854..911 CDD:189968 23/56 (41%)
Collagen 884..943 CDD:189968 20/58 (34%)
Collagen 923..982 CDD:189968 17/58 (29%)
Collagen 1028..1085 CDD:189968 21/56 (38%)
Collagen 1229..1287 CDD:189968 28/57 (49%)
Collagen 1318..1376 CDD:189968 23/57 (40%)
Collagen 1399..1458 CDD:189968 10/34 (29%)
Collagen 1477..1534 CDD:189968
C4 1555..1662 CDD:128421
C4 1663..1777 CDD:128421
Col8a1NP_031765.2 Nonhelical region (NC2) 29..118 16/57 (28%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 101..395 163/428 (38%)
Triple-helical region (COL1) 119..572 258/624 (41%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 412..439 15/29 (52%)
Collagen 445..502 CDD:189968 31/69 (45%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 457..590 67/159 (42%)
Nonhelical region (NC1) 573..744 21/71 (30%)
C1Q 609..744 CDD:128420 10/34 (29%)
Blue background indicates that the domain is not in the aligned region.


Information from Original Tools:


Tool Simple Score Weighted Score Original Tool Information
BLAST Result Score Score Type Cluster ID
Compara 00.000 Not matched by this tool.
Domainoid 00.000 Not matched by this tool.
eggNOG 00.000 Not matched by this tool.
Hieranoid 00.000 Not matched by this tool.
Homologene 00.000 Not matched by this tool.
Inparanoid 00.000 Not matched by this tool.
Isobase 00.000 Not matched by this tool.
OMA 00.000 Not matched by this tool.
OrthoDB 00.000 Not matched by this tool.
OrthoFinder 00.000 Not matched by this tool.
OrthoInspector 00.000 Not matched by this tool.
orthoMCL 00.000 Not matched by this tool.
Panther 00.000 Not matched by this tool.
Phylome 1 0.910 - -
RoundUp 00.000 Not matched by this tool.
SonicParanoid 00.000 Not matched by this tool.
SwiftOrtho 1 1.000 - -
TreeFam 00.000 Not matched by this tool.
21.910

Return to query results.
Submit another query.