DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Col3a1 and Col4a1

DIOPT Version :9

Sequence 1:NP_034060.2 Gene:Col3a1 / 12825 MGIID:88453 Length:1464 Species:Mus musculus
Sequence 2:NP_723044.1 Gene:Col4a1 / 33727 FlyBaseID:FBgn0000299 Length:1779 Species:Drosophila melanogaster


Alignment Length:1685 Identity:615/1685 - (36%)
Similarity:724/1685 - (42%) Gaps:419/1685 - (24%)


- Green bases have known domain annotations that are detailed below.


Mouse    55 VCDSGSVLCDDIICDEEPLDCPNPE---IPFGECCAICPQPSTPAPVLPDG-------------H 103
            :.||..|...|:    .|.:|....   :|  :|.|.......|.|:.|.|             .
  Fly    57 IVDSAGVARGDL----PPKNCTAGYAGCVP--KCIAEKGNRGLPGPLGPTGLKGEMGFPGMEGPS 115

Mouse   104 GPQGPKGDPGP---PGIPGRNGDPGLPGQ---PGLPGPPGSPGICESCPTGGQNYSPQFDSYDVK 162
            |.:|.||||||   .|..|..|.|||.||   ||:.||.|:||      ..|.|.....|..|..
  Fly   116 GDKGQKGDPGPYGQRGDKGERGSPGLHGQAGVPGVQGPAGNPG------APGINGKDGCDGQDGI 174

Mouse   163 SGVGGMGGYPGPAGPPGPPGPPGSSGHP---------GSPGSPGYQ------GPPGEPGQAGPAG 212
            .|:.|:.|.|||.|..|..|..|..|.|         |..|.||::      ||.|.||:.|..|
  Fly   175 PGLEGLSGMPGPRGYAGQLGSKGEKGEPAKENGDYAKGEKGEPGWRGTAGLAGPQGFPGEKGERG 239

Mouse   213 PPGPPGALGPAGPAGKDGESG-------RPGRP---GERGLP-------------GPPGIKGPAG 254
            ..||.||.||.|..|..||.|       :||.|   ||:|.|             ||.|..|..|
  Fly   240 DSGPYGAKGPRGEHGLKGEKGASCYGPMKPGAPGIKGEKGEPASSFPVKPTHTVMGPRGDMGQKG 304

Mouse   255 MPGF------PGMKGHRGFDGRNGEKGETGAPGLKGENGLPGDNGAPGPMGPRGAPGERGRPGLP 313
            .||.      ||.:|..|.||:.||||..|.||.:|..|..|..|:.|..|.||.||..|.||.|
  Fly   305 EPGLVGRKGEPGPEGDTGLDGQKGEKGLPGGPGDRGRQGNFGPPGSTGQKGDRGEPGLNGLPGNP 369

Mouse   314 GAAGARGNDGARGSDGQPGPPGPPGTA-GFPGSPGAKGEVGPAGSPGSNGSPGQRGEPGPQGHAG 377
            |..|..|..||.|..|..|||||||.. |.||.||.||..|..|:||..|..|..|.|||||:.|
  Fly   370 GQKGEPGRAGATGKPGLLGPPGPPGGGRGTPGPPGPKGPRGYVGAPGPQGLNGVDGLPGPQGYNG 434

Mouse   378 AQGP---PGPPGNNGSPGGKGEMGPAGIPGAPGLIGA---RGPPGPAGTN--------GIPGTRG 428
            .:|.   ||.|||.|.||.|||.|.||:.|..|.||.   .|||||.|..        ||.|::|
  Fly   435 QKGGAGLPGRPGNEGPPGKKGEKGTAGLNGPKGSIGPIGHPGPPGPEGQKGDAGLPGYGIQGSKG 499

Mouse   429 PSGEPGKNGAKGEPGARGERGEAGSP--------------GIPGPKGEDGKDGSPGEPGANGLPG 479
            .:|.||..|.||..|.||.:|.||:|              |.||.||:.|:.|:||:.|..|:.|
  Fly   500 DAGIPGYPGLKGSKGERGFKGNAGAPGDSKLGRPGTPGAAGAPGQKGDAGRPGTPGQKGDMGIKG 564

Mouse   480 AAGER------GPSGFRGPAGPNGIPGE---KGPPGERGGPGPAGPRGV------AGEPGRDGTP 529
            ..|.:      ||.|.:|.:|..||||:   :|||||||.||..|..|:      .||.|.||..
  Fly   565 DVGGKCSSCRAGPKGDKGTSGLPGIPGKDGARGPPGERGYPGERGHDGINGQTGPPGEKGEDGRT 629

Mouse   530 GGPGIRGMPGSP---------------GGPGNDGKPGPPGSQGESGRPGPPGPSGPRGQPGVMGF 579
            |.||..|.||.|               |.||..|..|..|.:|..|.||.|||.|..|..|..|.
  Fly   630 GLPGATGEPGKPALCDLSLIEPLKGDKGYPGAPGAKGVQGFKGAEGLPGIPGPKGEFGFKGEKGL 694

Mouse   580 PGPKGNDGAPGK----------------------------------------------------- 591
            .|..||||.||:                                                     
  Fly   695 SGAPGNDGTPGRAGRDGYPGIPGQSIKGEPGFHGRDGAKGDKGSFGRSGEKGEPGSCALDEIKMP 759

Mouse   592 -NGERGGPGGPGLPGPAGKN------------GETGPQGPPGP------TGPAGDKGDSGP---- 633
             .|.:|.||..|:|||.|::            |.||||||||.      .||.|:||:.|.    
  Fly   760 AKGNKGEPGQTGMPGPPGEDGSPGERGYTGLKGNTGPQGPPGVEGPRGLNGPRGEKGNQGAVGVP 824

Mouse   634 --PGPQGLQGIPGTGGPPGENGKPG--EPGPKGEVGAPGAPGGKGDSGA---------------P 679
              ||..||:||||..|.||..|:||  .|||.|..|..|..|.|||.|.               |
  Fly   825 GNPGKDGLRGIPGRNGQPGPRGEPGISRPGPMGPPGLNGLQGEKGDRGPTGPIGFPGADGSVGYP 889

Mouse   680 GERGP---PGTAGIPGARGGAGPPGPEGGKGPAGPPGPPGAS------------GSPGLQGMP-- 727
            |:||.   ||.:|.||..|..|..||.|..|.|||||.||..            |||||.|||  
  Fly   890 GDRGDAGLPGVSGRPGIVGEKGDVGPIGPAGVAGPPGVPGIDGVRGRDGAKGEPGSPGLVGMPGN 954

Mouse   728 -GERGGPGSPGPK---GEKGEPGGAGADGVP---------------GKDGPRGPAGPIGPPGPAG 773
             |:||.||:.|||   |..|.||..|..|:|               |.|||.|..||.|.||..|
  Fly   955 KGDRGAPGNDGPKGFAGVTGAPGKRGPAGIPGVSGAKGDKGATGLTGNDGPVGGRGPPGAPGLMG 1019

Mouse   774 QPGDKGEGGSP---------------GLPGIAGPRGGPG---ERGEHGPP---------GPAGFP 811
            ..||:|..|:|               |.||:.||.|.||   |:|:.|.|         ||||.|
  Fly  1020 IKGDQGLAGAPGQQGLDGMPGEKGNQGFPGLDGPPGLPGDASEKGQKGEPGPSGLRGDTGPAGTP 1084

Mouse   812 GAPGQNGEPG--AKGERGAPGEKGEGGPPGPAGPTGSSGPAGPPGPQGV---KGERGSPGGPGTA 871
            |.||:.|.||  ..|..|.|||||:.|..|..|..|.:|..|..|.|||   .||:||.|.||..
  Fly  1085 GWPGEKGLPGLAVHGRAGPPGEKGDQGRSGIDGRDGINGEKGEQGLQGVWGQPGEKGSVGAPGIP 1149

Mouse   872 GFPGGRGLPGP---------PGNNGNPGPPGPSGAP---GKDGP---------PGPAGNSGSPGN 915
            |.||..||||.         ||:.|:.|.||.||.|   |:.||         |||.|..|..|.
  Fly  1150 GAPGMDGLPGAAGAPGAVGYPGDRGDKGEPGLSGLPGLKGETGPVGLQGFTGAPGPKGERGIRGQ 1214

Mouse   916 PG-------IAGPKGDAGQPGEKGPPGAQGPPGSPGPLGIAGLTGARGLAGPP------GMPGPR 967
            ||       |.|.||..|:.|..|..|.||..|..||.|:||..|.|||.|||      |:||.:
  Fly  1215 PGLPATVPDIRGDKGSQGERGYTGEKGEQGERGLTGPAGVAGAKGDRGLQGPPGASGLNGIPGAK 1279

Mouse   968 GSPGPQG--------IKGESGKPGASGHNGERGPPGPQGLPGQPGTAGE---PGRDGNPGSDGQP 1021
            |..||:|        ||||.|.||..|.||.      |||.|.||..||   ||..|.||..|.|
  Fly  1280 GDIGPRGEIGYPGVTIKGEKGLPGRPGRNGR------QGLIGAPGLIGERGLPGLAGEPGLVGLP 1338

Mouse  1022 GRDGSPGGKGDRGENGSPGAPGAPGHPGPP---GPVGP------------SGKSGDRGETGPAGP 1071
            |..|..|.||:||..||||.||..|.||.|   |..||            .|:.||:|:.|..||
  Fly  1339 GPIGPAGSKGERGLAGSPGQPGQDGFPGAPGLKGDTGPQGFKGERGLNGFEGQKGDKGDRGLQGP 1403

Mouse  1072 SGAPGPAGARGAPGPQGPRGDKGETGERGSNGIKGHRGFPGNPGPPGSPGAAGHQGAIGSPGPAG 1136
            ||.||..|.:|..|..|..|:.|..|..|..|..|.:|..|..|.||.||..|..|.:..|||.|
  Fly  1404 SGLPGLVGQKGDTGYPGLNGNDGPVGAPGERGFTGPKGRDGRDGTPGLPGQKGEPGMLPPPGPKG 1468

Mouse  1137 PRGPVGPHGPPGKDGTSGHPGPIGPPGPRGNRGERGSEGSPGHPGQPGPPGPPGAPGPCCGGGAA 1201
            ..|..|.:||.|:.|..|..|.||..|.||.:||||..|..|:.|:|||.|..|.||.....||.
  Fly  1469 EPGQPGRNGPKGEPGRPGERGLIGIQGERGEKGERGLIGETGNVGRPGPKGDRGEPGERGYEGAI 1533

Mouse  1202 AIAGVGGEKSGGFSPYYGDDPMDFKINTEEIMSSLKSVN-GQIE-----SLISPDGS-------- 1252
            .:.|..|| .|..:|...|......|.......::.:.: |..|     ||:..||:        
  Fly  1534 GLIGQKGE-PGAPAPAALDYLTGILITRHSQSETVPACSAGHTELWTGYSLLYVDGNDYAHNQDL 1597

Mouse  1253 ----------RKNPARNCRDLKFCHPELKSGE-YWVDPNQGCKMDAIK-----------VFCNME 1295
                      ...|..:|.....|:...::.: :|:..|....|..::           |.|...
  Fly  1598 GSPGSCVPRFSTLPVLSCGQNNVCNYASRNDKTFWLTTNAAIPMMPVENIEIRQYISRCVVCEAP 1662

Mouse  1296 TGETCINASPMTVPR-KHWWTDSGAEKKHVWFGESM--------NGGFQFSYGPPDLPED 1346
            .....:::..:.||. .:.|       :.:|.|.|.        .||.|....|....||
  Fly  1663 ANVIAVHSQTIEVPDCPNGW-------EGLWIGYSFLMHTAVGNGGGGQALQSPGSCLED 1715

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
Col3a1NP_034060.2 VWC 33..89 CDD:278520 9/36 (25%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 97..1195 570/1442 (40%)
Nonhelical region (N-terminal) 155..169 4/13 (31%)
Triple-helical region 170..1195 538/1350 (40%)
Collagen 233..292 CDD:189968 31/87 (36%)
Collagen 278..331 CDD:189968 24/52 (46%)
Collagen 353..422 CDD:189968 38/82 (46%)
Collagen 413..472 CDD:189968 32/80 (40%)
Collagen 449..502 CDD:189968 26/75 (35%)
Collagen 1016..1095 CDD:189968 39/93 (42%)
Collagen 1064..1135 CDD:189968 29/70 (41%)
COLFI 1231..1463 CDD:279718 26/161 (16%)
Col4a1NP_723044.1 Collagen 104..161 CDD:189968 23/62 (37%)
Collagen 322..380 CDD:189968 27/57 (47%)
Collagen 413..465 CDD:189968 27/51 (53%)
Collagen 499..561 CDD:189968 24/61 (39%)
Collagen 574..632 CDD:189968 26/57 (46%)
Collagen 657..714 CDD:189968 24/56 (43%)
Collagen 765..824 CDD:189968 23/58 (40%)
Collagen 854..911 CDD:189968 20/56 (36%)
Collagen 884..943 CDD:189968 22/58 (38%)
Collagen 923..982 CDD:189968 27/58 (47%)
Collagen 1028..1085 CDD:189968 18/56 (32%)
Collagen 1229..1287 CDD:189968 26/57 (46%)
Collagen 1318..1376 CDD:189968 28/57 (49%)
Collagen 1399..1458 CDD:189968 25/58 (43%)
Collagen 1477..1534 CDD:189968 27/56 (48%)
C4 1555..1662 CDD:128421 15/106 (14%)
C4 1663..1777 CDD:128421 12/60 (20%)


Information from Original Tools:


Tool Simple Score Weighted Score Original Tool Information
BLAST Result Score Score Type Cluster ID
Compara 00.000 Not matched by this tool.
Domainoid 00.000 Not matched by this tool.
eggNOG 1 0.900 - - E33208_3BAFZ
Hieranoid 00.000 Not matched by this tool.
Homologene 00.000 Not matched by this tool.
Inparanoid 00.000 Not matched by this tool.
Isobase 00.000 Not matched by this tool.
OMA 00.000 Not matched by this tool.
OrthoDB 00.000 Not matched by this tool.
OrthoFinder 00.000 Not matched by this tool.
OrthoInspector 00.000 Not matched by this tool.
orthoMCL 00.000 Not matched by this tool.
Panther 00.000 Not matched by this tool.
Phylome 00.000 Not matched by this tool.
RoundUp 00.000 Not matched by this tool.
SonicParanoid 00.000 Not matched by this tool.
SwiftOrtho 1 1.000 - -
TreeFam 00.000 Not matched by this tool.
21.900

Return to query results.
Submit another query.