DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Col4a1 and Col1a1

DIOPT Version :9

Sequence 1:NP_723044.1 Gene:Col4a1 / 33727 FlyBaseID:FBgn0000299 Length:1779 Species:Drosophila melanogaster
Sequence 2:NP_031768.2 Gene:Col1a1 / 12842 MGIID:88467 Length:1453 Species:Mus musculus


Alignment Length:1395 Identity:563/1395 - (40%)
Similarity:649/1395 - (46%) Gaps:305/1395 - (21%)


- Green bases have known domain annotations that are detailed below.


  Fly    73 CTAGYAGCVPKCIAEKGNRGLPGPLGPTGLKGEMGFPGMEGPSGDKGQKGDPGPYGQRGDKGERG 137
            |...|.....:.:..:|.:|.|||.||   :|.:|.||.:|..|..|..|.|||      .|..|
Mouse    86 CPEEYVSPNSEDVGVEGPKGDPGPQGP---RGPVGPPGRDGIPGQPGLPGPPGP------PGPPG 141

  Fly   138 SPGLHG-------------QAG--VPGVQGPAGN---PGAPGINGKDGCDGQDGIPGLEGLSGMP 184
            .|||.|             .||  |||..||:|.   ||.||..|..|..|..|.||..|.||..
Mouse   142 PPGLGGNFASQMSYGYDEKSAGVSVPGPMGPSGPRGLPGPPGAPGPQGFQGPPGEPGEPGGSGPM 206

  Fly   185 GPRGYAGQLGSKGEKGEPAKENGDYAKGEKGEPGWRGTAGLAGPQGFPGEKGERGDSGPYGAK-- 247
            ||||..|..|..|:.||..|..   ..||:|.||.:|..||.|..|.||.||.||.||..|||  
Mouse   207 GPRGPPGPPGKNGDDGEAGKPG---RPGERGPPGPQGARGLPGTAGLPGMKGHRGFSGLDGAKGD 268

  Fly   248 ----GPRGEHGLKGEKGASCYGPMKPGAPGIKGEKGEPASSFPVKPTHTVMGPRGDMGQKGEPGL 308
                ||:||.|..||.||    |.:.|..|:.||:|.|             ||.|..|.:|..|.
Mouse   269 AGPAGPKGEPGSPGENGA----PGQMGPRGLPGERGRP-------------GPPGTAGARGNDGA 316

  Fly   309 VGRKGEPGPEGDTGLDGQKGEKGLPGGPGDRGRQGNFGPPGSTGQKGDRGEPGLNGLPGNPGQKG 373
            ||..|.|||.|.||..|..|..|..|..|.:|.:|:.||.|..|:.|..|..|..|..||||..|
Mouse   317 VGAAGPPGPTGPTGPPGFPGAVGAKGEAGPQGARGSEGPQGVRGEPGPPGPAGAAGPAGNPGADG 381

  Fly   374 EPGRAGATGKPGLLGPPGPPGGGRGTPGPPGPKGPRGYVGAPGPQGLNGVDGLPGPQGYNGQKGG 438
            :||..||.|.||:.|.||.| |.||..||.||.||                  |||      ||.
Mouse   382 QPGAKGANGAPGIAGAPGFP-GARGPSGPQGPSGP------------------PGP------KGN 421

  Fly   439 AGLPGRPGNEGPPGKKGEKGTAGLNGPKGSIGPIGHPGPPGPEGQKGDAGLPGYGIQGSKGDAGI 503
            :|.||.|||:|..|.|||.|..|:.||         |||.|.||::|..|.|        |.:|:
Mouse   422 SGEPGAPGNKGDTGAKGEPGATGVQGP---------PGPAGEEGKRGARGEP--------GPSGL 469

  Fly   504 PGYPGLKGSKGERGFKGN---AGAPGDSKLGRPGTPGAAGAPGQKGDAGRPGTPGQKGDMGIKGD 565
            ||.||.:|..|.|||.|.   ||..|.|  |..|.||.||..|..|:|||||             
Mouse   470 PGPPGERGGPGSRGFPGADGVAGPKGPS--GERGAPGPAGPKGSPGEAGRPG------------- 519

  Fly   566 VGGKCSSCRAGPKGDKGTSGLPGIPGKDGARGPPGERGYPGERGHDGINGQTGPPGEKGEDGRTG 630
                    .||..|.||.:|.||.||.||..||      ||..|.||..|..||||.:|:.|..|
Mouse   520 --------EAGLPGAKGLTGSPGSPGPDGKTGP------PGPAGQDGRPGPAGPPGARGQAGVMG 570

  Fly   631 LP---GATGEPGKPALCDLSLIEPLKGDKGYPGAPGAKGVQGFKGAEGLPGIPGPKGEFGFKGEK 692
            .|   |..|||||            .|::|.||.|||.|..|..|..|..|.|||.|..|.:||:
Mouse   571 FPGPKGTAGEPGK------------AGERGLPGPPGAVGPAGKDGEAGAQGAPGPAGPAGERGEQ 623

  Fly   693 GLSGAPGNDGTPGRAGRDGYPGIPG-QSIKGEPGFHGRDGAKGDKGSFGRSGEKGEPGSCALDEI 756
            |.:|:||..|.||.||..|..|.|| |.:.|:.|..|..||:|::|..|..|.:|.||       
Mouse   624 GPAGSPGFQGLPGPAGPPGEAGKPGEQGVPGDLGAPGPSGARGERGFPGERGVQGPPG------- 681

  Fly   757 KMPA--KGNKGEPGQTGMPGPPGEDGSPGERGYTGLKGNTGPQGPPGVEGPRGLNGPRGEKGNQG 819
              ||  :||.|.||..|..|..|..|:||.:|..||      ||.||..|..||.||:|::|:.|
Mouse   682 --PAGPRGNNGAPGNDGAKGDTGAPGAPGSQGAPGL------QGMPGERGAAGLPGPKGDRGDAG 738

  Fly   820 AVGVPGNPGKDGLRGIPGRNGQPGPRGEPGISRPGPMGPPGLNGLQGEKGDRGPTGPIGFPGADG 884
            ..|..|:|||||.||:                 .||:||||..|..|:||:.||:||   ||..|
Mouse   739 PKGADGSPGKDGARGL-----------------TGPIGPPGPAGAPGDKGEAGPSGP---PGPTG 783

  Fly   885 SVGYPGDRGDAGLPGVSGRPGIVGEKGDVGPIGPAGVAGPPGVPGIDGVRGRDGAKGEPGSPGLV 949
            :.|.|||||:|                  ||.||||.|||||..      |:.|||||||..|: 
Mouse   784 ARGAPGDRGEA------------------GPPGPAGFAGPPGAD------GQPGAKGEPGDTGV- 823

  Fly   950 GMPGNKGDRGAPGNDGPKGFAGV---TGAPGKRGPAGIPGVSGAKGDKGATGLTGNDGPVGGRGP 1011
                 |||.|.||..||.|..|.   .||||.:||.|..|..||.|..||.|..|..||.|..||
Mouse   824 -----KGDAGPPGPAGPAGPPGPIGNVGAPGPKGPRGAAGPPGATGFPGAAGRVGPPGPSGNAGP 883

  Fly  1012 PGAPGLM------GIKGDQGLAGAPGQQGLDGMPGEKGNQGFPGLDGPPGLPGDASEKGQKGEPG 1070
            ||.||.:      |.:|:.|.||.||:.|..|.||..|.:|.||.|||.|.|         |.||
Mouse   884 PGPPGPVGKEGGKGPRGETGPAGRPGEVGPPGPPGPAGEKGSPGADGPAGSP---------GTPG 939

  Fly  1071 PSGLRGDTGPAGTPGWPGEKGLPGLAVHGRAGPPGEKGDQGRSGIDGRDGINGEKGEQGLQGVWG 1135
            |.|:.|..|..|.||..||:|.|||     .||.||.|.||.||..|..|..|..|..||.   |
Mouse   940 PQGIAGQRGVVGLPGQRGERGFPGL-----PGPSGEPGKQGPSGSSGERGPPGPMGPPGLA---G 996

  Fly  1136 QPGEKGSVGAPGIPGAPGMDGLPGAAGAPGAVGYPGDRGDKGEPGLSGLPGLKGETGPVGLQGFT 1200
            .|||.|..|:||..|:||.|      |||||.|..|:.|..|.||..|.||..|..||.|..|..
Mouse   997 PPGESGREGSPGAEGSPGRD------GAPGAKGDRGETGPAGPPGAPGAPGAPGPVGPAGKNGDR 1055

  Fly  1201 GAPGPKGERGIRGQPGLPATVPDIRGDKGSQGERGYTGEKGEQGERGLTGPAGVAGAKGDRGLQG 1265
            |..||.|..|..|..|       .||..|.||.||..||.||||:|      |:.|.:|..||||
Mouse  1056 GETGPAGPAGPIGPAG-------ARGPAGPQGPRGDKGETGEQGDR------GIKGHRGFSGLQG 1107

  Fly  1266 PPGASGLNGIPGAKGDIGPRGEIGYPGVTIKGEKGLPGRPGRNGRQGLIGAPGLIGERGLPGLAG 1330
            |||:                                   ||..|.||..||.|..|.||.||.||
Mouse  1108 PPGS-----------------------------------PGSPGEQGPSGASGPAGPRGPPGSAG 1137

  Fly  1331 EP---GLVGLPGPIGPAGSKGERGLAGSPGQPGQDGFPGAPGLKGDTGPQGFK------GERGLN 1386
            .|   ||.||||||||.|.:|..|.:|..|.||..|.||.||  ..:|...|.      .|:..:
Mouse  1138 SPGKDGLNGLPGPIGPPGPRGRTGDSGPAGPPGPPGPPGPPG--PPSGGYDFSFLPQPPQEKSQD 1200

  Fly  1387 GFEGQKGDKG----DRGLQGPSGLPGLVGQ 1412
            |....:.|..    ||.|:..:.|..|..|
Mouse  1201 GGRYYRADDANVVRDRDLEVDTTLKSLSQQ 1230

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
Col4a1NP_723044.1 Collagen 104..161 CDD:189968 27/74 (36%)
Collagen 322..380 CDD:189968 23/57 (40%)
Collagen 413..465 CDD:189968 18/51 (35%)
Collagen 499..561 CDD:189968 29/64 (45%)
Collagen 574..632 CDD:189968 27/57 (47%)
Collagen 657..714 CDD:189968 28/56 (50%)
Collagen 765..824 CDD:189968 25/58 (43%)
Collagen 854..911 CDD:189968 24/56 (43%)
Collagen 884..943 CDD:189968 24/58 (41%)
Collagen 923..982 CDD:189968 28/61 (46%)
Collagen 1028..1085 CDD:189968 24/56 (43%)
Collagen 1229..1287 CDD:189968 22/57 (39%)
Collagen 1318..1376 CDD:189968 31/60 (52%)
Collagen 1399..1458 CDD:189968 4/14 (29%)
Collagen 1477..1534 CDD:189968
C4 1555..1662 CDD:128421
C4 1663..1777 CDD:128421
Col1a1NP_031768.2 VWC 31..86 CDD:214564 563/1395 (40%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 94..1210 555/1365 (41%)
Nonhelical region (N-terminal) 152..167 2/14 (14%)
Triple-helical region 168..1181 522/1253 (42%)
Collagen 225..284 CDD:189968 28/61 (46%)
Collagen 258..317 CDD:189968 28/75 (37%)
Collagen 486..545 CDD:189968 32/87 (37%)
Collagen 525..584 CDD:189968 32/76 (42%)
Collagen 657..714 CDD:189968 25/65 (38%)
Collagen 705..755 CDD:189968 27/72 (38%)
Cell attachment site. /evidence=ECO:0000255 734..736 0/1 (0%)
Collagen <784..822 CDD:189968 26/61 (43%)
Collagen 1068..1123 CDD:189968 31/102 (30%)
Cell attachment site. /evidence=ECO:0000255 1082..1084 1/1 (100%)
Nonhelical region (C-terminal) 1182..1207 4/24 (17%)
COLFI 1219..1452 CDD:279718 3/12 (25%)
Blue background indicates that the domain is not in the aligned region.


Information from Original Tools:


Tool Simple Score Weighted Score Original Tool Information
BLAST Result Score Score Type Cluster ID
Compara 00.000 Not matched by this tool.
Domainoid 00.000 Not matched by this tool.
eggNOG 1 0.900 - - E33208_3BAFZ
Hieranoid 00.000 Not matched by this tool.
Homologene 00.000 Not matched by this tool.
Inparanoid 00.000 Not matched by this tool.
Isobase 00.000 Not matched by this tool.
OMA 00.000 Not matched by this tool.
OrthoDB 00.000 Not matched by this tool.
OrthoFinder 00.000 Not matched by this tool.
OrthoInspector 00.000 Not matched by this tool.
orthoMCL 00.000 Not matched by this tool.
Panther 00.000 Not matched by this tool.
Phylome 00.000 Not matched by this tool.
RoundUp 00.000 Not matched by this tool.
SonicParanoid 00.000 Not matched by this tool.
SwiftOrtho 1 1.000 - -
TreeFam 00.000 Not matched by this tool.
21.900

Return to query results.
Submit another query.