DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Col4a1 and Col7a1

DIOPT Version :9

Sequence 1:NP_723044.1 Gene:Col4a1 / 33727 FlyBaseID:FBgn0000299 Length:1779 Species:Drosophila melanogaster
Sequence 2:NP_001100328.2 Gene:Col7a1 / 301012 RGDID:1311417 Length:2944 Species:Rattus norvegicus


Alignment Length:1752 Identity:728/1752 - (41%)
Similarity:867/1752 - (49%) Gaps:359/1752 - (20%)


- Green bases have known domain annotations that are detailed below.


  Fly    18 LVGADAQFWKTAGTAGSIQDSVKHYNRNEPKFPIDDSYDIVDSAGVARGDLPPKNCTAGYA---- 78
            |||||.:..:.....   .|.::.:      |.:|:..|: |.||   .||....|.|..|    
  Rat  1193 LVGADPEQLRLLAPG---MDPIQTF------FAVDNGLDL-DRAG---SDLAVALCQAAVAIQPQ 1244

  Fly    79 --GCVPKCIAEKGNRGLPGPLGPTGLKGEMGFPGMEGPSGDKGQKGDPGPYGQRGDKGERGSPG- 140
              .|...|              |.|.|||   ||:.||.|..|..|.||..|:.|..|.:|:|| 
  Rat  1245 LEPCAVPC--------------PKGQKGE---PGVTGPQGQAGPPGPPGLPGRTGAPGPQGAPGS 1292

  Fly   141 --LHGQAGVPGVQGPAGNPGAPGINGKDGCDGQDGIPGLEGLSGMPGPRGYAGQLGSKGEKGEPA 203
              ..|:.|.||.:||.|:||.||:.|.         ||::|..|..||||..|:.|.:|.|||| 
  Rat  1293 TQAKGERGFPGPEGPPGSPGLPGVPGS---------PGVKGSPGWSGPRGDRGERGPQGPKGEP- 1347

  Fly   204 KENGDYAKGEKGEPGWRGTAGLAGPQGFPGEKGERGDSGPYGAKGPRGEHGLKGEKGASCYGPMK 268
                       ||||  ...| .|..|.||:||:.|.|||.|..||.|:.|.:|          .
  Rat  1348 -----------GEPG--QVIG-GGRPGLPGKKGDPGPSGPPGPHGPLGDPGPRG----------P 1388

  Fly   269 PGAPG--IKGEKGEPASSFPVKPTHTVMGPRGDMGQKGEPGLVGRKGEPGPEGDTGLDGQKGEK- 330
            ||.||  :||:||:.....|       .||.....::|.|||.|..|.|||:|..|..|:|||| 
  Rat  1389 PGLPGTSVKGDKGDRGERGP-------PGPGTGASEQGSPGLPGLPGSPGPQGPPGRTGEKGEKG 1446

  Fly   331 -------GLPGGPGDRGRQGNFGPPGSTGQKGDRGEPGLNGLPGNPGQKGEPGRAGATGKPGLLG 388
                   ||||.||..|..|..|.||.||.||||   ||.|.||.||:|||.|..|..|..||.|
  Rat  1447 DCEDGGPGLPGQPGVPGEPGLRGAPGVTGPKGDR---GLTGTPGEPGEKGERGPPGPVGPQGLPG 1508

  Fly   389 PPGPPG--GGRGTPGPPGPKGPRGYVGAP----------GPQGLNGVDGLPGPQGYNGQKGGAGL 441
            ..|.||  |..|.|||||.:|.:|..|.|          |.:|..|..|||||:|.:|.||..|.
  Rat  1509 AAGRPGVEGPEGPPGPPGRRGEKGEPGRPGDPALGPGGAGAKGEKGDAGLPGPRGASGIKGEQGA 1573

  Fly   442 PG--RPGNEGPPGKKGEKGTAGLNGPKGSIGPIGHPGPPGPEGQKGDAGLPG-------YGIQGS 497
            ||  .||:.||.|..|::|..||.   |..||.|..||||.:|:.|..|.||       .|..|.
  Rat  1574 PGLALPGDPGPKGDPGDRGPIGLT---GRAGPTGDSGPPGEKGEPGRPGSPGPVGPRGRDGEAGE 1635

  Fly   498 KGDAGIPGYPGLKGSKGERGF------------KGNAGAPGDSKLGRPGTPGAAGAPGQKGDAGR 550
            |||.|.||.|||.|..||||.            |||.|.||:.  ||.||||.:|..|.:|:.|.
  Rat  1636 KGDEGAPGEPGLPGKAGERGLRGAPGPRGPVGEKGNEGDPGED--GRNGTPGPSGPKGDRGEPGP 1698

  Fly   551 PGTPGQKGDMGI----KGDVGGKCSSCRAGPKGDKGTSGLPGIPGK---DGARGPPGERGYPGER 608
            ||.||:..|..:    ||:.|      :.||:|.||..|.||..|:   ||.|||||.:|.||.|
  Rat  1699 PGLPGRLVDAALESRDKGEPG------QEGPRGPKGDPGPPGASGERGIDGLRGPPGPQGDPGVR 1757

  Fly   609 GHDGINGQTGPPGEKGEDGRTGLPGATGEPGKPALCDLS---------LIEPLKGDKGYPGAPGA 664
            |..|..|..|.|   |.|||.||.|..|.||.|.|...|         .:..|:|:.|.||.||.
  Rat  1758 GPAGDKGDRGSP---GLDGRNGLDGKPGAPGPPGLHGASGKAGDPGRDGLPGLRGEHGPPGPPGP 1819

  Fly   665 KGVQGFKGAEGLPGI------PGPKGEFGFKGEKGLSGAPGND---------GTPGRAGRDGYPG 714
            .||.|..|.:|.||:      ||..||.|.|||||.||.||.:         |.||..|..|.||
  Rat  1820 PGVPGKPGDDGKPGLNGKNGEPGDPGEDGRKGEKGDSGVPGREGPDGPKGERGAPGNPGLQGPPG 1884

  Fly   715 IPGQ---SIKGEPGFHGRDGAKGDKGSFGRSGE---------KGEPGSCALDE-------IKMPA 760
            :|||   ..:|.||..|..|.|||:|..|..||         :|||||....|       ||:.|
  Rat  1885 LPGQVGPPGQGFPGVPGVTGPKGDRGETGSKGEQGLPGERGLRGEPGSLPNAERFLETAGIKVSA 1949

  Fly   761 ----------------------KGNKGEPGQTGMPGPPGEDGS---PGERGYTGLKGNTGPQGPP 800
                                  :|.||:||.   |||||::||   |||||..|.:|:.||||||
  Rat  1950 LREIVDTWGESSGSFLLVPERRQGPKGDPGD---PGPPGKEGSIGLPGERGLKGERGDPGPQGPP 2011

  Fly   801 GVE-GPRGLNGPRGEKGNQGAVGVPGNPGKDGLRGIPGRNGQPGPRGE------------PGI-- 850
            |:. |.||..||.|..|..|..|:||.||:.|..|..||.|:.|.|||            ||:  
  Rat  2012 GLALGERGPPGPSGLAGEPGKPGIPGLPGRAGAAGEAGRPGERGERGEKGERGEQGRDGHPGLPG 2076

  Fly   851 --SRPGP--------------MGPPGLNGLQGEKGD---RGPTGPIGFPGADGSVGYPGDRGDAG 896
              ..|||              .|||||.|.:||.|.   .||.|..|.||..|..|.||.||..|
  Rat  2077 PPGPPGPKVAIEELGPGPAREQGPPGLKGAKGEPGSDGVPGPKGDRGVPGIKGDAGEPGKRGPDG 2141

  Fly   897 LPGV---------SGRPGIVGEKGDVGPIGPAGVAGPPGVPGI---DGVRGRDGAKGEPGS---P 946
            .||:         .|:||:.|.:|..||:|..|..||||.||:   .|.:|..|.|||||.   |
  Rat  2142 NPGLPGERGVSGPEGKPGLQGPRGTPGPVGSHGDPGPPGAPGLAGPAGPQGPSGLKGEPGETGPP 2206

  Fly   947 GLVGMPGNKGDRGAPGNDGPKGFAGVTGAPGKRGPAGIPGVSGAKGDKGATGLTGNDGPVGGRGP 1011
            |. |:||..|..|.||..||.|..|..|:||..|..   |.:|..|..|..|.:|.||..||.|.
  Rat  2207 GR-GLPGPTGAVGLPGPPGPSGLVGPQGSPGLPGQV---GETGKPGPPGRDGSSGKDGERGGPGV 2267

  Fly  1012 PGAPGL---MGIKGDQGLAGAPGQQGLDGMPGEKGNQGFPGLDGPPGLPGDASEKGQKGEPGPSG 1073
            ||.|||   :|.||:.|..|||||. :.|.||.||.:|.|| |....|.|:...||.:|.|||.|
  Rat  2268 PGLPGLPGPVGPKGEPGPVGAPGQV-VVGPPGAKGEKGAPG-DLAGALLGEPGAKGDRGLPGPRG 2330

  Fly  1074 LRGDTGPAGTPGWPGE------------KGLPGLAVH---GRAGPPGEKGDQGRSGIDGRDGING 1123
            .:|:.|.||.||.|||            ||.||:.|.   |.:||||.|||.|..|..|..||.|
  Rat  2331 EKGEAGHAGEPGDPGEDGQKGAPGLKGLKGEPGIGVQGPPGPSGPPGMKGDLGPPGAPGAPGIVG 2395

  Fly  1124 EKGEQGLQGVWGQP---GEKGSVGAPGIPGAP---GMDGLPGAAGAPGAVGYPGDRGDKGEPGLS 1182
            ..|:.|.:|..|||   ||:|..|.||..|||   |..|.||:.|||||.|:.||:||.|    :
  Rat  2396 FPGQPGPRGETGQPGPVGERGLAGPPGREGAPGPLGPPGPPGSVGAPGASGFKGDKGDSG----A 2456

  Fly  1183 GLPGLKGETGPVGLQGFTGAPGPKGERGIRGQPGLPATVPDIRGDKGSQGERGYTGEKGEQGERG 1247
            ||||.:||.|..||:|..|.||.:|.||:.|.||       .||::|.:|:.|..|.||::|:..
  Rat  2457 GLPGPRGERGEPGLRGEDGHPGQEGPRGLMGPPG-------SRGERGEKGDPGAAGLKGDKGDSA 2514

  Fly  1248 -LTGPAGVAGAKGDRGLQGPPGASGLNGIPGAKGDIGPRGEIGYPGVTIKGEKGLPGRPGRNGRQ 1311
             :.||||..|||||.|.:||.|..         ||.|||||.|.|     |:||..|.||..   
  Rat  2515 VIEGPAGPRGAKGDMGERGPRGID---------GDQGPRGESGDP-----GDKGSKGEPGDK--- 2562

  Fly  1312 GLIGAPGLIGERGLPGLAGEPGLVGLPGPIGPAGSKGERGLAGSPGQPGQDGFPGAPGLKGD--- 1373
               |:.|..|.|||.|..||||..|:|               |.||.||:||.||..|.|||   
  Rat  2563 ---GSAGSTGVRGLTGPKGEPGAAGIP---------------GEPGAPGKDGAPGFRGDKGDIGF 2609

  Fly  1374 TGPQGFKGERGLNGFEGQKGDKGDRGLQGPSGLPGLVGQKGDTGYPGLNGNDGPVGAPGERGFTG 1438
            |||:|.|||||:.|..|:.|:|||:|..|..|.|||.|:|||.|.||:.|..   ||||:.|..|
  Rat  2610 TGPRGLKGERGVKGTCGRDGEKGDKGEAGFPGRPGLSGKKGDMGDPGIPGQS---GAPGKEGLIG 2671

  Fly  1439 PKGRDGRDGTPGLPGQKGEPGMLPPPGPKGEPGQPGRNGPKGEPGRPGERGLIGIQGERGEKGER 1503
            |||..|.||..|..|.:||.|...|||..|.||..|.:|..|.||.||..|..|.:|.:|:||||
  Rat  2672 PKGDRGFDGQSGPKGDQGEKGERGPPGVGGFPGPRGNDGSSGPPGPPGSIGPKGPEGLQGQKGER 2736

  Fly  1504 GLIGETGNVGRPGPKGDRGEPGERGYEGAIGLIGQKGEPGAPAPAAL--DYLTGILITRHSQ 1563
            |..||: .||.||..|..||.||:|..|..|..|:|||      |||  |.:.|.:....||
  Rat  2737 GPPGES-VVGAPGAPGTPGERGEQGRPGPAGPRGEKGE------AALTEDDIRGFVRQEMSQ 2791

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
Col4a1NP_723044.1 Collagen 104..161 CDD:189968 26/59 (44%)
Collagen 322..380 CDD:189968 34/65 (52%)
Collagen 413..465 CDD:189968 26/63 (41%)
Collagen 499..561 CDD:189968 34/73 (47%)
Collagen 574..632 CDD:189968 30/60 (50%)
Collagen 657..714 CDD:189968 31/71 (44%)
Collagen 765..824 CDD:189968 33/62 (53%)
Collagen 854..911 CDD:189968 30/82 (37%)
Collagen 884..943 CDD:189968 29/70 (41%)
Collagen 923..982 CDD:189968 31/64 (48%)
Collagen 1028..1085 CDD:189968 27/56 (48%)
Collagen 1229..1287 CDD:189968 25/58 (43%)
Collagen 1318..1376 CDD:189968 26/60 (43%)
Collagen 1399..1458 CDD:189968 29/58 (50%)
Collagen 1477..1534 CDD:189968 28/56 (50%)
C4 1555..1662 CDD:128421 3/9 (33%)
C4 1663..1777 CDD:128421
Col7a1NP_001100328.2 vWA_collagen_alphaI-XII-like 38..202 CDD:238759
fn3 236..318 CDD:394996
fn3 334..406 CDD:394996
fn3 427..488 CDD:394996
FN3 510..589 CDD:238020
FN3 599..681 CDD:238020
FN3 688..772 CDD:238020
fn3 778..856 CDD:394996
fn3 868..935 CDD:394996
FN3 959..1045 CDD:238020
VWA 1055..1223 CDD:395045 8/38 (21%)
PRK07764 <1509..1714 CDD:236090 88/209 (42%)
Collagen 1580..1635 CDD:396114 22/57 (39%)
Collagen 1613..1668 CDD:396114 21/54 (39%)
Collagen 1839..1894 CDD:396114 24/54 (44%)
Collagen 1878..1932 CDD:396114 22/53 (42%)
PHA03169 2053..>2202 CDD:223003 55/148 (37%)
Collagen 2102..2158 CDD:396114 22/55 (40%)
Collagen 2315..2363 CDD:396114 19/47 (40%)
PRK12678 2403..>2636 CDD:237171 125/278 (45%)
Collagen 2457..2511 CDD:396114 27/60 (45%)
Collagen 2533..2589 CDD:396114 32/90 (36%)
Collagen 2611..2666 CDD:396114 31/57 (54%)
Collagen 2728..2774 CDD:396114 25/52 (48%)
KU 2877..2932 CDD:238057
Blue background indicates that the domain is not in the aligned region.


Information from Original Tools:


Tool Simple Score Weighted Score Original Tool Information
BLAST Result Score Score Type Cluster ID
Compara 1 0.930 - - C166349626
Domainoid 00.000 Not matched by this tool.
eggNOG 1 0.900 - - E33208_3BAFZ
Hieranoid 00.000 Not matched by this tool.
Homologene 00.000 Not matched by this tool.
Inparanoid 00.000 Not matched by this tool.
OMA 00.000 Not matched by this tool.
OrthoDB 00.000 Not matched by this tool.
OrthoFinder 00.000 Not matched by this tool.
OrthoInspector 00.000 Not matched by this tool.
orthoMCL 00.000 Not matched by this tool.
Panther 00.000 Not matched by this tool.
Phylome 1 0.910 - -
SonicParanoid 00.000 Not matched by this tool.
SwiftOrtho 1 1.000 - -
TreeFam 00.000 Not matched by this tool.
43.740

Return to query results.
Submit another query.