DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Col4a1 and Col19a1

DIOPT Version :9

Sequence 1:NP_723044.1 Gene:Col4a1 / 33727 FlyBaseID:FBgn0000299 Length:1779 Species:Drosophila melanogaster
Sequence 2:XP_006495705.1 Gene:Col19a1 / 12823 MGIID:1095415 Length:1162 Species:Mus musculus


Alignment Length:1202 Identity:431/1202 - (35%)
Similarity:521/1202 - (43%) Gaps:356/1202 - (29%)


- Green bases have known domain annotations that are detailed below.


  Fly    43 NRNEPKFPIDDSYDIVDSA------GVARGDLPPKNCTAGYAGCVPKCIAEKGNRGLPGPLGPTG 101
            |.:..|.|..|.:....|:      |.....||.|........|:|    .|...||||.|...|
Mouse   267 NLSPTKCPEQDDFGSTTSSWGTSNTGKMSSYLPGKQELKDTCQCIP----NKEEAGLPGTLRSIG 327

  Fly   102 LKGEMGFPGMEGPSGDKGQKGDPGPYGQRGDKGERGSPGLHGQAGVPGVQGPAGNPGAPGINGKD 166
            .||:.|.|      |:.|..|.||..||   |||:|..|:.|:.      |..|.|||.|.:|.|
Mouse   328 HKGDKGEP------GEHGLDGTPGLPGQ---KGEQGLEGIKGEI------GEKGEPGAKGDSGLD 377

  Fly   167 GCDGQDGIPGLEGLSGMPGPRGYAGQLGSKGEKGEPAKENGDYAKGEKGEPGWRGTAGLAGPQGF 231
            |.:||||:.|..|..|.|||:      |.||:.|.|            |.|...|:.|:.||||.
Mouse   378 GLNGQDGLKGDSGPQGPPGPK------GDKGDMGPP------------GPPALTGSIGIQGPQGP 424

  Fly   232 P---GEKGERGDSGPYGAKGPRGEHGLKGEKGASCYGPMKPGAPGIKGEKGEPASSFPVKPTHTV 293
            |   |::|.||.:||.|..||                |..||.||::|                :
Mouse   425 PGKEGQRGRRGKTGPPGNPGP----------------PGPPGPPGLQG----------------L 457

  Fly   294 MGPRGDMGQKGEPGLVGRKGEPGPEGDTGLDGQKGEKGLPGGPGDRGRQGNFGPPG---STGQKG 355
            ..|.|....||    .|..|..||:      |:||:.||||.||..|.:|:.|.||   :.|:||
Mouse   458 QQPFGGYFNKG----TGEHGASGPK------GEKGDTGLPGFPGSVGPKGHKGEPGEPLTKGEKG 512

  Fly   356 DRGEPGLNGLPGNPGQKGEPGRAGATGKPGLLGPPGPPGGGRGTPGPPGPKGPRGYVGAPGPQGL 420
            |||||||.|..|..|:.|:||..|..|.|||.|..||.|.       .||:||.|.||.||..  
Mouse   513 DRGEPGLLGPQGIKGEPGDPGPPGLLGSPGLKGQQGPAGS-------MGPRGPPGDVGLPGEH-- 568

  Fly   421 NGVDGLPGPQGYNGQKGGAGLPGRPGNEGPPGKKGEKGTAGLNGPKGSIGPIGHPGPPGPEGQKG 485
                |:||.||..|:||..|  ||.|..|.||.||:.|..|::.| |..|..|:||.|||.|.||
Mouse   569 ----GIPGKQGVKGEKGDPG--GRLGPPGLPGLKGDAGPPGISLP-GKPGLDGNPGSPGPRGPKG 626

  Fly   486 DAGLPGYGIQGSKGD-----AGIPGYPGLKGSKGERGFKGNAGAPGDSKLGRPGTPGAAGAPGQK 545
            :.|||  |:.||.||     .||||..|.:|..||.|.:|..|.||  ..|.||.||..||||:.
Mouse   627 ERGLP--GLHGSPGDTGPPGVGIPGRTGSQGPAGEPGIQGPRGLPG--LPGTPGMPGNDGAPGKD 687

  Fly   546 GDAGRPGTPGQKGDMGIKGDVGGK----CSSCRAGPKGDKGTSGLPGIPGKDGARGPPGERGYPG 606
            |..|.||.||....:.:.||:|..    |.:|:|      ...||..|.|.||:.|.||:     
Mouse   688 GKPGLPGPPGDPIALPLLGDIGALLKNFCGNCQA------NVPGLKSIKGDDGSTGEPGK----- 741

  Fly   607 ERGHDGINGQTGPPGEKGEDGRTGLPGATGEPGKPALCDLSLIEPLKGDKGYPGAPGAKGVQGFK 671
               :|       |...||:                           .|.:|.||.||.:|.:|.|
Mouse   742 ---YD-------PAARKGD---------------------------VGPRGPPGFPGREGPKGSK 769

  Fly   672 GAEGLPGIPGPKGEFGFKGEKGLSGAPGNDGTPGRAGRDGYPGIPGQSIKGEPGFHGRDGAKGDK 736
            |..|.|||.|.||:.|.:|..|||||||..|.||..||.|:|              |..||||||
Mouse   770 GERGYPGIHGEKGDEGLQGIPGLSGAPGPTGPPGLTGRTGHP--------------GPTGAKGDK 820

  Fly   737 GSFGRSGEKGEPGSCALDEIKMPAKGNKGEPGQTGMPGPPGEDGSPGERGYTGLKGNTGPQGPPG 801
            ||                         :|.||:   ||||                     ||||
Mouse   821 GS-------------------------EGPPGK---PGPP---------------------GPPG 836

  Fly   802 VEGPRGLNGPRGEKGNQGAVGVPGNPGKDGLRGIPGRNGQPGPRGEPG-ISRPGPMGPPGLNGLQ 865
            |....| ||.......||.|.|||.||..         |.|||:|:|| :..||.||.|||.|..
Mouse   837 VPLNEG-NGMSSLYKIQGGVNVPGYPGPP---------GPPGPKGDPGPVGEPGAMGLPGLEGFP 891

  Fly   866 GEKGDRGPTGPIGFPGADGSVGYPGDRGDAGLPGVSGRPGIVGEKGDVGPIGPAGVAGPPGVPGI 930
            |.||||||.||            ||..|.:|.||..|.||:.||:|:.||||..|..||.|..|.
Mouse   892 GVKGDRGPAGP------------PGIAGISGKPGAPGPPGVPGEQGERGPIGDTGFPGPEGPSGK 944

  Fly   931 DGVRGRDGAKGEPGSPGLVGMPGNKGDRGAPGNDGPKGFAGVTGAPGKRGPAGIPGVSGAKGDKG 995
            .|:.|:|            |:||.:|..|.||:.|||         |:||..||||..|.:|::|
Mouse   945 PGINGKD------------GLPGAQGIMGKPGDRGPK---------GERGDQGIPGDRGPQGERG 988

  Fly   996 ATGLTGNDGPVGGRGPPGAPGLMGIKGDQGLAGAPGQQGLDGMPGEKGNQGFPGLDGPPGLPGDA 1060
            ..||||..|.:|..||.|:                           ||:.|.||..||||.|   
Mouse   989 KPGLTGMKGAIGPVGPAGS---------------------------KGSTGPPGHQGPPGNP--- 1023

  Fly  1061 SEKGQKGEPGPSGLRGDTGPAGTPGWPGEKGLPGLAVHGRAGPPGEKGDQGRSGIDGRDGINGEK 1125
                                 |.||.|.:      ||               |..:.:..||.|.
Mouse  1024 ---------------------GIPGTPAD------AV---------------SFEEIKHYINQEV 1046

  Fly  1126 ---GEQGLQGVWGQ---PGEKGSVGAPGIPGAPGMDGLPGAAGAPGAVGYPGDRGDKGEP--GLS 1182
               .|:.:.....|   |....|..|.|.||.||.|||||..|.||..||.|.:|::|||  ||.
Mouse  1047 LRIFEERMAVFLSQLKLPAAMLSAQAHGRPGPPGKDGLPGPPGDPGPQGYRGQKGERGEPGIGLP 1111

  Fly  1183 GLPGLKGETGPVGLQGFTGAPGPKGERGIRGQ 1214
            |.|||.|.:. |||.|..|||||:|..|..|:
Mouse  1112 GSPGLPGSSA-VGLPGSPGAPGPQGPPGPSGR 1142

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
Col4a1NP_723044.1 Collagen 104..161 CDD:189968 21/56 (38%)
Collagen 322..380 CDD:189968 30/60 (50%)
Collagen 413..465 CDD:189968 22/51 (43%)
Collagen 499..561 CDD:189968 30/66 (45%)
Collagen 574..632 CDD:189968 14/57 (25%)
Collagen 657..714 CDD:189968 31/56 (55%)
Collagen 765..824 CDD:189968 18/58 (31%)
Collagen 854..911 CDD:189968 27/56 (48%)
Collagen 884..943 CDD:189968 24/58 (41%)
Collagen 923..982 CDD:189968 20/58 (34%)
Collagen 1028..1085 CDD:189968 11/56 (20%)
Collagen 1229..1287 CDD:189968
Collagen 1318..1376 CDD:189968
Collagen 1399..1458 CDD:189968
Collagen 1477..1534 CDD:189968
C4 1555..1662 CDD:128421
C4 1663..1777 CDD:128421
Col19a1XP_006495705.1 TSPN 76..255 CDD:214560
Collagen 330..387 CDD:189968 29/71 (41%)
Collagen 476..529 CDD:189968 29/58 (50%)
Collagen 545..602 CDD:189968 31/71 (44%)
Collagen 588..644 CDD:189968 28/58 (48%)
Collagen 629..685 CDD:189968 28/59 (47%)
Collagen 946..1005 CDD:189968 30/79 (38%)
Blue background indicates that the domain is not in the aligned region.


Information from Original Tools:


Tool Simple Score Weighted Score Original Tool Information
BLAST Result Score Score Type Cluster ID
Compara 00.000 Not matched by this tool.
Domainoid 00.000 Not matched by this tool.
eggNOG 00.000 Not matched by this tool.
Hieranoid 00.000 Not matched by this tool.
Homologene 00.000 Not matched by this tool.
Inparanoid 00.000 Not matched by this tool.
Isobase 00.000 Not matched by this tool.
OMA 00.000 Not matched by this tool.
OrthoDB 00.000 Not matched by this tool.
OrthoFinder 00.000 Not matched by this tool.
OrthoInspector 00.000 Not matched by this tool.
orthoMCL 00.000 Not matched by this tool.
Panther 00.000 Not matched by this tool.
Phylome 1 0.910 - -
RoundUp 00.000 Not matched by this tool.
SonicParanoid 00.000 Not matched by this tool.
SwiftOrtho 1 1.000 - -
TreeFam 00.000 Not matched by this tool.
21.910

Return to query results.
Submit another query.