DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Col4a1 and Col2a1

DIOPT Version :9

Sequence 1:NP_723044.1 Gene:Col4a1 / 33727 FlyBaseID:FBgn0000299 Length:1779 Species:Drosophila melanogaster
Sequence 2:NP_112440.2 Gene:Col2a1 / 12824 MGIID:88452 Length:1487 Species:Mus musculus


Alignment Length:1727 Identity:632/1727 - (36%)
Similarity:751/1727 - (43%) Gaps:411/1727 - (23%)


- Green bases have known domain annotations that are detailed below.


  Fly     8 LLYAAVIAGALVGADAQFWKTAGTAGSIQDSVKHYNRNEPKFPIDDSYDIVDSAGVARGDL---P 69
            ||.|||:  ...|.|||      .|||...:.:.|...:...|......:.|:..|...|:   .
Mouse    15 LLIAAVL--RCQGQDAQ------EAGSCLQNGQRYKDKDVWKPSSCRICVCDTGNVLCDDIICED 71

  Fly    70 PK--NCTAGYAGCVPKCIAEKGNRGLPGPLGPTGLKGEMG----FPGMEGPSGDKGQKGDPGPYG 128
            |.  |....:..|.|.|.|:.....  |.|||.|.|||.|    ..|..||.|.:|..|:.||.|
Mouse    72 PDCLNPEIPFGECCPICPADLATAS--GKLGPKGQKGEPGDIRDIIGPRGPPGPQGPAGEQGPRG 134

  Fly   129 QRGDKGERGSP---GLHGQAGVPGVQGPAGNPGAPGINGKDGCDGQDGIPGLEGLSGMPGPRGYA 190
            .||||||:|:|   |..|:.|.||..||||.||.|            |.|||..       ..:|
Mouse   135 DRGDKGEKGAPGPRGRDGEPGTPGNPGPAGPPGPP------------GPPGLSA-------GNFA 180

  Fly   191 GQL-GSKGEKGEPAKENGDYAKGEKGEPGWRGTAGLAGPQGFPGEKGERGDSGPYGAKGPRGEHG 254
            .|: |...||.                    |.|.:...||..|..|.||..||.||.||:|..|
Mouse   181 AQMAGGYDEKA--------------------GGAQMGVMQGPMGPMGPRGPPGPAGAPGPQGFQG 225

  Fly   255 LKGEKGASCYGPMKPGAPGIKGEKGEPASSFPVKPTHTVMGPRGDMGQKGEPGLVGRKGEPGPEG 319
            ..||          ||.||:.|.                |||||..|..|:||..|..|:||..|
Mouse   226 NPGE----------PGEPGVSGP----------------MGPRGPPGPAGKPGDDGEAGKPGKSG 264

  Fly   320 DTGLDGQKGEKGLPGGPGDRGRQGNFGPPGSTGQKGDRGEPGLNGLPGNPGQKGEPGRAGATGKP 384
            :.||.|.:|.:|.||.||..|.:|:.|.||..|.||:.|.||:.|..|:||:.|.||..|..|.|
Mouse   265 ERGLPGPQGARGFPGTPGLPGVKGHRGYPGLDGAKGEAGAPGVKGESGSPGENGSPGPMGPRGLP 329

  Fly   385 GLLGPPGPPG--GGRGTPGPPGPKGPRGYVGAPGPQGLNGVDGLPGPQGYNGQKGGAGLPGRPGN 447
            |..|..||.|  |.||..|.|||.||                  |||.   |..||.|.|     
Mouse   330 GERGRTGPAGAAGARGNDGQPGPAGP------------------PGPV---GPAGGPGFP----- 368

  Fly   448 EGPPGKKGEKGTAGLNGPKGSIGPIGHPGPPGPEGQKGDAGLPGY-GIQGSKGDAGIPGYPGLKG 511
             |.||.|||.|..|..||:|:.|..|.||.||..|..|.:|.||. ||.|:||.||.||..|..|
Mouse   369 -GAPGAKGEAGPTGARGPEGAQGSRGEPGNPGSPGPAGASGNPGTDGIPGAKGSAGAPGIAGAPG 432

  Fly   512 SKGERGFKGNAGAPGDSKLGRPGTPGAAGAPGQKGDAGRPGTPGQKGDMGIKGDVGGKCSSCRAG 576
            ..|.|              |.||..||.|..|.||.||.||..|.|||.|.||:.|........|
Mouse   433 FPGPR--------------GPPGPQGATGPLGPKGQAGEPGIAGFKGDQGPKGETGPAGPQGAPG 483

  Fly   577 PKGDKGTSGLPGIPGKDGARGPPGERGYPGER---GHDGINGQTGPPGEKGEDGRTGLPGATGEP 638
            |.|::|..|..|.||..|..|||||||.||.|   |.||:.|..|.|||:|..|.||..||.|:|
Mouse   484 PAGEEGKRGARGEPGGAGPIGPPGERGAPGNRGFPGQDGLAGPKGAPGERGPSGLTGPKGANGDP 548

  Fly   639 GKPALCDLSLIEPLKGDKGYPGAPGAKGVQGFKGAEGLPGIPGPKGEFGFKGEKGLSGAPGNDGT 703
            |:|               |.||.|||:|:      .|.||..||:|:.      |.|||||.||.
Mouse   549 GRP---------------GEPGLPGARGL------TGRPGDAGPQGKV------GPSGAPGEDGR 586

  Fly   704 PGRAGRDGYPGIPGQSIKGEPGFHGRDGAKGDKGSFGRSGEKGEPGSCALDEIKMPAKGNKGEPG 768
            ||..|..|        .:|:||..|..|.||..|..|::||||..|:..|       :|..|:.|
Mouse   587 PGPPGPQG--------ARGQPGVMGFPGPKGANGEPGKAGEKGLAGAPGL-------RGLPGKDG 636

  Fly   769 QTGMPGPPGEDGSPGERGYTGLKGNTGPQGPPGVEGPRGLNGPRGEKGNQGAVGVPGNPGKDGLR 833
            :||..||||..|..||||..|..|.:|.||.||..||.|..|.:|::|..|..|.||..|..|.|
Mouse   637 ETGAAGPPGPSGPAGERGEQGAPGPSGFQGLPGPPGPPGEGGKQGDQGIPGEAGAPGLVGPRGER 701

  Fly   834 GIPGRNGQPGPRGEPGISRPGPMGPPGLNGLQGEKGDRGPTGPIGFPGADGSVGYPGDRGDAGLP 898
            |.||..|.||.:|     ..||.|.||..|..|.||..||.||.|..|..|..|.||:||.||:.
Mouse   702 GFPGERGSPGAQG-----LQGPRGLPGTPGTDGPKGAAGPDGPPGAQGPPGLQGMPGERGAAGIA 761

  Fly   899 GVSGRPGIVGEKGDVGPIGPAGVAGPPGVPGIDGVRGRDGAKGEPGSPGLVGMPGNKGDRGAPGN 963
            |..|..|.||||            ||.|.||.||.||..|..|.||.   .|..|.||:.|.|  
Mouse   762 GPKGDRGDVGEK------------GPEGAPGKDGGRGLTGPIGPPGP---AGANGEKGEVGPP-- 809

  Fly   964 DGPKGFAGVTGAPGKRGPAGIPGVSGAKGDKGATGLTGNDGPVGGRGPPGAPGLMGIKGDQGLAG 1028
             ||.|..|..||||:||..|.|                  ||.|..|||||.|..|.|||||.| 
Mouse   810 -GPSGSTGARGAPGERGETGPP------------------GPAGFAGPPGADGQPGAKGDQGEA- 854

  Fly  1029 APGQQGLDGMPGEKGNQGFPGLDGPPGLPGDASEKGQKGEPGPSGLRGDTGPAGTPGWPGEKGLP 1093
                       |:||:.|.||..||.|.||   .:|..|..||.|.||..||      ||..|.|
Mouse   855 -----------GQKGDAGAPGPQGPSGAPG---PQGPTGVTGPKGARGAQGP------PGATGFP 899

  Fly  1094 GLAVHGRAGPPGEKGDQGRSGID---GRDGINGEKGEQGLQGVWGQPGEKGSVGAPGIPGAPGMD 1155
            |.|  ||.||||..|:.|.:|..   |:||..|.:|:.|..|..|.||.:|..||||..|.||.|
Mouse   900 GAA--GRVGPPGANGNPGPAGPPGPAGKDGPKGVRGDSGPPGRAGDPGLQGPAGAPGEKGEPGDD 962

  Fly  1156 GLPGAAGAPGAVGYPGDRGDKGEPGLSGLPGLKGETGPVGLQGFTGAPGPKGERGIRGQPGLPAT 1220
            |..|..|.||..|..|.|      |:.||||.:||      :||.|.|||.||.|.:|.|     
Mouse   963 GPSGLDGPPGPQGLAGQR------GIVGLPGQRGE------RGFPGLPGPSGEPGKQGAP----- 1010

  Fly  1221 VPDIRGDKGSQGERGYTGEKGEQGERGLTGPAGVAGAKGDRGLQGPPGASGLNGIPGAKGDIGPR 1285
                    |:.|:|   |..|..|..|||||||..|.:|..|..||||..|..|:   |||.|..
Mouse  1011 --------GASGDR---GPPGPVGPPGLTGPAGEPGREGSPGADGPPGRDGAAGV---KGDRGET 1061

  Fly  1286 GEIGYPGVTIKGEKGLPGRPGRNGRQGLIGAPGLIGERGLPGLAGEPGLVGLPGPIGPAGSKGER 1350
            |.:|.||.        ||.||.                              |||.||.|.:|:|
Mouse  1062 GALGAPGA--------PGPPGS------------------------------PGPAGPTGKQGDR 1088

  Fly  1351 GLAGSPGQPGQDGFPGAPGLKGDTGPQGFKGERGLNGFEGQKGDKGDRGLQGPSGLPGLVGQKGD 1415
            |.||:.|..|..|..||.|:.|..||:|.|||   :|.:|::|.||.||..|..||||..|..||
Mouse  1089 GEAGAQGPMGPSGPAGARGIAGPQGPRGDKGE---SGEQGERGLKGHRGFTGLQGLPGPPGPSGD 1150

  Fly  1416 TGYPGLNGNDGPVGAPGERGFTGPKGRDGRDGTPGLPGQKGEPGMLPPPGPKGEPGQPGRNGPKG 1480
                  .|..||.|..|.||..||.|..|:||:.|:||..|      ||||:|..|:.|..||.|
Mouse  1151 ------QGASGPAGPSGPRGPPGPVGPSGKDGSNGIPGPIG------PPGPRGRSGETGPVGPPG 1203

  Fly  1481 EPGRPGERGLIGIQGERGEKGERGLIGETGNVGRPGPKGDRGEPG--ERGYEGAIGLIGQKGEPG 1543
            .||.||.                           |||.|    ||  ...:.|    :||: |.|
Mouse  1204 SPGPPGP---------------------------PGPPG----PGIDMSAFAG----LGQR-EKG 1232

  Fly  1544 APAPAALDYLTGILITRHSQSETVPACSAGHTELWTGYSLLYVDGNDYAHNQDLGSPGSCVPRFS 1608
               |..:.|:         :::...:....|.        :.||....:.|..:.|..|  |..|
Mouse  1233 ---PDPMQYM---------RADEADSTLRQHD--------VEVDATLKSLNNQIESIRS--PDGS 1275

  Fly  1609 TL-PVLSCGQNNVCNYASRNDKTFWLTTNAAIPMMPVENIEIRQYISRCVVCEAPANVIAVHSQT 1672
            .. |..:|....:| :.......:|:..|....:..::           |.|........|:...
Mouse  1276 RKNPARTCQDLKLC-HPEWKSGDYWIDPNQGCTLDAMK-----------VFCNMETGETCVYPNP 1328

  Fly  1673 IEVPDCPNGWEG-------LWIGYSFL--MHTAVGNG 1700
            ..||. .|.|..       :|.|.:..  .|.:.|:|
Mouse  1329 ATVPR-KNWWSSKSKEKKHIWFGETMNGGFHFSYGDG 1364

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
Col4a1NP_723044.1 Collagen 104..161 CDD:189968 31/63 (49%)
Collagen 322..380 CDD:189968 27/57 (47%)
Collagen 413..465 CDD:189968 16/51 (31%)
Collagen 499..561 CDD:189968 26/61 (43%)
Collagen 574..632 CDD:189968 31/60 (52%)
Collagen 657..714 CDD:189968 25/56 (45%)
Collagen 765..824 CDD:189968 28/58 (48%)
Collagen 854..911 CDD:189968 29/56 (52%)
Collagen 884..943 CDD:189968 26/58 (45%)
Collagen 923..982 CDD:189968 29/58 (50%)
Collagen 1028..1085 CDD:189968 20/56 (36%)
Collagen 1229..1287 CDD:189968 26/57 (46%)
Collagen 1318..1376 CDD:189968 18/57 (32%)
Collagen 1399..1458 CDD:189968 26/58 (45%)
Collagen 1477..1534 CDD:189968 14/58 (24%)
C4 1555..1662 CDD:128421 15/107 (14%)
C4 1663..1777 CDD:128421 10/47 (21%)
Col2a1NP_112440.2 VWC 34..88 CDD:278520 9/53 (17%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 96..179 42/103 (41%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 191..1237 536/1361 (39%)
Triple-helical region 201..1214 523/1296 (40%)
Collagen 258..317 CDD:189968 28/58 (48%)
Collagen 375..425 CDD:189968 25/49 (51%)
Collagen 468..541 CDD:189968 34/72 (47%)
Collagen 546..617 CDD:189968 38/105 (36%)
Collagen 684..732 CDD:189968 22/52 (42%)
Collagen 801..860 CDD:189968 36/91 (40%)
Collagen 834..893 CDD:189968 34/79 (43%)
Collagen 933..992 CDD:189968 30/70 (43%)
Collagen 969..1046 CDD:189968 41/104 (39%)
Nonhelical region (C-terminal) 1215..1241 10/46 (22%)
COLFI 1254..1486 CDD:279718 24/126 (19%)


Information from Original Tools:


Tool Simple Score Weighted Score Original Tool Information
BLAST Result Score Score Type Cluster ID
Compara 00.000 Not matched by this tool.
Domainoid 00.000 Not matched by this tool.
eggNOG 1 0.900 - - E33208_3BAFZ
Hieranoid 00.000 Not matched by this tool.
Homologene 00.000 Not matched by this tool.
Inparanoid 00.000 Not matched by this tool.
Isobase 00.000 Not matched by this tool.
OMA 00.000 Not matched by this tool.
OrthoDB 00.000 Not matched by this tool.
OrthoFinder 00.000 Not matched by this tool.
OrthoInspector 00.000 Not matched by this tool.
orthoMCL 00.000 Not matched by this tool.
Panther 00.000 Not matched by this tool.
Phylome 1 0.910 - -
RoundUp 00.000 Not matched by this tool.
SonicParanoid 00.000 Not matched by this tool.
SwiftOrtho 1 1.000 - -
TreeFam 00.000 Not matched by this tool.
32.810

Return to query results.
Submit another query.