DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Col4a1 and col2a1

DIOPT Version :9

Sequence 1:NP_723044.1 Gene:Col4a1 / 33727 FlyBaseID:FBgn0000299 Length:1779 Species:Drosophila melanogaster
Sequence 2:NP_989220.1 Gene:col2a1 / 394828 XenbaseID:XB-GENE-6258353 Length:1492 Species:Xenopus tropicalis


Alignment Length:1736 Identity:616/1736 - (35%)
Similarity:747/1736 - (43%) Gaps:412/1736 - (23%)


- Green bases have known domain annotations that are detailed below.


  Fly     8 LLYAAVIAGALVGADAQFWKTAGTAGSIQDSVKHYNRNEPKFPIDDSYDIVDSAGVARGDL---P 69
            :|:||.....|.....|..:.....||.....:.|:..:...|......:.|:..|...::   .
 Frog    11 VLFAATQVILLAVVRCQDEEDVLATGSCVQHGQRYSDKDVWKPEPCQICVCDTGNVLCDEIICED 75

  Fly    70 PKNCTAG---YAGCVPKCIAE-----------KGNRGLPGPLGPTGLKGEMGFPGMEGPSGDKGQ 120
            ||:|...   :..|.|.|..|           ||.:|.||.:  ..:.|..|.||.:||||::|.
 Frog    76 PKDCPNAEIPFGECCPICPTEQSSTSSGQGVLKGQKGEPGDI--KDVVGPKGPPGPQGPSGEQGP 138

  Fly   121 KGDPGPYGQRGDKGERGSP---GLHGQAGVPGVQGPAGNPGAPGINGKDGCDGQDGIPGLEGLSG 182
            :||      ||||||:|:|   |..|:.|.||..||.|.||.|            |.|||.|   
 Frog   139 RGD------RGDKGEKGAPGPRGRDGEPGTPGNPGPVGPPGPP------------GPPGLGG--- 182

  Fly   183 MPGPRGYAGQL-GSKGEKGEPAKENGDYAKGEKGEPGWRGTAGLAGPQGFPGEKGERGDSGPYGA 246
                 .:|.|: |...||.                    |.|.:...||..|..|.||..||.||
 Frog   183 -----NFAAQMTGGFDEKA--------------------GGAQMGVMQGPMGPMGPRGPPGPTGA 222

  Fly   247 KGPRGEHGLKGEKGASCYGPMKPGAPGIKGEKGEPASSFPVKPTHTVMGPRGDMGQKGEPGLVGR 311
            .||:|..|..||          ||.||..|.                |||||..|..|:||..|.
 Frog   223 PGPQGFQGNPGE----------PGEPGAGGP----------------MGPRGPPGPAGKPGDDGE 261

  Fly   312 KGEPGPEGDTGLDGQKGEKGLPGGPGDRGRQGNFGPPGSTGQKGDRGEPGLNGLPGNPGQKGEPG 376
            .|:||..|:.|..|.:|.:|.|            |.||..|.||.||.|||:|..|..|..|..|
 Frog   262 AGKPGKSGERGPPGPQGARGFP------------GTPGLPGVKGHRGYPGLDGSKGEAGAAGAKG 314

  Fly   377 RAGATGKPGLLGPPGPPGGGRGTPGPPGPKGPRGYVGAPGPQGLNGVDGLPGPQGYNGQKGGAGL 441
            ..||||:.|..||.||    ||.|      |.||..||.|..|..|.||||||.|..|..|.||.
 Frog   315 EGGATGEAGSPGPMGP----RGLP------GERGRPGASGAAGARGNDGLPGPAGPPGPVGPAGA 369

  Fly   442 PGRPGNEGPPGKKGEKGTAGLNGPKGSIGPIGHPGPPGPEGQKGDAGLPGY-GIQGSKGDAGIPG 505
            ||.|   |.||.|||.|..|..||:|:.||.|..|.||..|..|.:|.||. ||.|:||.:|.||
 Frog   370 PGFP---GAPGSKGEAGPTGARGPEGAQGPRGESGTPGSPGPAGASGNPGTDGIPGAKGSSGAPG 431

  Fly   506 YPGLKGSKGERGFKGNAGAPGDSKLGRPGTPGAAGAPGQKGDAGRPGTPGQKGDMGIKGDVGGKC 570
            ..|..|..|.|              |.||..||.|..|.||..|.||..|.||:.|.||::|...
 Frog   432 IAGAPGFPGPR--------------GPPGPQGATGPLGPKGQTGDPGVAGFKGEHGPKGEIGSAG 482

  Fly   571 SSCRAGPKGDKGTSGLPGIPGKDGARGPPGERGYPGER---GHDGINGQTGPPGEKGEDGRTGLP 632
            .....||.|::|..|..|.||..|..|||||||.||.|   |.||:.|..|.|||:|..|..|..
 Frog   483 PQGAPGPAGEEGKRGARGEPGAAGPLGPPGERGAPGNRGFPGQDGLAGPKGAPGERGVPGLGGPK 547

  Fly   633 GATGEPGKPALCDLSLIEP-LKGDKGYPGAPGAKGVQ------GFKGAEGLPGIPGPKGEFGFKG 690
            ||.|:||:|.       || |.|.:|..|.||..|.|      |..|.:|.||.|||:|..|..|
 Frog   548 GANGDPGRPG-------EPGLPGARGLTGRPGDAGPQGKVGPSGASGEDGRPGPPGPQGARGQPG 605

  Fly   691 EKGLSGAPGNDGTPGRAGRDGYPGIPGQSIKGEPGFHGRDGAKGDKGSFGRSGEKGEPGSCALDE 755
            ..|..|..|.:|.||:||..|..|.||  ::|.||..|..||:|..|..|.:||:||        
 Frog   606 VMGFPGPKGANGEPGKAGEKGLLGAPG--LRGLPGKDGETGAQGPNGPAGPAGERGE-------- 660

  Fly   756 IKMPAKGNKGEPGQTGMPGPPGEDGSPGERGYTGLKGNTGPQGPPGVEGPRGLNGPRGEKGNQGA 820
                    :|.||.:|..|.||..|||||      .|..|.||.||..|..||.|||||:|..|.
 Frog   661 --------QGPPGPSGFQGLPGPPGSPGE------GGKPGDQGVPGEAGAPGLVGPRGERGFPGE 711

  Fly   821 VGVPGNPGKDGLRGIPGRNGQPGPRGEPGISRP-GPMGPPGLNGLQGEKGDRGPTGPIGFPGADG 884
            .|..|..|..|.||:||..|..||:|..|.|.| |..|||||.|:.||:|..|.:||        
 Frog   712 RGSSGPQGLQGPRGLPGTPGTDGPKGATGPSGPNGAQGPPGLQGMPGERGAAGISGP-------- 768

  Fly   885 SVGYPGDRGDAGLPGVSGRPGIVGEKGDVGPIGPAGVAGPPGVPGIDGVRGRDGAKGEPGSPGLV 949
                .|||||.|..|..|.||..|.:|..|||||.|.:||                         
 Frog   769 ----KGDRGDTGEKGPEGAPGKDGSRGLTGPIGPPGPSGP------------------------- 804

  Fly   950 GMPGNKGDRGAPGNDGPKGFAGVTGAPGKRGPAGIPGVSGAKGDKGATGLTGNDGPVGGRGPPGA 1014
                 .|::|..|..||.|..|..||||.||..|.|                  ||.|..|||||
 Frog   805 -----NGEKGESGPSGPAGIVGARGAPGDRGETGPP------------------GPAGFAGPPGA 846

  Fly  1015 PGLMGIKGDQGLAGAPGQQGLDGMPGEKGNQGFPGLDGPPGLPGDASEKGQKGEPGPSGLRGDTG 1079
            .|..|:|||||.:            |:||:.|.||..||.|.||   .:|..|..||.|.||..|
 Frog   847 DGQAGLKGDQGES------------GQKGDAGAPGPQGPSGAPG---PQGPTGVNGPKGARGAQG 896

  Fly  1080 PAGTPGWPGEKGLPGLAVHGRAGPPGEKGDQGRSGIDGRDGINGEKGEQGLQGVWGQPGEKGSVG 1144
            |      ||..|.||.|  ||.||||..|:.|.||..|..|..|.||.:|..|..|:.|:.|..|
 Frog   897 P------PGATGFPGAA--GRVGPPGPNGNPGPSGAPGSAGKEGPKGARGDAGPTGRAGDPGLQG 953

  Fly  1145 APGIPGAPGMDGLPGAAGAPGAVGYPGDRGDKGEPGLSGLPGLKGETGPVGLQGFTGAPGPKGER 1209
            ..|:||..|..|..|.:|..|.   ||.:|..|:.|:.||||.:||      :||.|.|||.||.
 Frog   954 PAGVPGEKGESGEDGPSGPDGP---PGPQGLSGQRGIVGLPGQRGE------RGFPGLPGPSGEP 1009

  Fly  1210 GIRGQPGLPATVPDIRGDKGSQGERGYTGEKGEQGERGLTGPAGVAGAKGDRGLQGPPGASGLNG 1274
            |.:|.|             ||.|:|   |..|..|..|||||||..|.:|:.|..||||..|..|
 Frog  1010 GKQGGP-------------GSAGDR---GPPGPVGPPGLTGPAGEPGREGNAGSDGPPGRDGATG 1058

  Fly  1275 IPGAKGDIGPRGEIGYPGVTIKGEKGLPGRPGRNGRQGLIGAPGLIGERGLPGLAGEPGLVGLPG 1339
            |.|.:|:.||                             :|||            |.||..|.||
 Frog  1059 IKGDRGETGP-----------------------------LGAP------------GAPGAPGAPG 1082

  Fly  1340 PIGPAGSKGERGLAGSPGQPGQDGFPGAPGLKGDTGPQGFKGERGLNGFEGQKGDKGDRGLQGPS 1404
            |:||.|.:|:||.:|..|..|..|..||.||.|..||:|.|||.|..|..||||.:|..||||..
 Frog  1083 PVGPTGKQGDRGESGPQGPLGPSGPAGARGLPGPQGPRGDKGEAGEAGERGQKGHRGFTGLQGLP 1147

  Fly  1405 GLPGLVGQKGDTGYPGLNGNDGPVGAPGERGFTGPKGRDGRDGTPGLPGQKGEPGMLPPPGPKGE 1469
            |.||..|.:|.:         ||.|..|.||..||.|..|:||:.||||..|      ||||:|.
 Frog  1148 GPPGTAGDQGAS---------GPAGPGGPRGPPGPVGPSGKDGSNGLPGPIG------PPGPRGR 1197

  Fly  1470 PGQPGRNGPKGEPGRPGERGLIG-----------IQGERGEKGERGLIGETGNVGRP-------- 1515
            .|:.|..||.|:||.||..|..|           .|.|:|....|.:..:..:...|        
 Frog  1198 GGETGPAGPPGQPGPPGPPGPPGPGIDMSAFAGLSQPEKGPDPMRYMRADQASSSVPQRDVDVEA 1262

  Fly  1516 -------------GPKGDRGEPGERGYEGAIGLIGQKGEPGAPAPAALDY-------LTGILITR 1560
                         .|.|.:..|.....:  :.|...:.:.|       ||       .|...|..
 Frog  1263 TLKSLNNQIESIRSPDGTKKNPARTCRD--LKLCHPEWKSG-------DYWIDPNQGCTVDAIKV 1318

  Fly  1561 HSQSETVPAC--------------SAGHTE---LWTGYSLLYVDGNDYAHNQDLGSPGSCVPRFS 1608
            ....||...|              ||...|   :|.|.::  ..|..:::..|..:|.:...:.:
 Frog  1319 FCNMETGETCVYPNPSKIPKKNWWSAKGKEKKHIWFGETI--NGGFQFSYGDDSSAPNTANIQMT 1381

  Fly  1609 TLPVLSCGQNNVCNYASRNDKTFWLTTNA----AIPMMPVENIEIR 1650
            .|.:||........|..:|...|....:.    |:.:....::|||
 Frog  1382 FLRLLSTDATQNITYHCKNSIAFMDEASGNLKKAVLLQGSNDVEIR 1427

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
Col4a1NP_723044.1 Collagen 104..161 CDD:189968 29/59 (49%)
Collagen 322..380 CDD:189968 21/57 (37%)
Collagen 413..465 CDD:189968 27/51 (53%)
Collagen 499..561 CDD:189968 23/61 (38%)
Collagen 574..632 CDD:189968 30/60 (50%)
Collagen 657..714 CDD:189968 26/62 (42%)
Collagen 765..824 CDD:189968 29/58 (50%)
Collagen 854..911 CDD:189968 24/56 (43%)
Collagen 884..943 CDD:189968 20/58 (34%)
Collagen 923..982 CDD:189968 15/58 (26%)
Collagen 1028..1085 CDD:189968 20/56 (36%)
Collagen 1229..1287 CDD:189968 28/57 (49%)
Collagen 1318..1376 CDD:189968 22/57 (39%)
Collagen 1399..1458 CDD:189968 26/58 (45%)
Collagen 1477..1534 CDD:189968 17/88 (19%)
C4 1555..1662 CDD:128421 22/117 (19%)
C4 1663..1777 CDD:128421
col2a1NP_989220.1 VWC 38..93 CDD:214564 9/54 (17%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 98..1255 567/1464 (39%)
Triple-helical region 206..1219 514/1272 (40%)
Collagen 263..>303 CDD:189968 20/51 (39%)
Med15 393..>785 CDD:312941 188/448 (42%)
Collagen 497..556 CDD:189968 31/58 (53%)
Collagen 807..865 CDD:189968 32/87 (37%)
PRK07764 <816..1057 CDD:236090 125/306 (41%)
Collagen 842..900 CDD:189968 33/78 (42%)
Collagen 1094..1151 CDD:189968 28/56 (50%)
Nonhelical region (C-terminal) 1220..1246 5/25 (20%)
COLFI 1256..1491 CDD:366624 30/183 (16%)
Blue background indicates that the domain is not in the aligned region.


Information from Original Tools:


Tool Simple Score Weighted Score Original Tool Information
BLAST Result Score Score Type Cluster ID
Compara 00.000 Not matched by this tool.
Domainoid 00.000 Not matched by this tool.
eggNOG 00.000 Not matched by this tool.
Hieranoid 00.000 Not matched by this tool.
Homologene 00.000 Not matched by this tool.
Inparanoid 00.000 Not matched by this tool.
OMA 00.000 Not matched by this tool.
OrthoDB 00.000 Not matched by this tool.
OrthoFinder 00.000 Not matched by this tool.
OrthoInspector 00.000 Not matched by this tool.
Panther 00.000 Not matched by this tool.
Phylome 1 0.910 - -
RoundUp 00.000 Not matched by this tool.
SonicParanoid 00.000 Not matched by this tool.
SwiftOrtho 00.000 Not matched by this tool.
TreeFam 00.000 Not matched by this tool.
10.910

Return to query results.
Submit another query.