DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Col4a1 and col4a2

DIOPT Version :9

Sequence 1:NP_723044.1 Gene:Col4a1 / 33727 FlyBaseID:FBgn0000299 Length:1779 Species:Drosophila melanogaster
Sequence 2:XP_002933063.2 Gene:col4a2 / 100486079 XenbaseID:XB-GENE-5950101 Length:1722 Species:Xenopus tropicalis


Alignment Length:1871 Identity:773/1871 - (41%)
Similarity:966/1871 - (51%) Gaps:342/1871 - (18%)


- Green bases have known domain annotations that are detailed below.


  Fly    52 DDSYDIVDSAGVARGDLPPKNCTAGYAGCVPKCIAEKGNRGLPGPLGPTGLKGE---MGFPGMEG 113
            |.||         .|....::|:   .||  :|:.|||:||.|||||..|..|.   :|..|::|
 Frog    48 DKSY---------TGPCGGRDCS---QGC--QCLPEKGSRGQPGPLGGQGTSGPPGLIGIAGLQG 98

  Fly   114 PSGDKGQKGDPGPYGQRGDKGERGSPGLHGQAGVPGVQGPAGNPGAPGINGKDGCDGQDGIPGLE 178
            ..||||::|.||..|..||||:.|..|..|..||||..|..|..|.|   |.|||:|..|..|:.
 Frog    99 RKGDKGERGFPGVTGPSGDKGQSGVTGFPGADGVPGHTGQGGPRGKP---GHDGCNGTQGDAGVG 160

  Fly   179 GLSGMPGPRGYAGQLGSKGEKGEP---AKENGDYAKGEKGEPGWR---------GTAGLAGPQGF 231
            |..|..||.|.:|..|:||:||||   :.|..:..:|:.||.|:.         ||||..||||:
 Frog   161 GSHGSRGPPGISGGFGAKGQKGEPFHVSIEVKNRLRGDPGEAGFNGIQGPRGSPGTAGFIGPQGY 225

  Fly   232 PGEKGERGDSGPYGAKGPRGEHGL--KGEKGASCYGPMKPGAPGIKGEKGEPASSFPVKPTHTV- 293
               :|.||..||.|.:||:|..||  .||||.    |..||.||.:..:||.      :..|.: 
 Frog   226 ---RGPRGPPGPPGPQGPQGNRGLGYYGEKGE----PGDPGPPGEQPRQGER------EQKHKIE 277

  Fly   294 ------MGPRGDMGQKGEPGLVGRKGEPGPEGDTGLDGQKGEKGLPGGPGDRGRQGNFGPPGSTG 352
                  .|.:||.|:||.|   |.||:|      .:.|:|||:|:.|.||.||..||   .|::|
 Frog   278 ILLLQYKGAKGDQGEKGNP---GDKGQP------AVTGEKGEEGITGFPGQRGFPGN---DGNSG 330

  Fly   353 QKGDRGEPGLNGLPGNPGQKGEPGRAGATGKPGLLGPPGPPGGGRGTPGPPGPKGPRGYVGAPGP 417
            ..|:.|.||.||:||.||.||:.|..|.      |||.||.   ....|....||..|.:|.||.
 Frog   331 ASGEIGFPGANGVPGLPGTKGKKGEVGD------LGPQGPV---TFVNGQQKRKGYEGEIGFPGL 386

  Fly   418 QGLNGVDGLPGPQGYNGQKGGAGLPGR-PGNEGPPGKKGEKGTAGLNGPKGSIGPIGHPGPPGPE 481
            .|:.|.||.||..|:.||.|.   |.| ||..|..|.||::|..|..|..|.  |...|||||.:
 Frog   387 NGVYGDDGDPGDPGFPGQNGS---PSRFPGQTGDTGLKGQRGPKGYQGDTGL--PAFQPGPPGID 446

  Fly   482 ---GQKGDAGLPGY-----------GIQGSKGDAGIPGYPGLKGSKGERG----------FKGNA 522
               |::|..|.|||           |..|..|..|.||..||:|.|||:|          .:|.|
 Frog   447 GAPGKEGPPGQPGYPGGRIGMKGLPGFPGRDGQRGAPGSRGLRGIKGEQGECRCYEGDEAGRGPA 511

  Fly   523 GAPGDSKLGRPGTPGAAGAPGQKGDAGRPGTPGQKGDMGI---------KGDVGGKCSSCRAGPK 578
            |.||  .:|.||..|.||..|:.||.|..|.||..|.:|:         .|:.|.|..:...|.|
 Frog   512 GEPG--PVGFPGFQGHAGRKGEAGDRGVQGLPGAPGTVGLAGPAGFPGPAGEKGDKVFATEKGSK 574

  Fly   579 GDKGTSGLPGIPGKDGARGPPGERGYPGERGHDGINGQTGPPGEKGEDGRTGLPGATGEPGKPAL 643
            |.:|.||.||:||:|         ||||..|.||..|..||.|    ||..||||:.|.||.|..
 Frog   575 GAQGDSGFPGVPGRD---------GYPGRNGRDGSPGFPGPSG----DGIKGLPGSHGFPGLPGN 626

  Fly   644 CDLSLIEPLKGDKGYPGAPGAKGVQGFKGAEGLPGIPGPKGEFGFKGEKGLSGAPGN-------- 700
            ..      .:|:.|:||       .||.|::||.|.||.:|..||.|..||||..|:        
 Frog   627 AG------ARGETGFPG-------PGFPGSKGLRGEPGNRGVSGFPGVPGLSGPQGDCYQETDEE 678

  Fly   701 -----DGT-------PGRAGRDGYPGIPG-QSIKGEPGF---HGRDGA---KGDKGSFGRSGEKG 746
                 :||       |..||..|.||:|| ..|:|:.||   ||.||.   ||.||..|.....|
 Frog   679 NEIGKEGTGCRLIAPPAPAGAPGPPGLPGYNGIRGQKGFPGPHGFDGVFGLKGIKGDRGTDSLPG 743

  Fly   747 EPGSCALDEIKMPAKGNKGEPGQTGMPGPPGEDGSPGERGYTGLKGNTGPQ--GPPGVEGPRGLN 809
            .||...       .:|::|..|..|:||.||..|.||.:|..|.||:.|.:  .|.|.:|..||.
 Frog   744 PPGFSG-------QRGDRGNAGLPGLPGVPGLSGKPGFKGVQGEKGSVGDRLGAPSGEQGNTGLP 801

  Fly   810 GPRGEKGNQGAVGVPGNPGKDGLRGIPGRNGQPGPRGEPGISRPGPMGPPGLNGLQGEKGDRGPT 874
            |.:|.||.||..|:||..||.|..|.||..|..|..|.||:  ||..||..|             
 Frog   802 GLQGLKGTQGDQGIPGLQGKVGTPGTPGNKGIKGSPGAPGV--PGIPGPAAL------------- 851

  Fly   875 GPIGFPGADGSVGYPGDRGDAGLPGVSGRPGIVGEKGDVGPIGPAGVAGPPGVPGIDGVRGRDGA 939
              .||||..|:.|.||.:|..|.||..|:.|..||||..||.|..|:.|..||.|..|||||.|.
 Frog   852 --FGFPGMPGNPGTPGPQGSMGPPGPPGQRGDEGEKGPPGPAGMKGLPGELGVGGFIGVRGRHGN 914

  Fly   940 KGEPGSPGLVGMPGNKGDRGAPGNDGPKGFAGVTGAPGKRGPAGIPGVSGAKGDKGATGLTGNDG 1004
            .|.||..|:.|:||.||.:|:||.:|.:|..|..|.|..||..|..|..|..|.||..|.:|..|
 Frog   915 PGIPGLVGMHGLPGVKGIKGSPGMEGFQGMKGAKGRPAWRGLKGPSGNIGFPGVKGGNGTSGTKG 979

  Fly  1005 PVGGRGPPGAP-----------------------GLMGIKGDQGLAGAPGQQGLDGMP------- 1039
            ..|.:||||.|                       |:.|:||.:|:.|.|||.||.|:|       
 Frog   980 DRGEQGPPGDPPKMPEMMLLTKGETGDQGVSGFKGISGLKGSKGMPGPPGQLGLPGLPGLRSFEF 1044

  Fly  1040 GEKGNQGFPGLDGPPGLPGDASEKGQKGEPGPSGLRGDTGPAGTPGWPGEKGLPGLAVHGRAGP- 1103
            |:||..|.||:.|..|:.|:....|..|.||.:|.||..|.:|.||.|||:|..|..  |..|. 
 Frog  1045 GDKGETGTPGVIGNQGILGEVGPDGIIGFPGFTGPRGTPGISGFPGVPGERGKYGDI--GERGDT 1107

  Fly  1104 ---PGEKGDQGRS---GIDGRDGINGEK---GEQGLQGVWGQPGEKGSVGAPGIPGAPGMDGLPG 1159
               |||:|.:|:.   |:.|:.||:|||   |::|.:|:.|..||.|..|..|.||..|:.|.||
 Frog  1108 IDLPGERGTKGQGGTPGLPGQRGIHGEKGTTGDEGFRGIEGVIGEHGQTGEKGFPGQQGLVGFPG 1172

  Fly  1160 AAGAPGAVGYPGDRGDKGEPGLSGLPGLKGETGPVGLQGFTGAPGPKGERGIRGQPGLPATVPDI 1224
            :.|.||..|.||::|.      ||.|||.|:.|..|::|..|..|..|.:||.||||     .||
 Frog  1173 SQGFPGLPGVPGEKGS------SGFPGLPGQHGFPGIRGIAGLDGLPGTKGINGQPG-----ADI 1226

  Fly  1225 RGDKGSQGERGYTGEKGEQGER----GLTGPAGVAGAKGDRGLQ---GPPGASGLNGIPGAKGDI 1282
            .|.||.   :|.:|.||:.||.    |:.||||..||:||:||.   |.||..|:.|:||.....
 Frog  1227 IGLKGF---KGLSGGKGQPGEASTIVGMPGPAGSKGAEGDQGLPGLIGLPGTVGVRGLPGFSNKT 1288

  Fly  1283 GPRGEIGYPGVTIKGEKGLPGRPGRNGRQGLIGAPGLIGERGLPGLAGEPGLVGLPGPIGPAGSK 1347
            |..|::|.|     |..||||.||..||               |||..:||:.|:.|.:|..|:.
 Frog  1289 GLLGDVGPP-----GPPGLPGFPGPIGR---------------PGLPSKPGVKGVIGDLGLQGNY 1333

  Fly  1348 GERGLAGSPGQ---PGQDGFPGAPGLKGDTGPQGFKGERGLNGFEGQK---GDKGDRGLQGPSGL 1406
            |.:|:.|..|:   ||..|.|||.||||..|.|||.|..|..|.:|.|   ||.|:.||:||.|.
 Frog  1334 GAKGIPGDEGKVGVPGLAGIPGAKGLKGSAGFQGFVGVSGYRGDQGPKGLRGDIGEYGLKGPPGQ 1398

  Fly  1407 PGLVGQ----KGDTGYPGLNGNDGPVGAPGERGFTGPKGRDGRDGTPGLPGQKGEPGMLPPPGPK 1467
            ||.:.:    ..:.||||.:   |.:|..|.:|..|||   |..||||.|||.|:.|:       
 Frog  1399 PGPMSEPPLITAEQGYPGAH---GIIGHQGVQGDMGPK---GLVGTPGAPGQTGKEGL------- 1450

  Fly  1468 GEPGQPGRNGPKGEPGRPGERGLIGIQGERGEKGERGLIGETGNVGRPGPKGDRGEPGERGYEGA 1532
              ||.|      |.|..||.|                  ||.|.:||.||         .|::||
 Frog  1451 --PGVP------GWPAPPGSR------------------GEQGPMGRTGP---------NGFKGA 1480

  Fly  1533 IGLIGQKGEPGAPAPAALDYLTGILITRHSQSETVPACSAGHTELWTGYSLLYVDGNDYAHNQDL 1597
            .|:.|:.|.||.|..:.   ..|.|:.:|||::..|.|..|...|||||||||.:|.:.||||||
 Frog  1481 SGVPGRAGLPGMPGRSV---NIGYLLVKHSQTDEEPMCPVGMARLWTGYSLLYFEGQEKAHNQDL 1542

  Fly  1598 GSPGSCVPRFSTLPVLSCGQNNVCNYASRNDKTFWLTTNAAIPMMPVENIEIRQYISRCVVCEAP 1662
            |..|||:.||||:|.|.|...:||.||:||||::||:|.|.:|||||...|||.|||||.|||||
 Frog  1543 GLAGSCLQRFSTMPFLYCNPGDVCYYANRNDKSYWLSTTAPLPMMPVVEEEIRPYISRCSVCEAP 1607

  Fly  1663 ANVIAVHSQTIEVPDCPNGWEGLWIGYSFLMHTAVGNGGGGQALQSPGSCLEDFRATPFIECNGA 1727
            |..||||||.:.:|.||:||..||||||||||||.|:.||||:|.|||||||||||||||||||.
 Frog  1608 AVAIAVHSQDVSIPHCPDGWRSLWIGYSFLMHTAAGDEGGGQSLSSPGSCLEDFRATPFIECNGG 1672

  Fly  1728 KGTCHFYETMTSFWMYNLESSQPFE-RPQQQTIKAGERQSHVSRCQVCMKN 1777
            :||||::....|||:..::  :||: .|...|:|||..::|:|||||||||
 Frog  1673 RGTCHYFANKYSFWLTTID--EPFQSSPPADTLKAGLIRTHISRCQVCMKN 1721

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
Col4a1NP_723044.1 Collagen 104..161 CDD:189968 26/59 (44%)
Collagen 322..380 CDD:189968 27/57 (47%)
Collagen 413..465 CDD:189968 23/52 (44%)
Collagen 499..561 CDD:189968 30/71 (42%)
Collagen 574..632 CDD:189968 25/57 (44%)
Collagen 657..714 CDD:189968 26/76 (34%)
Collagen 765..824 CDD:189968 27/60 (45%)
Collagen 854..911 CDD:189968 20/56 (36%)
Collagen 884..943 CDD:189968 29/58 (50%)
Collagen 923..982 CDD:189968 29/58 (50%)
Collagen 1028..1085 CDD:189968 26/63 (41%)
Collagen 1229..1287 CDD:189968 26/64 (41%)
Collagen 1318..1376 CDD:189968 23/60 (38%)
Collagen 1399..1458 CDD:189968 27/62 (44%)
Collagen 1477..1534 CDD:189968 15/56 (27%)
C4 1555..1662 CDD:128421 63/106 (59%)
C4 1663..1777 CDD:128421 71/114 (62%)
col4a2XP_002933063.2 Collagen 95..149 CDD:396114 26/56 (46%)
Collagen 509..563 CDD:396114 21/55 (38%)
Collagen 539..602 CDD:396114 26/71 (37%)
Med15 728..>1034 CDD:312941 130/329 (40%)
Collagen 1169..1224 CDD:396114 28/65 (43%)
Collagen 1434..1490 CDD:396114 31/97 (32%)
C4 1501..1605 CDD:396133 60/103 (58%)
C4 1611..1719 CDD:396133 68/109 (62%)


Information from Original Tools:


Tool Simple Score Weighted Score Original Tool Information
BLAST Result Score Score Type Cluster ID
Compara 00.000 Not matched by this tool.
Domainoid 1 1.000 163 1.000 Domainoid score I3912
eggNOG 00.000 Not matched by this tool.
Hieranoid 00.000 Not matched by this tool.
Homologene 1 1.000 - - H1390
Inparanoid 00.000 Not matched by this tool.
OMA 00.000 Not matched by this tool.
OrthoDB 1 1.010 - - D41315at33208
OrthoFinder 1 1.000 - - FOG0000443
OrthoInspector 00.000 Not matched by this tool.
Panther 00.000 Not matched by this tool.
Phylome 1 0.910 - -
RoundUp 1 1.030 - avgDist Average_Evolutionary_Distance R2460
SonicParanoid 00.000 Not matched by this tool.
SwiftOrtho 00.000 Not matched by this tool.
TreeFam 00.000 Not matched by this tool.
65.950

Return to query results.
Submit another query.