DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Ppn and col28a1

DIOPT Version :10

Sequence 1:NP_788752.2 Gene:Ppn / 43872 FlyBaseID:FBgn0003137 Length:2898 Species:Drosophila melanogaster
Sequence 2:XP_031760315.1 Gene:col28a1 / 100488728 XenbaseID:XB-GENE-956674 Length:1142 Species:Xenopus tropicalis


Alignment Length:1181 Identity:223/1181 - (18%)
Similarity:359/1181 - (30%) Gaps:369/1181 - (31%)


- Green bases have known domain annotations that are detailed below.


  Fly  1016 SSTTSNSASSETTGLSSDGSTTDATTAASDNTDITTDGSTDESTDGSSNASTEGSTEGASEDTTI 1080
            ||.....:.:|.||:.:.....::.|.....|      .|..:...::|......:.|..:...:
 Frog   108 SSVRIEQSFNEWTGVENFKRIVNSMTYIGQGT------YTYYAIMNATNIFKAHKSAGNVKVAIL 166

  Fly  1081 STESSGSTESTDA-IASDGSTTEGSTVEDLSSSTSSDVTSDSTITDSSPSTEVSGSTDS------ 1138
            .|:.....:|.|| .|||.:...|.....:..||..   ::.||.     .::||.:.|      
 Frog   167 MTDGIDHPKSPDARQASDFARAAGINFISIGLSTQK---ANKTIL-----FKISGQSLSEPVLIL 223

  Fly  1139 ------SSSTDGSSTDASSTEASSTDVTESTDSTVSGGTSDTTESGPTEESTTEGSTESTTEGST 1197
                  ....:..::.|::.....|.:.|..::..:|...::.|.|...:...:|....:.:|..
 Frog   224 GDPNLLQEILEKLASIANTQCDKGTCICEKGEAGPAGPQGNSGERGEKGDRGAKGEPGESAKGDP 288

  Fly  1198 DSTQSTDLDSTTSDIWSTSDKDDESESSTP-------------------YSFDSEVTKSKPRKCK 1243
            ..........|      ...|.|..|...|                   |    :.....|.:..
 Frog   289 GMKGEEGPPGT------EGPKGDRGECGKPGVKGDRGPGGPPGQIGPRGY----QGISGPPGQTG 343

  Fly  1244 PKKSTCAKSEYGCCPDGKSTPKGPFDEGCPIAKTCADTKYGCCLDGVSPAKGKNNK--------- 1299
            |:.:...|.|.|  |:|   |:||  :|.|          |....|....||:..:         
 Frog   344 PRGNQGDKGEPG--PEG---PRGP--DGIP----------GVGHQGAKGEKGEEGRIGPPGPPGI 391

  Fly  1300 ---GCPKSQCAETLFGCCPDKFTAADGENDE----------GCPETTTV-------PPTTTTEET 1344
               |.|.|..:|.:.|..........|:..|          |.|..:..       ||.|..:..
 Frog   392 GEPGSPGSPGSEGMPGERGQPGEGVAGQKGEKGSEGPQGRSGLPGLSIKGDKGDIGPPGTPGQLG 456

  Fly  1345 QPETTTEIEGSGQ------DSTTSEPDTKKSCSFSEFGCCPDAETSAKG---------PDFEGC- 1393
            .|       |||.      ......|..:.....|..|.  ..||..||         |.|.|. 
 Frog   457 LP-------GSGSPGPQGLQGIRGLPGPRGPQGLSIPGI--KGETGEKGTPGPAGLTTPGFPGLK 512

  Fly  1394 ---GLASPVAKGCA-----ESENGCCPDGQTPASGPNG---------EGCSGCTRERFGC----- 1436
               ||..|  ||.:     |.:.|  ..|:...|||.|         ||..|...|:...     
 Frog   513 GNPGLPGP--KGDSGEKGNEGKEG--KKGEQGISGPQGPEGRPGIGLEGQKGDQGEKGSVGLIGT 573

  Fly  1437 --CPDSQTPAHGPNKEGCCLDTQFGCCPDNILAARGPN---NEGCECHYTPYGCCPDNKSAATGY 1496
              .|....|...|...|             ::...||:   ..|.:....|.|  |:......|.
 Frog   574 RGIPGPPGPKGEPGING-------------LIGLPGPSVVGPPGLKGDIGPQG--PEGPVGDPGQ 623

  Fly  1497 NQEGCACETTQYGCCPDKITAAKGPKHEGCPCETTQFGC-CPDGLTFAKG---------PHHHGC 1551
            :.:|...:....|  |..||   |||.||.|      |. .|.|:..|:|         |...| 
 Frog   624 SVKGEKGDVGYVG--PPGIT---GPKGEGSP------GIPGPRGIPGAQGLPGEKGTGDPGQKG- 676

  Fly  1552 HCTQTEFKCCDDEKTPAKGPNG-DGCTCVESKFGCCPDGVTKATDEKFGGCENVQ-EPPQKACGL 1614
                         :...:||.| .|           |.|:.....:...|.:.:| .|.....|:
 Frog   677 -------------EPGIRGPQGLPG-----------PRGIGSPGSKGTMGQKGIQGTPGPTGYGV 717

  Fly  1615 PKETGTCNNYSVKYYFDTSYGGCARFWYG--GCDGND---------------------NRFESEA 1656
            |...|...       :..|.|......:|  |..||:                     .|...:.
 Frog   718 PGPKGESG-------YKGSIGPKGSIGHGVPGSKGNEGIMGESGMKGTKGEIGNPGSVGRMGPKG 775

  Fly  1657 E--------------------CKDTCQDYTGKHVCLL--PKSAGPCTGFTKKWYFD--VDR---- 1693
            |                    |...|:|.....|.::  .:|.||......|.:.:  :|:    
 Frog   776 EKGELGLTREDIIRLIIEICGCGKDCKDVPLDLVFIIDSSESVGPENFDIIKQFVNRVIDKISTD 840

  Fly  1694 ---------NRCEEFQYGGCYGTNNRFDSLEQC--------QGTCAASENLPTCEQPVESGPCAG 1741
                     |...:.:.....|..:..::|.:.        :||..|:....:.|...:|.....
 Frog   841 QSASKVGIINFSHKVEVVAHIGQLSNKENLREAINRMNYLGEGTYTATAIKKSTELFQQSRGDVK 905

  Fly  1742 NFERWYYDNETDICRPFTYGGCKGNKNNYPTE------HACNYNCRQPGVLKD--------RCAL 1792
            .......|.:.|:            ::|...:      |:.|......||:..        |..:
 Frog   906 KIAIVITDGQADV------------RDNLSLDLVVREAHSVNVEMYVIGVVDTHDPNYNLFRNEM 958

  Fly  1793 PKQTGDCSE----KLAKWH-FSESEKRCVPFYYSGCGGNKNNFPTLESCEDHCPRQVAKDICEIP 1852
            .....|..|    ::|.:. .||.|.:   .:...|  .|:::|.||:...      .:....:|
 Frog   959 NLIASDPDEEHVFQIADFSTLSELENK---LFRKIC--TKDSYPWLETLSR------IESAATVP 1012

  Fly  1853 AEVGECANYVTSWYYDTQDQACRQFYYGGCGGNENRFPTEESCLA---RCDRKPEPT-------T 1907
            :.|      ||..|...::|....:      ....:.|.::..:.   :...:|.||       .
 Frog  1013 SVV------VTEPYQTDREQEVIHY------PQSTQPPAQDQKVTVSIKDIAEPRPTQKPGALEV 1065

  Fly  1908 TTPATRP--QPSRQDV-------CDEEPAPGECSTWVLKWHFDRKIGACRQFYYGNCGGNGNRFE 1963
            ..||..|  |...||:       |.|:..||.|..:|:||::|:...:|.:|:||.|.||.|||:
 Frog  1066 VKPAGAPSTQTVVQDIRQEQDARCLEDMTPGTCRDYVVKWYYDKIADSCARFWYGGCEGNRNRFD 1130

  Fly  1964 TENDCQQRCLS 1974
            ||.|||..|::
 Frog  1131 TEKDCQTICMT 1141

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
PpnNP_788752.2 TSP1 60..111 CDD:214559
ADAMTS_CR_3 116..213 CDD:437068
ADAMTS_spacer1 222..329 CDD:461796
TSP1_ADAMTS 347..396 CDD:465950
TSP1_ADAMTS 400..457 CDD:465950
TSP1_ADAMTS 465..520 CDD:465950
TSP1_ADAMTS 580..632 CDD:465950
TSP1_ADAMTS 643..693 CDD:465950
MSCRAMM_ClfA <1049..1262 CDD:468110 39/244 (16%)
Kunitz_papilin_lacunin-like 1612..1663 CDD:438681 12/93 (13%)
Kunitz_BPTI 1670..1721 CDD:425421 10/75 (13%)
Kunitz-type 1730..1780 CDD:438633 7/55 (13%)
Kunitz_BPTI 1789..1840 CDD:425421 12/55 (22%)
Kunitz_papilin_mig6-like 1849..1899 CDD:438679 7/52 (13%)
Kunitz_papilin_mig6-like 1922..1972 CDD:438679 24/49 (49%)
Kunitz_papilin_mig6-like 2001..2051 CDD:438679
Kunitz-type 2071..2121 CDD:438633
Kunitz_BPTI 2127..2178 CDD:425421
Kunitz_BPTI 2193..2245 CDD:425421
Kunitz_BPTI 2252..2303 CDD:425421
Kunitz_BPTI 2317..2372 CDD:425421
WAP 2455..2497 CDD:459672
Ig 2530..2610 CDD:472250
Ig strand B 2539..2543 CDD:409353
Ig strand C 2552..2556 CDD:409353
Ig strand E 2576..2579 CDD:409353
Ig strand F 2589..2594 CDD:409353
Ig strand G 2603..2606 CDD:409353
IG_like 2627..2703 CDD:214653
Ig strand B 2636..2640 CDD:409353
Ig strand C 2649..2653 CDD:409353
Ig strand E 2670..2674 CDD:409353
Ig strand F 2684..2689 CDD:409353
Ig strand G 2697..2700 CDD:409353
I-set 2766..2845 CDD:400151
Ig strand B 2771..2775 CDD:409353
Ig strand C 2784..2788 CDD:409353
Ig strand F 2821..2826 CDD:409353
PLAC 2851..2883 CDD:462560
col28a1XP_031760315.1 VWA 56..235 CDD:459670 25/140 (18%)
gly_rich_SclB <270..>560 CDD:468478 64/329 (19%)
gly_rich_SclB <533..>781 CDD:468478 59/307 (19%)
VWA 807..985 CDD:459670 28/189 (15%)
Kunitz_collagen_alpha1_XXVIII 1089..1139 CDD:438671 24/49 (49%)
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.