DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Ppn and Col28a1

DIOPT Version :10

Sequence 1:NP_788752.2 Gene:Ppn / 43872 FlyBaseID:FBgn0003137 Length:2898 Species:Drosophila melanogaster
Sequence 2:NP_001418648.1 Gene:Col28a1 / 312115 RGDID:1564680 Length:1141 Species:Rattus norvegicus


Alignment Length:1260 Identity:245/1260 - (19%)
Similarity:355/1260 - (28%) Gaps:406/1260 - (32%)


- Green bases have known domain annotations that are detailed below.


  Fly   950 NSSDSTSDATSDSTASSDSTDSTSDQTTETTPESS---------TDSTESSTLDA--SSTTDASS 1003
            :||:|:.....|:  ..|..||.|::..:.||..|         ...:.|..:|.  ||..|..:
  Rat    54 DSSESSKIVLFDN--QKDFVDSLSEKLFQLTPGHSLRYDIKLAALQFSSSVQIDPPFSSWKDLRT 116

  Fly  1004 TSESSSESSTDGSSTTSNSASSETTGLSSDGSTTDATTAASDNTDITTDG-STDESTDGSSNAST 1067
            ..:.....:..|..|.|..|.|..|.|.......|....|.    :.||| ...:|.|       
  Rat   117 FKQRVKSLNLIGQGTFSYYAISNATRLLKKEGRKDGVKVAL----LMTDGIDHPKSPD------- 170

  Fly  1068 EGSTEGASEDTTISTESSGSTESTDAIASDGSTTEGSTVEDLSSSTSSD----VTSDSTITDSSP 1128
               .:..|||..|    ||.:..|..:    ||........|.|..||:    :.||.|:.|   
  Rat   171 ---VQSISEDARI----SGISFITVGL----STVVNEAKLRLISGDSSNEPVLLLSDPTLVD--- 221

  Fly  1129 STEVSGSTDSSSSTDGSSTDASSTEASSTDVTESTDSTVSGGTSDTTESGPTEESTTEGSTESTT 1193
              ::.|..|....   ...:....|....|..:.......|......|.||      :|:.....
  Rat   222 --KIRGRLDVLFE---RKCEHKICECEKGDPGDPGPPGTHGNPGIKGERGP------KGNPGDAQ 275

  Fly  1194 EGSTDSTQSTDLDSTTSDIWSTSD------KDDESESST--------------------PYSFDS 1232
            :|.|.......:.....|.....:      |.|:..:..                    |..|..
  Rat   276 KGETGERGPVGIPGYKGDKGERGECGKPGMKGDKGPAGPYGPKGPRGIQGIGGPPGDPGPKGFQG 340

  Fly  1233 EVTKSKPRKCKP--------------------KKSTCAKSEYGCCPDGKSTPKGPFDEGCPIAKT 1277
              .|.:|....|                    :..|.|....|....|:..|:||  ||.|    
  Rat   341 --NKGEPGPPGPYGPPGAPGIGQQGVKGERGQEGRTGAPGPIGIGEPGQPGPRGP--EGAP---- 397

  Fly  1278 CADTKYGCCLDGVSPAKG------------------KNNKG-----CPK---------SQCAETL 1310
               .:.|...:||...||                  |.:||     .|:         ||..:.:
  Rat   398 ---GERGLPGEGVPGPKGEKGSEGPTGPQGLQGLSIKGDKGDLGPVGPQGPAGIPGIGSQGEQGI 459

  Fly  1311 FGCCPDKFTAADGENDEGCP----ETTTVPPT------------------TTTEETQPETTTEIE 1353
            .|  |.......|...:|.|    |...:.||                  |.....||....|..
  Rat   460 QG--PTGPPGPQGPPGQGSPGPKGEVGRMGPTGPRGPMGIGIQGPKGEPGTVGLPGQPGVPGEDG 522

  Fly  1354 GSGQDSTTSEPDT-------------------KKSCSFSE--------------------FGCCP 1379
            .||:......|.|                   ||....::                    ||...
  Rat   523 ASGKKGEAGLPGTRGPEGMPGKGQPGPKGDEGKKGSKGNQGQRGFPGPEGPKGEPGIMGPFGMPG 587

  Fly  1380 DAETSAKGPDFEGCGLASPVAKGCAESENGCCPDGQTPASGPNGE-GCSGCTRERFGCCPDSQTP 1443
            .:.....||..:..|...|..||    |.|....|...|.||.|. |..|...:.:   |....|
  Rat   588 ASIPGPSGPKGDRGGPGMPGLKG----EPGLSVRGPKGAQGPRGPVGAPGLKGDGY---PGVAGP 645

  Fly  1444 AHGPNKEGCCLDTQFGCCPDNILAARGPNNEGCECHYTPYGCCPDNKSAATGYNQEGCACETTQ- 1507
            ...|.             |...:..||..:.|.:......|  |...|...|...:|...:|.| 
  Rat   646 RGLPG-------------PPGPMGLRGVGDTGAKGEPGVRG--PPGPSGPRGIGIQGPKGDTGQK 695

  Fly  1508 ----------YGCCPDKITAAKGPKHEGCPCETTQFGCCPDGLTFAKGPH-HHGCHCTQTEFKCC 1561
                      ||  ...|...:||  :|.|......|.   ||...||.| ..|....:.|....
  Rat   696 GLPGPPGPPGYG--SQGIKGEQGP--QGFPGSKGTVGL---GLPGQKGEHGERGDVGRKGEKGDI 753

  Fly  1562 DDEKTPAK----GPNGDGCTCVESKFGCCPDGVTKATDEKFGGCENVQEPPQKACGL--PKETGT 1620
            .:..:|.|    ||.||        .|...:.:.|...|..|.....:|.|.:...:  ..|:..
  Rat   754 GEPGSPGKQGLQGPKGD--------LGLTKEEIIKLIIEICGCGPKCKETPLELVFVIDSSESVG 810

  Fly  1621 CNNYSVKYYFDTSYG-------GCARFWYGGCDGNDNRFESEAECKDTCQDYTGKHVCLLPKSAG 1678
            ..|:.:...|..:..       |.||.   |.....::.|..|..|.                  
  Rat   811 PENFQIIQNFVKTLADRVALDLGTARI---GIINYSHKVEKVASLKQ------------------ 854

  Fly  1679 PCTGFTKKWYFDVDRNRCEEFQYGGCYGTNNRFDSLEQCQGTCAASENLPTCEQPVESGPCAGNF 1743
                |:.|   |..:...:..||.|              :||..|:......:...|:.|.....
  Rat   855 ----FSSK---DDFKLVVDNMQYLG--------------EGTYTATALQAANDMFKEARPGVKKV 898

  Fly  1744 ERWYYDNETDI-----------------CRPFTYGGCKGNKNNY------------PTEHACNYN 1779
            .....|.:||.                 ...|..|..|.:..|:            .:||.  |.
  Rat   899 ALVITDGQTDSRDKRKLADVVKDANDSKVEIFVIGVVKKDDPNFEIFHKEMNLIATDSEHV--YQ 961

  Fly  1780 CRQPGVLKDRCALPKQTGDCSEKLAKWHFSESEKRCVPF--YYSGCGGNKNNFPTLESCEDHCPR 1842
            ......|:|         ...:||:|       |.|..|  |.....|:.:..|.....|    |
  Rat   962 FDDFFTLQD---------TLKQKLSK-------KICEDFDSYLIQVFGSSSFQPEFGVSE----R 1006

  Fly  1843 QVAKDICEIPAEVGECANYVTSWYYDTQDQACRQFYYGGCGGNENRFP---TEESCLARCDRKPE 1904
            :|.....:...|:.:..|.                   ..|.||...|   ||...|| ....||
  Rat  1007 EVNISTPKPTKEISKLFNI-------------------SRGQNEETEPSVLTEAGNLA-IPTPPE 1051

  Fly  1905 PTTTTPATRPQPSRQDV-------------------CDEEPAPGECSTWVLKWHFDRKIGACRQF 1950
            .|.|.......|.|.:.                   |:|...||.|..:|::|::|:::.:|.:|
  Rat  1052 ATNTLKPLLSSPERVEARTPNPNLLQSEKSLYKDPRCEEALKPGNCGDYVVRWYYDKQVNSCARF 1116

  Fly  1951 YYGNCGGNGNRFETENDCQQRCLSQ 1975
            ::..|.|:||||.:|.:||:.|:.|
  Rat  1117 WFSGCNGSGNRFHSEKECQETCIKQ 1141

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
PpnNP_788752.2 TSP1 60..111 CDD:214559
ADAMTS_CR_3 116..213 CDD:437068
ADAMTS_spacer1 222..329 CDD:461796
TSP1_ADAMTS 347..396 CDD:465950
TSP1_ADAMTS 400..457 CDD:465950
TSP1_ADAMTS 465..520 CDD:465950
TSP1_ADAMTS 580..632 CDD:465950
TSP1_ADAMTS 643..693 CDD:465950
MSCRAMM_ClfA <1049..1262 CDD:468110 45/263 (17%)
Kunitz_papilin_lacunin-like 1612..1663 CDD:438681 10/59 (17%)
Kunitz_BPTI 1670..1721 CDD:425421 7/50 (14%)
Kunitz-type 1730..1780 CDD:438633 12/78 (15%)
Kunitz_BPTI 1789..1840 CDD:425421 10/52 (19%)
Kunitz_papilin_mig6-like 1849..1899 CDD:438679 10/52 (19%)
Kunitz_papilin_mig6-like 1922..1972 CDD:438679 19/49 (39%)
Kunitz_papilin_mig6-like 2001..2051 CDD:438679
Kunitz-type 2071..2121 CDD:438633
Kunitz_BPTI 2127..2178 CDD:425421
Kunitz_BPTI 2193..2245 CDD:425421
Kunitz_BPTI 2252..2303 CDD:425421
Kunitz_BPTI 2317..2372 CDD:425421
WAP 2455..2497 CDD:459672
Ig 2530..2610 CDD:472250
Ig strand B 2539..2543 CDD:409353
Ig strand C 2552..2556 CDD:409353
Ig strand E 2576..2579 CDD:409353
Ig strand F 2589..2594 CDD:409353
Ig strand G 2603..2606 CDD:409353
IG_like 2627..2703 CDD:214653
Ig strand B 2636..2640 CDD:409353
Ig strand C 2649..2653 CDD:409353
Ig strand E 2670..2674 CDD:409353
Ig strand F 2684..2689 CDD:409353
Ig strand G 2697..2700 CDD:409353
I-set 2766..2845 CDD:400151
Ig strand B 2771..2775 CDD:409353
Ig strand C 2784..2788 CDD:409353
Ig strand F 2821..2826 CDD:409353
PLAC 2851..2883 CDD:462560
Col28a1NP_001418648.1 vWFA_subfamily_ECM 47..212 CDD:238727 43/181 (24%)
gly_rich_SclB <257..446 CDD:468478 34/205 (17%)
gly_rich_SclB <363..632 CDD:468478 55/283 (19%)
Collagen 713..772 CDD:460189 18/71 (25%)
VWA 798..974 CDD:459670 33/228 (14%)
Kunitz-type 1088..1138 CDD:444694 19/49 (39%)
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.