DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Ppn and Papln

DIOPT Version :9

Sequence 1:NP_788752.2 Gene:Ppn / 43872 FlyBaseID:FBgn0003137 Length:2898 Species:Drosophila melanogaster
Sequence 2:XP_038969231.1 Gene:Papln / 314297 RGDID:1311176 Length:1328 Species:Rattus norvegicus


Alignment Length:2897 Identity:507/2897 - (17%)
Similarity:713/2897 - (24%) Gaps:1692/2897 - (58%)


- Green bases have known domain annotations that are detailed below.


  Fly    49 TPGGEGND----PDEWTPWSSPSDCSRTCGGGVSYQTRECL--RRDDRGEAVCSGGSRRYFSCNT 107
            |||....:    .|.|..|...|.||||||||:|::.|.|.  |||  |.|.|.|.:|.:.:|:|
  Rat    69 TPGSWAQNVRRQSDTWGAWGEWSPCSRTCGGGISFRERPCYSQRRD--GGASCVGPARSHRTCHT 131

  Fly   108 QDCPEEESDFRAQQCSRFDRQQFDGVFYEWVPYTNAPNPCELNCMPKGERFYYRQREKVVDGTRC 172
            :.||:...||||:||:.||...|.|..|.|:||..|||.|||||:|||:.||::.|:.|||||.|
  Rat   132 ESCPDSVRDFRAEQCAEFDGTDFQGRRYRWLPYYAAPNKCELNCIPKGQNFYFKHRDAVVDGTPC 196

  Fly   173 NDKDLDVCVNGECMPVGCDMMLGSDAKEDKCRKCGGDGSTCKTIRNTITTKDLAPGYNDLLLLPE 237
            .....|:||.|.|..||||..|.|..:||||.:||||||:|..|..|....||:.|||.:.::|.
  Rat   197 EPGKRDICVEGVCRVVGCDHKLDSTKQEDKCLQCGGDGSSCYPITGTFDANDLSRGYNQIFIIPA 261

  Fly   238 GATNIRIEETVPSSNYLACRNHSGHYYLNGDWRIDFPRPMFFANSWWNYQRKPMGFAAPDQLTCS 302
            |||:|.|||...|.|:||.::..|.|||||.|.|:..:.:..|::...|:|...|..||::|...
  Rat   262 GATSIHIEEADASRNFLAVKSIRGEYYLNGHWTIEEAQALPVASTVLQYERGVEGDLAPERLQAR 326

  Fly   303 GPISESLFIVMLVQEKNISLDYEYSIPESLSHSQQDTHTWTHHQFNACSASCGGGSQSRKVTC-- 365
            ||.||.|.|.:::||.|..:.|||.:|   .:....:.:|::..:..|||.||||.|||.|.|  
  Rat   327 GPTSEPLVIELIIQESNPGVHYEYYLP---VNDPGRSFSWSYGSWGDCSAECGGGHQSRLVFCTI 388

  Fly   366 NNRITLAEVNPS-LCDQKSKPVEEQACGTEPC--APHWVEGEWSKCSKGCGSDGFQNRSITCERI 427
            :|     |..|. :|..:.:|...::|.|.||  ...|..|.|:.||..||. |||:||:.|...
  Rat   389 DN-----EAYPDHMCQHQPRPAHRRSCNTHPCPKTKRWKVGPWTPCSVSCGG-GFQSRSVYCVSS 447

  Fly   428 SSSGEHTVEEDAVCLKEVGNKPATKQECNRDVKNCPKYHLGPWTPCDKLCGDGKQTRKVTCFIEE 492
            ..:|.....|:..|....| ||.|.|.||  ::.|..:.:.||..|...||.|.:.|.|||    
  Rat   448 DGTGGQEAAEETQCAGLAG-KPPTTQACN--LQRCAVWSVEPWGECSVTCGAGIRKRSVTC---- 505

  Fly   493 NGHKRVLPEEDCVEEKPETEKSCLLTPCEGVDWIISQWSGCNACGQNTETRTAICGNKEGKVYPE 557
                                                                  .|:::..|:|.
  Rat   506 ------------------------------------------------------RGDEDSLVHPA 516

  Fly   558 EFCEPEVPTLSRPCKSPKCEA----QWFSSEWSKCSAPCGKGVKSRIVICGEFDGKTVTPADDDS 618
            .....:.|||:.||....|..    .|....||.||..||.|::.|.|:|      |:       
  Rat   517 ACSLKDQPTLTEPCVREACPGFRGQAWHVGSWSLCSKSCGSGIRRRQVVC------TI------- 568

  Fly   619 KCNKETKPESEQDCEGEEKVCPGEWFTGPWGKC-----SKPCGGGERVREVLCLSNGTKSVNCDE 678
                                       ||.|:|     |||.       ||              
  Rat   569 ---------------------------GPPGRCVDLQSSKPA-------EV-------------- 585

  Fly   679 EKVEPLSEKCNSEACTEDEILPLTSTDKPIEDDEEDCDEDGIELISDGLSDDEKSEDVIDLEGTA 743
                   |.||.:.|.    ||                                           
  Rat   586 -------EACNRQPCH----LP------------------------------------------- 596

  Fly   744 KTETTPEAEDLMQSDSPTPYDEFESTGTTFEGSGYDSESTTDSGISTEGSGDDEETSEASTDLSS 808
                                                                             
  Rat   597 ----------------------------------------------------------------- 596

  Fly   809 STDSGSTSSDSTSSDSSSSISSDATSEAPASSVSDSSDSTDASTETTGVSDDSTDVSSSTEASAS 873
                                     .|.|  |:.||                             
  Rat   597 -------------------------QEVP--SIQDS----------------------------- 605

  Fly   874 ESTDVSGASDSTGSTNASDSTPESSTEASSSTDDSTDSSDNSSNVSESSTEASSSSVSDSNDSSD 938
                         .|:.||....|..:.                          |.|:|:.|.. 
  Rat   606 -------------RTHPSDRRMLSGPQV--------------------------SPVADARDQQ- 630

  Fly   939 GSTDGVSSTTENSSDSTSDATSDSTASSDSTDSTSDQTTETTPESSTDSTESSTLDASSTTDASS 1003
                                                                             
  Rat   631 ----------------------------------------------------------------- 630

  Fly  1004 TSESSSESSTDGSSTTSNSASSETTGLSSDGSTTDATTAASDNTDITTDGSTDESTDGSSNASTE 1068
                                                                             
  Rat   631 ----------------------------------------------------------------- 630

  Fly  1069 GSTEGASEDTTISTESSGSTESTDAIASDGSTTEGSTVEDLSSSTSSDVTSDSTITDSSPSTEVS 1133
                                                                             
  Rat   631 ----------------------------------------------------------------- 630

  Fly  1134 GSTDSSSSTDGSSTDASSTEASSTDVTESTDSTVSGGTSDTTESGPTEESTTEGSTESTTEGSTD 1198
                                                                             
  Rat   631 ----------------------------------------------------------------- 630

  Fly  1199 STQSTDLDSTTSDIWSTSDKDDESESSTPYSFDSEVTKSKPRKCKPKKSTCAKSEYGCCPDGKST 1263
                          |:|.::                                       |..:|.
  Rat   631 --------------WATLER---------------------------------------PRAQSN 642

  Fly  1264 PKGPFDEGCPIAKTCADTKYGCCLDGVSPAKGKNNKGCPKSQCAETLFGCCPDKFTAADGENDEG 1328
            |:                                                               
  Rat   643 PR--------------------------------------------------------------- 644

  Fly  1329 CPETTTVPPTTTTEETQPETTTEIEGSGQDSTTSEPDTKKSCSFSEFGCCPDAETSAKGPDFEGC 1393
                                      .||||           ..|..|..|..:.....|...  
  Rat   645 --------------------------EGQDS-----------HLSSAGRAPTLQHPPHQPPLR-- 670

  Fly  1394 GLASPVAKGCAESENGCCPDGQTPASGPNGEGCSGCTRERFGCCPDSQTPAHGPNKEGCCLDTQF 1458
              .|..|:.|..|.:||||||.||:.||..:||                |..|.:    ||.:::
  Rat   671 --PSSGARDCRHSPHGCCPDGHTPSLGPQWQGC----------------PLAGAS----CLKSRY 713

  Fly  1459 GCCPDNILAARGPNNEGCECHYTPYGCCPDNKSAATGYNQEGCACETTQYGCCPDKITAAKGPKH 1523
            |||||.:.||:||...||...::     .||    || |:.|.....::    ..||..::....
  Rat   714 GCCPDGVSAAKGPQQAGCTRSHS-----SDN----TG-NRPGSRAVVSK----DPKIHQSQAHPS 764

  Fly  1524 EGCPCETTQFGCCPDGLTFAKGPHHHGCHCTQTEFKCCDDEKTPAKGPNGDGCTCVESKFGCCPD 1588
            |...|.::|||||.|.:.|                         |.||.|:||.           
  Rat   765 EISECRSSQFGCCYDNVAF-------------------------AAGPLGEGCV----------- 793

  Fly  1589 GVTKATDEKFGGCENVQEPPQKACGLPKETGTCNNYSVKYYFDTSYGGCARFWYGGCDGNDNRFE 1653
                       |..:...|.:  |.||...|:|.:::.::||..|.|.|.|||||||.||.|.|.
  Rat   794 -----------GQPSYAYPVR--CLLPSAQGSCGDWAARWYFVGSVGRCNRFWYGGCHGNANNFA 845

  Fly  1654 SEAECKDTCQDYTGKHVCLLPKSAGPCTGFTKKWYFDVDRNRCEEFQYGGCYGTNNRFDSLEQCQ 1718
            :|.||.|||:   |:|        ||                                       
  Rat   846 TEQECMDTCR---GQH--------GP--------------------------------------- 860

  Fly  1719 GTCAASENLPTCEQPVESGPCAGNFERWYYDNETDICRPFTYGGCKGNKNNYPTEHACNYNCRQP 1783
                        .:| |:| .||:  |.:.|           ||.:|                 |
  Rat   861 ------------RRP-EAG-AAGH--RVHMD-----------GGQRG-----------------P 881

  Fly  1784 GVLKDRCALPKQTGDCSEKLAKWHFSESEKRCVPFYYSGCGGNKNNFPTLESCEDHCPRQVAKDI 1848
            |               .::...||..|:   .||                               
  Rat   882 G---------------GQQEPDWHRIEA---TVP------------------------------- 897

  Fly  1849 CEIPAEVGECANYVTSWYYDTQDQACRQFYYGGCGGNENRFPTEESCLARCDRKPEPTTTTPATR 1913
                                                                |.|.| :.:|.:|
  Rat   898 ----------------------------------------------------RLPSP-SGSPWSR 909

  Fly  1914 PQPSRQDVCDEEPAPGECSTWVLKWHFDRKIGACRQFYYGNCGGNGNRFETENDCQQRCLSQEPP 1978
                     ::||.||:.                                               
  Rat   910 ---------EQEPGPGQA----------------------------------------------- 918

  Fly  1979 APTPPRAPAPTRQPDPAPTVAQCSQPADPGQCDKWALHWNYNETEGRCQSFYYGGCGGNDNRFAT 2043
                |..|            ||..:|..||                                   
  Rat   919 ----PHIP------------AQGKRPRVPG----------------------------------- 932

  Fly  2044 EEECSARCSVNIDIRIGADPVEHDTSKCFLAFEPGNCYNNVTRWFYNSAEGLCDEFVYTGCGGNA 2108
                                ::.|                                         
  Rat   933 --------------------LDRD----------------------------------------- 936

  Fly  2109 NNYATEEECQNECNDAQTTCALPPVRGRCSDLSRRWYFDERSGECHEFEFTGCRGNRNNFVSQSD 2173
                                |.||||                                       
  Rat   937 --------------------ARPPVR--------------------------------------- 942

  Fly  2174 CLNFCIGEPVVEPSAPTYSVCAEPPEAGECDNRTTAWFYDSENMACTAFTYTGCGGNGNRFETRD 2238
                           ||:|                                              
  Rat   943 ---------------PTHS---------------------------------------------- 946

  Fly  2239 QCERQCGEFKGVDVCNEPVTTGPCTDWQTKYYFNTASQACEPFTYGGCDGTGNRFSDLFECQTVC 2303
                                        ..|..|                               
  Rat   947 ----------------------------PSYRIN------------------------------- 952

  Fly  2304 LAGREPRVGSAKEICLLPVATGRCNGPSVHERRWYYDDEAGNCVSFIYAGCSGNQNNFRSFEACT 2368
            ||..||        .|:..|.|:.                      :...|.||           
  Rat   953 LADSEP--------SLVQAAPGQA----------------------VQLFCPGN----------- 976

  Fly  2369 NQCRPEPNKQDNEIGQNPCDTFDAECQELRCPYGVRRVAARSQPECTQCICENPCEGYSCPEGQQ 2433
              ..||               |.|..|:                                 ||: 
  Rat   977 --IPPE---------------FQAGWQK---------------------------------EGR- 990

  Fly  2434 CAIDVASSDDRQFAPVCRDIYKPGECPALSANASGCARECYTDADCRGDNKCCSDGCGQLCVHPA 2498
                          |:..:.|:                   ..||            |.|.:.|.
  Rat   991 --------------PISSNRYQ-------------------LQAD------------GSLIISPL 1010

  Fly  2499 RPTQPPRTQAPVVSYPGDARAALEPKEAHELDVQTAIGGIAVL-----RCF-ATGNPAPNITWSL 2557
            ||     ..|.:.|. |..|:..||:   ::.::...|.:|||     |.| .|.||.|..:...
  Rat  1011 RP-----EDAGIYSC-GSQRSGHEPQ---KIQLRVTGGDMAVLSEVQPRHFPETRNPDPGHSPQN 1066

  Fly  2558 KNLVINTNKGRYVLTANGDLTIVQVRQTDDGTYVCVASNGLGEPVRREVALQVTEPVSQPAYIYG 2622
            :.:.:.. :|..||:::......::|.  |.|                          ||..|  
  Rat  1067 RGIGVGA-EGHRVLSSSHPRPATRLRL--DRT--------------------------QPGVI-- 1100

  Fly  2623 DKNVTQIVELNRPAVIRCPAGGFPEPHVSWWRNGQMFGLKNNLMARDYSLVFNSIQLSDLGLYTC 2687
            |.:..|.::|.      |.|.|||.|.:.|.|:||:.....:.:..|.|||.:.:.:.|.|.|||
  Rat  1101 DASPGQQIQLT------CHAEGFPVPTIEWQRDGQLVSSPRHQVQHDGSLVISRVTVEDGGFYTC 1159

  Fly  2688 EVYN----QRRPVSLRVTLKAVGPVRPLSPEEEQYMQYVLNPATRPVTQRPSYPYRPTRPAYVPE 2748
            ..:|    .:|.|.|||                          .|.:|                 
  Rat  1160 VAFNGHDRDQRWVQLRV--------------------------QRELT----------------- 1181

  Fly  2749 PTVNVHAVLALEPKNSYTPGSTIVMSCSVQGYPEPNVTWIKDDVPLY-NNERVQITYQPHR---- 2808
                   :..|.|..:...|.|..:.|.|.| ...|:.|.::.:|:. :..||      |:    
  Rat  1182 -------ITGLPPAMTVPEGDTARLLCVVAG-DSVNIRWSRNGLPIQADGHRV------HQSPDG 1232

  Fly  2809 -LVLSDVTSADSGKYTC---RASNAYTYANGEANVSIQSVVPVSP---------ECVDNPYFANC 2860
             |::.::...|.|.|||   |.|.|.:     .:..::..:|.:|         :|:|.|..|||
  Rat  1233 TLLIHNLRPRDEGSYTCSAFRGSQAVS-----RSTEVKVALPAAPAAQSRDLGKDCIDQPELANC 1292

  Fly  2861 KLIVKGRYCSNPYYTQFCCRSCTLAGQVASPPLHPNA 2897
            .||::.:.|.|.||:.|||.||:        ...|||
  Rat  1293 ALILQAQLCGNEYYSSFCCASCS--------RFQPNA 1321

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
PpnNP_788752.2 TSP1 60..111 CDD:214559 24/52 (46%)
ADAM_spacer1 214..329 CDD:283607 46/114 (40%)
TSP1 468..521 CDD:214559 10/52 (19%)
TSP_1 645..693 CDD:278517 12/52 (23%)
Kunitz_BPTI 1611..1663 CDD:278443 26/51 (51%)
Kunitz_BPTI 1670..1721 CDD:278443 2/50 (4%)
Kunitz_BPTI 1729..1781 CDD:278443 10/51 (20%)
Kunitz_BPTI 1789..1840 CDD:278443 5/50 (10%)
Kunitz_BPTI 1848..1899 CDD:278443 0/50 (0%)
KU 1920..1973 CDD:238057 4/52 (8%)
Kunitz_BPTI 2001..2051 CDD:278443 3/49 (6%)
KU 2071..2121 CDD:238057 0/49 (0%)
Kunitz_BPTI 2127..2178 CDD:278443 5/50 (10%)
KU 2192..2245 CDD:238057 1/52 (2%)
KU 2251..2304 CDD:238057 2/52 (4%)
KU 2316..2372 CDD:238057 6/55 (11%)
WAP 2457..2497 CDD:278522 4/39 (10%)
Ig 2521..2610 CDD:299845 18/94 (19%)
IG_like 2530..2610 CDD:214653 16/85 (19%)
IG_like 2627..2703 CDD:214653 27/79 (34%)
Ig 2636..2701 CDD:143165 24/68 (35%)
IG_like 2766..2841 CDD:214653 20/83 (24%)
Ig 2768..2830 CDD:299845 19/70 (27%)
PLAC 2851..2883 CDD:285849 16/31 (52%)
PaplnXP_038969231.1 TSP1 84..135 CDD:214559 24/52 (46%)
ADAM_spacer1 238..353 CDD:368694 46/114 (40%)
TSP1_ADAMTS 363..415 CDD:408800 20/56 (36%)
TSP1_ADAMTS 421..479 CDD:408800 23/61 (38%)
TSP1_ADAMTS 482..535 CDD:408800 18/110 (16%)
TSP1_ADAMTS 543..593 CDD:408800 24/117 (21%)
Kunitz_BPTI 803..855 CDD:394972 26/53 (49%)
Papilin_u7 863..947 CDD:374683 42/533 (8%)
Ig 957..>1020 CDD:416386 25/204 (12%)
Ig strand A' 960..964 CDD:409353 1/3 (33%)
Ig strand B 968..974 CDD:409353 0/27 (0%)
Ig strand C 982..987 CDD:409353 1/4 (25%)
Ig strand C' 989..992 CDD:409353 1/17 (6%)
Ig strand D 996..1000 CDD:409353 1/22 (5%)
Ig strand E 1003..1009 CDD:409353 2/5 (40%)
IGc2 1104..1165 CDD:197706 22/66 (33%)
Ig strand B 1108..1112 CDD:409353 1/9 (11%)
Ig strand C 1121..1125 CDD:409353 0/3 (0%)
Ig strand E 1142..1146 CDD:409353 2/3 (67%)
Ig strand F 1156..1161 CDD:409353 3/4 (75%)
IG 1186..1266 CDD:214652 21/91 (23%)
PLAC 1283..1315 CDD:400844 16/31 (52%)


Information from Original Tools:


Tool Simple Score Weighted Score Original Tool Information
BLAST Result Score Score Type Cluster ID
Compara 00.000 Not matched by this tool.
Domainoid 1 1.000 135 1.000 Domainoid score I4858
eggNOG 00.000 Not matched by this tool.
Hieranoid 00.000 Not matched by this tool.
Homologene 1 1.000 - - H71541
Inparanoid 00.000 Not matched by this tool.
OMA 00.000 Not matched by this tool.
OrthoDB 1 1.010 - - D36783at33208
OrthoFinder 1 1.000 - - FOG0005760
OrthoInspector 1 1.000 - - oto98337
orthoMCL 1 0.900 - - OOG6_106772
Panther 00.000 Not matched by this tool.
Phylome 1 0.910 - -
SonicParanoid 1 1.000 - - X4788
SwiftOrtho 1 1.000 - -
TreeFam 00.000 Not matched by this tool.
98.820

Return to query results.
Submit another query.