DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment CG10211 and pxn-1

DIOPT Version :10

Sequence 1:NP_609883.1 Gene:CG10211 / 35106 FlyBaseID:FBgn0032685 Length:1394 Species:Drosophila melanogaster
Sequence 2:NP_505188.3 Gene:pxn-1 / 191484 WormBaseID:WBGene00004256 Length:1285 Species:Caenorhabditis elegans


Alignment Length:826 Identity:267/826 - (32%)
Similarity:387/826 - (46%) Gaps:130/826 - (15%)


- Green bases have known domain annotations that are detailed below.


  Fly   581 KEAIAELSPELIEAAVERAKQELE---ERKRFEYEVWRTKGGISARSPDGTAASFSKANLAALNL 642
            |.|..::..||:.|..::|:|.:|   |:.|.:.    |:..::..:.......||... .|:.|
 Worm   503 KAAAPQIDEELLRAIAQKARQNVENAVEKTRKQL----TQDKVTNTNDLKRLFRFSTPK-QAVEL 562

  Fly   643 ANSSLIFELTSNEIVKTL-NHITRRKRQIFNPNQNAFNRNELTDTLQTVDISGLLG------GAQ 700
            :.:..|:|    |.|:.: .|:  .|..|.|.::...........|....:..|:|      |..
 Worm   563 SKAREIYE----ESVRLVREHV--EKGLILNVDELHPKNVSYESVLHVTHVQALMGLSGCHTGQY 621

  Fly   701 KQ--LDTCPEPSQQCDANSPFRTLSGRCNNLRNPNWGKSLTTFSRLLPAQYEDGISAP------R 757
            |.  .|||        .:..:|:..|:|||...|..|.||....|||...||:|.:.|      |
 Worm   622 KNPCTDTC--------FHHRYRSFDGQCNNKNKPMTGVSLMPLRRLLKPVYENGFNTPVGWEKGR 678

  Fly   758 LTGVTGTALPNPRTIS------TTIHPDISNLHTRYSLMVMQFAQFVDHDL--TLTPIHKGFHES 814
            |  ..|..|||.|.:|      ..|.|     |::.|.||||:.|||||||  |:|.:.:..:.:
 Worm   679 L--YNGYPLPNVREVSRQLVATENITP-----HSKLSSMVMQWGQFVDHDLTHTVTALSRHSYAT 736

  Fly   815 IPSC-RPCNSRQTVHPECNPFPVPAGDFYYPEVNVTSGERFCFPSMRSL----PGQQSL-----G 869
            ...| |.|   :.:.| |...|:...|   |.|...|.:..|....||.    .|:.||     .
 Worm   737 GAFCNRTC---ENLDP-CFNIPLSPND---PRVKSGSAKYPCIEFERSAAVCGSGETSLVFNRVT 794

  Fly   870 PRDQINQNTHFLDGSMVYGETTCLSNKLRGFSG-----RMNSTQVRGKELLPL--GPHPECK--- 924
            .|:|:|..|.|||.|.|||.....:.:||....     |.:.|...|||.||.  ..:.:|:   
 Worm   795 YREQMNALTSFLDASNVYGSNEVQAQELRDTYNNNGMLRFDITSEAGKEYLPFEKDSNMDCRRNF 859

  Fly   925 -SRNGL-CFLGGDDRASEQPGLTAIHTAFLREHNRIVEGLRGVNPHWNGEQLFHHARKIVSAQVQ 987
             ..|.: |||.||.||:||..|.|.||.|:||||||.:.|:.:|.:|:||.::|..||||.|.:|
 Worm   860 SEENPIRCFLAGDLRANEQLALAATHTIFIREHNRIAKKLKSMNGNWDGEIIYHETRKIVGAMMQ 924

  Fly   988 HIVFNEFLPRILSWNA-VNLYGLKLLPQGYYKDYNPSCSPIVFNEFAAAAFRIGHSLLRPHIPRL 1051
            ||.:..::|.|....| :|.:      .|.|:.|:|.....|.|.||.||||.||:::.|.:.||
 Worm   925 HITYKHWMPIIFGGQAQMNKF------VGTYQGYDPDVDASVTNAFATAAFRFGHTIINPSLFRL 983

  Fly  1052 SVQHQPV-EPPLLLRDGFFRMDALLQPGIIDEILRGLVATPME--TLDQFITGEVTNHLF-EDRK 1112
            .....|: |..:.|...||..:.:|..|.:|.:||||.|:|::  ...|.:..|:...|| :..:
 Worm   984 GNDFMPIKEGHIALHKAFFTPELVLTQGGVDPLLRGLFASPLKHPMPTQLLNMELIEKLFMKGHE 1048

  Fly  1113 IPFSGIDLIALNIQRARDHGIPSYNNYRALCNLKRATNWNDLSREIPTE-VINRFQKIYASVDDI 1176
            :   .:||..:||||:||||:|||..||..|||.....|.|:...|..: :|.:.:.:|....:|
 Worm  1049 V---SLDLAVMNIQRSRDHGLPSYTEYRKFCNLPVPVQWEDMKGYIKDDMIIQKLRGLYGVPQNI 1110

  Fly  1177 DLFPGAMTERPLQGGLVGPTLACIIGIQFRQLRKCDRFWYENQNPEVKFTEAQLAEVRKVTLAKI 1241
            ||:.|.:.|..|:.||.|||.|||||.|||::|..||||||...   .||..||.|::|:|||::
 Worm  1111 DLWVGGIVEEKLENGLFGPTFACIIGEQFRKIRDGDRFWYEKDG---VFTPEQLREIKKITLARL 1172

  Fly  1242 VCENLEITGDMQRAAFDLPSNFLNPRVPCASMPQIDLNAWR---ENV------------------ 1285
            .|:|.:....:|:..|..|.........|.....::|.||.   :||                  
 Worm  1173 FCDNGDNIDRIQKDVFMYPGMDKENYGTCQETEMMNLRAWSKCCDNVCPTMLDRILRSRHRGSRL 1237

  Fly  1286 QGCQIGNRN-VRVGESAFPSP----CTSCVCSAEGAQCASLRITDC 1326
            .||   |:| :...|.|...|    ||.|||......|::..  ||
 Worm  1238 HGC---NQNGIWRPEGAKWIPQNEICTECVCQGSRVWCSTKE--DC 1278

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
CG10211NP_609883.1 An_peroxidase 69..537 CDD:460804
An_peroxidase 719..1257 CDD:460804 213/579 (37%)
pxn-1NP_505188.3 LRR <35..>174 CDD:443914
leucine-rich repeat 35..53 CDD:275380
leucine-rich repeat 54..77 CDD:275380
leucine-rich repeat 78..101 CDD:275380
leucine-rich repeat 102..117 CDD:275380
leucine-rich repeat 125..148 CDD:275380
leucine-rich repeat 149..172 CDD:275380
PCC 153..>267 CDD:188093
I-set 315..400 CDD:400151
Ig strand B 332..336 CDD:409353
Ig strand C 345..349 CDD:409353
Ig strand E 368..372 CDD:409353
Ig strand F 382..387 CDD:409353
Ig strand G 395..398 CDD:409353
IG_like 414..496 CDD:214653
Ig strand B 425..429 CDD:409353
Ig strand C 438..442 CDD:409353
Ig strand E 461..466 CDD:409353
Ig strand F 476..481 CDD:409353
Ig strand G 489..492 CDD:409353
peroxidasin_like 757..1203 CDD:188658 169/460 (37%)
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.