DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Pen and Kpna1

DIOPT Version :10

Sequence 1:NP_477041.1 Gene:Pen / 34338 FlyBaseID:FBgn0287720 Length:522 Species:Drosophila melanogaster
Sequence 2:NP_942021.2 Gene:Kpna1 / 288064 RGDID:735064 Length:538 Species:Rattus norvegicus


Alignment Length:532 Identity:236/532 - (44%)
Similarity:326/532 - (61%) Gaps:21/532 - (3%)


- Green bases have known domain annotations that are detailed below.


  Fly     7 NSRQGSYKANSINTQDSRMRRHEVTIELRKSKKEDQMFKRRNI------NDEDLTSP----LKEL 61
            |.|..|||..|:|..:.|.||.|..::|||.|:|:|:|||||:      .:|::.|.    ..::
  Rat     8 NFRLKSYKNKSLNPDEMRRRREEEGLQLRKQKREEQLFKRRNVATAEEETEEEVMSDGGFHEAQI 72

  Fly    62 NG--QSPVQLSVDEIVAAMNSEDQERQFLGMQSARKMLSRERNPPIDLMIG-HGIVPICIRFLQN 123
            |.  .:|..:...::...:.|...|:|....|..||:||:|.|||||.:|. .|:|...:.||:.
  Rat    73 NNMEMAPGGVITSDMTEMIFSNSPEQQLSATQKFRKLLSKEPNPPIDEVINTPGVVARFVEFLKR 137

  Fly   124 TNNSMLQFEAAWALTNIASGTSDQTRCVIEHNAVPHFVALLQSKSMNLAEQAVWALGNIAGDGAA 188
            ..|..||||:||.|||||||.|.|||.||:..|||.|:.||.|:..::.|||||||||||||...
  Rat   138 KENCTLQFESAWVLTNIASGNSLQTRIVIQAGAVPIFIELLSSEFEDVQEQAVWALGNIAGDSTM 202

  Fly   189 ARDIVIHHNVIDGILPLINNETPLSFLRNIVWLMSNLCRNKNPSPPFDQVKRLLPVLSQLLLSQD 253
            .||.|:..|::..:|.|.:.:..|:..||.||.:|||||.|:|.|.|.:|...|.|||.||...|
  Rat   203 CRDYVLDCNILPPLLQLFSKQNRLTMTRNAVWALSNLCRGKSPPPEFAKVSPCLNVLSWLLFVSD 267

  Fly   254 IQVLADACWALSYVTDDDNTKIQAVVDSDAVPRLVKLLQMDEPSIIVPALRSVGNIVTGTDQQTD 318
            ..|||||||||||::|..|.|||||:|:....|||:||..::..::.||||:|||||||.|.||.
  Rat   268 TDVLADACWALSYLSDGPNDKIQAVIDAGVCRRLVELLMHNDYKVVSPALRAVGNIVTGDDIQTQ 332

  Fly   319 VVIASGGLPRLGLLLQHNKSNIVKEAAWTVSNITAGNQKQIQAVIQAGIFQQLRTVLEKGDFKAQ 383
            |::....|..|..||...|.:|.|||.||:|||||||:.|||.||.|.:|..|.::|:..:|:.:
  Rat   333 VILNCSALQSLLHLLSSPKESIKKEACWTISNITAGNRAQIQTVIDANMFPALISILQTAEFRTR 397

  Fly   384 KEAAWAVTNTTTSGTPEQIVDLIEKYKILKPFIDLLDTKDPRTIKVVQTGLSNLFALAEKL---- 444
            ||||||:||.|:.|:.|||..|:| ...:||..|||...|.:.::|...||.|:..|.|:.    
  Rat   398 KEAAWAITNATSGGSAEQIKYLVE-LGCIKPLCDLLTVMDAKIVQVALNGLENILRLGEQEAKRN 461

  Fly   445 -GGTENLCLMVEEMGGLDKLETLQQHENEEVYKKAYAIIDTYFSNGDDEAEQELAPQEVNGALEF 508
             .|....|.::||..||||:|.||.|||:|:|:||:.:|:.||  |.::.:..:|||......::
  Rat   462 GSGINPYCALIEEAYGLDKIEFLQSHENQEIYQKAFDLIEHYF--GTEDEDSSIAPQVDLSQQQY 524

  Fly   509 NATQPKAPEGGY 520
            ...|.:||..|:
  Rat   525 IFQQCEAPMEGF 536

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
PenNP_477041.1 SRP1 2..517 CDD:227396 234/527 (44%)
HEAT repeat 70..102 CDD:293787 9/31 (29%)
armadillo repeat 108..140 CDD:293788 14/32 (44%)
armadillo repeat 148..184 CDD:293788 21/35 (60%)
armadillo repeat 190..225 CDD:293788 12/34 (35%)
armadillo repeat 359..396 CDD:293788 18/36 (50%)
armadillo repeat 402..437 CDD:293788 12/34 (35%)
Kpna1NP_942021.2 Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1..41 15/32 (47%)
SRP1 33..516 CDD:227396 219/485 (45%)
Nuclear localization signal. /evidence=ECO:0000250|UniProtKB:P52293 42..51 6/8 (75%)
ARM 1, truncated 77..117 11/39 (28%)
ARM 2. /evidence=ECO:0000305 118..161 22/42 (52%)
armadillo repeat 128..154 CDD:293788 12/25 (48%)
NLS binding site (major). /evidence=ECO:0000250|UniProtKB:P52293 149..241 48/91 (53%)
armadillo repeat 162..198 CDD:293788 21/35 (60%)
armadillo repeat 204..239 CDD:293788 12/34 (35%)
ARM 4. /evidence=ECO:0000305 207..245 15/37 (41%)
Binding to RAG1. /evidence=ECO:0000250|UniProtKB:P52294 245..437 98/192 (51%)
armadillo repeat 256..281 CDD:293788 17/24 (71%)
armadillo repeat 289..325 CDD:293788 18/35 (51%)
NLS binding site (minor). /evidence=ECO:0000250|UniProtKB:P52293 318..406 46/87 (53%)
armadillo repeat 373..409 CDD:293788 17/35 (49%)
ARM 9. /evidence=ECO:0000305 413..457 17/44 (39%)
armadillo repeat 416..450 CDD:293788 12/34 (35%)
ARM 10, atypical 460..504 19/43 (44%)

Return to query results.
Submit another query.