DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment sbr and Nxf1

DIOPT Version :10

Sequence 1:NP_524660.1 Gene:sbr / 43944 FlyBaseID:FBgn0003321 Length:672 Species:Drosophila melanogaster
Sequence 2:NP_067590.2 Gene:Nxf1 / 59087 RGDID:62014 Length:618 Species:Rattus norvegicus


Alignment Length:695 Identity:213/695 - (30%)
Similarity:330/695 - (47%) Gaps:123/695 - (17%)


- Green bases have known domain annotations that are detailed below.


  Fly     3 KRGGGSSQ----RYNNNVGNGGGRYNA---PEDFDDFDVEDRQRRKDRNKRRVSFKPSQCLHNKK 60
            |:|.|..:    ..|...|.||....:   .||..|..:.|.|     :..||.:.|.....|: 
  Rat    22 KKGRGPFRWKCGEGNRRSGRGGSGIRSSRFEEDDGDVAMNDPQ-----DGPRVRYNPYTSRPNR- 80

  Fly    61 DIKLRPEDLRR--WDEDDDMSDMTTAVKDRPTSRRRGSPIPRGKFGKLMPNSF-GWYQVTLQNAQ 122
                     ||  |.:.|         :...|.||..:|..||..|.....:. .|:::|:...:
  Rat    81 ---------RRDTWHDRD---------RIHVTVRRDRAPQERGGAGTSQDGTTKNWFKITIPYGK 127

  Fly   123 IYEKETLLSALLAAMS-PHVFIPQYWRVERNCVIFFTDDYEAAERIQHLGKNGHLPDGYRLMPRV 186
            .|:|..|||.:.:..| |  |.|..:..|.....||.:|...|..::.:.......:..|:...:
  Rat   128 KYDKMWLLSMIQSKCSVP--FNPIEFHYENTRAQFFVEDATTASALKAVNYKIQDRENRRISIII 190

  Fly   187 RSGIPLVAIDDAFK----EKMKVTMAKRYNIQTKALDLSRFHADPDL--KQVFCPLFRQNVMGAA 245
            .|..|...:.:..|    |::|:.|:|||:...:||||....:||||  :.:...|.|:..|.||
  Rat   191 NSSAPPYIVQNELKPEQVEQLKLIMSKRYDGSQQALDLKGLRSDPDLVAQNIDVVLNRRGCMAAA 255

  Fly   246 IDIMCDNIPDLEALNLNDNSISSMEAFKGVEKRLPNLKILYLGDNKIPSLAHLVVLRNLSILELV 310
            :.|:.:|||:|.:|||::|.:..::....:.::.||||||.|..|::.|...|..::.|.:.||.
  Rat   256 LRIIEENIPELLSLNLSNNRLYKLDDMSSIVQKAPNLKILNLSGNELKSEWELDKIKGLKLEELW 320

  Fly   311 LKNNPCRSRYKDSQQFISEVRRKFPKLVKLDGETLEPQITFDLSEQGRLLETKASYLCDVAGAE- 374
            |..||....:.|...:||.:|.:||||::|||..|.|.|.||:.....|...|.||.    |.| 
  Rat   321 LDRNPMCDTFLDQSTYISTIRERFPKLLRLDGHELPPPIAFDVEAPTMLPPCKGSYF----GTEN 381

  Fly   375 ---VVRQFLDQYFRIFDSGNRQALLDAYHEKAMLSISMPSASQ---AGRLNSFWKFNRNLRRLLN 433
               :|..||.||:.|:|||:||.||||||:.|..|:|.||..|   ...|..::..:||::: :.
  Rat   382 LKSLVLHFLQQYYAIYDSGDRQGLLDAYHDGACCSLSTPSNPQNPVRHNLAKYFNDSRNVKK-IK 445

  Fly   434 GEENRTRNLKYGRLACVSTLDEWPKTQHDRRTFTVDLTIYNTSMMVFTVTGLFKELNDETNNPAS 498
            ....|.|.||:.||..|:.|:|.|||.||..:|.||::...::::.|:|.|:|||::.::.:   
  Rat   446 DTTTRFRLLKHTRLNVVAFLNELPKTHHDVNSFVVDISAQTSTLLCFSVNGVFKEVDGKSRD--- 507

  Fly   499 MELYDVRHFARTYVVVP-QNNGFCIRNETIFITNATHEQVREFKRSQHQPAPGAMPSTSSAVTSP 562
                .:|.|.||::.|| .|:|.||.|:.:|:.||:.|   |.:|:...|||  .||:|      
  Rat   508 ----SLRAFTRTFIAVPASNSGLCIVNDELFVRNASPE---EIQRAFAMPAP--TPSSS------ 557

  Fly   563 QAGAAAGLQGRLNALGVATGPVAILSGDPLAATAPVNSGSAAISTTAVAPGAQDESTKMQMIEAM 627
                                ||..||.:                             :..|::|.
  Rat   558 --------------------PVPTLSQE-----------------------------QQDMLQAF 573

  Fly   628 SAQSQMNVIWSRKCLEETNWDFNHAAFVFEKLFKENKIPPEAFMK 672
            |.||.||:.||:|||::.|||:..:|..|..|..:.:||..||||
  Rat   574 STQSGMNLEWSQKCLQDNNWDYTRSAQAFTHLKAKGEIPEVAFMK 618

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
sbrNP_524660.1 Tap-RNA_bind 113..191 CDD:462696 18/78 (23%)
leucine-rich repeat 215..255 CDD:275382 16/41 (39%)
LRR <252..>315 CDD:443914 22/62 (35%)
leucine-rich repeat 256..281 CDD:275382 5/24 (21%)
leucine-rich repeat 282..305 CDD:275382 8/22 (36%)
leucine-rich repeat 306..336 CDD:275382 10/29 (34%)
NTF2 372..531 CDD:238403 63/166 (38%)
flgK <548..>613 CDD:235895 9/64 (14%)
TAP_C 609..671 CDD:197882 21/61 (34%)
Nxf1NP_067590.2 Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1..113 27/114 (24%)
Interaction with ALYREF/THOC4 and LUZP4. /evidence=ECO:0000250|UniProtKB:Q9UBU9 2..197 46/200 (23%)
RNA-binding (RBD). /evidence=ECO:0000250 2..117 27/118 (23%)
Minor non-specific RNA-binding. /evidence=ECO:0000250 2..59 10/36 (28%)
Major non-specific RNA-binding. /evidence=ECO:0000250 60..117 17/80 (21%)
RNA binding. /evidence=ECO:0000250 60..117 17/80 (21%)
Nuclear localization signal. /evidence=ECO:0000250 66..99 11/51 (22%)
Nuclear export signal. /evidence=ECO:0000250 82..109 9/35 (26%)
Tap-RNA_bind 118..197 CDD:462696 19/80 (24%)
leucine-rich repeat 223..265 CDD:275382 16/41 (39%)
LRR <262..>351 CDD:443914 32/88 (36%)
LRR 1 265..290 5/24 (21%)
leucine-rich repeat 266..291 CDD:275382 5/24 (21%)
LRR 2 291..314 9/22 (41%)
leucine-rich repeat 292..315 CDD:275382 8/22 (36%)
LRR 3 315..342 8/26 (31%)
leucine-rich repeat 316..346 CDD:275382 10/29 (34%)
LRR 4 343..370 12/26 (46%)
NTF2 381..537 CDD:238403 61/163 (37%)
TAP_C 555..617 CDD:197882 27/116 (23%)

Return to query results.
Submit another query.