DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Sox14 and Sox30

DIOPT Version :10

Sequence 1:NP_476894.1 Gene:Sox14 / 37822 FlyBaseID:FBgn0005612 Length:669 Species:Drosophila melanogaster
Sequence 2:NP_001414119.1 Gene:Sox30 / 689918 RGDID:1586382 Length:770 Species:Rattus norvegicus


Alignment Length:805 Identity:155/805 - (19%)
Similarity:240/805 - (29%) Gaps:315/805 - (39%)


- Green bases have known domain annotations that are detailed below.


  Fly     4 KPNQATTEPP-----------------------LSLRPGTV-----------PTVPATTPARPAT 34
            ||.|....||                       |.|||..:           |..|...|.:|.|
  Rat    84 KPEQVLLLPPGPLLPQAPDEGAAAAASSAQARLLQLRPELLLLPPQSAADGGPCRPELRPMQPRT 148

  Fly    35 ITIQ------------------------RRHPAPKADSTPHTLPPFSPSPSPASSPSPAPAQT-- 73
            :.::                        |...|.|.|.|...|.........|...:...::.  
  Rat   149 LLVKAEKQELGAGLDLSVGSRRTTEAGPRASRAAKLDGTGKALDGRRGDEKKAKLEAEEASRDAL 213

  Fly    74 ------------PGAQKTQ---------------SQAAITHPAAVASPSAPVAAAAPKTPKTPEP 111
                        .|..||:               :...:.|.:..|..:.|.:|..|.......|
  Rat   214 KGGEGRSLLAIGEGVIKTEEPERLRDDCRLGTEATSNGLVHSSKEAILAQPPSAFGPHQQDLRFP 278

  Fly   112 RSTHT-HTHTHSQHFSPPPRE-------------SEMDGERSPSHSGHEMTLSMDGIDSSLVFGS 162
            .:.|| ......|...|||.|             .:|.....||     :.:....:..:::...
  Rat   279 LTLHTVPPGARIQFQGPPPSELIRLSKVPLTPVPIKMQSLLEPS-----VKIETKDVPLTVLPSD 338

  Fly   163 ARVPVNSSTPYSDATRTKKHSPGHIKRPMNAFMVWSQMERRKICERTPDLHNAEISKELGRRWQL 227
            |.:|   .||:|      |...||:||||||||||:::.|..:.:..|..:|||||.:||..|..
  Rat   339 AGIP---DTPFS------KDRNGHVKRPMNAFMVWARIHRPALAKANPAANNAEISVQLGLEWNK 394

  Fly   228 LSKDDKQPYIIEAEKLRKLHMIEYPNYKYRPQK-KQTRSPGSLKPNQDADGCEARNDTTNNNNSL 291
            ||::.|:||..||:|:::.|..|:|.:.|:|:. |:.|.|.|:.        ...:.||.|    
  Rat   395 LSEEQKKPYYDEAQKIKEKHREEFPGWVYQPRPGKRKRFPLSVS--------NVFSGTTQN---- 447

  Fly   292 TTLAINGTTTAGRKSKRSTSTCQSGSASKRLRNDSGDTSSKPKYEVKMESAEQLNSADIILPSAD 356
             .::.|.||....:|                          |.|.|             ::|...
  Rat   448 -IISTNPTTIYPYRS--------------------------PTYSV-------------VIPGLQ 472

  Fly   357 NLISYQSSEYLPLSTLSNADCDEKLHSELSSGPLESRENLSEVVNRFLPLFLGGNEDSQLGVSSL 421
            |.|::...|..|...|.                       :..|.|..|:             :|
  Rat   473 NTITHPVGEAPPAIQLP-----------------------TPAVQRPSPI-------------TL 501

  Fly   422 TQSQHNQSDPTAGLMDNISDISPINDREELTEEVMRYLPYLEVNPSSDGLTLKVESSSLLGKPLN 486
            .|...:.:.|.|        :.|               |.|...|            ||..:..:
  Rat   502 FQPSVSSTGPVA--------VPP---------------PSLTPRP------------SLPPQRFS 531

  Fly   487 EPVFDSEDNIVNDANLHSASHQIPPYVPDSHDCFAEDCGGDSSSHQVEFEVVRPQTVTMTMTCTL 551
            .|   |:.::          |::|              .|.|.|      |.||..|::..|..:
  Rat   532 GP---SQTDV----------HRLP--------------SGTSRS------VKRPTPVSLESTNRI 563

  Fly   552 PYGGPDAGHTTFQADDFNAIPSAAEDSECSILTT-------------------------SNSPQI 591
            |.|...| |..|...........|..|.|...|.                         ...|:.
  Rat   564 PAGASTA-HARFATSPIQPPKEYAGVSTCPRSTPIPPATPIPHSHVYQPPPLGHPATLFGTPPRF 627

  Fly   592 GFNGSSFVEADAI--GSTCTYAQQ-----DYTGSVIET--HNDLNYAAHDNNGALLA-----YTF 642
            .|:...|:.....  .|||.|::.     ::..|:.|.  :.:..|..|:   |:.:     |.|
  Rat   628 SFHHPYFLPGPHYFPSSTCPYSRPPFGYGNFPSSMPECLGYYEDRYQKHE---AIFSALNRDYPF 689

  Fly   643 EDLPPQPTGSHLEFNTNKYEFASYY 667
            .|.|.:.|.|....:....:...||
  Rat   690 RDYPDEHTHSEDSRSCESMDGPPYY 714

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
Sox14NP_476894.1 HMG-box_SoxC 186..261 CDD:438838 34/75 (45%)
Sox30NP_001414119.1 None

Return to query results.
Submit another query.