DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Sox100B and Sox4

DIOPT Version :10

Sequence 1:NP_651839.1 Gene:Sox100B / 45039 FlyBaseID:FBgn0024288 Length:529 Species:Drosophila melanogaster
Sequence 2:NP_001258134.1 Gene:Sox4 / 364712 RGDID:1309488 Length:440 Species:Rattus norvegicus


Alignment Length:472 Identity:128/472 - (27%)
Similarity:190/472 - (40%) Gaps:140/472 - (29%)


- Green bases have known domain annotations that are detailed below.


  Fly    38 GGRKEDERITTAVMKVLEGYDWNLVQASAKAPTDRKKEHIKRPMNAFMVWAQAARRVMSKQYPHL 102
            ||:.:|.             .|      .|.|:.    ||||||||||||:|..||.:.:|.|.:
  Rat    43 GGKADDP-------------SW------CKTPSG----HIKRPMNAFMVWSQIERRKIMEQSPDM 84

  Fly   103 QNSELSKSLGKLWKNLKDSDKKPFMEFAEKLRMTHKQEHPDYKYQPRRKKARVLPSQQSGEGGSP 167
            .|:|:||.|||.||.||||||.||::.||:||:.|..::|||||:||:|    :.|..:|.|   
  Rat    85 HNAEISKRLGKRWKLLKDSDKIPFIQEAERLRLKHMADYPDYKYRPRKK----VKSGNTGAG--- 142

  Fly   168 GPEMTLSATMGSSGKPRSSNSNGQRRAGKGNAAADLG---------SCASTISHANVGSNSSDVF 223
                  ||.....|:.....:.|...||.|:|....|         ||...::.::||.      
  Rat   143 ------SAATAKPGEKGDKVAGGSGHAGSGHAGGGAGGSSKPAPKKSCGPKVAGSSVGK------ 195

  Fly   224 SNEAFMKSLNSACAASLMEQSLIETGLDSPCSTASSMSSLTPPATPYNV-APSNAKASAANNPSL 287
            .:..|:.:.....|||...:.             :::..|..||..|.| .||.|..:|:::|| 
  Rat   196 PHAKFVPAGGGKAAASFSPEQ-------------AALLPLGEPAAVYKVRTPSAATPAASSSPS- 246

  Fly   288 LLRQLSEPVANAGDG-------YGVLLEAGREYVAIGEVNYQGQSAGVQSGAEGGGAGQEMDFLE 345
              ..|:.|..:..|.       :|.|   |.....:|.:......:......|.||.|...|. .
  Rat   247 --SALATPAKHPADKKVKRVYLFGSL---GASASPVGGLGASADPSDPLGLYEDGGPGCSPDG-R 305

  Fly   346 NINGYGGYTGSRVSYPAYS-YPANGGHFATEEQQQQQALQASEALNYKPAAADIDPKEIDQYFMD 409
            :::|    ..|..|.||.| .||:...:|        :|:|:     .||               
  Rat   306 SLSG----RSSAASSPAASRSPADHRGYA--------SLRAA-----SPA--------------- 338

  Fly   410 QMLPMTQHHHPHHTHPLHHPLHHSPPLNSSASLSSACSSASSQ--------QPVAEYYEHLGYSP 466
                           |...|.|.|..|:||:|.||..||:..:        .| :..:|.:....
  Rat   339 ---------------PSSAPSHASSSLSSSSSSSSGSSSSDDEFEDDLLDLNP-SSNFESMSLGS 387

  Fly   467 AASSASQNP----NFGP 479
            .:||::.:.    ||.|
  Rat   388 FSSSSALDRDLDFNFEP 404

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
Sox100BNP_651839.1 HMG-box_SoxE 76..150 CDD:438840 45/73 (62%)
Sox4NP_001258134.1 HMG-box_SoxC 58..133 CDD:438838 46/74 (62%)

Return to query results.
Submit another query.