DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Sox100B and Sox9

DIOPT Version :10

Sequence 1:NP_651839.1 Gene:Sox100B / 45039 FlyBaseID:FBgn0024288 Length:529 Species:Drosophila melanogaster
Sequence 2:NP_536328.1 Gene:Sox9 / 140586 RGDID:620474 Length:507 Species:Rattus norvegicus


Alignment Length:584 Identity:173/584 - (29%)
Similarity:231/584 - (39%) Gaps:200/584 - (34%)


- Green bases have known domain annotations that are detailed below.


  Fly     3 DSSSSNCSKDRAKPVE-TLVLANYALKAEQKKAQGQGGRKEDER------ITTAVMKVLEGYDWN 60
            ||:.|.|........| |....|...|.|...      :||.|.      |..||.:||:||||.
  Rat    29 DSAGSPCPSGSGSDTENTRPQENTFPKGEPDL------KKESEEDKFPVCIREAVSQVLKGYDWT 87

  Fly    61 LVQASAKA-PTDRKKEHIKRPMNAFMVWAQAARRVMSKQYPHLQNSELSKSLGKLWKNLKDSDKK 124
            ||....:. .:.:.|.|:||||||||||||||||.::.|||||.|:||||:|||||:.|.:|:|:
  Rat    88 LVPMPVRVNGSSKNKPHVKRPMNAFMVWAQAARRKLADQYPHLHNAELSKTLGKLWRLLNESEKR 152

  Fly   125 PFMEFAEKLRMTHKQEHPDYKYQPRRKKARVLPSQQSGEGGSPGPEMTLSATMGSSGKPRSSNSN 189
            ||:|.||:||:.||::||||||||||:|                                 |..|
  Rat   153 PFVEEAERLRVQHKKDHPDYKYQPRRRK---------------------------------SVKN 184

  Fly   190 GQRRAGKGNAAADLGSCASTISHANVGSNSSDVFSNEAFMKSLNSACAASLMEQSLIETGLDSPC 254
            ||..|.:          |:..:|          .|..|..|:|.:                |||.
  Rat   185 GQAEAEE----------ATEQTH----------ISPNAIFKALQA----------------DSPH 213

  Fly   255 STASSMSSL--------------TPPATPYNVAPSNAKASAANNPSLLLRQLSEPVANAGDGYGV 305
            | :|.||.:              |||.||    .::.:|...:     |::...|:|        
  Rat   214 S-SSGMSEVHSPGEHSGQSQGPPTPPTTP----KTDVQAGKVD-----LKREGRPLA-------- 260

  Fly   306 LLEAGRE------YVAIGEVNYQGQSAGVQSGAEGGGAGQEMDFLENINGYGGY--TGSRVSYPA 362
              |.||:      .|.|||:     |:.|.|..|.....:...:|.. ||:.|.  |..:|||..
  Rat   261 --EGGRQPPIDFRDVDIGEL-----SSDVISNIETFDVNEFDQYLPP-NGHPGVPATHGQVSYTG 317

  Fly   363 YSY--------PANGGHFATEEQQ------QQ--QALQASEALNYKPAAADIDPKEIDQYFMDQM 411
             ||        ||..||....:||      ||  ||.||.:|    |......|:       .|.
  Rat   318 -SYGISSTAPTPATAGHVWMSKQQAPPPPPQQPPQAPQAPQA----PPQQQAPPQ-------PQQ 370

  Fly   412 LPMTQHHHPHHTHPLHHPLHHSPPLNSSASLSSACSSASSQQPVAEYYEHLGYSPAASSAS--QN 474
            .|..|..|...|            |:|....|......:.|...:.|.|...:||...|.|  ..
  Rat   371 APQQQQAHTLTT------------LSSEPGQSQRTHIKTEQLSPSHYSEQQQHSPQQISYSPFNL 423

  Fly   475 PNFGPQQPYANGAASMTPTLGDPAPQQELQSQQQEQQHQNPSQHH---------LWGTYTYVNP 529
            |::.|..|        |.|          :||.....|||...::         |:.|:||:||
  Rat   424 PHYNPSYP--------TIT----------RSQYDYTDHQNSGSYYSHAAGQGSGLYSTFTYMNP 469

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
Sox100BNP_651839.1 HMG-box_SoxE 76..150 CDD:438840 52/73 (71%)
Sox9NP_536328.1 Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1..67 12/43 (28%)
Sox_N 22..94 CDD:463586 23/70 (33%)
Dimerization (DIM). /evidence=ECO:0000250|UniProtKB:P48436 63..103 13/39 (33%)
PQA. /evidence=ECO:0000250|UniProtKB:P48436 63..103 13/39 (33%)
HMG-box_SoxE 104..178 CDD:438840 52/73 (71%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 160..250 39/163 (24%)
Transactivation domain (TAM). /evidence=ECO:0000250|UniProtKB:P48436 224..307 24/107 (22%)
9aaTAD 1. /evidence=ECO:0000250|UniProtKB:P48436 275..284 5/13 (38%)
9aaTAD 2. /evidence=ECO:0000250|UniProtKB:P48436 290..298 0/7 (0%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 335..429 27/116 (23%)
Transactivation domain (TAC). /evidence=ECO:0000250|UniProtKB:P48436 392..507 23/96 (24%)
9aaTAD 3. /evidence=ECO:0000250|UniProtKB:P48436 458..466 2/7 (29%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 477..507
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.