DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Sox100B and Sox7

DIOPT Version :10

Sequence 1:NP_651839.1 Gene:Sox100B / 45039 FlyBaseID:FBgn0024288 Length:529 Species:Drosophila melanogaster
Sequence 2:NP_001099515.1 Gene:Sox7 / 290317 RGDID:1310038 Length:383 Species:Rattus norvegicus


Alignment Length:517 Identity:115/517 - (22%)
Similarity:154/517 - (29%) Gaps:235/517 - (45%)


- Green bases have known domain annotations that are detailed below.


  Fly    53 VLEGYDW---------------NLVQASAKAPTDRK--KEHIKRPMNAFMVWAQAARRVMSKQYP 100
            :|..|.|               .|...:...|:..|  :..|:||||||||||:..|:.::.|.|
  Rat     4 LLGAYPWTEGLECPALEAELSDGLSPPAVPRPSGDKGSESRIRRPMNAFMVWAKDERKRLAVQNP 68

  Fly   101 HLQNSELSKSLGKLWKNLKDSDKKPFMEFAEKLRMTHKQEHPDYKYQPRRKKARVLPSQQSGEGG 165
            .|.|:||||.|||.||.|..|.|:|:::.||:||:.|.|::|:|||:|||||             
  Rat    69 DLHNAELSKMLGKSWKALTLSQKRPYVDEAERLRLQHMQDYPNYKYRPRRKK------------- 120

  Fly   166 SPGPEMTLSATMGSSGKPRSSNSNGQRRAGKGNAAADLGSCASTISHANVGSNSSDVFSNEAFMK 230
                                   .|:|...:    .|.|...|::|.                  
  Rat   121 -----------------------QGKRLCKR----VDPGFLLSSLSR------------------ 140

  Fly   231 SLNSACAASLMEQSLIETGLDSPCSTASSMSSLTPPATPYNVAPSNAKASAANNPSLLLRQLSEP 295
                                                  ..|..|.  |.|....|          
  Rat   141 --------------------------------------DQNTLPE--KNSIGRGP---------- 155

  Fly   296 VANAGDGYGVLLEAGREYVAIGEVNYQGQSA------GVQSGAEGGGAGQEMDFLENINGYGGYT 354
                                :||...:|:.|      |:.|....|.|..              .
  Rat   156 --------------------LGEKEDRGEYAPGATLPGLHSCYREGAAAA--------------P 186

  Fly   355 GSRVSYPAYSYPANGGHFATEEQQQQQALQASEALNYKPAAADIDPKEIDQYFMDQMLPMTQHHH 419
            ||..:|| |..|.                        .|..:.:|..|.:|.|..... ..:|.|
  Rat   187 GSVDTYP-YGLPT------------------------PPEMSPLDALEPEQTFFSSSC-QEEHGH 225

  Fly   420 PHHTHPLHH-------------PLHHSPPLNSSASLS----SACSSASSQQPVAEYYEHLGYSPA 467
            |||   |.|             |||.|.||.|.|...    |..||.....|...||.|..|.|.
  Rat   226 PHH---LPHLPGPPYSPEFTPSPLHCSHPLGSLALGQSPGVSMMSSVPGCPPSPAYYSHATYHPL 287

  Fly   468 ASSASQNPNF---------GPQQPYAN--GAASMTPTLGDPAPQQELQSQQQEQQHQNPSQH 518
                  :||.         .|:.|..:  ...|....|||       ..:.:..|:.|...|
  Rat   288 ------HPNLQAHLGQLSPPPEHPGFDTLDQLSQVELLGD-------MDRNEFDQYLNTPGH 336

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
Sox100BNP_651839.1 HMG-box_SoxE 76..150 CDD:438840 40/73 (55%)
Sox7NP_001099515.1 HMG-box_SoxF_SOX7 34..121 CDD:438849 46/122 (38%)
Sox17_18_mid 178..222 CDD:463454 13/83 (16%)

Return to query results.
Submit another query.