DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment SoxN and Sry

DIOPT Version :10

Sequence 1:NP_001260269.1 Gene:SoxN / 44275 FlyBaseID:FBgn0029123 Length:761 Species:Drosophila melanogaster
Sequence 2:NP_035694.1 Gene:Sry / 21674 MGIID:98660 Length:395 Species:Mus musculus


Alignment Length:593 Identity:151/593 - (25%)
Similarity:196/593 - (33%) Gaps:266/593 - (44%)


- Green bases have known domain annotations that are detailed below.


  Fly   179 VKRPMNAFMVWSRGQRRKMASDNPKMHNSEISKRLGAQWKDLSESEKRPFIDEAKRLRAVHMKEH 243
            ||||||||||||||:|.|:|..||.|.|:||||:||.:||.|:|:|||||..||:||:.:|.:::
Mouse     5 VKRPMNAFMVWSRGERHKLAQQNPSMQNTEISKQLGCRWKSLTEAEKRPFFQEAQRLKILHREKY 69

  Fly   244 PDYKYRPRRKTKTLTKTKEKYPMGGLMPGQTVGGGAPGEPVTPTRVQG--QPGQNQSLNGSGGSA 306
            |:|||:|.|:.|...::      |.|.|.           |..|::..  |..:|..        
Mouse    70 PNYKYQPHRRAKVSQRS------GILQPA-----------VASTKLYNLLQWDRNPH-------- 109

  Fly   307 AAAAAAAAAAAQQARQD------MYQMNAPNGY-----MPNGYMMHADPAGAAAYQTSYMGQHYA 360
                      |...|||      :|..|..:.|     :|.|   |.........|..:...|..
Mouse   110 ----------AITYRQDWSRAAHLYSKNQQSFYWQPVDIPTG---HLQQQQQQQQQQQFHNHHQQ 161

  Fly   361 AQRYDMGHMYNNGYAMYQTVSGGQTSPYGSSLQQPGSPSPYGGSSLQQQPGSPTPYGGGGGGGGQ 425
            .|::                       |....||            |||.              |
Mouse   162 QQQF-----------------------YDHHQQQ------------QQQQ--------------Q 177

  Fly   426 VSCQSHSPSDSSIKSEPVSPSPSAIALNNNNNINNNHIMKREYSSAAAAAAAAAAAAAAGGGELN 490
            ...|.|...                  ......:::|..::::..                   :
Mouse   178 QQQQFHDHH------------------QQKQQFHDHHQQQQQFHD-------------------H 205

  Fly   491 HLMNMYHLPDEQRHLLHYQTDSPDLQQQHQSMQQQQQHLPQQHLSQQHQQIPQ--QHHTMQQQQQ 553
            |    :|..::|.|..|.|      |||....|||||...||.....|||..|  .||..|||||
Mouse   206 H----HHHQEQQFHDHHQQ------QQQFHDHQQQQQQQQQQQFHDHHQQKQQFHDHHHHQQQQQ 260

  Fly   554 QH-HLQHQQSLRAMAPLAHMXEMASAYGAAGSSVSVSPPLPPYMPTAPQQQQQ---QQQQQQQLL 614
            .| |.|.||.........|...                       ..|||:||   ..|||||. 
Mouse   261 FHDHQQQQQQFHDHQQQQHQFH-----------------------DHPQQKQQFHDHPQQQQQF- 301

  Fly   615 NGRPSPTASGSSGGRSSAHSSGHTAAHSPALAAAYQLTPSAATSSATSAAAAAATAAAVAAAAAA 679
                              |...|                                          
Mouse   302 ------------------HDHHH------------------------------------------ 306

  Fly   680 GSSFAASASMLDMQQQHLEEQQQHH------HLHYQQQQQ--QHYQQQQQ-----QQQQLQHLPQ 731
                         |||..::...||      |.|:||:||  .|:|||||     ||||.|   |
Mouse   307 -------------QQQQKQQFHDHHQQKQQFHDHHQQKQQFHDHHQQQQQFHDHHQQQQQQ---Q 355

  Fly   732 QQQHQLYH 739
            |||.|.:|
Mouse   356 QQQQQQFH 363

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
SoxNNP_001260269.1 HMG-box_SoxB 177..252 CDD:438790 46/72 (64%)
SryNP_035694.1 Sufficient for interaction with KPNB1. /evidence=ECO:0000250|UniProtKB:Q05066 4..81 47/75 (63%)
HMG-box_SoxA_SoxB_SoxG 4..79 CDD:438837 46/73 (63%)
Required for nuclear localization. /evidence=ECO:0000250|UniProtKB:Q05066 6..22 14/15 (93%)
Sufficient for interaction with EP300. /evidence=ECO:0000250|UniProtKB:Q05066 52..84 15/31 (48%)
Required for nuclear localization. /evidence=ECO:0000250|UniProtKB:Q05066 75..81 2/5 (40%)
Necessary for interaction with ZNF208 isoform KRAB-O. /evidence=ECO:0000269|PubMed:15469996 92..144 14/83 (17%)
Necessary for interaction with SLC9A3R2 and nuclear accumulation of SLC9A3R2. /evidence=ECO:0000269|PubMed:16166090 94..138 10/61 (16%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 142..361 85/414 (21%)

Return to query results.
Submit another query.