DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Sox100B and Sox1

DIOPT Version :10

Sequence 1:NP_651839.1 Gene:Sox100B / 45039 FlyBaseID:FBgn0024288 Length:529 Species:Drosophila melanogaster
Sequence 2:NP_033259.2 Gene:Sox1 / 20664 MGIID:98357 Length:391 Species:Mus musculus


Alignment Length:506 Identity:115/506 - (22%)
Similarity:168/506 - (33%) Gaps:230/506 - (45%)


- Green bases have known domain annotations that are detailed below.


  Fly    70 TDRKKEHIKRPMNAFMVWAQAARRVMSKQYPHLQNSELSKSLGKLWKNLKDSDKKPFMEFAEKLR 134
            |...::.:||||||||||::..||.|:::.|.:.|||:||.||..||.:.:::|:||::.|::||
Mouse    44 TKANQDRVKRPMNAFMVWSRGQRRKMAQENPKMHNSEISKRLGAEWKVMSEAEKRPFIDEAKRLR 108

  Fly   135 MTHKQEHPDYKYQPRRKKARVLPSQQSGEGGSPGPEMTLSATMGSSGKPRSSNSNGQRRAGKGNA 199
            ..|.:|||||||:||||...:|...          :.:|:..:.::|            ||.|.|
Mouse   109 ALHMKEHPDYKYRPRRKTKTLLKKD----------KYSLAGGLLAAG------------AGGGGA 151

  Fly   200 AADLGSCASTISHANVGSNSSDVFSNEAFMKSLNSACAASLMEQSLIETGLDSPCSTASSMSSLT 264
            |..:|                                                            
Mouse   152 AVAMG------------------------------------------------------------ 156

  Fly   265 PPATPYNVAPSNAKASAANNPSLLLRQLSEPVANAGDGYGVLLEAGREYVAIGEVNYQGQSAGVQ 329
                                              .|.|.|.        .|:|:   :.:|.|  
Mouse   157 ----------------------------------VGVGVGA--------AAVGQ---RLESPG-- 174

  Fly   330 SGAEGGGAGQEMDFLENINGY--GGYTGSRVSYPAYSYPANGGHFATEEQQQQQALQASEALNYK 392
             ||.|||       ..::||:  |.|.||..:..|          |....|:.|.     |....
Mouse   175 -GAAGGG-------YAHVNGWANGAYPGSVAAAAA----------AAAMMQEAQL-----AYGQH 216

  Fly   393 PAAADIDPKEIDQYFMDQMLPMTQHHHPHHTHPLHHP-----------------LHHSPPLNSSA 440
            |.|....|                |.||.|.|| |||                 |.:||..||..
Mouse   217 PGAGGAHP----------------HAHPAHPHP-HHPHAHPHNPQPMHRYDMGALQYSPISNSQG 264

  Fly   441 SLSS-----------ACSSASSQQPVAEYYEHLGYSPAASSAS------------QNPNFGPQQP 482
            .:|:           |.::|::....|.....:..:.||::||            ..|:..|..|
Mouse   265 YMSASPSGYGGIPYGAAAAAAAAAGGAHQNSAVAAAAAAAAASSGALGALGSLVKSEPSGSPPAP 329

  Fly   483 YANGAASMTPTLGD---------PA-----PQQELQSQQQEQQHQNPSQHH 519
                |.|..|..||         ||     |.....:..|.:.|..| ||:
Mouse   330 ----AHSRAPCPGDLREMISMYLPAGEGGDPAAAAAAAAQSRLHSLP-QHY 375

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
Sox100BNP_651839.1 HMG-box_SoxE 76..150 CDD:438840 38/73 (52%)
Sox1NP_033259.2 Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1..52 1/7 (14%)
HMG-box_SoxB 49..128 CDD:438790 41/78 (53%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 214..249 12/51 (24%)
9aaTAD. /evidence=ECO:0000250|UniProtKB:P41225 342..350 0/7 (0%)

Return to query results.
Submit another query.