DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Sox100B and Sox17

DIOPT Version :10

Sequence 1:NP_651839.1 Gene:Sox100B / 45039 FlyBaseID:FBgn0024288 Length:529 Species:Drosophila melanogaster
Sequence 2:NP_035571.1 Gene:Sox17 / 20671 MGIID:107543 Length:419 Species:Mus musculus


Alignment Length:513 Identity:135/513 - (26%)
Similarity:181/513 - (35%) Gaps:154/513 - (30%)


- Green bases have known domain annotations that are detailed below.


  Fly     1 MSDSSSSNCSKDRAKPVETLVLANYALKAEQKKAQGQGGRKEDERITTAVMKVLEGYDW------ 59
            ||...:...|.|:::|               :.||            .|||..|....|      
Mouse     1 MSSPDAGYASDDQSQP---------------RSAQ------------PAVMAGLGPCPWAESLSP 38

  Fly    60 -------NLVQASAKAPTD-----RKKEHIKRPMNAFMVWAQAARRVMSKQYPHLQNSELSKSLG 112
                   ..|.||:.||..     :.:..|:||||||||||:..|:.:::|.|.|.|:||||.||
Mouse    39 LGDVKVKGEVVASSGAPAGTSGRAKAESRIRRPMNAFMVWAKDERKRLAQQNPDLHNAELSKMLG 103

  Fly   113 KLWKNLKDSDKKPFMEFAEKLRMTHKQEHPDYKYQPRRKK------------ARVLPSQQSG--- 162
            |.||.|..::|:||:|.||:||:.|.|:||:|||:|||:|            ...|...|:|   
Mouse   104 KSWKALTLAEKRPFVEEAERLRVQHMQDHPNYKYRPRRRKQVKRMKRVEGGFLHALVEPQAGALG 168

  Fly   163 -EGGS--------PGPEMTLSATMGSSGKPRSSNSNGQRR---AGKGNAAADLGSCASTISHANV 215
             |||.        |.||....|     |.|..|...|...   .|.|..|.|.....:..:....
Mouse   169 PEGGRVAMDGLGLPFPEPGYPA-----GPPLMSPHMGPHYRDCQGLGAPALDGYPLPTPDTSPLD 228

  Fly   216 GSNSSDVFSNEAFMKSLNSACAASLMEQSLIETGLDSPCSTASSMSSLTPPATPYNVAP---SNA 277
            |......|    |...|...|.|:         |..:....:....|:.|||.|..|.|   ..|
Mouse   229 GVEQDPAF----FAAPLPGDCPAA---------GTYTYAPVSDYAVSVEPPAGPMRVGPDPSGPA 280

  Fly   278 KASAANNPS---LLLRQLSEPVANAGDGYGVLLEAGREYVAIGEVNYQGQSAGVQSGAEGGGAGQ 339
            .......||   |....:..|.|:||.|:....:.        .:..|......|......|.||
Mouse   281 MPGILAPPSALHLYYGAMGSPAASAGRGFHAQPQQ--------PLQPQAPPPPPQQQHPAHGPGQ 337

  Fly   340 EMDFLENINGYGGYTGSRVSYPAYSYPANGGHFATEEQQQQQALQASEALNYKPAAADIDPKEID 404
            .                  |.|..:.|...|   ||..|..:.|            .::|..|.:
Mouse   338 P------------------SPPPEALPCRDG---TESNQPTELL------------GEVDRTEFE 369

  Fly   405 QYFMDQMLPMTQHHH---PHHTHPLHHPLHHSPPLNSSASLSSACSSASSQQPVAEYY 459
            ||     ||......   |:..|.....|.     :|..::||..|.|||    |.||
Mouse   370 QY-----LPFVYKPEMGLPYQGHDCGVNLS-----DSHGAISSVVSDASS----AVYY 413

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
Sox100BNP_651839.1 HMG-box_SoxE 76..150 CDD:438840 42/73 (58%)
Sox17NP_035571.1 Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1..27 10/52 (19%)
HMG-box_SoxF_SOX17 62..142 CDD:438850 43/79 (54%)
Sox17_18_mid 203..253 CDD:463454 11/62 (18%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 303..356 15/81 (19%)
9aaTAD. /evidence=ECO:0000250|UniProtKB:Q9H6I2 366..374 4/12 (33%)

Return to query results.
Submit another query.