DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Sox21a and SOX7

DIOPT Version :10

Sequence 1:NP_001261827.1 Gene:Sox21a / 39567 FlyBaseID:FBgn0036411 Length:407 Species:Drosophila melanogaster
Sequence 2:NP_113627.1 Gene:SOX7 / 83595 HGNCID:18196 Length:388 Species:Homo sapiens


Alignment Length:391 Identity:112/391 - (28%)
Similarity:154/391 - (39%) Gaps:132/391 - (33%)


- Green bases have known domain annotations that are detailed below.


  Fly    77 PPAAPTPVAPKMHQHTHHHGNSHHNAPTSHSNSNTGSHHNSHDHIKRPMNAFMVWSRGQRRKMAQ 141
            |||.|.|...|                            .|...|:|||||||||::.:|:::|.
Human    29 PPAVPRPPGDK----------------------------GSESRIRRPMNAFMVWAKDERKRLAV 65

  Fly   142 DNPKMHNSEISKRLGAEWKLLTEGQKRPFIDEAKRLRALHMKEHPDYKYRPRRK--PKTLNKSPV 204
            .||.:||:|:||.||..||.||..||||::|||:|||..||:::|:||||||||  .|.|.|...
Human    66 QNPDLHNAELSKMLGKSWKALTLSQKRPYVDEAERLRLQHMQDYPNYKYRPRRKKQAKRLCKRVD 130

  Fly   205 PG------------------GGGGGGGGGANGGVNAGGAG--------NSGPSGPGSVGSPKDMQ 243
            ||                  |..|..|...:.|..:.|..        :.||:|.|..|:|..:.
Human   131 PGFLLSSLSRDQNALPEKRSGSRGALGEKEDRGEYSPGTALPSLRGCYHEGPAGGGGGGTPSSVD 195

  Fly   244 ---------PQLSPLGQSLPHL----------HGHPHQSPY-QSHPHHPH--PHPHHVQLAAATL 286
                     |::|||....|..          ||||.:.|: ..||:.|.  |.|.|......:|
Human   196 TYPYGLPTPPEMSPLDVLEPEQTFFSSPCQEEHGHPRRIPHLPGHPYSPEYAPSPLHCSHPLGSL 260

  Fly   287 SAKYGFGSPLELSLPRLPNAFPGLAHYPLDPTLALDLQARLQAMYAGSI----YHPW-------- 339
            :.....|..:...:|..|   |..|:|  .|.....|.:.||| :.|.:    .||.        
Human   261 ALGQSPGVSMMSPVPGCP---PSPAYY--SPATYHPLHSNLQA-HLGQLSPPPEHPGFDALDQLS 319

  Fly   340 --------------RYLPLISPETPPSPPSSSGTGISSYGCVKSEKSSPNAVVASAASPPNIIXP 390
                          :||     .||..|.|::|                 |:..|...|.:.: |
Human   320 QVELLGDMDRNEFDQYL-----NTPGHPDSATG-----------------AMALSGHVPVSQVTP 362

  Fly   391 T 391
            |
Human   363 T 363

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
Sox21aNP_001261827.1 HMG-box_SoxB 119..198 CDD:438790 46/80 (58%)
SOX7NP_113627.1 Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 20..46 7/44 (16%)
HMG-box_SoxF_SOX7 34..121 CDD:438849 49/114 (43%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 140..197 11/56 (20%)
Sox17_18_mid <198..230 CDD:463454 6/31 (19%)

Return to query results.
Submit another query.