DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment ash1 and Setd1a

DIOPT Version :10

Sequence 1:NP_524160.2 Gene:ash1 / 40133 FlyBaseID:FBgn0005386 Length:2226 Species:Drosophila melanogaster
Sequence 2:NP_821172.2 Gene:Setd1a / 233904 MGIID:2446244 Length:1716 Species:Mus musculus


Alignment Length:1830 Identity:349/1830 - (19%)
Similarity:543/1830 - (29%) Gaps:679/1830 - (37%)


- Green bases have known domain annotations that are detailed below.


  Fly   140 DDLPLKVHQQRAPRVLLSAIIQAAQSASKPTL-DIGISSSDNELPNLVQAAI-----KRVESDTE 198
            :::.:.:|.:....:.|:.::..:...:|.|: ::.::|   .:.|::.|.:     :|::  ..
Mouse   121 EEVEILLHPRTRKHLGLARVLFTSTRGAKETVKNLHLTS---VMGNIIHAQLDIKGQQRMK--YY 180

  Fly   199 DTTVEGSFRKAAKDKNLPQYQSTLLQDFMEKTQMLGQTVNAKLAEEKVAKAKEETLVQTAVPRKR 263
            :..|.||:        .||...|..:...||.|..|                  ...:|...|:|
Mouse   181 ELIVNGSY--------TPQTVPTGGKALSEKFQGSG------------------AAAETTEARRR 219

  Fly   264 RGRPKKVVP---TV-PAPGNSGPAINESADSGVISTTSTTQSTTPSPKMQNENAVP---TGSLPI 321
            ........|   || ..|||..|.   |.|:...|:...|.|:......|:....|   .||.|.
Mouse   220 SSSDTAAYPAGTTVGGTPGNGTPC---SQDTNFSSSRQDTPSSFGQFTPQSSQGTPYTSRGSTPY 281

  Fly   322 ASSSKPKIDMAYLDKRMYATERVLYPPPRSKRRQNNKKTACSSSNKEELQLDPLWREIDVNKKFR 386
            :.      |.||..    :|....:.|.||                          |......|.
Mouse   282 SQ------DSAYSS----STTSTSFKPRRS--------------------------ENSYQDSFS 310

  Fly   387 LRSMSVGAASGTGASTTICSKVLAAKSGYVSDYGSVRHQRSSHNHNSGYK-SDASCKSRYSTKSC 450
            .|..|..:|..|.|:.|..:....|.|...|...|.....||.:..|.:: ||:|..:.|.:   
Mouse   311 RRHFSTSSAPATTATATSATAAATAASSSSSSSSSSSSSSSSSSSASQFRGSDSSYPAYYES--- 372

  Fly   451 MSRRSRAKSCGYRSDCKESGKSGLRMRRKRRASMLLKSSADDTVEDQDILQLAGLSLGQSSEESN 515
            .:|..|..|...|...:|. .||             .|.|::|.|.......:.|:...:.....
Mouse   373 WNRYQRHTSYPPRRATRED-PSG-------------ASFAENTAERFPPSYTSYLAPEPNRSTDQ 423

  Fly   516 EYISKPSLKSLPTTSASKKYGEINRYVTTGQYFGRGGSLSATNPDNFISKMMNQRKETPAPSKSS 580
            :|  :|.....|.....:..|........|...|.||.                  ..|:|.:..
Mouse   424 DY--RPPASEAPPPEPPEPGGGGGGSGGGGGGGGGGGG------------------GAPSPEREE 468

  Fly   581 CKIKSRRSSAASMCSSYVSGVSRMRRRHRRKSFSHNKSLNIDSKLLTEIEIITSTFNSRCRIQDD 645
            .:...|.:|.|.      ||.......:....|:.:.||  ||::...::...|.|:......::
Mouse   469 ARTPPRPASPAR------SGSPAPETTNESVPFAQHSSL--DSRIEMLLKEQRSKFSFLASDTEE 525

  Fly   646 RLTGSSGKEKLLADANKLQATLAAPSPAQQLTLNGGGPASTLSKPLKRGLKKRKLSEPLVDFAML 710
            ....||      |......|....||.|      |.||.:....|              .:|..:
Mouse   526 EEENSS------AGPGARDAGAEVPSGA------GHGPCTPPPAP--------------ANFEDV 564

  Fly   711 SASASGTPNG---SGSSNGNTKRRHKKSQSNDSSSPDDH-------------------------- 746
            :.:.||.|..   |..:||..:.....|..:...|.||.                          
Mouse   565 APTGSGEPGAARESPKANGQNQASPCSSGEDMEISDDDRGGSPPPAPTPPQQPPPPPPPPPPPPP 629

  Fly   747 ----KLPL----KKRHYLLTP---GERPPAE-----------VAFANG-----KLNAEAWAAAAA 784
                .|||    .:..|||.|   |..||..           ..|.|.     :|.|: |..   
Mouse   630 PYLASLPLGYPPHQPAYLLPPRPDGPPPPEYPPPPPPPPPHIYDFVNSLELMDRLGAQ-WGG--- 690

  Fly   785 AAKSTASTKSQAQFNARSVKSALTPKKRHLLEQPTSVSGA-----GSSASNSPLRIVVDNNSISG 844
               ...|.:.|.|...|          .|.|.|...::.|     |.:...:.|.......:..|
Mouse   691 ---MPMSFQMQTQMLTR----------LHQLRQGKGLTAASAGPPGGAFGEAFLPFPPPQEAAYG 742

  Fly   845 GKLLDISPSSLCSLKQQRRGGAAKQKVSAAKDLVQLQSPAGSYPPPGVFEPSVEL------EIQI 903
                  .|.:|.:..|:.||       |.:::...|..|..:.|.|.......|.      |.:|
Mouse   743 ------LPYALYTQGQEGRG-------SYSREAYHLPLPMAAEPLPSSSVSGEEARLPHREEAEI 794

  Fly   904 PLSKLNESVITKAEVESPLLSALDIKEDTKKEVGQRVVETLLH---------------------- 946
            ..||:..|..|...|.:.|:.  ::|...::::.:::||.:..                      
Mouse   795 AESKVLPSAGTVGRVLATLVQ--EMKSIMQRDLNRKMVENVAFGAFDQWWESKEEKAKPFQNAAK 857

  Fly   947 -------------------------KTGG--------------------NLLLKRK--------- 957
                                     |:||                    :..:|||         
Mouse   858 QQAKEEDKEKMKLKEPGMLSLVDWAKSGGITGIEAFAFGSGLRGALRLPSFKVKRKEPSEISEAS 922

  Fly   958 --------------------RKKINRTGFPTVRRKKR-----KVSVEQQTTAVID---------- 987
                                .|:....|.|..:..||     |...:.:.:..:|          
Mouse   923 EEKRPRPSTPAEEDEDDPEREKEAGEPGRPGTKPPKRDEERGKTQGKHRKSFTLDSEGEEASQES 987

  Fly   988 --EHEPEFDPDDEPLQSLRETRSSNNVNVQAAPNPPLDCERVPQA-------GEARETFVARTNQ 1043
              |.:.:.|.:||..:...|...:.....:|:.....|.:...|.       ||...|..:.:..
Mouse   988 SSEKDEDDDDEDEEDEEQEEAVDATKKEAEASDGEDEDSDSSSQCSLYADSDGENGSTSDSESGS 1052

  Fly  1044 KAPRLSVVAL------------ERLQRPQTPARGRPRGRKPKNREQAEAAPQPPPKSEPE----- 1091
            .:...|..:.            |..|....|:...|       ||..|..|.|..|.|.:     
Mouse  1053 SSSSSSSSSSSSSSSSSESSSEEEEQSAVIPSASPP-------REVPEPLPAPDEKPETDGLVDS 1110

  Fly  1092 -IRPAKKRGRQPKQPV--LEEPPPTPPPQQKKNKMEPNIRLPDG---IDPNTNFSCKIRL----- 1145
             :.|..::...|.||.  .|||||:.|    :...||....||.   :|...  |..|.|     
Mouse  1111 PVMPLSEKETLPTQPAGPAEEPPPSVP----QPPAEPPAGPPDAAPRLDERP--SSPIPLLPPPK 1169

  Fly  1146 KRRKNLE---------------AGTQPKKEKPVQ---PVTVEEIPPEIPVSQ--------EEI-- 1182
            ||||.:.               |..|.|...||.   |..||.....:|:..        ||:  
Mouse  1170 KRRKTVSFSAAEEAPVPEPSTAAPLQAKSSGPVSRKVPRVVERTIRNLPLDHASLVKSWPEEVAR 1234

  Fly  1183 -------------------------------------------------DAEA-----EAKR--- 1190
                                                             |:||     ||:|   
Mouse  1235 GGRNRAGGRVRSTEEEEATESGTEVDLAVLADLALTPARRGLATLPTGDDSEATETSDEAERPSP 1299

  Fly  1191 -LDSIPTEH------------------DPLPA--------------------SESHNPGPQ---- 1212
             |..|..||                  :|.||                    :|:..|..|    
Mouse  1300 LLSHILLEHNYALAIKPPPTTPAPRPLEPAPALAALFSSPADEVLEAPEVVVAEAEEPKQQLQQQ 1364

  Fly  1213 -----------DYASCSESSEDKASTTS-----LRKLSKVKKTYLVAGLFSNHYKQSLMPPPAKV 1261
                       |....|||||..:|::|     :|:.|....|        ...:..|.|||   
Mouse  1365 HPEQEGEEEEEDEEEESESSESSSSSSSDEEGAIRRRSLRSHT--------RRRRPPLPPPP--- 1418

  Fly  1262 NKKPGLEEQVGPASLLPPPPYCEKYLRRTEMD-FELPYDIWWAYTNSKLPTRNVVPSWNYRKIR- 1324
                            ||||..|.   |:|.: ..:.||||    ||.|...::    :|.::. 
Mouse  1419 ----------------PPPPSFEP---RSEFEQMTILYDIW----NSGLDLEDM----SYLRLTY 1456

  Fly  1325 -----------------------TNVYA--ESVRPNLAGFDHPTCNCKNQG-------EK-SCLD 1356
                                   ||:..  ...||.....:|.|.:.:::|       || ..||
Mouse  1457 ERLLQQTSGADWLNDTHWVQHTITNLSTPKRKRRPQDGPREHQTGSARSEGYYPISKKEKDKYLD 1521

  Fly  1357 NC---LNRMVYTECSPSNCPAGEKCRNQKIQRHAVAPGVERFMTAD------------------- 1399
            .|   ..::...:...:|....|:...|:  |...|.|....|.:|                   
Mouse  1522 VCPVSARQLEGGDTQGTNRVLSERRSEQR--RLLSAIGTSAIMDSDLLKLNQLKFRKKKLRFGRS 1584

  Fly  1400 --KGWGVRTKLPIAKGTYILEYVGEVV-------TEKEFKQR-MASIYLNDTHHYCLHLDGGLVI 1454
              ..||:....|||....::||||:.:       .||.:.|. :.|.||       ..:|...:|
Mouse  1585 RIHEWGLFAMEPIAADEMVIEYVGQNIRQMVADMREKRYVQEGIGSSYL-------FRVDHDTII 1642

  Fly  1455 DGQRMGSDCRFVNHSCEPNCEMQKWSVNGLSRMVLFAKRAIEEGEELTYDYNFSLFNPSEGQ--P 1517
            |..:.|:..||:||.|.|||..:..::....::|:::|:.|...||:||||.|    |.|..  |
Mouse  1643 DATKCGNLARFINHCCTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYKF----PLEDNKIP 1703

  Fly  1518 CRCNTPQCRG 1527
            |.|.|..|||
Mouse  1704 CLCGTESCRG 1713

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
ash1NP_524160.2 PHA03247 <1018..1281 CDD:223021 83/441 (19%)
AWS 1340..1388 CDD:197795 12/58 (21%)
SET_ASH1L 1391..1531 CDD:380951 49/168 (29%)
Bromo_ASH1 1680..1787 CDD:99955
PHD_ASH1L 1858..1900 CDD:277023
BAH_polybromo 1929..2073 CDD:240068
Setd1aNP_821172.2 Interaction with WDR82. /evidence=ECO:0000250|UniProtKB:O15047 60..89
RRM_Set1A 92..186 CDD:409964 9/69 (13%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 194..367 50/229 (22%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 380..499 26/158 (16%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 516..670 33/179 (18%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 849..869 0/19 (0%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 911..1206 60/307 (20%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1230..1259 2/28 (7%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1275..1297 5/21 (24%)
HCFC1-binding motif (HBM). /evidence=ECO:0000250|UniProtKB:O15047 1307..1311 2/3 (67%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1355..1427 22/101 (22%)
Interaction with CFP1. /evidence=ECO:0000250|UniProtKB:O15047 1424..1459 11/45 (24%)
N-SET 1434..1571 CDD:463344 28/146 (19%)
Interaction with ASH2L, RBBP5 and WDR5. /evidence=ECO:0000250|UniProtKB:O15047 1459..1546 14/86 (16%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1480..1508 6/27 (22%)
WDR5 interaction motif (WIN). /evidence=ECO:0000250|UniProtKB:O15047 1501..1506 0/4 (0%)
RxxxRR motif. /evidence=ECO:0000250|UniProtKB:P38827 1546..1551 1/6 (17%)
SET_SETD1 1565..1712 CDD:380946 44/157 (28%)
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.