DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment ash1 and Nsd2

DIOPT Version :9

Sequence 1:NP_001246834.1 Gene:ash1 / 40133 FlyBaseID:FBgn0005386 Length:2226 Species:Drosophila melanogaster
Sequence 2:NP_001074571.2 Gene:Nsd2 / 107823 MGIID:1276574 Length:1366 Species:Mus musculus


Alignment Length:1520 Identity:314/1520 - (20%)
Similarity:523/1520 - (34%) Gaps:496/1520 - (32%)


- Green bases have known domain annotations that are detailed below.


  Fly   234 GQTVNAKLAEE---KVAKAKEETLVQTAVPRKRRGRPKKVVPTVPAPG---------NSGPAINE 286
            |:|.|.::..|   .::||:....:|..|.:|..|.  ..:|.:||..         |..|..::
Mouse    33 GKTQNCEVNHECSVFLSKAQLSNSLQEGVMQKFNGH--DALPFLPAEKLKDLTSCVFNGEPGAHD 95

  Fly   287 SA---DSGVISTTSTTQSTTPSPKMQNENAVPTGSLPIAS---SSKP----------KIDMAYLD 335
            :.   ::..:....|..:|||.     :|..|...|.|..   :.||          ..|::..:
Mouse    96 TKLCFEAQEVKGIGTPPNTTPI-----KNGSPEIKLKITKTYMNGKPLFESSICGDGAADVSQSE 155

  Fly   336 KRMYATERVLYPPPRSKRRQNNKKTACSSSNKEELQLD---------PLWREIDVNKKFRLRSMS 391
            :....::        :|.|:|.|::....|..|:..::         |..::|.|.|:       
Mouse   156 ENEQKSD--------NKTRRNRKRSIKYDSLLEQGLVEAALVSKISSPADKKIPVKKE------- 205

  Fly   392 VGAASGTGASTTICSK------VLAAKSGY------------VSDYGSVRHQ-RSSHNHNSGYKS 437
              :...||....:..|      |.:..|||            :.::..::.| :|:..::..:..
Mouse   206 --SCPNTGRDRDLLLKYNVGDLVWSKVSGYPWWPCMVSADPLLHNHTKLKGQKKSARQYHVQFFG 268

  Fly   438 DASCKSRYSTKSCMSRRSRAKSCGYRSDCKESGK---------------SGLRMRRKRRASML-L 486
            ||..::....||.::.....:   :...|:||.|               || |:|.:....:: .
Mouse   269 DAPERAWIFEKSLVAFEGEEQ---FEKLCQESAKQAPTKAEKIKLLKPISG-RLRAQWEMGIVQA 329

  Fly   487 KSSADDTVEDQDILQLAGLSLG-------QSSEESNEYISKPSLKSLPTTSASKKYGEINRYVTT 544
            :.:|..::|::. .:...|.:|       |.::|:. .:::|..:.:.::.||::          
Mouse   330 EEAASMSIEERK-AKFTFLYVGDQLHLNPQVAKEAG-IVTEPLGEMVDSSGASEE---------- 382

  Fly   545 GQYFGRGGSLSATNPDNFISKMMNQRKETPAPSKSSCKIKSRRSSAASMCSSYVSGVSRMRRRHR 609
                      :|.:|.:.        :|...|:|                         .|||.:
Mouse   383 ----------AAVDPGSV--------REEDIPTK-------------------------RRRRTK 404

  Fly   610 RKSFSHNKSLNIDSKLLTEIEIITSTFNSRCRIQDDRLTGSSGKEK----LLADANKLQATLAAP 670
            |.|.:.|:.                              |..|.:|    .:|:|   :......
Mouse   405 RSSSAENQE------------------------------GDPGTDKSTPPKMAEA---EPKRGVG 436

  Fly   671 SPAQQLTLNGGGPAS--------------------TLSKPLKRG-------------LKKRKLSE 702
            |||.:....|..|.|                    ....|...|             |.:::.:.
Mouse   437 SPAGRKKSTGSAPRSRKGDSAAQFLVFCQKHRDEVVAEHPDASGEEIEELLGSQWSMLNEKQKAR 501

  Fly   703 PLVDFAMLSASASGTPNGSGSSNGNTKRRHKKSQSNDSSSPDDHKLPLKK----RHYLLTPGERP 763
            ....|:::.::.|...:|:|  ||. ||.|.|...:.:...|....|.|:    :|.|....|..
Mouse   502 YNTKFSLMISAQSEEDSGNG--NGK-KRSHTKRADDPAEDVDVEDAPRKRLRADKHSLRKQRETI 563

  Fly   764 PAEVAFANGKLNAEAWAAAAAAAKSTASTKSQAQFNARSVKSALTPKKRHLLEQPTSVSGAGSSA 828
            ..:.|..:.....|    ||::.||.|:||        ::..|..|.|:......|:.|..|.:.
Mouse   564 TDKTARTSSYKAIE----AASSLKSQAATK--------NLSDACKPLKKRNRASATASSALGFNK 616

  Fly   829 SNSPLRIVVDNN-SISGGKLLDISP---------SSLCSLKQQRRGGAAKQKVSAAKDLVQLQSP 883
            |:||...:.::. |.|.|.....||         .:..|.|:..||.|||::.     :.||...
Mouse   617 SSSPSASLTEHEVSDSPGDEPSESPYESADETQTEASVSSKKSERGMAAKKEY-----VCQLCEK 676

  Fly   884 AGSY-----PPPGVFEPSVELEIQIPLSKLNESVITKAEVESPLLSALDIKEDTKKEVGQRVVET 943
            .||.     |..|.|..:.     :.||:..|...|..|..|.:.|....|| :|.||       
Mouse   677 TGSLLLCEGPCCGAFHLAC-----LGLSRRPEGRFTCTECASGIHSCFVCKE-SKMEV------- 728

  Fly   944 LLHKTGGNLLLKRKRKKINRTGFPTVRRKKRKVSVEQQTTAVIDEHEPEFDPDDEPLQSLRETRS 1008
                         ||..:|:.|     :...:..|::....|.:......     ||.|.....:
Mouse   729 -------------KRCVVNQCG-----KFYHEACVKKYPLTVFESRGFRC-----PLHSCMSCHA 770

  Fly  1009 SNNVNVQAAPNPPLDCERVPQAGEARETFVARTNQKAPRLSVVALERLQRPQTPARGRPRGRKPK 1073
            ||..|.:.:....:.|.|.|.|....:..:      |...||:|...:     ...|....||.|
Mouse   771 SNPSNPRPSKGKMMRCVRCPVAYHGGDACL------AAGCSVIASNSI-----ICTGHFTARKGK 824

  Fly  1074 NREQ------------------AEAAPQPPPKSEPEIRPAKKRGRQPKQPVLEEPPPTPPPQQKK 1120
            ....                  .||.   |....|:..                           
Mouse   825 RHHTHVNVSWCFVCSKGGSLLCCEAC---PAAFHPDCL--------------------------- 859

  Fly  1121 NKMEPNIRLPDGIDPNTNFSCKIRLKRRKNLEAGTQPKKEK------------PVQPVTVEEIPP 1173
                 ||.:|||     ::.|       .:..||.:...:.            |.:....:.:||
Mouse   860 -----NIEMPDG-----SWFC-------NDCRAGKKLHFQDIIWVKLGNYRWWPAEVCHPKNVPP 907

  Fly  1174 EIPVSQEEIDAEAEAKRLDSIPTEHDPLPASESHNP----GPQDYASCSES------SEDKAST- 1227
            .|...:.||                       ...|    |.:||....::      ..|:.|. 
Mouse   908 NIQKMKHEI-----------------------GEFPVFFFGSKDYYWTHQARVFPYMEGDRGSRY 949

  Fly  1228 TSLRKLSKVKKTYLVAGLFSNHYKQSLMPPPAKVNK-KPGLEEQVGPASLLPPPPYCEKYLRRTE 1291
            ..:|.:.:|             :|.:|....|:.|: |...|.:....|...||||  |:::..:
Mouse   950 QGVRGIGRV-------------FKNALQEAEARFNEVKLQREARETQESERKPPPY--KHIKVNK 999

  Fly  1292 MDFELPYDIWWAYTNSKLPTRNVVPSWNYRKIRTNVYAESVRPNLAGFDHPTCNCKNQGEKSCLD 1356
                 ||.                        :..:|...:.      :.|.||||...|..|..
Mouse  1000 -----PYG------------------------KVQIYTADIS------EIPKCNCKPTDENPCGS 1029

  Fly  1357 N--CLNRMVYTECSPSNCPAGEKCRNQKIQRHAVAPGVERFMTADKGWGVRTKLPIAKGTYILEY 1419
            :  |||||:..||.|..|||||.|:||...:... |..:...|..||||:..|..|.||.::.||
Mouse  1030 DSECLNRMLMFECHPQVCPAGEYCQNQCFTKRQY-PETKIIKTDGKGWGLVAKRDIRKGEFVNEY 1093

  Fly  1420 VGEVVTEKEFKQRMASIYLND-THHYCLHLDGGLVIDGQRMGSDCRFVNHSCEPNCEMQKWSVNG 1483
            |||::.|:|...|:...:.|| ||.|.|.:|...:||....|:..||:||||:||||..||:|||
Mouse  1094 VGELIDEEECMARIKYAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNG 1158

  Fly  1484 LSRMVLFAKRAIEEGEELTYDYNFSLFNPSEGQPCRCNTPQCRGVIGGKSQRVKPLPAVEAKPSG 1548
            .:|:.|||...|..|.|||::||..... :|...|||....|.|.:|.:       |...|..|.
Mouse  1159 DTRVGLFAVCDIPAGTELTFNYNLDCLG-NEKTVCRCGASNCSGFLGDR-------PKTSASLSS 1215

  Fly  1549 EGLSGRNGRQRKQKAKKHAQRQAGK 1573
            |    ..|::.|:|.::...:..||
Mouse  1216 E----EKGKKAKKKTRRRRAKGEGK 1236

Known Domains:


Indicated by green bases in alignment.

Software error:

Illegal division by zero at /www/www.flyrnai.org/docroot/cgi-bin/DRSC_prot_align.pl line 591.

For help, please send mail to the webmaster (ritg@hms.harvard.edu), giving this error message and the time and date of the error.

GeneSequenceDomainRegion External IDIdentity
ash1NP_001246834.1 AWS 1340..1388 CDD:197795 23/49 (47%)
SET 1392..1512 CDD:214614 53/120 (44%)
Bromo_ASH1 1680..1787 CDD:99955
PHD_ASH1L 1858..1900 CDD:277023
BAH_polybromo 1929..2073 CDD:240068
Nsd2NP_001074571.2 Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 149..169 3/27 (11%)
MSH6_like 218..337 CDD:99898 20/122 (16%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 373..455 24/167 (14%)
HMG-box 460..>505 CDD:238037 3/44 (7%)
TNG2 <546..711 CDD:227367 45/186 (24%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 595..659 15/63 (24%)
PHD1_NSD1_2 670..712 CDD:277118 12/46 (26%)
PHD2_NSD2 717..763 CDD:277121 13/76 (17%)
PHD3_NSD2 764..817 CDD:277124 12/63 (19%)
PHD_SF 834..874 CDD:389947 10/86 (12%)
WHSC1_related 879..972 CDD:99899 17/128 (13%)
AWS 1013..1063 CDD:197795 23/49 (47%)
SET_NSD2 1063..1204 CDD:380988 60/142 (42%)
S-adenosyl-L-methionine binding. /evidence=ECO:0000250|UniProtKB:O96028 1116..1119 2/2 (100%)