DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment ash1 and Nsd1

DIOPT Version :9

Sequence 1:NP_001246834.1 Gene:ash1 / 40133 FlyBaseID:FBgn0005386 Length:2226 Species:Drosophila melanogaster
Sequence 2:NP_032765.3 Gene:Nsd1 / 18193 MGIID:1276545 Length:2691 Species:Mus musculus


Alignment Length:2459 Identity:484/2459 - (19%)
Similarity:792/2459 - (32%) Gaps:886/2459 - (36%)


- Green bases have known domain annotations that are detailed below.


  Fly    78 PKSRKMSTQDTESGCSEAKNRAVSKKVKVK-RKKLASSSGISKSDKVSKSKKSQISAFSS---DS 138
            |..||...|         |.:....||..| ..|..:|.|:::...|.|..|:|....||   ||
Mouse   397 PVLRKRGKQ---------KEKGYRHKVPQKILSKWEASVGLAEQYDVPKGSKNQKCVSSSVKLDS 452

  Fly   139 EDDLPLK--VHQQRAPRVLLSAIIQAAQSASKPTLDIGISSSDNELPNLVQAAIKRVESDTEDTT 201
            |:|:|.:  .:...:..:||:..:::.      ..|...|:.:.|.|    .|..||...::   
Mouse   453 EEDMPFEDCTNDPDSEHLLLNGCLKSL------AFDSEHSADEKEKP----CAKSRVRKSSD--- 504

  Fly   202 VEGSFRKAAKDKNLPQYQSTLLQDFMEKTQMLGQTVNAKLAEEKVAKAKEETLVQTAVPRKRRGR 266
               :.::.:..|:|..::|                            .|||          |||:
Mouse   505 ---NIKRTSVKKDLVPFES----------------------------RKEE----------RRGK 528

  Fly   267 PKKVVPTVPAPGN------SGPAINESADSGV--ISTTSTTQSTTPSPKMQNENAVPTGSLPIAS 323
                     .|.|      ||...::.|.:.:  |:.:.|..||.|            ||...:|
Mouse   529 ---------IPDNLGLDFISGGVSDKQASNELSRIANSLTGSSTAP------------GSFLFSS 572

  Fly   324 S----SKPKIDMAYLDKRMYATERVLYPPPRSKRRQNNKKTACSSSNKEELQLDPLWREIDVNKK 384
            |    :|...:....|.....:|..|......::::......|||    ::||..:...   :::
Mouse   573 SVQNTAKTDFETPDCDSLSGLSESALISKHSGEKKKLQPGQVCSS----KVQLCYVGAG---DEE 630

  Fly   385 FRLRSMSVGAASGTGASTTICSKVLAAKSGYVSDYGSVRHQRSSHNHNSGYK------SDASCKS 443
            .|..|:||...|..|     ||.:                  ....||||::      :||..| 
Mouse   631 KRSNSVSVSTTSDDG-----CSDL------------------DPTEHNSGFQNSVLGITDAFDK- 671

  Fly   444 RYSTKSCMSRRSRAKSCGYRSDCKESGKSGLRMRRKRRASMLLKSSADDTVEDQDILQLAGLSLG 508
               |::.:|        .::::.:.|........::::.|::..|.||..:.....::.....|.
Mouse   672 ---TENALS--------VHKNETQYSRYPVTNRIKEKQKSLITNSHADHLMGSTKTMEPETAELS 725

  Fly   509 QSSEESNEYISKPSLKSLP-------TTSASKKYGEINRYVTTGQYFGRGGSLSAT------NPD 560
            |.: .|:..||.|..|..|       ||..|...|..|....|     :||..:.|      ...
Mouse   726 QVN-LSDLKISSPIPKPQPEFRNDGLTTKFSAPPGIRNENPLT-----KGGLANQTLLPLKCRQP 784

  Fly   561 NFISKMMNQRKETPAPSKSS----------CKIKSRRSSAASMC-SSYVSGVSRMRRRHRRKSFS 614
            .|.| :..:.||:||.:::|          |...:..|..|::. |....|:..:...|.:...|
Mouse   785 KFRS-IKCKHKESPAVAETSATSEDLSLKCCSSDTNGSPLANISKSGKGEGLKLLNNMHEKTRDS 848

  Fly   615 HNKSLNIDSKLLTEIEIITSTFNSRCRIQDDRLTGSSGKEKLLADANKLQATLAAPSPAQQLTLN 679
            .:....:...:|:|::.:  ::.|......|..|..:.|..|.:.|          |....:.:.
Mouse   849 SDIETAVVKHVLSELKEL--SYRSLSEDVSDSGTAKASKPLLFSSA----------SSQNHIPIE 901

  Fly   680 GGGPASTLSKPLKRGLKKRKLSEPLVDFAMLSAS---------ASGTPNGS------GSSNGNTK 729
            .....|||...|| .:...|..|..:..|...||         :||:|.|:      |||..|::
Mouse   902 PDYKFSTLLMMLK-DMHDSKTKEQRLMTAQNLASYRTPDRGDCSSGSPVGTSKVLVLGSSTPNSE 965

  Fly   730 RRHKKSQSNDSSSPDDHKLPLKKRHYLLTPGERPPAEVAFANGKLNAEAWAAAAA-------AAK 787
            :....:|.:...||......|.        ||...:..:.|:.|....|.....:       ..:
Mouse   966 KPGDSTQDSVHQSPGGGDSALS--------GELSSSLSSLASDKRELPACGKIRSNCIPRRNCGR 1022

  Fly   788 STASTKSQAQFNARSVKSALTPK------KRHLLEQPT--------------SVSGAGSSASNSP 832
            :..|:|.:...:|:.||.::.||      ||.....|.              ||:|.....:..|
Mouse  1023 AKPSSKLRETISAQMVKPSVNPKALKTERKRKFSRLPAVTLAANRLGNKESGSVNGPSRGGAEDP 1087

  Fly   833 -----------LR-----------------------------------------IVVDNNSISGG 845
                       ||                                         :..:|:.:.| 
Mouse  1088 GKEEPLQQMDLLRNEDTHFSDVHFDSKAKQSDPDKNLEKEPSFENRKGPELGSEMNTENDELHG- 1151

  Fly   846 KLLDISPSSLCSLKQQRR----------------GGA--------AKQKVSAAKDLVQLQSPAGS 886
             :..:.|........|||                .||        |.||  |.:|.::.::|..|
Mouse  1152 -VNQVVPKKRWQRLNQRRPKPGKRANRFREKENSEGAFGVLLPADAVQK--AREDYLEQRAPPTS 1213

  Fly   887 YPPPGVFEPSVELEIQIPLSKLNESVITKAEVESPLLSALDIKEDTKKEVGQRVVETLLHKTG-G 950
            .|.....:|:.....:....:||  |..|:.|.         ..|.:||.|   :.:|:.:|. .
Mouse  1214 KPEDSAADPNHGSHSESVAPRLN--VCEKSSVG---------MGDVEKETG---IPSLMPQTKLP 1264

  Fly   951 NLLLKRKRKKINRTGFPT-------------VRRKKRKVSVEQQTTAVIDEHEPEFDPDDEPL-- 1000
            ...::.::|::.:   |:             ...||::..|::|.      |:.....:||.|  
Mouse  1265 EPAIRSEKKRLRK---PSKWLLEYTEEYDQIFAPKKKQKKVQEQV------HKVSSRCEDESLLA 1320

  Fly  1001 ---QSLRETRSSNNVNVQAAPNPP-LDCERVPQAGEARETFVARTNQKAPRLSVVALERLQRPQT 1061
               .|.:..:...|..:.....|| |:.|.....|...::.:..|:.:.|:|:      |..|..
Mouse  1321 RCQPSAQNKQVDENSLISTKEEPPVLEREAPFLEGPLAQSDLGVTHAELPQLT------LSVPVA 1379

  Fly  1062 PARGRPRGRKPKNREQAEAAPQPPPKSEPEI--RPA---KKRGRQPKQPVLEEPPPTPPPQQKK- 1120
            |                ||:|:|..:||..:  .|.   .||.|:|.:.:||.....|....|| 
Mouse  1380 P----------------EASPRPALESEELLVKTPGNYESKRQRKPTKKLLESNDLDPGFMPKKG 1428

  Fly  1121 -----NKMEPNIRLPDGIDPNTNFSCKIRLKRRKNLEAGT-----QPKKEKPVQPVTV------- 1168
                 .|.....|..:||..:...|      ..|....||     :|:|.|..:.||.       
Mouse  1429 DLGLSRKCFEASRSGNGIVESRATS------HLKEFSGGTTKIFDKPRKRKRQRLVTARVHYKKV 1487

  Fly  1169 --EEIPPEIPVSQEEI---DAEAEAKRLDSIPTEHDP---------------------------- 1200
              |::..:.|.|:.|:   ...|..|.:.....||||                            
Mouse  1488 KKEDLTKDTPSSEGELLIHRTAASPKEILEEGVEHDPGMSASKKLQVERGGGAALKENVCQNCEK 1552

  Fly  1201 ---------------------LP--------ASESHNPGPQDYASCSESSED------------- 1223
                                 ||        .:|.|. |......|.:|.||             
Mouse  1553 LGELLLCEAQCCGAFHLECLGLPEMPRGKFICNECHT-GIHTCFVCKQSGEDVKRCLLPLCGKFY 1616

  Fly  1224 ------------------------------------KASTTSLRKLSKVKKTY------LVAG-- 1244
                                                .||...|.:..:....|      |.||  
Mouse  1617 HEECVQKYPPTVTQNKGFRCPLHICITCHAANPANVSASKGRLMRCVRCPVAYHANDFCLAAGSK 1681

  Fly  1245 -------LFSNHYKQ----------------------SLM----PPPA----------------- 1259
                   :..||:..                      ||:    .|.|                 
Mouse  1682 ILASNSIICPNHFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNIDIPEGNWYC 1746

  Fly  1260 ---KVNKKPGLEE----QVG-----PASLLPP---PPYCEKYLRRTEMDFELPY----DIWWAY- 1304
               |..|||...|    :||     ||.:..|   |...:| :|....:|.:.:    |..|.: 
Mouse  1747 NDCKAGKKPHYREIVWVKVGRYRWWPAEICHPRAVPSNIDK-MRHDVGEFPVLFFGSNDYLWTHQ 1810

  Fly  1305 ----------------------------------------------------TNSKLPTRNVVPS 1317
                                                                .|.|.|     |.
Mouse  1811 ARVFPYMEGDVSSKDKMGKGVDGTYKKALQEAAARFEELKAQKELRQLQEDRKNDKKP-----PP 1870

  Fly  1318 WNYRKI-----RTNVYAESVRPNLAGFDHPTCNCKNQGEKSC-LDN-CLNRMVYTECSPSNCPAG 1375
            :.:.|:     |..::...:.      :.|.||||...|..| :|: |:|||:..||.|:.||||
Mouse  1871 YKHIKVNRPIGRVQIFTADLS------EIPRCNCKATDENPCGIDSECINRMLLYECHPTVCPAG 1929

  Fly  1376 EKCRNQKIQRHAVAPGVERFMTADKGWGVRTKLPIAKGTYILEYVGEVVTEKEFKQRMASIYLND 1440
            .:|:||...:... |.||.|.|..:|||:|||..|.||.::.|||||::.|:|.:.|:.....:|
Mouse  1930 VRCQNQCFSKRQY-PDVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHD 1993

  Fly  1441 -THHYCLHLDGGLVIDGQRMGSDCRFVNHSCEPNCEMQKWSVNGLSRMVLFAKRAIEEGEELTYD 1504
             |:.|.|.||...:||....|:..||:||.|:||||.|||||||.:|:.|||...|:.|.|||::
Mouse  1994 ITNFYMLTLDKDRIIDAGPKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFN 2058

  Fly  1505 YNFSLFNPSEGQPCRCNTPQCRGVIGGKSQRVKPLPAVEAKPSGEGLSGRNGRQRKQKAKKHAQR 1569
            ||.......: ..|:|..|.|.|.:|   .|.|..|.|..:.|           ||.|.|.|.:|
Mouse  2059 YNLECLGNGK-TVCKCGAPNCSGFLG---VRPKNQPIVTEEKS-----------RKFKRKPHGKR 2108

  Fly  1570 QAGKDISSAVAVAKLQPLSEKEKKLVRQFNTFLVRNFEKIRRCKAKRA-----SDAAATASSPAL 1629
            ::..:::             ||    |:...|...:..::..||....     :|.......|| 
Mouse  2109 RSQGEVT-------------KE----REDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPA- 2155

  Fly  1630 GTTNG---------DIPGRRPSTPSSPSLAAQISALCSPRNIKTR------------GLTQAVHD 1673
                |         |:.|:.         ||....:|.....|..            .|:...||
Mouse  2156 ----GKWECPWHQCDVCGKE---------AASFCEMCPSSFCKQHREGMLFISKLDGRLSCTEHD 2207

  Fly  1674 PELEKMAKMAVVLRDIC--SAMETLKMSDLLTTVSSKKKKPIKTTLSGKLGSTAATSKVEFRSIQ 1736
            |               |  :.:|..::.:.:...::....|  .|...:..|..||         
Mouse  2208 P---------------CGPNPLEPGEIREYVPPTATSPPSP--GTQPKEQSSEMAT--------- 2246

  Fly  1737 AQVEQGHYKTPQEFDDHMQQLFVEAKQQHGDDEGKEKALQSLKDSYEQQKIASYVQLVEILGD-- 1799
                ||..|:.|...|..|.|.:..|...|..:      :.|......::..|...|::.:.|  
Mouse  2247 ----QGPKKSDQPPTDATQLLPLSKKALTGSCQ------RPLLPERPPERTDSSSHLLDRIRDLA 2301

  Fly  1800 -----SESL-QSFKPKEVLSSEEEP-GKIAVKKSPGAKERDSPIV-------PLKVT-------- 1842
                 |:|| .|.:|::...::|.| .:...:.||..:...||.|       ||::|        
Mouse  2302 GSGTKSQSLVSSQRPQDRPPAKEGPRPQPPDRASPMTRPSSSPSVSSLPLERPLRMTDSRLDKSI 2366

  Fly  1843 ---PPPLLPIEASPDEDVIRCICGLYKDEGLMIQCSKCMV------WQHTECTKADIDADNYQCE 1898
               .|....:|.:|....:|...   .|..|.....|..:      ..|...|           :
Mouse  2367 GAASPKSQAVEKTPASTGLRLSS---PDRLLTTNSPKPQISDRPPEKSHASLT-----------Q 2417

  Fly  1899 RCEPRE----------VDRE---IPLEEFTEEGHRYYLSLMRGDLQVRQGDAVYVLRDI-PIKDE 1949
            |..|.|          |.:|   .|:::.|:..||..:.:...||..||.:.....::: |..||
Mouse  2418 RLPPPEKVLSAVVQSLVAKEKALRPVDQNTQSKHRPAVVMDLIDLTPRQKERAASPQEVTPQADE 2482

  Fly  1950 ----------------------------------SGKVLPTKKHTYETIGAIDY 1969
                                              ||||....:|.::.:.::.:
Mouse  2483 KTAMLESSSWPSSKGLGHIPRATEKISVSESLQPSGKVAAPSEHPWQAVKSLTH 2536

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
ash1NP_001246834.1 AWS 1340..1388 CDD:197795 22/49 (45%)
SET 1392..1512 CDD:214614 57/120 (48%)
Bromo_ASH1 1680..1787 CDD:99955 17/108 (16%)
PHD_ASH1L 1858..1900 CDD:277023 6/47 (13%)
BAH_polybromo 1929..2073 CDD:240068 12/76 (16%)
Nsd1NP_032765.3 MSH6_like 320..430 CDD:99898 11/41 (27%)
TNG2 <1453..1587 CDD:227367 21/139 (15%)
PHD1_NSD1_2 1546..1587 CDD:277118 2/40 (5%)
PHD2_NSD1 1593..1639 CDD:277120 4/45 (9%)
PHD3_NSD1 1640..1693 CDD:277123 7/52 (13%)
PHD4_NSD1 1710..1749 CDD:277126 4/38 (11%)
WHSC1_related 1755..1849 CDD:99899 13/94 (14%)
AWS 1902..1940 CDD:375420 17/37 (46%)
SET_NSD1 1942..2083 CDD:380987 63/142 (44%)
PHD5_NSD1 2121..2163 CDD:277129 7/46 (15%)
C5HCH 2162..2211 CDD:375464 11/72 (15%)
PHA03307 2255..>2576 CDD:223039 54/302 (18%)


Information from Original Tools:


Tool Simple Score Weighted Score Original Tool Information
BLAST Result Score Score Type Cluster ID
Compara 1 0.930 - - C167848350
Domainoid 00.000 Not matched by this tool.
eggNOG 00.000 Not matched by this tool.
Hieranoid 00.000 Not matched by this tool.
Homologene 00.000 Not matched by this tool.
Inparanoid 00.000 Not matched by this tool.
Isobase 00.000 Not matched by this tool.
OMA 00.000 Not matched by this tool.
OrthoDB 1 1.010 - - D507784at2759
OrthoFinder 00.000 Not matched by this tool.
OrthoInspector 00.000 Not matched by this tool.
orthoMCL 00.000 Not matched by this tool.
Panther 00.000 Not matched by this tool.
Phylome 1 0.910 - -
RoundUp 1 1.030 - avgDist Average_Evolutionary_Distance R4489
SonicParanoid 00.000 Not matched by this tool.
SwiftOrtho 00.000 Not matched by this tool.
TreeFam 00.000 Not matched by this tool.
43.880

Return to query results.
Submit another query.