DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment ash1 and Nsd1

DIOPT Version :9

Sequence 1:NP_001246834.1 Gene:ash1 / 40133 FlyBaseID:FBgn0005386 Length:2226 Species:Drosophila melanogaster
Sequence 2:XP_006253682.1 Gene:Nsd1 / 306764 RGDID:1307748 Length:2689 Species:Rattus norvegicus


Alignment Length:2364 Identity:500/2364 - (21%)
Similarity:768/2364 - (32%) Gaps:749/2364 - (31%)


- Green bases have known domain annotations that are detailed below.


  Fly    78 PKSRKMSTQDTESGCSEAKNRAVSKKVKVK-RKKLASSSGISKSDKVSKSKKSQISAFSS---DS 138
            |..||...|         |.:....||..| ..|..:|.|:::...|.:..|:|....||   ||
  Rat   396 PVLRKRGKQ---------KEKGYRHKVPQKILSKWEASVGLAEQCDVPRGPKTQKCVPSSAKLDS 451

  Fly   139 EDDLPLK--------VHQQRAPRVLLSAIIQAAQSA---SKPTLDIGI-SSSDNELPNLVQAAIK 191
            |:|:|.:        .|.......|.|....:..||   .||.....: .||||.....|:..:.
  Rat   452 EEDMPFEDCANDPDSEHDLLLNGCLKSLAFDSEHSADEKEKPCAKSRVRKSSDNIKRTSVKKGLM 516

  Fly   192 RVESDTEDTTVEGSFRKAAKDKNLPQYQSTLLQDFM-------EKTQMLGQTVNAKLAE------ 243
            ..|:..|:       |:....:||.       .||:       :.:..|.:..|:....      
  Rat   517 PFEAQKEE-------RRGKSPENLG-------LDFLSGGVSDKQASNELSRIANSLTGSSAAPGQ 567

  Fly   244 -------EKVAKAKEET--------LVQTAVPRKRRGRPKKVVP----------TVPAPGNSGPA 283
                   :..||...||        |.::|:..|..|..||:.|          .....|:.   
  Rat   568 FLFSSCGQNTAKTDFETPNCDSLSGLSESALISKHSGEKKKLQPGQVCSSKVQLCYVGAGDE--- 629

  Fly   284 INESADSGVISTTSTTQSTTPSPKMQN-----------------ENAVPTGSLPIASSSKPKIDM 331
             .:.:||..:.|||...|:...|...|                 |||:          |..|.:.
  Rat   630 -EKRSDSVSVCTTSDDGSSDLDPTEHNSEFHKSVLEVTDALDKTENAL----------SMHKNET 683

  Fly   332 AYLDKRMYATERV------LYPPPRSKRRQNNKKTA------CSSSNKEELQL-------DPLWR 377
            .|  .|..||.||      |.....:....::.||.      .|..|..:|::       .|.:|
  Rat   684 KY--SRYPATNRVKEKQKSLITNSHTDHLMDSTKTVEPGTAEISQVNLSDLKISSPIPKPQPEFR 746

  Fly   378 EIDVNKKFR----LRSMSVGAASGTGASTTICSKVLAAKSGYVSDYGSVRHQRSSHNHNSGYKSD 438
            ...:..||.    :|:.:.....|....|.:..|....|         .|..:..|..:......
  Rat   747 NDGLTTKFNAPPGIRNENSLTKGGLANQTLLPLKCRQPK---------FRSIKCKHKESPTAAET 802

  Fly   439 ASCKSRYSTKSCMSRRSRAKSCGYRSDCKESGK-SGLR----MRRKRRASMLLKSSADDTVEDQD 498
            ::.....|.|.|.|..:.:.    .:...:||| .||:    |..|.|.|    |..:..|....
  Rat   803 SATSEDLSLKCCSSDTNGSP----MTSISKSGKGEGLKLLNNMHEKTRDS----SDIETAVVKHV 859

  Fly   499 ILQLAGLSLGQSSEESNEYISKPSLKSLPTTSASKKYGEINRYVTTGQYFGRGGSLSATNPDNFI 563
            :.:|..||....||:.::..:..|.|.|..:|||.:    |.......|  :..:|.....|...
  Rat   860 LSELKELSYRSLSEDVSDSGTSKSSKPLLFSSASSQ----NHIPIEPDY--KFSTLLMMLKDMHD 918

  Fly   564 SKMMNQRKETP-------APSKSSCKIKSRRSSAASMCSSYVSGVSRMRRRHRRKSFSHNKSLNI 621
            ||...||..|.       .|.:..|      ||.:.:.:|.|..:........:...|...|:.:
  Rat   919 SKTKEQRLMTAQNVASYRTPDRGDC------SSGSPVGTSKVLVLGGSTHNSEKPGDSTQDSVRL 977

  Fly   622 -----DSKLLTEIEIITSTFNSRCRIQDDRLTGSSGK--EKLLADANKLQATLAAPSPAQQLTLN 679
                 ||.|..|:....|...|     |.|...:.||  ...:...|..:|.|   ||..::|: 
  Rat   978 SPGGGDSALSGELSSSLSVLPS-----DKRDLPACGKIRSNCIPRRNCGRAKL---SPKLRVTI- 1033

  Fly   680 GGGPASTLSKPL--KRGLK---KRKLSE-PLVDFAMLSASASGTPNGSGSSNGNTK------RRH 732
                ::.::||.  .:.||   |||||. |.|   .|:|:..|.....||.||..|      .:.
  Rat  1034 ----STQMAKPSVNPKALKTERKRKLSRLPAV---TLAANGLGNKESGGSVNGPLKGGAQDPAKE 1091

  Fly   733 KKSQSND--------------SSSPDDHKLPLKKRHYLLTPGERPPAEVAFANG----------- 772
            :..|..|              .|.||  |:..|:..:....|....:|:...|.           
  Rat  1092 EPLQQMDLLRNEETHFDSKVKQSDPD--KILEKEPSFENRKGPEVGSEINIENDEPHGVDQVVPK 1154

  Fly   773 ----KLNAEAWAAAAAAAKSTASTKSQAQFNARSVKSALTPKKRHLLEQ---PTSV--------S 822
                :||.........|::......|:..|........:...:...|||   |||:        :
  Rat  1155 KRWQRLNQRRPKPGKRASRFREKENSEGAFGVLLPGDPVQKGRDDYLEQRAPPTSILEDSAADPN 1219

  Fly   823 GAGSSASNSPLRIVVDNNSISGGKLLDISPSSLCSLKQQRR---GGAAKQKVSAAKDLVQLQSPA 884
            ....|.|..|...|.|.:|:|.|.|  ...:.:.||..|.:   .....:|....|....|....
  Rat  1220 HVSHSESVGPRLNVCDKSSVSMGDL--EKETGIPSLTPQTKIPEPAVRSEKKRLRKPSKWLLEYT 1282

  Fly   885 GSYPPPGVFEP-SVELEIQIPLSKL------------------------NESVITKAE------- 917
            ..|..  :|.| ..:.::|..:.|:                        |..:.||.|       
  Rat  1283 EEYDQ--IFAPKKKQKKVQEQVHKVSSRCEDESLLARCRSSAQNKQVDENSLISTKEEPPVLERE 1345

  Fly   918 ---VESPL-----------LSALDIKEDTKKEVGQRVV---ETLLHKTGGNLLLKRKRKKINRTG 965
               :|.||           |..|.:......||..|..   |.||.||.||...||:||      
  Rat  1346 APFLEGPLVQSDLGVAHAELPQLTLSVPVAPEVSPRPTLESEELLVKTPGNYEGKRQRK------ 1404

  Fly   966 FPTVRRKKRKVSVEQQTTAVIDEHEPEFDPDDEPLQSLRETRSSNNVNVQAAPNPPLDCERVPQA 1030
             ||   ||...|         ::.:|.|.|....|...|:...|.::.              ...
  Rat  1405 -PT---KKLLES---------NDLDPGFMPKKGDLGLSRKCCESGHLE--------------NGV 1442

  Fly  1031 GEARETFVARTNQKAPRLSVVALERLQRPQTPARGRPRGRKP----------KNREQAEAAPQPP 1085
            |::|.|         |.     |:......|....:||.||.          |..::.::|.:.|
  Rat  1443 GDSRAT---------PH-----LKEFGGGTTRIFDKPRKRKRQRHGTARVHYKRVKKEDSARETP 1493

  Fly  1086 PKSEPEIRPAKKRGRQPKQPVLEEPPPTPPPQQKKNKME----------PNI------------- 1127
            ..:|.|:. ..:....||: :|||.....|......:::          .|:             
  Rat  1494 SSAEGELM-IHRTAASPKE-ILEEGIEHDPGMSASKRLQGERGGGAALKENVCQNCEKLGELLLC 1556

  Fly  1128 ----------------RLPDG--------IDPNTNFSCKIRLKRRKNLEAGTQPKK------EKP 1162
                            .:|.|        ...:|.|.||         ::|...|:      .|.
  Rat  1557 EAQCCGAFHLECLGLTEMPRGKFICNECRTGIHTCFVCK---------QSGEDVKRCLLPLCGKF 1612

  Fly  1163 VQPVTVEEIPPEI-------------------------------------PVSQEEID--AEAEA 1188
            .....|::.||.:                                     ||:....|  ..|.:
  Rat  1613 YHEECVQKYPPTVVQNKGFRCPLHICITCHAANPANVSASKGRLMRCVRCPVAYHANDFCLAAGS 1677

  Fly  1189 KRLDS----IPTEHDPLPASESHNPGPQDYA-SCSESSE----DKASTTSLRKLSKVKKTYLVAG 1244
            |.|.|    .|....|.....:|......:. .|||...    |.......|:...:.   :..|
  Rat  1678 KILASNSIICPNHFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNID---IPEG 1739

  Fly  1245 LFSNHYKQSLMPPPAKVNKKPGLEE----QVG-----PASLLPP---PPYCEKYLRRTEMDFELP 1297
               |.|...     .|..|||...|    :||     ||.:..|   |...:| :|....:|.:.
  Rat  1740 ---NWYCND-----CKAGKKPHYREIVWVKVGRYRWWPAEICHPRAVPSNIDK-MRHDVGEFPVL 1795

  Fly  1298 Y----DIWWAY-----------------------------------------------------T 1305
            :    |..|.:                                                     .
  Rat  1796 FFGSNDYLWTHQARVFPYMEGDVSSKDKMGKGVDGTYKKALQEAAARFEELKAQKELRQLQEDRK 1860

  Fly  1306 NSKLPTRNVVPSWNYRKI-----RTNVYAESVRPNLAGFDHPTCNCKNQGEKSC-LDN-CLNRMV 1363
            |.|.|     |.:.:.|:     |..::...:.      :.|.||||...|..| :|: |:|||:
  Rat  1861 NDKKP-----PPYKHIKVNRPIGRVQIFTADLS------EIPRCNCKATDENPCGIDSECINRML 1914

  Fly  1364 YTECSPSNCPAGEKCRNQKIQRHAVAPGVERFMTADKGWGVRTKLPIAKGTYILEYVGEVVTEKE 1428
            ..||.|:.||||.:|:||...:... |.||.|.|..:|||:|||..|.||.::.|||||::.|:|
  Rat  1915 LYECHPTVCPAGGRCQNQCFSKRQY-PDVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEE 1978

  Fly  1429 FKQRMASIYLND-THHYCLHLDGGLVIDGQRMGSDCRFVNHSCEPNCEMQKWSVNGLSRMVLFAK 1492
            .:.|:.....:| |:.|.|.||...:||....|:..||:||.|:||||.|||||||.:|:.|||.
  Rat  1979 CRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFAL 2043

  Fly  1493 RAIEEGEELTYDYNFSLFNPSEGQPCRCNTPQCRGVIGGKSQRVKPLPAVEAKPSGEGLSGRNGR 1557
            ..|:.|.|||::||.......: ..|:|..|.|.|.:|   .|.|..|.|..:.|          
  Rat  2044 SDIKAGTELTFNYNLECLGNGK-TVCKCGAPNCSGFLG---VRPKNQPIVTEEKS---------- 2094

  Fly  1558 QRKQKAKKHAQRQAGKDISSAVAVAKLQPLSEKEKKLVRQFNTFLVRNFEKIRRCKAKRA----- 1617
             ||.|.|.|.:|::..:::             ||    |:...|...:..::..||....     
  Rat  2095 -RKFKRKPHGKRRSQGEVT-------------KE----REDECFSCGDGGQLVSCKKPGCPKVYH 2141

  Fly  1618 SDAAATASSPALGTTNG---------DIPGRRPSTPSSPSLAAQISALCSPRNIKTR-------- 1665
            :|.......||     |         |:.|:.         ||....:|.....|..        
  Rat  2142 ADCLNLTKRPA-----GKWECPWHQCDVCGKE---------AASFCEMCPSSFCKQHREGMLFIS 2192

  Fly  1666 ----GLTQAVHDPELEKMAKMAVVLRDICSAMETLKMSDLLTTVSSKKKKPIKTTLSGKLGSTAA 1726
                .|:...|||               |.. ..|:..::      ::..|...|||...|:...
  Rat  2193 KLDGRLSCTEHDP---------------CGP-NPLEPGEI------REYVPPTATLSPSPGTQQT 2235

  Fly  1727 TSKVEFRSIQAQVEQGHYKTPQEFDDHMQQLFVEAKQQHGDDEGKEKALQSLKDSYEQQKIASYV 1791
            ....|..:      ||..|:.|...|..|.|.:..|...|..:      :.|......::..|..
  Rat  2236 EQSSEMGT------QGPKKSDQPPTDATQMLSLSKKAVTGTCQ------RPLLPERPPERTDSSS 2288

  Fly  1792 QLVEILGD-------SESL-QSFKPKEVLSSEEEP-GKIAVKKSPGAKERDSPIV-------PLK 1840
            .|::.:.|       |:|| .|.:|::...::|.| .:...:.||..:...||.|       ||:
  Rat  2289 HLLDRIRDLAGSGTKSQSLVSSQRPQDRPPAKEGPRPQPPDRASPVTRPSSSPSVSSLPLERPLR 2353

  Fly  1841 VTPPPL-----------LPIEASPDEDVIRCICGLYKDEGLMIQCSKCMVWQHTECTKADI---- 1890
            :|.|.|           ..:|..|....:|    |...:.|:          :|...|..|    
  Rat  2354 MTEPRLDKSIGAASPKSQAVEKPPAPTGLR----LSSPDRLL----------NTNSPKPQISDRP 2404

  Fly  1891 -DADNYQ-CERCEPRE----------VDRE---IPLEEFTEEGHRYYLSLMRGDLQVRQGDAVYV 1940
             |..:.. .:|..|.|          |.:|   .|:::.|:..||..:.:...||..||.:....
  Rat  2405 PDKSHASLTQRLPPPEKVLSAVVQSLVAKEKALRPVDQNTQAKHRAAVVMDLIDLTPRQKERAAS 2469

  Fly  1941 LRDIPIK-DESGKVL-----PTKK 1958
            .:::..: ||...||     ||.:
  Rat  2470 PQEVTAQADEKTPVLESSSRPTSR 2493

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
ash1NP_001246834.1 AWS 1340..1388 CDD:197795 22/49 (45%)
SET 1392..1512 CDD:214614 57/120 (48%)
Bromo_ASH1 1680..1787 CDD:99955 18/106 (17%)
PHD_ASH1L 1858..1900 CDD:277023 7/47 (15%)
BAH_polybromo 1929..2073 CDD:240068 10/36 (28%)
Nsd1XP_006253682.1 MSH6_like 319..429 CDD:99898 11/41 (27%)
PHD1_NSD1_2 1543..1585 CDD:277118 2/41 (5%)
PHD2_NSD1 1590..1636 CDD:277120 10/54 (19%)
PHD3_NSD1 1637..1690 CDD:277123 8/52 (15%)
PHD4_NSD1 1707..1746 CDD:277126 8/49 (16%)
WHSC1_related 1752..1846 CDD:99899 13/94 (14%)
AWS 1889..1939 CDD:197795 22/49 (45%)
SET 1940..2063 CDD:214614 58/122 (48%)
PHD5_NSD1 2118..2160 CDD:277129 7/46 (15%)


Information from Original Tools:


Tool Simple Score Weighted Score Original Tool Information
BLAST Result Score Score Type Cluster ID
Compara 1 0.930 - - C166351963
Domainoid 00.000 Not matched by this tool.
eggNOG 00.000 Not matched by this tool.
Hieranoid 00.000 Not matched by this tool.
Homologene 00.000 Not matched by this tool.
Inparanoid 00.000 Not matched by this tool.
OMA 00.000 Not matched by this tool.
OrthoDB 1 1.010 - - D507784at2759
OrthoFinder 00.000 Not matched by this tool.
OrthoInspector 00.000 Not matched by this tool.
orthoMCL 00.000 Not matched by this tool.
Panther 00.000 Not matched by this tool.
Phylome 1 0.910 - -
SonicParanoid 00.000 Not matched by this tool.
SwiftOrtho 1 1.000 - -
TreeFam 00.000 Not matched by this tool.
43.850

Return to query results.
Submit another query.