DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment NSD and ASH1L

DIOPT Version :9

Sequence 1:NP_733239.1 Gene:NSD / 43351 FlyBaseID:FBgn0039559 Length:1427 Species:Drosophila melanogaster
Sequence 2:NP_001353106.1 Gene:ASH1L / 55870 HGNCID:19088 Length:2969 Species:Homo sapiens


Alignment Length:1807 Identity:357/1807 - (19%)
Similarity:596/1807 - (32%) Gaps:599/1807 - (33%)


- Green bases have known domain annotations that are detailed below.


  Fly    10 EIEGDAAHGNVLCNSASDSLTATDEVAAGNDESVATEGDDVEIPRDTNNSTPVRLLDKPGQNPVQ 74
            |:|......||.|:|.|:|                             ||.|.:.:        :
Human   735 ELERSELFKNVSCSSLSNS-----------------------------NSEPAKFM--------K 762

  Fly    75 NGAQPAAEESELESQRQTPVQKQQQQRVSMVNRKRDLINLQSALSPKYIGYANANSPTPLSDSDD 139
            |...|:..:.:...:|...:.|.....:::         |..:..|.:..:|.....:.:..|.|
Human   763 NIGPPSFVDHDFLKRRLPKLSKSTAPSLAL---------LADSEKPSHKSFATHKLSSSMCVSSD 818

  Fly   140 TI------RTTRRRVNQAAALNNSSAGETLAHDNASPRTPGGGGGGGGDDSANQLLSKTYMSPIE 198
            .:      :..|.:..:...|..... .||       :.|           |:::.|.......|
Human   819 LLSDIYKPKRGRPKSKEMPQLEGPPK-RTL-------KIP-----------ASKVFSLQSKEEQE 864

  Fly   199 KLLIKNGASSPNSTGFEAGSEDLGIRPIVRK--HVKRKMKRVPKAK---------VTLELDEKNQ 252
            ..:::.....|:   |:.|   |.:.|..:|  ..||:|:...|.|         |..|...|.:
Human   865 PPILQPEIEIPS---FKQG---LSVSPFPKKRGRPKRQMRSPVKMKPPVLSVAPFVATESPSKLE 923

  Fly   253 QEVDEKSVKTEPIDEEVDRTDEAPTQEAQTTAISIKSETEAEHKAAVDVH------------IKQ 305
            .|.|.....::..:.|....|.....::...::...|:.|.|....:...            |.:
Human   924 SESDNHRSSSDFFESEDQLQDPDDLDDSHRPSVCSMSDLEMEPDKKITKRNNGQLMKTIIRKINK 988

  Fly   306 EDTIRLDIVNNPVESTSIVITEEPK---DLEKSTEELA--FALPLASSTEVDLK----------- 354
            ..|::...:.|.:.|:|:..:.:.|   .|..:...||  |...|.....|..|           
Human   989 MKTLKRKKLLNQILSSSVESSNKGKVQSKLHNTVSSLAATFGSKLGQQINVSKKGTIYIGKRRGR 1053

  Fly   355 SPPDLSSTALATSIKSPSSVSI-----DSAKG--LSIVTDPGWPTYQVGDLFWGKVFSYCFWPCM 412
            .|..:.:..|:   .||:|:::     ..|.|  |..:..|..|:         ...|....|..
Human  1054 KPKTVLNGILS---GSPTSLAVLEQTAQQAAGSALGQILPPLLPS---------SASSSEILPSP 1106

  Fly   413 VCPDPLGQIVGNMPSHPQRSSLDNANVP-----------IQVHVRFFADNGRRNWIKPENLLTFA 466
            :|....|...|..|.......::.::||           ||......|..|||. :.|..||..:
Human  1107 ICSQSSGTSGGQSPVSSDAGFVEPSSVPYLHLHSRQGSMIQTLAMKKASKGRRR-LSPPTLLPNS 1170

  Fly   467 GLKAFDDMREELRIKHGPKSAKYRQMVPKRTKVVIWRQAIEEAQAMTQIP-----------YSDR 520
            .             .|..:....::..|         ..|.|:.:...||           .|||
Human  1171 P-------------SHLSELTSLKEATP---------SPISESHSDETIPSDSGIGTDNNSTSDR 1213

  Fly   521 LEKFYQTYENVVTLNRQKRKRTKY-----MMQDTSDVGSSLYDSTDNLHNKQGTQLLAVKRERSE 580
            .|||.          .||::|..:     :..:||.|.|||.:...:...::....|:..     
Human  1214 AEKFC----------GQKKRRHSFEHVSLIPPETSTVLSSLKEKHKHKCKRRNHDYLSYD----- 1263

  Fly   581 SPFSPAFSPVKSKNEKRAKRRK---LSNGTEADTGSNSMAVTPSQTETTVDSSAYENPEFRQLLS 642
                      |.|.:||.:::|   |.|..:.|..:                      |..:|:|
Human  1264 ----------KMKRQKRKRKKKYPQLRNRQDPDFIA----------------------ELEELIS 1296

  Fly   643 AVMEYVMMNRSDEKVEKVLLSVVSNI---------------------WSLKQIQLRELERDLASG 686
            .:.|..:.:||...:.:.||..:..|                     ..||:.:.|..:...|..
Human  1297 RLSEIRITHRSHHFIPRDLLPTIFRINFNSFYTHPSFPLDPLHYIRKPDLKKKRGRPPKMREAMA 1361

  Fly   687 EI------------------------EEPLGSSVVGRG--------------------------- 700
            |:                        ..||.::.:|.|                           
Human  1362 EMPFMHSLSFPLSSTGFYPSYGMPYSPSPLTAAPIGLGYYGRYPPTLYPPPPSPSFTTPLPPPSY 1426

  Fly   701 -------SGVGTIKRLSNRLMTM--MVRRSMTPVVTPSTTPA-PSE-----------------PD 738
                   .......:..::|:..  .:..|.||:::.||.|: |.|                 .:
Human  1427 MHAGHLLLNPAKYHKKKHKLLRQEAFLTTSRTPLLSMSTYPSVPPEMAYGWMVEHKHRHRHKHRE 1491

  Fly   739 RRLSEPPKTKKPVNRPIEEVIEDILQLDSKYLFRGLSREPICKYCYQAGSDLV----------RC 793
            .|.||.|:......             .|:.:...|.|       |:.|.|.|          ||
Human  1492 HRSSEQPQVSMDTG-------------SSRSVLESLKR-------YRFGKDAVGERYKHKEKHRC 1536

  Fly   794 SRTC-------------SSWLHADCLERKVTGAPMPKIGSRKALVIPPTSKSPS-------PDED 838
            ..:|             ..|:|.:..|    .:|: .:|.:..|.|..:..|||       |:.:
Human  1537 HMSCPHLSPSKSLINREEQWVHREPSE----SSPL-ALGLQTPLQIDCSESSPSLSLGGFTPNSE 1596

  Fly   839 HVTADAKEVVAVGTSLVCHECNVGEPEGCVICHQVESPAVPSTP--------RKEDSSSHTPIED 895
            ..::|  |...:.||.: ..|.|..|.........:||.:.|..        |||...|:.... 
Human  1597 PASSD--EHTNLFTSAI-GSCRVSNPNSSGRKKLTDSPGLFSAQDTSLNRLHRKESLPSNERAV- 1657

  Fly   896 KLLTCSQPMCGKRFH-----TSCCKYWPQASSSKHS-------ARCPRHVCH------------- 935
            :.|..|||...|...     |:|.....::||...|       :|.||.|..             
Human  1658 QTLAGSQPTSDKPSQRPSESTNCSPTRKRSSSESTSSTVNGVPSRSPRLVASGDDSVDSLLQRMV 1722

  Fly   936 ----------------TCVSDDPS-------GKFQQLGS------------------SKLAKCVR 959
                            ...|..||       .|.:.||.                  |.|::.:.
Human  1723 QNEDQEPMEKSIDAVIATASAPPSSSPGRSHSKDRTLGKPDSLLVPAVTSDSCNNSISLLSEKLT 1787

  Fly   960 CPATYHQLSKCIPAGTQ-----MLNTTNIICPRHNIAKADAHVNVLWCYICVKGGELV------- 1012
            ...:.|.:.:.:....|     |.|...|:..:.|:    .|||.:     :|..:|.       
Human  1788 SSCSPHHIKRSVVEAMQRQARKMCNYDKILATKKNL----DHVNKI-----LKAKKLQRQARTGN 1843

  Fly  1013 --------CCETCPI--------------------------AVHAHCRNIP-------------- 1029
                    ....||:                          |:|.....:.              
Human  1844 NFVKRRPGRPRKCPLQAVVSMQAFQAAQFVNPELNRDEEGAALHLSPDTVTDVIEAVVQSVNLNP 1908

  Fly  1030 -----IKTNESYICEECESGRLPLYGEIVWAKFNNFRWWPAIILPPTEVPSNILKKAHGENDF-- 1087
                 :|.....:.|:....:.||..|.......:|...|..|..|:|.|:   |.:..|:..  
Human  1909 EHKKGLKRKGWLLEEQTRKKQKPLPEEEEQENNKSFNEAPVEIPSPSETPA---KPSEPESTLQP 1970

  Fly  1088 VVRFFGTHDHGWISRRRVY----LYIEGDTGDGHKT---KSQLFR------NYTTGVEEASRF-L 1138
            |:.............::.|    ||     .|.:||   ||:|.:      .||.|..|...| .
Human  1971 VLSLIPREKKPPRPPKKKYQKAGLY-----SDVYKTTDPKSRLIQLKKEKLEYTPGEHEYGLFPA 2030

  Fly  1139 PII-------KARRQEQ-DME-------RQSGNKLHPPP----YVKIKTN--KAVPPLRFSQNLE 1182
            ||.       |..||:: |.:       :...|:|:..|    |.||::|  ..|.||    :..
Human  2031 PIHVVFFVSGKYLRQKRIDFQLPYDILWQWKHNQLYKKPDVPLYKKIRSNVYVDVKPL----SGY 2091

  Fly  1183 DLSTCNC-LPVDEHPCGPEAGCLNRMLFNECNPEYCKAGSLCENRMFEQRKSPR-LEVVYMNERG 1245
            :.:|||| .|.|:...|....|||||:|.||:|..|..|..|.|:..::.:..: ||.....|:|
Human  2092 EATTCNCKKPDDDTRKGCVDDCLNRMIFAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKG 2156

  Fly  1246 FGLVNREPIAVGDFVIEYVGEVINHAEFQRRMEQKQRDRDENYYFLGVEKDFIIDAGPKGNLARF 1310
            :|:..:||:..|.|:|||:|||::..||:.||.::..:..: :|.|.::...:||:...||.|||
Human  2157 WGIRTKEPLKAGQFIIEYLGEVVSEQEFRNRMIEQYHNHSD-HYCLNLDSGMVIDSYRMGNEARF 2220

  Fly  1311 MNHSCEPNCETQKWTVNCIHRVGIFAIKDIPVNSELTFNYLWDDLMNNSKKACFCGAKRCSGEIG 1375
            :||||:||||.|||:||.::|:|::|:||:|..:|||::|.:.......::.|.||.::|.|.||
Human  2221 INHSCDPNCEMQKWSVNGVYRIGLYALKDMPAGTELTYDYNFHSFNVEKQQLCKCGFEKCRGIIG 2285

  Fly  1376 GKLKDDAVKAHAKLKQMRRAKASAVRIHVKPKKTPKVK----HISADDEPMD 1423
            ||.:.......:|..|.......:.|...|.|...|:|    |:|  :||.:
Human  2286 GKSQRVNGLTSSKNSQPMATHKKSGRSKEKRKSKHKLKKRRGHLS--EEPSE 2335

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
NSDNP_733239.1 MSH6_like 391..508 CDD:99898 21/127 (17%)
PHD2_NSD 867..932 CDD:277040 19/84 (23%)
PHD3_NSD 933..988 CDD:277041 14/113 (12%)
PHD4_NSD 1001..1041 CDD:277042 8/99 (8%)
WHSC1_related 1047..1141 CDD:99899 27/109 (25%)
AWS 1183..1233 CDD:197795 20/50 (40%)
SET 1234..1354 CDD:214614 51/120 (43%)
ASH1LNP_001353106.1 Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1..70
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 118..143
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 501..525
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 537..583
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 824..845 2/21 (10%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 878..966 18/90 (20%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1100..1128 5/27 (19%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1151..1231 22/112 (20%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1243..1281 9/52 (17%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1489..1508 4/31 (13%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1580..1711 32/134 (24%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1741..1761 5/19 (26%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1911..1991 15/82 (18%)
Catalytic domain 2069..2288 87/223 (39%)
AWS 2092..2143 CDD:197795 20/50 (40%)
SET 2146..2266 CDD:214614 51/120 (43%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 2288..2346 11/50 (22%)
Bromo_ASH1 2443..2548 CDD:99955
PHD_ASH1L 2586..2628 CDD:277023
BAH_polybromo 2665..2799 CDD:240068
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 2825..2856
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 2876..2919
Blue background indicates that the domain is not in the aligned region.


Information from Original Tools:


Tool Simple Score Weighted Score Original Tool Information
BLAST Result Score Score Type Cluster ID
Compara 00.000 Not matched by this tool.
Domainoid 00.000 Not matched by this tool.
eggNOG 00.000 Not matched by this tool.
Hieranoid 00.000 Not matched by this tool.
Homologene 00.000 Not matched by this tool.
Inparanoid 00.000 Not matched by this tool.
Isobase 00.000 Not matched by this tool.
OMA 00.000 Not matched by this tool.
OrthoDB 1 1.010 - - D507784at2759
OrthoFinder 00.000 Not matched by this tool.
OrthoInspector 00.000 Not matched by this tool.
orthoMCL 00.000 Not matched by this tool.
Panther 00.000 Not matched by this tool.
Phylome 1 0.910 - -
RoundUp 1 1.030 - avgDist Average_Evolutionary_Distance R4489
SonicParanoid 00.000 Not matched by this tool.
SwiftOrtho 00.000 Not matched by this tool.
TreeFam 00.000 Not matched by this tool.
User_Submission 00.000 Not matched by this tool.
32.950

Return to query results.
Submit another query.