DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment NSD and Kmt2a

DIOPT Version :10

Sequence 1:NP_733239.1 Gene:NSD / 43351 FlyBaseID:FBgn0039559 Length:1427 Species:Drosophila melanogaster
Sequence 2:NP_001344478.1 Gene:Kmt2a / 214162 MGIID:96995 Length:3966 Species:Mus musculus


Alignment Length:1713 Identity:333/1713 - (19%)
Similarity:540/1713 - (31%) Gaps:592/1713 - (34%)


- Green bases have known domain annotations that are detailed below.


  Fly    22 CNSAS-----DSLTATDEVAAGNDESVATEGDDVEIPRDTNNSTPVRLLDKPGQNPVQN------ 75
            ||:.|     |.:.....|..|  :|...||...|:......|..|..|...|:|..:|      
Mouse  2477 CNNVSSEKIGDKVLPLSGVPKG--QSTQVEGSSKELQAPRKCSVKVTPLKMEGENQSKNTQKESG 2539

  Fly    76 GAQPAAEES----ELESQRQTP-----VQKQQQQRVSMVNRKRDLINL----QSALSPKYIGYAN 127
            ...||..||    |..|..::|     ||......:|...:..:..||    ::.:.|       
Mouse  2540 PGSPAHIESVCPAEPVSASRSPGAGPGVQPSPNNTLSQDPQSNNYQNLPEQDRNLMIP------- 2597

  Fly   128 ANSPTPLSDSDDTIRTTRRRVNQAAALNNSSAGETLA-------------HDNASPRTPGGGGGG 179
             :.|.|..|.....|..||   .|.|.:|...|.|..             :.|::.:..|.....
Mouse  2598 -DGPKPQEDGSFKRRYPRR---SARARSNMFFGLTPLYGVRSYGEEDIPFYSNSTGKKRGKRSAE 2658

  Fly   180 GGDDSANQLLSKTYMSPIEKLLIKNGASSPNSTGFEAGSEDLGIRPIVRKH-------------- 230
            |..|.|:.|.:    |..:.|...|...:..|:|   |.|.|....:.|:.              
Mouse  2659 GQVDGADDLST----SDEDDLYYYNFTRTVISSG---GEERLASHNLFREEEQCDLPKISQLDGV 2716

  Fly   231 ------------VKRKMKRVPKAKVTLELDEKNQQEVDEKSVKTEPIDEEVDRTDEAPTQE-AQT 282
                        ..||..::||         :|.:|...:::|       :||.::|..:| ...
Mouse  2717 DDGTESDTSVTATSRKSSQIPK---------RNGKENGTENLK-------IDRPEDAGEKEHVIK 2765

  Fly   283 TAISIKSETEAEHKAAVDVHIKQEDTIRLDIVNNPVESTSIVITEEPKD----------LEKS-- 335
            :|:..|:|.:.::..:|. .:|.:....|:...:.:||:..|.|..|.|          |.||  
Mouse  2766 SAVGHKNEPKLDNCHSVS-RVKAQGQDSLEAQLSSLESSRRVHTSTPSDKNLLDTYNAELLKSDS 2829

  Fly   336 ----TEELAFALPLASSTEVDLKSPPDLSSTALATSIKSPSSVSIDSAKGLSIVTDPGWPTYQVG 396
                :::....|| :...:..||:.|  |..||..|.:|.||..:...:||      |..:.:..
Mouse  2830 DNNNSDDCGNILP-SDIMDFVLKNTP--SMQALGESPESSSSELLTLGEGL------GLDSNREK 2885

  Fly   397 DLFWGKVFSYCFWPCMVCPDPLGQIVGNMPSHPQRSSLDNANVPIQVHVRFFADNGRRNWIKPEN 461
            |:...:|||    ..:...:|:...|.:..|..::..|     |:::.........|...:..:|
Mouse  2886 DIGLFEVFS----QQLPATEPVDSSVSSSISAEEQFEL-----PLELPSDLSVLTTRSPTVPSQN 2941

  Fly   462 LLTFAGLKAFDDMREELRIKHGPKSAKYRQMVPKRTKVVIWRQAIEEAQAMTQIPYSDRLEKFYQ 526
            .   :.|....|..|                  ||..:.....|..|.......|..|...:.:.
Mouse  2942 P---SRLAVISDSGE------------------KRVTITEKSVASSEGDPALLSPGVDPAPEGHM 2985

  Fly   527 TYENVVTLNRQKRKRTKYMMQDTSDVGSSLYDSTDNLH-NKQ------GTQLLAVKRERSESPFS 584
            |.::.:..:           .|...:.|....|.:..| |.|      ||..|.|       |.|
Mouse  2986 TPDHFIQGH-----------MDADHISSPPCGSVEQGHGNSQDLTRNSGTPGLQV-------PVS 3032

  Fly   585 PAFSPVKSKNEKRAKRRKLSNGTEADTGSNSMAVTPSQTETTVDSSAYENPEFRQLLSAVMEYVM 649
            |.   |..:|:|.......|.| .:...:.::..||...:...:.....|...:.|      ||:
Mouse  3033 PT---VPVQNQKYVPSSTDSPG-PSQISNAAVQTTPPHLKPATEKLIVVNQNMQPL------YVL 3087

  Fly   650 MNRSDEKVEKV-LLSVVSNIWSLKQIQLRELERDLASGEIEEPLGSSVVG-RGSGVGTIKRLSNR 712
            ....:...:|: |.|.||:..|:.:                  ..:||:| .|||:         
Mouse  3088 QTLPNGVTQKIQLTSPVSSTPSVME------------------TNTSVLGPMGSGL--------- 3125

  Fly   713 LMTMMVRRSMTPVVTPSTTPAPS---------------------EPDRRLSEPPKTKKP------ 750
                    ::|..:.||..|:||                     ....:.|.||....|      
Mouse  3126 --------TLTTGLNPSLPPSPSLFPPASKGLLSVPHHQHLHSFPAAAQSSFPPNISSPPSGLLI 3182

  Fly   751 -VNRPIEEVIEDILQLDSKYLFR------------GLSREPICKYCYQ--------------AGS 788
             |..|     .|...|.|:...|            ||.:.||.:...:              |.|
Mouse  3183 GVQPP-----PDPQLLGSEANQRTDLTTTVATPSSGLKKRPISRLHTRKNKKLAPSSAPSNIAPS 3242

  Fly   789 DLVRCSRTCSSWL------HADCLE----RKVTGAPMPKIGSR---------KALVIPPT----- 829
            |:| .:.|..::.      |...|:    ...:...:|.|..|         :|.::||.     
Mouse  3243 DVV-SNMTLINFTPSQLSNHPSLLDLGSLNPSSHRTVPNIIKRSKSGIMYFEQAPLLPPQSVGGT 3306

  Fly   830 ------SKSPSPDEDHVTADAKEVVAVGTSLVCHECNVGEPEGCVICHQVESPAVPSTPRKEDSS 888
                  |.:.|.|..|:|:.....:|.|:|::    ||         ..:::.|.|        :
Mouse  3307 AATAAGSSTISQDTSHLTSGPVSALASGSSVL----NV---------VSMQTTAAP--------T 3350

  Fly   889 SHTPIEDKLLTCSQPMCGKRFHTSCCKYWPQASSSKHSARCPRHVCHTCVSDDP------SGKFQ 947
            |.|.:...:...:|.:.|.          |...|..|......|. ...:.|.|      ||.|.
Mouse  3351 SSTSVPGHVTLANQRLLGT----------PDIGSISHLLIKASHQ-SLGIQDQPVALPPSSGMFP 3404

  Fly   948 QLGSSKLAKCVRCPATYHQLSKCIPAGTQMLNTTNIICPRHNIAKADAHVNVLWCYICVKGGELV 1012
            |||:|:........|.   .|.|:...:|....|....|    .:|:.|      |...:|.:|:
Mouse  3405 QLGTSQTPSAAAMTAA---SSICVLPSSQTAGMTAASPP----GEAEEH------YKLQRGNQLL 3456

  Fly  1013 CCETCPIAVHAHCRNIPIK------TNESYICEECESGRLPLYGEIVWAKFNNFRWWPAIILPPT 1071
            ..:|..:.  :.....|..      :|.:...|.....||.....:..||       ||....|.
Mouse  3457 AGKTGTLT--SQRDRDPDSAPGTQPSNFTQTAEAPNGVRLEQNKTLPSAK-------PASSASPG 3512

  Fly  1072 EVPSNILKKAHGENDFVVRFFGTHDHGWIS----------RRRVYLYIEGDTGDGHK-----TKS 1121
            ..||:      |:..           |..|          .:|:.|.::..:|..||     |.|
Mouse  3513 SSPSS------GQQS-----------GSSSVPGPTKPKPKAKRIQLPLDKGSGKKHKVSHLRTSS 3560

  Fly  1122 QLF-----------------------RNYTTGVEEASR-----------FLPIIKARR------- 1145
            :..                       :....|||:.|:           .||.::|.:       
Mouse  3561 EAHIPHRDTDPAPQPSVTRTPRANREQQDAAGVEQPSQKECGQPAGPVAALPEVQATQNPANEQE 3625

  Fly  1146 ----------------------QEQDMERQSGNKLHPPPYV---------------------KIK 1167
                                  |::...::|..:..|...:                     |..
Mouse  3626 NAEPKAMEEEESGFSSPLMLWLQQEQKRKESITERKPKKGLVFEISSDDGFQICAESIEDAWKSL 3690

  Fly  1168 TNKAVPPLRFSQNLEDLSTC--------------------------NCL----------PVDEHP 1196
            |:| |...|.:..|:.||..                          :|.          ..:|.|
Mouse  3691 TDK-VQEARSNARLKQLSFAGVNGLRMLGILHDAVVFLIEQLAGAKHCRNYKFRFHKPEEANEPP 3754

  Fly  1197 CGPEAGC-----LNRMLFNECN---------PEY---------------CKAGSL---CENRMFE 1229
            ..|....     |.:..|:..|         |||               .:|.|:   ...|...
Mouse  3755 LNPHGSARAEVHLRKSAFDMFNFLASKHRQPPEYNPNDEEEEEVQLKSARRATSMDLPMPMRFRH 3819

  Fly  1230 QRKSPRLEV-VYMNE-RGFGLVNREPIAVGDFVIEYVGEVINHAEFQRRMEQKQRDRDENY---- 1288
            .:|:.:..| ||.:. .|.||..:..|..|:.||||.|.||...         |.|:.|.|    
Mouse  3820 LKKTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSI---------QTDKREKYYDSK 3875

  Fly  1289 ----YFLGVEKDFIIDAGPKGNLARFMNHSCEPNCETQKWTVNCIHRVGIFAIKDIPVNSELTFN 1349
                |...::...::||...||.|||:||||||||.::...::....:.|||::.|....|||::
Mouse  3876 GIGCYMFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 3940

  Fly  1350 YLW--DDLMNNSKKACFCGAKRC 1370
            |.:  :|..|  |..|.||||:|
Mouse  3941 YKFPIEDASN--KLPCNCGAKKC 3961

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
NSDNP_733239.1 PWWP_NSD_rpt1 395..514 CDD:438972 18/118 (15%)
PHD2_NSD 867..932 CDD:277040 9/64 (14%)
PHD3_NSD 933..988 CDD:277041 15/60 (25%)
PHD4_NSD 1001..1041 CDD:277042 7/45 (16%)
PWWP_NSD_rpt2 1048..1143 CDD:438963 23/143 (16%)
AWS 1183..1233 CDD:197795 15/117 (13%)
SET_NSD 1233..1375 CDD:380950 52/150 (35%)
Kmt2aNP_001344478.1 Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1..106
Menin-binding motif (MBM). /evidence=ECO:0000250|UniProtKB:Q03164 6..25
Integrase domain-binding motif 1 (IBM1). /evidence=ECO:0000250|UniProtKB:Q03164 121..132
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 130..231
Integrase domain-binding motif 2 (IBM2). /evidence=ECO:0000250|UniProtKB:Q03164 145..150
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 322..343
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 440..590
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 711..943
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 963..1003
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1034..1064
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1101..1161
zf-CXXC 1144..1191 CDD:366873
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1196..1390
PHD1_KMT2A 1432..1478 CDD:277063
PHD2_KMT2A 1480..1529 CDD:277065
PHD3_KMT2A 1567..1626 CDD:277067
Interaction with histone H3K4me3. /evidence=ECO:0000250|UniProtKB:Q03164 1583..1599
Bromo_ALL-1 1649..1779 CDD:99925
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1665..1714
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1807..1870
ePHD_KMT2A 1873..1985 CDD:277163
FYRN 2026..2073 CDD:461787
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 2147..2174
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 2214..2339
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 2371..2619 36/154 (23%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 2639..2673 7/37 (19%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 2709..2759 10/65 (15%)
9aaTAD. /evidence=ECO:0000250|UniProtKB:Q03164 2843..2851 0/7 (0%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 2958..3060 23/123 (19%)
Herpes_BLLF1 <3152..>3361 CDD:282904 41/235 (17%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 3164..3239 14/79 (18%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 3462..3640 29/203 (14%)
FYRC 3666..3749 CDD:197781 9/83 (11%)
WDR5 interaction motif (WIN). /evidence=ECO:0000250|UniProtKB:Q03164 3759..3764 0/4 (0%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 3782..3805 3/22 (14%)
SET_KMT2A_2B 3813..3966 CDD:380947 54/160 (34%)
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.