DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment NSD and Kmt2b

DIOPT Version :10

Sequence 1:NP_733239.1 Gene:NSD / 43351 FlyBaseID:FBgn0039559 Length:1427 Species:Drosophila melanogaster
Sequence 2:NP_001277502.1 Gene:Kmt2b / 75410 MGIID:109565 Length:2722 Species:Mus musculus


Alignment Length:1581 Identity:280/1581 - (17%)
Similarity:448/1581 - (28%) Gaps:659/1581 - (41%)


- Green bases have known domain annotations that are detailed below.


  Fly    79 PAAEESELESQR----------------------------QTPVQKQQQQRVS-MVNRKRDLINL 114
            |::..|.|.|||                            :.|.:|:.::... ||....:|:..
Mouse   134 PSSLRSALRSQRGRAPRGRGRKHKTTPLPPRLADVTPVPPKAPTRKRGEEGTERMVQALTELLRR 198

  Fly   115 QSALSPKYIGYANANSPTPLSDSDDTIRTTR-----------RRVNQAAAL-------------- 154
            ..|..|.. ..|.|..|:       |.|.:|           |:..||..|              
Mouse   199 SQAPQPPR-SRARAREPS-------TPRRSRGRPPGRPAGPCRKKQQAVVLAEAAVTIPKPEPPP 255

  Fly   155 ------NNSSAGETLAHDNASPRTP--------GGGGGGGGDDSANQLLSKTYMSPIEKLLI--- 202
                  |.:.:.:........|.||        ||.||.|.......|:.| ::|..:|:.:   
Mouse   256 PVVPVKNKAGSWKCKEGPGPGPGTPKRGGQPGRGGRGGRGRGRGGLPLMIK-FVSKAKKVKMGQL 319

  Fly   203 -----------KNGASSPNSTGFEAGSEDLGIRPIVRKHVKRKMKRVPKAKVTLELDEKNQQEVD 256
                       :.|.|..::...:.|.|.  .|...||..::|::         |.:|:.::|.:
Mouse   320 SQELESGQGHGQRGESWQDAPQRKDGDEP--ERGSCRKKQEQKLE---------EEEEEEEKEGE 373

  Fly   257 EKSVKTEPID----EEVDRTDEAPTQEAQTTAISIKSETEAE----------------------- 294
            ||..|.:..|    ||.:.|:.|..:|   .|:..|.:.||:                       
Mouse   374 EKEEKDDNEDNNKQEEEEETERAVAEE---EAMLAKEKEEAKLPSPPLTPPVPSPPPPLPPPSTS 435

  Fly   295 ------------------HKAAVDVHIKQEDTI-------------------------------- 309
                              .........|||::.                                
Mouse   436 PPPPASPLPPPVSPPPPLSPPPYPAPEKQEESPPLVPATCSRKRGRPPLTPSQRAEREAARSGPE 500

  Fly   310 -RLDIVNNPVESTSIVITEEPKDLEKSTEEL----AFALPLASS------------TEVDLKSPP 357
             .|....||..:|...:.:.|..:.|||..|    .|.:|:.|:            .:.|...||
Mouse   501 GTLSPTPNPSTTTGSPLEDSPTVVPKSTTFLKNIRQFIMPVVSARSSRVIKTPRRFMDEDPPKPP 565

  Fly   358 DLSST----ALATSIKSPSS-VSIDS---------------AKGLSIVTDP--GW---------- 390
            .:.::    .:|||..:|.. |.:.|               .|..||:.:|  .|          
Mouse   566 KVEASIVRPPVATSPPAPQEPVPVSSPPRVPTPPSTPVPLPEKRRSILREPTFRWTSLTRELPPP 630

  Fly   391 --------------------PTYQVGDLFWGKVFSYCFWPCMVCPDPLGQI---------VGNMP 426
                                |.......|.........:..::.|.|||.:         ..:.|
Mouse   631 PPAPPPAPSPPPAPATPSRRPLLLRAPQFTPSEAHLKIYESVLTPPPLGALETPEPELPPADDSP 695

  Fly   427 SHPQRSSLDNAN------------VPIQVHVRFFADNGRRNWIKPENLLTFAGLKAFDDMREELR 479
            :.|:..::...|            .|::|.|             |.:     |..|..: .::|:
Mouse   696 AEPEPRAVGRTNHLSLPRFVPVVTSPVKVEV-------------PPH-----GAPALSE-GQQLQ 741

  Fly   480 IKHGPKSAKYRQMVPKRTKVVIWRQAIEEAQAMTQIPYSDR----LEKFYQTYENVVTLNRQKRK 540
            ::. |..|...|::|         ||:...|...|.|.|.:    |||     ..|.:|.     
Mouse   742 LQQ-PPQALQTQLLP---------QALPPQQPQAQPPPSPQHTPPLEK-----ARVASLG----- 786

  Fly   541 RTKYMMQDTSDVGSSLYDSTDNLHNKQGTQLLAVKRERSES-----PFSPAFSP----------- 589
                 ....|.|...::    :|..:...||..:.:::.:.     |.|||...           
Mouse   787 -----SLPLSGVEEKMF----SLLKRAKVQLFKIDQQQQQKVAASMPLSPAVQTEEAVGTVKQTP 842

  Fly   590 ----VKSKNEKRAKRRKLSNGTEADTGS-------NSMAVTPSQTETTVDSSAYENPEFRQLLSA 643
                |:|::|....:|..::|.|:....       ...||...|....|       ||....|||
Mouse   843 DRGCVRSEDESMEAKRDRASGPESPLQGPRIKHVCRHAAVALGQARAMV-------PEDVPRLSA 900

  Fly   644 VMEYVMMNRSDEKVEKVLLSVVSNIWSLKQIQLRELERDLASGEIEEPLGSS------------- 695
            :   .:.:|.|...|..  |..|...|:.....||.......|...||.||:             
Mouse   901 L---PLRDRQDLATEDT--SSASETESVPSRSQREKVESAGPGGDSEPTGSTGALAHTPRRSLPS 960

  Fly   696 -------------------VVGRGS-----------GVGTIK---------RLSNRLMTMMVRRS 721
                               |...||           |..|.|         ::..|.|..:.::.
Mouse   961 HHGKKMRMARCGHCRGCLRVQDCGSCVNCLDKPKFGGPNTKKQCCVYRKCDKIEARKMERLAKKG 1025

  Fly   722 MT-------------PVVTP------------------STTPAPSEPDRRLSEPPKTKKPV-NRP 754
            .|             |..:|                  ..||.|.|.|..|.:....::.| .||
Mouse  1026 RTIVKTLLPWDSDESPEASPGPPGPRRGAGAGGSREEVGATPGPEEQDSLLLQRKSARRCVKQRP 1090

  Fly   755 IEEVIEDILQLDSKYLFRGLSREPICKYCYQAGSDLVRCSRTCSSWLHADCLE-------RKVTG 812
            ..:|.||  ..||         ||       .|....|  |..........||       ||.|.
Mouse  1091 SYDVFED--SDDS---------EP-------GGPPAPR--RRTPREHELPVLEPEEQSRPRKPTL 1135

  Fly   813 APMPKIGSRKAL---VIPP----------TSKSPSPDEDH-VTADAKE------VVAVGTSLVCH 857
            .|:.::.:|:.|   .:.|          |.|..|||..| |..|.||      |..:|...|..
Mouse  1136 QPVLQLKARRRLDKDALAPGPFASFPNGWTGKQKSPDGVHRVRVDFKEDCDLENVWLMGGLSVLT 1200

  Fly   858 ECNVGEPEGCVICHQVESPAVPSTPRKEDSSSHTPIEDKLLTCSQPMCGKRFHTSCCKYWPQASS 922
            ....|.|..|::|              .....|     :|:.|.  :|...||..|.:...:.|.
Mouse  1201 SVPGGPPMVCLLC--------------ASKGLH-----ELVFCQ--VCCDPFHPFCLEEAERPSP 1244

  Fly   923 SKHSARCPRH--VCHTCVSDDPSGKFQQLGSSKLAKCVRCPATYHQLS----------------- 968
            ......|.|.  .||.|      |: :..||..|.:|.||...||...                 
Mouse  1245 QHRDTWCCRRCKFCHVC------GR-KGRGSKHLLECERCRHAYHPACLGPSYPTRATRRRRHWI 1302

  Fly   969 -----KCIPAGTQMLNTTNI-------ICPRHNIAKADAHVNVLWCYICVK-------GGELVCC 1014
                 :|...|.......::       :|||    ..:.:....:|.||.:       ..:::.|
Mouse  1303 CSACVRCKSCGATPGKNWDVEWSGDYSLCPR----CTELYEKGNYCPICTRCYEDNDYESKMMQC 1363

  Fly  1015 ETCPIAVHAHCRNIPIKTNE---------SYICEECESGRLPLYGEIVWAKFNNFRWWPAII--- 1067
            ..|...|||.|..:..:..|         .|.|..|.....|             ||..|:.   
Mouse  1364 AQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGPCAGATQP-------------RWREALSGAL 1415

  Fly  1068 ----------LPPTEVPSNIL---------KKAHGENDFVVRFFGTHDHGWISRRRVYLYIEGDT 1113
                      |..::|...:|         |:.|.         |..|...:.:|    :.||  
Mouse  1416 QGGLRQVLQGLLSSKVAGPLLLCTQCGQDGKQLHP---------GPCDLQAVGKR----FEEG-- 1465

  Fly  1114 GDGHKTKSQLFRNYTTGVEEASRFLPIIKARRQEQDMERQSGNKL-----------------HPP 1161
                     |:::..:.:|:....|  ::...:.:..||::|:::                 |.|
Mouse  1466 ---------LYKSVHSFMEDVVAIL--MRHSEEGETPERRAGSQMKGLLLKLLESAFCWFDAHDP 1519

  Fly  1162 PYVKIKT--------NKAVPP 1174
            .|.:..|        |..:||
Mouse  1520 KYWRRSTRLPNGVLPNAVLPP 1540

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
NSDNP_733239.1 PWWP_NSD_rpt1 395..514 CDD:438972 22/139 (16%)
PHD2_NSD 867..932 CDD:277040 11/64 (17%)
PHD3_NSD 933..988 CDD:277041 16/83 (19%)
PHD4_NSD 1001..1041 CDD:277042 12/55 (22%)
PWWP_NSD_rpt2 1048..1143 CDD:438963 16/116 (14%)
AWS 1183..1233 CDD:197795
SET_NSD 1233..1375 CDD:380950
Kmt2bNP_001277502.1 Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1..65
Menin-binding motif (MBM). /evidence=ECO:0000250|UniProtKB:Q9UMN6 17..36
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 82..524 68/412 (17%)
PHA03378 <483..>702 CDD:223065 34/218 (16%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 542..783 44/274 (16%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 831..872 6/40 (15%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 899..964 15/69 (22%)
zf-CXXC 963..1010 CDD:366873 6/46 (13%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1032..1076 7/43 (16%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1088..1138 17/69 (25%)
PHD1_KMT2B 1209..1255 CDD:277064 12/66 (18%)
PHD2_KMT2B 1257..1306 CDD:277066 12/55 (22%)
PHD3_KMT2B 1343..1399 CDD:277068 12/55 (22%)
Bromo_ALL-1 1380..1522 CDD:99925 26/180 (14%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1550..1572
ePHD_KMT2B 1587..1691 CDD:277164
FYRN 1739..1786 CDD:461787
PHA03307 1814..>2193 CDD:223039
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 2065..2113
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 2125..2169
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 2288..2365
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 2391..2417
FYRC 2420..2504 CDD:197781
WDR5 interaction motif (WIN). /evidence=ECO:0000250|UniProtKB:Q9UMN6 2515..2520
SET_KMT2A_2B 2569..2722 CDD:380947
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.