DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Set2 and Kmt2a

DIOPT Version :9

Sequence 1:NP_572888.2 Gene:Set2 / 32301 FlyBaseID:FBgn0030486 Length:2362 Species:Drosophila melanogaster
Sequence 2:NP_001344478.1 Gene:Kmt2a / 214162 MGIID:96995 Length:3966 Species:Mus musculus


Alignment Length:2029 Identity:392/2029 - (19%)
Similarity:638/2029 - (31%) Gaps:640/2029 - (31%)


- Green bases have known domain annotations that are detailed below.


  Fly    22 RPPKVALSALGNTPPHINPSLKHAD--------------------------AEASPTAPEDQDSG 60
            |||.|        .|.||.:::|.|                          :..||..|..|.||
Mouse  2077 RPPVV--------EPDINSTVEHDDNRTIAHSPSSFIDASCKDSQSTAAILSPPSPDRPHSQTSG 2133

  Fly    61 Q-----------------SECRRSSRKK---------------------IIKFDVRDLLNKNRKA 87
            .                 |..:||...:                     ::...:|.:.::....
Mouse  2134 SCYYHVISKVPRIRTPSYSPTQRSPGCRPLPSAGSPTPTTHEIVTVGDPLLSSGLRSIGSRRHST 2198

  Fly    88 HK---IQIEARIDSNPSTGHSQSGTTAAST-SMSTATASAASASSAATVSRLFSMFEMSHQSLPP 148
            ..   ::.:.||.|...||.:.|.::.:|. |:.|||...|||.::.....|.|...:.|.:   
Mouse  2199 SSLSPLRSKLRIMSPVRTGSAYSRSSVSSVPSLGTATDPEASAKASDRGGLLSSSANLGHSA--- 2260

  Fly   149 PPPPPTALEIFAKPR---------------------PTQSLIVAQ---VTSEPSAVGGAHPVQTM 189
             ||..::.......:                     |..||:..:   .:|..|..|.||.. ..
Mouse  2261 -PPSSSSQRTVGGSKTSHLDGSSPSEVKRCSASDLVPKGSLVKGEKNRTSSSKSTDGSAHST-AY 2323

  Fly   190 AGLPPVTPRKR----GRPRKSQL---ADAAIIP-----TVIVPSC------SDSDTN-STSTTTS 235
            .|:|.:||:..    |....|::   |:.:.:|     ||..|..      ||.|.: ..|.:..
Mouse  2324 PGIPKLTPQVHNATPGELNISKIGSFAEPSTVPFSSKDTVSYPQLHLRGQRSDRDQHMDPSQSVK 2388

  Fly   236 NMSSDSGE-----LPGF-----------------------PIQKPKSKLRVSLKRLKLGGRLESS 272
            ...::.||     |||.                       ...|...|.:.|.|.....|::.:.
Mouse  2389 PSPNEDGEIKTLKLPGMGHRPSILHEHIGSSSRDRRQKGKKSSKETCKEKHSSKSYLEPGQVTTG 2453

  Fly   273 DSGNSPSSSSPEVEPPALQDE----NAMDER-----------PK----QEQNLSRMVDAEENSDS 318
            :.||.....:.||..|....:    |...|:           ||    |.:..|:.:.|......
Mouse  2454 EEGNLKPEFADEVLTPGFLGQRPCNNVSSEKIGDKVLPLSGVPKGQSTQVEGSSKELQAPRKCSV 2518

  Fly   319 DSQIIFIEIETESPKGEEEQEEGRPVEVE---PQDLIDIDMELAKQEPTPDPEEDLDEIMVEVLS 380
            ....:.:|.|.:|...::|...|.|..:|   |.:.:.     |.:.|...|         .|..
Mouse  2519 KVTPLKMEGENQSKNTQKESGPGSPAHIESVCPAEPVS-----ASRSPGAGP---------GVQP 2569

  Fly   381 GPPSLWSADDEAEE-----EEDATVQRATPPGKEPAADSCSSA--PRRSRRS--------APLSG 430
            .|.:..|.|.::..     |:|..:.  .|.|.:|..|.....  ||||.|:        .||.|
Mouse  2570 SPNNTLSQDPQSNNYQNLPEQDRNLM--IPDGPKPQEDGSFKRRYPRRSARARSNMFFGLTPLYG 2632

  Fly   431 ------------SSRQGKTLEETFAEIAAESSKQILEAEESQDQEEQHILIDLIEDTLSESEVTS 483
                        |:..||...:..||...:.:..:..::|.          ||.....:.:.::|
Mouse  2633 VRSYGEEDIPFYSNSTGKKRGKRSAEGQVDGADDLSTSDED----------DLYYYNFTRTVISS 2687

  Fly   484 SVSPTIEHMVVEEVVVEENQL----VDEADEILDSKQEFVIKKVFSESD-NIAASLNKDIFEPKV 543
            ...   |.:....:..||.|.    :.:.|.:.|.          :||| ::.|:..|....||.
Mouse  2688 GGE---ERLASHNLFREEEQCDLPKISQLDGVDDG----------TESDTSVTATSRKSSQIPKR 2739

  Fly   544 ETKATCGE--VVPRPEMVTEDVYITEGIAATLEKSAVVTKPTTEM-----IAETKLSDEVVIEPP 601
            ..|....|  .:.|||...|..::.        ||||..|...::     ::..|...:..:|..
Mouse  2740 NGKENGTENLKIDRPEDAGEKEHVI--------KSAVGHKNEPKLDNCHSVSRVKAQGQDSLEAQ 2796

  Fly   602 LKDESDPKQTEVELPESKPAVNIPKSERILSAEVETTSSP----LVPPECCTLESVSGPVLLETS 662
            |......::.....|..|..::...:| :|.::.:..:|.    ::|.:.......:.|.:....
Mouse  2797 LSSLESSRRVHTSTPSDKNLLDTYNAE-LLKSDSDNNNSDDCGNILPSDIMDFVLKNTPSMQALG 2860

  Fly   663 LSTEEKSNE------------NVETTPLKTEAAKEDSPPAAPEEEASNSSEEPNFLLEDYESNQE 715
            .|.|..|:|            |.|......|...:..|...|.:.:.:||....   |.:|...|
Mouse  2861 ESPESSSSELLTLGEGLGLDSNREKDIGLFEVFSQQLPATEPVDSSVSSSISAE---EQFELPLE 2922

  Fly   716 QVAEDEMMKCNNQKGQKQTP--LPEMKEP-EKPVAETVSKKEKAME--NPARSSPAIVDKKVRAG 775
            ..::..::...:.....|.|  |..:.:. ||.|  |:::|..|..  :||..||.:  .....|
Mouse  2923 LPSDLSVLTTRSPTVPSQNPSRLAVISDSGEKRV--TITEKSVASSEGDPALLSPGV--DPAPEG 2983

  Fly   776 EMEKKVVKSTKGTVPEKKMD----SKKSCAAVTPAKQKESGKSAKEAILKK-------------- 822
            .|       |.....:..||    |...|.:|      |.|....:.:.:.              
Mouse  2984 HM-------TPDHFIQGHMDADHISSPPCGSV------EQGHGNSQDLTRNSGTPGLQVPVSPTV 3035

  Fly   823 --ETEKEKSSAKLDSSSPNTLDKKGKDTA----------------QWSP--QLQTLPKSSTK--- 864
              :.:|...|: .||..|:.:......|.                ...|  .|||||...|:   
Mouse  3036 PVQNQKYVPSS-TDSPGPSQISNAAVQTTPPHLKPATEKLIVVNQNMQPLYVLQTLPNGVTQKIQ 3099

  Fly   865 --PPQESAPSVISKTTSNQPAPKEEQHAAKKGLSDNSPPSVLKAKEKAVSGFVECDAMF----KA 923
              .|..|.|||: :|.::...|.........||:.:.|||               .::|    |.
Mouse  3100 LTSPVSSTPSVM-ETNTSVLGPMGSGLTLTTGLNPSLPPS---------------PSLFPPASKG 3148

  Fly   924 MDLANAQLRLDEKNKKKLKKVPTKVEAPPK-----VEPP-------------------TAVPVPG 964
            :........|...........|..:.:||.     |:||                   .|.|..|
Mouse  3149 LLSVPHHQHLHSFPAAAQSSFPPNISSPPSGLLIGVQPPPDPQLLGSEANQRTDLTTTVATPSSG 3213

  Fly   965 QKKSLSGKTSLRRN---------------------TVYEDSPNLERNSSPSSDSAQANTSAGKLK 1008
            .||....:...|:|                     |:...:|:...|.....|....|.|:.:..
Mouse  3214 LKKRPISRLHTRKNKKLAPSSAPSNIAPSDVVSNMTLINFTPSQLSNHPSLLDLGSLNPSSHRTV 3278

  Fly  1009 PSKVKKK-----------INPRRSTICEAAKDLRSSSSSSTPTREVA--ASSPVSTSSDSSSKRN 1060
            |:.:|:.           :.|.:|....||    :::.|||.:::.:  .|.|||..:..||..|
Mouse  3279 PNIIKRSKSGIMYFEQAPLLPPQSVGGTAA----TAAGSSTISQDTSHLTSGPVSALASGSSVLN 3339

  Fly  1061 GSKRTTSDLDGGSKLDQRRYTICEDR--------------------------QPETAIPVPLTKR 1099
            .....|:.....|.......|:...|                          ||   :.:|.:..
Mouse  3340 VVSMQTTAAPTSSTSVPGHVTLANQRLLGTPDIGSISHLLIKASHQSLGIQDQP---VALPPSSG 3401

  Fly  1100 RF-----SMHPKASAN---------PLHDTLLQTAGKKRGRKEGKESLSRQNSL----------- 1139
            .|     |..|.|:|.         |...|...||....|..|....|.|.|.|           
Mouse  3402 MFPQLGTSQTPSAAAMTAASSICVLPSSQTAGMTAASPPGEAEEHYKLQRGNQLLAGKTGTLTSQ 3466

  Fly  1140 -DSSSSASQG-----------AP------KKKALKSAEILSAALLETESSESTSSGSKMSRWDVQ 1186
             |....::.|           ||      :.|.|.||:..|:|...:..|....|||.......:
Mouse  3467 RDRDPDSAPGTQPSNFTQTAEAPNGVRLEQNKTLPSAKPASSASPGSSPSSGQQSGSSSVPGPTK 3531

  Fly  1187 TSPEL------------------------EAANPFGDIAKFIEDGVNLLKRDKVDEDQRKEGQDE 1227
            ..|:.                        ||..|..|.....:..|....|      ..:|.||.
Mouse  3532 PKPKAKRIQLPLDKGSGKKHKVSHLRTSSEAHIPHRDTDPAPQPSVTRTPR------ANREQQDA 3590

  Fly  1228 VKREADPEEDEFAQ---RVANMETPATTPTPSPTQSNPEDSASTTTVLKELETGGG--------- 1280
            ...| .|.:.|..|   .||.:.....|..|:..|.|.|..|     ::|.|:|..         
Mouse  3591 AGVE-QPSQKECGQPAGPVAALPEVQATQNPANEQENAEPKA-----MEEEESGFSSPLMLWLQQ 3649

  Fly  1281 --VRRSHRIKQKPQ---------------------------GPRASQGR-----------GVASV 1305
              .|:....::||:                           ..:..:.|           ||..:
Mouse  3650 EQKRKESITERKPKKGLVFEISSDDGFQICAESIEDAWKSLTDKVQEARSNARLKQLSFAGVNGL 3714

  Fly  1306 ALAPISMD------EQLA---------------ELANIEAIN-------EQFLRSEGLNTFQLLK 1342
            .:..|..|      ||||               |.||...:|       |..||....:.|..|.
Mouse  3715 RMLGILHDAVVFLIEQLAGAKHCRNYKFRFHKPEEANEPPLNPHGSARAEVHLRKSAFDMFNFLA 3779

  Fly  1343 ENFYRCARQVSQENAEMQCDCFLTGDEEAQGHLSCGAGCINRMLMIECGPLCSNGARCTNKRFQQ 1407
            ...    ||..:.|.        ..:||.:..|.......:..|.:   |:.....:.|:|.   
Mouse  3780 SKH----RQPPEYNP--------NDEEEEEVQLKSARRATSMDLPM---PMRFRHLKKTSKE--- 3826

  Fly  1408 HQCWPCRVFRTEKKGCGITAELLIPPGEFIMEYVGEVIDSEEFERRQHLYSKDRNRHYYFMALRG 1472
                ...|:|:...|.|:..:..|..||.::||.|.||.|.:.::|:..|. .:....|...:..
Mouse  3827 ----AVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYD-SKGIGCYMFRIDD 3886

  Fly  1473 EAVIDATSKGNISRYINHSCDPNAETQKWTVNGELRIGFFSVKPIQPGEEITFDYQY-LRYGRDA 1536
            ..|:|||..||.:|:|||||:||..::...::|:..|..|:::.|..|||:|:||:: :....:.
Mouse  3887 SEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYDYKFPIEDASNK 3951

  Fly  1537 QRCYCEAANCRGWI 1550
            ..|.|.|..||.::
Mouse  3952 LPCNCGAKKCRKFL 3965

Known Domains:


Indicated by green bases in alignment.

Software error:

Illegal division by zero at /www/www.flyrnai.org/docroot/cgi-bin/DRSC_prot_align.pl line 591.

For help, please send mail to the webmaster (ritg@hms.harvard.edu), giving this error message and the time and date of the error.

GeneSequenceDomainRegion External IDIdentity
Set2NP_572888.2 AWS 1358..1410 CDD:197795 7/51 (14%)
SET 1414..1533 CDD:214614 40/119 (34%)
PostSET 1535..1551 CDD:214703 5/16 (31%)
WW 2014..2043 CDD:278809
SRI 2270..2348 CDD:285448
Kmt2aNP_001344478.1 Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1..106
Menin-binding motif (MBM). /evidence=ECO:0000250|UniProtKB:Q03164 6..25
Integrase domain-binding motif 1 (IBM1). /evidence=ECO:0000250|UniProtKB:Q03164 121..132
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 130..231
Integrase domain-binding motif 2 (IBM2). /evidence=ECO:0000250|UniProtKB:Q03164 145..150
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 322..343
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 440..590
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 711..943
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 963..1003
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1034..1064
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1101..1161
zf-CXXC 1144..1191 CDD:251032
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1196..1390
PHD1_KMT2A 1432..1478 CDD:277063
PHD2_KMT2A 1480..1529 CDD:277065
PHD3_KMT2A 1567..1626 CDD:277067
Interaction with histone H3K4me3. /evidence=ECO:0000250|UniProtKB:Q03164 1583..1599
Bromo_ALL-1 1649..1779 CDD:99925
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1665..1714
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1807..1870
ePHD_KMT2A 1873..1985 CDD:277163
FYRN 2026..2073 CDD:310506
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 2147..2174 3/26 (12%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 2214..2339 28/129 (22%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 2371..2619 51/263 (19%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 2639..2673 5/33 (15%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 2709..2759 14/59 (24%)
9aaTAD. /evidence=ECO:0000250|UniProtKB:Q03164 2843..2851 0/7 (0%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 2958..3060 21/117 (18%)
Herpes_BLLF1 <3152..>3361 CDD:330317 38/212 (18%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 3164..3239 13/74 (18%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 3462..3640 39/189 (21%)
FYRC 3666..3749 CDD:197781 9/82 (11%)
WDR5 interaction motif (WIN). /evidence=ECO:0000250|UniProtKB:Q03164 3759..3764 0/4 (0%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 3782..3805 6/34 (18%)
SET 3828..3948 CDD:214614 40/120 (33%)