Sequence 1: | NP_001261819.1 | Gene: | upSET / 39551 | FlyBaseID: | FBgn0036398 | Length: | 3146 | Species: | Drosophila melanogaster |
---|---|---|---|---|---|---|---|---|---|
Sequence 2: | NP_071900.2 | Gene: | NSD1 / 64324 | HGNCID: | 14234 | Length: | 2696 | Species: | Homo sapiens |
Alignment Length: | 3045 | Identity: | 519/3045 - (17%) |
---|---|---|---|
Similarity: | 927/3045 - (30%) | Gaps: | 1026/3045 - (33%) |
- Green bases have known domain annotations that are detailed below.
Fly 450 VNGTQMTDELSARILQSMAQKSFSQQQRFHQVPATGSGNMPPPTQIIYSNSTSNGA--------- 505
Fly 506 ----------AASSPGG------------NASGNMLLAHYQAAGTKPVSSA-----SFITVTGT- 542
Fly 543 --------PPVTV--------------ATTPSVSISSHGFASGSAAISSYMSSATAARRQSVSAP 585
Fly 586 SSRAVSLERKQHHQQLQHDVIGGGRKAP-------------TVIEYYNKHGVNSIVGSSNNLAQS 637
Fly 638 NSMSNLAGPRSNSGSGFATTTPTP-------ATPLHLT---------------PVNV---PV--- 674
Fly 675 ---------------HVEA-APPSSPALVKGSSQPPAQPQQQQQQAHPLGPNQLNANDEELYIEE 723
Fly 724 VRPVPVLTQDLRLQQLHAIMQDHTYASQQQQQQPQQAAGD-TTNPGAAQQVQQPQQWSLGGIGVT 787
Fly 788 VSGSQGTPTAVGGYCSYFGQQIARSQADDDAHSAISSSSRMGLASTDIDPGEETETAPEAEAEDD 852
Fly 853 SVTRCICELTHDDGYMICCDKCSAWQHVDCMGIDRQNIPEEYMCELCQPRAVDKARARALQRQKR 917
Fly 918 KEHMLLVA-TQAANGAAAVAAGTTLSGGLGSGLPMSEELQHRLASGLNGG--------------- 966
Fly 967 -----FATGTG-----------MSKKSKKTKENSGSTSTLKKTKKSAVGMGGEKNASGSGT--PT 1013
Fly 1014 GSSGKTSKKSSKRKSKSGGDGSSGGGSSPALTAAEKHAANL-----RQWIENYEYAVTNHYSPEL 1073
Fly 1074 RARLHAIQKQPSLLQSIQNTENKALRQIQQQLSTAGSAEQLEQRAQLIPYAGAKVLISSVDLSPH 1138
Fly 1139 APIHE---------LRGKYMLTTQFRTQNPTVNMNTPPPSNYLNSFKAHKTPGQFVFFYQLPGVE 1194
Fly 1195 APMQTLRPDGSVPQVAQQPP-----SYLKGPEVCVDTRTYGNDARFVRRSCRPNA----ELQHYF 1250
Fly 1251 EKGTLHLYIVALTHIRAQ-TEITIRHEPHDLT-AVEQKKSHAAVIQPTSTRCACDMGSDCLFALP 1313
Fly 1314 LAVQQQLQAPPTQPRS--------SHRNKAAAAAAAAAAANSAAAIQLTMGLGVGATVAAGASVL 1370
Fly 1371 PNSRNRSTSSSGESSQMGLN-SPQLGQLNLGFKTSVTATSLTAPVPGVHCNNSGGSSSSSNNSCS 1434
Fly 1435 VSMSSVLHDSGICTSSSSPSVSIPSPTPTQMQSPTLQQHP---------QQIPQQQL-SLLQ--- 1486
Fly 1487 ----------RSPTQQHQQQILAAL---------------------PTPMLTPMLS------PQL 1514
Fly 1515 PKPAQQQ------AHVVLPQSQQTSLLQQ-----------QQSQQSQEPLAVIAAAAAAQQPMAT 1562
Fly 1563 YFVRQPQQQQQQQSPK---PQALVAQQQHVVGAQQQQHFLQQQQKQQQQQMADEARMAVSALQTL 1624
Fly 1625 HAAPTSHIVSPIKVAAVQQQSQ---------------------PQQQQQNTHQQPHNQQAVQQQS 1668
Fly 1669 NQLQQQQSQQPN----------YPQSPQRQQKPQPVQHQP--QIVISTGAQAIPATMPTKLSSPT 1721
Fly 1722 KSAAPVISNNNITVSAQSSVVGGKKTPAKHPQQQQQQQQQPVTPV--SAATAPAATPSSSE---S 1781
Fly 1782 K--------EDDVSASSTTTPTT-----RTPAKDKPKQSREDRKLEAILRAIEKMEKQEARGKKD 1833
Fly 1834 TRQSSG--GKRQASNSPASPNKRNSSNSISEDVETPTST----NSAAAAAQRRN--KKKRKVSRS 1890
Fly 1891 LNNNTNGLGS-------------GGGSNNKRRKSI----VVESDGES------------------ 1920
Fly 1921 -----------------------HALTNSE-SEDQG------QHPQSHHSGSEDQAAGLLLALAH 1955
Fly 1956 NNSSPNE--PFKSPLSQSH-------------SL---PATPASVSSACLLIEAAMGPLQQQPAPA 2002
Fly 2003 SASPSLAE--------FKYPPGGAKTKKSLMSS----------------------WFQQA----- 2032
Fly 2033 -------EQQHASGLDSLVQAAM-------SEINGERE--QLQRQPQGESLPAP-ALLKVEQFIH 2080
Fly 2081 QAESTTA----VPAREQLHLPLQNNSSVKKRWLRQAISEETTPV--DELQQSQNQSVTATPSPQP 2139
Fly 2140 VPTVSPLANGFSTPLKKRRLVVVSNGTNVESDETHIDVIGEPKDEAEENVAMTELKVEIENHHQE 2204
Fly 2205 QDD--------DVDILRSPSPGTHQIVAEDNLVKI-----EPE----------DTSA---AADDV 2243
Fly 2244 KIDVE------------------------------REESQACDKFEEMVKVKREEEEQREKEIKQ 2278
Fly 2279 LQERQEH----------------EQPKVEPAPVEPKLENTVAKAEPKVEPSQEIVSKKEPTKVEP 2327
Fly 2328 KPGESLLRSTATVTATPTAATIAATTLLDVSKVAFKTRPPLKLEDEPQKKKP----KLESILPAP 2388
Fly 2389 VATVPPVSVPPIPAASNATTSAVTNTAAASLTTTTAPSSTKNLTEHDIQERLLSFHAANISYLQS 2453
Fly 2454 RNKKATAA-----LTSASPSQKSNS---------SSGGSGTESKK--SSK-----------DKDE 2491
Fly 2492 KRDKEKQL-KKSKKEKKKSKDKEKQKAAV------NVNSTSQIVDTKKKTTQP------------ 2537
Fly 2538 --SKPDSKSSIAPV---------LVPPSLPVATANGKT---KHTAYNNVDQQQQQQMRRRTMSMC 2588
Fly 2589 ITPVTPTPVVTPSPLHGTPPSTKKRQTNFEQELTKPNSQILSSSILLNSSKGLG-LP-------- 2644
Fly 2645 ---------LAAPT------VVSVPTA-VQQQQHRKENNHQEATPASG----------GPMSLAA 2683
Fly 2684 AIASGKLNAISRRRESMCGSRQQQALIAAALKKEKKEKKKSKKKDREKQKHDKQKGKEKEREKDK 2748
Fly 2749 EKDNKQKTNHIQKPAHPTTVPANSMPISAPAPVPVLVPTPVTTPKAAPIPVLITQPTPS----PI 2809
Fly 2810 HVTQPLVNNCSTKVASLPFYNTIY--GKLQDPSTPTSSPIPVTNTMPSLAEYLES 2862 |
Gene | Sequence | Domain | Region | External ID | Identity |
---|---|---|---|---|---|
upSET | NP_001261819.1 | PHD_MLL5 | 856..899 | CDD:277025 | 6/42 (14%) |
SET | <1221..1273 | CDD:214614 | 7/56 (13%) | ||
NSD1 | NP_071900.2 | Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite | 207..252 | 10/45 (22%) | |
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite | 281..311 | 7/34 (21%) | |||
MSH6_like | 319..429 | CDD:99898 | 21/143 (15%) | ||
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite | 487..514 | 4/33 (12%) | |||
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite | 872..891 | 3/18 (17%) | |||
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite | 936..1035 | 25/131 (19%) | |||
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite | 1067..1093 | 3/25 (12%) | |||
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite | 1112..1134 | 3/21 (14%) | |||
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite | 1243..1272 | 3/28 (11%) | |||
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite | 1294..1344 | 8/49 (16%) | |||
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite | 1382..1428 | 11/58 (19%) | |||
ING | <1431..1587 | CDD:331088 | 29/162 (18%) | ||
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite | 1480..1534 | 10/54 (19%) | |||
PHD1_NSD1_2 | 1545..1587 | CDD:277118 | 5/41 (12%) | ||
PHD2_NSD1 | 1592..1638 | CDD:277120 | 3/45 (7%) | ||
PHD3_NSD1 | 1639..1692 | CDD:277123 | 10/52 (19%) | ||
PHD4_NSD1 | 1709..1748 | CDD:277126 | 8/38 (21%) | ||
WHSC1_related | 1754..1848 | CDD:99899 | 10/93 (11%) | ||
AWS | 1891..1941 | CDD:197795 | 8/50 (16%) | ||
SET | 1942..2065 | CDD:214614 | 26/148 (18%) | ||
S-adenosyl-L-methionine binding | 1952..1954 | 0/1 (0%) | |||
S-adenosyl-L-methionine binding | 1994..1997 | 0/2 (0%) |