Sequence 1: | NP_572888.2 | Gene: | Set2 / 32301 | FlyBaseID: | FBgn0030486 | Length: | 2362 | Species: | Drosophila melanogaster |
---|---|---|---|---|---|---|---|---|---|
Sequence 2: | NP_071900.2 | Gene: | NSD1 / 64324 | HGNCID: | 14234 | Length: | 2696 | Species: | Homo sapiens |
Alignment Length: | 2951 | Identity: | 546/2951 - (18%) |
---|---|---|---|
Similarity: | 912/2951 - (30%) | Gaps: | 1120/2951 - (37%) |
- Green bases have known domain annotations that are detailed below.
Fly 11 SPVASRGRGRGRPPKVALS---ALGNTPP---HINPSLKH---------------ADAEASPTAP 54
Fly 55 EDQDSGQSEC-RRSSRKKIIKFDVRDLLNKNRKAHKIQIEARIDSNPSTGHSQSGTTAASTSMST 118
Fly 119 ATAS-------AASASSAATVSRLFSM-------------FEMSHQSLPPPPPPPTALEIFAKPR 163
Fly 164 PTQSLIVAQVTSEPSAVG------GAHPVQTMAGLPPVTPRKRGRPRKSQLADAAII-------- 214
Fly 215 --PTVIVPSCSDSDTNSTSTTTSNMSSDSGELPGFPIQKPKSKLRVSLKRLKLGGRLESSDSGNS 277
Fly 278 PSSSSPEVEPPAL---QDENAMDERP-------KQEQNLSRMVDAEENSDSDSQIIFIEIETESP 332
Fly 333 KGEEEQEEGRPVEVEPQDL-IDIDMELAKQEPTPDPEEDLDEIMVEVLSGPPSL-WSADDEAEEE 395
Fly 396 EDATVQRATPPGKEPAADSCSSAPRRSRRSA--PLSGSSRQGKTLEETFAEIAAESSKQILEAEE 458
Fly 459 SQDQEEQHILIDLIEDTLSESEVTSSVSPTIEHMVVEEVVVEENQLVDEADEILDSKQEFVIKKV 523
Fly 524 FSESDNIAASLNKDIFEPKVETKATCGEVVPRPEMVTED----VYITEGIAATLEK-SAVVTKPT 583
Fly 584 TEMIA-------------ETKLSDEVVI----------EPPLKDESDPKQTEVELPE----SKPA 621
Fly 622 VNIPKSERILSAEVETTSSPLVPPECCTLESVSGPVLLETSLSTEEKSNENVETTPLKTEAAKED 686
Fly 687 SPPAAPEEEASNSSEEPNFLLEDYESNQEQVAEDEMMKCNNQKGQKQTPLPEMKEPEKPVAETVS 751
Fly 752 KKEKAMENPARSSPAIVDKKVRAGEMEKKVVKS-TKGT-------------------VPEKKMDS 796
Fly 797 KKSCAAVTP---AKQKE-SGKSAKEAILKKETEKEK-----SSAKLDSSSP-------NTL---- 841
Fly 842 ----DKKGKDTAQWSPQLQTLPKS------STKPPQESAPSVIS-------------KTTSNQPA 883
Fly 884 PKEEQHAAKKGLSDNSPPSVLKAKEKAVSGFVECDAM-----------FKAMDLANAQL------ 931
Fly 932 --RLDEKNKKKLKKVPT-------------------KVEAPPKVEPPTAVPVPGQ---------- 965
Fly 966 --------KKSLSGKTSLRRNTVYEDSPNLERNSSPSSDSAQANTSAGKLKPSKVKKKINPRRST 1022
Fly 1023 ICEAAKDLRSSSSSSTPTREVAASSPVSTSSDSSSKRNGSKRTTS---------DLDGGSKLDQR 1078
Fly 1079 --RYTICE-------DRQPETAIPVPLTKRRFSMHPKASANPLHDTLLQTAGKKRGRKEGKESLS 1134
Fly 1135 RQNSLDSSSSASQGAPKKKALKSAEILS--AALLETESSESTSSGSKMSRWDVQTS--------P 1189
Fly 1190 ELEAANPF--GDIAK---------------FIEDGVNLLKRDKVDEDQ---RKEGQDEVKREADP 1234
Fly 1235 EEDEFAQRVANMETPATTP---------------------TPSPTQSNPEDSASTTTVL------ 1272
Fly 1273 ----------------------KELETGGGVRRSHRIKQKPQ-----------GPRAS---QG-R 1300
Fly 1301 GVASVALAPISMD-EQLAELANIEA------------INE----QFLRSE---GLNTFQLLKEN- 1344
Fly 1345 -------------FY--------------------------------------------RCAR-- 1350
Fly 1351 ---------------------------------------------QVSQENAEMQC--------- 1361
Fly 1362 ----------------DC----------------------------------------------- 1363
Fly 1364 -------------------FLTGD--------------------------EE--AQGHL------ 1375
Fly 1376 -------------------------------------------SCG--AGCINRMLMIECGP-LC 1394
Fly 1395 SNGARCTNKRFQQHQCWPCRVFRTEKKGCGITAELLIPPGEFIMEYVGEVIDSEEFERRQHLYSK 1459
Fly 1460 DRNRHYYFMALRGEAVIDATSKGNISRYINHSCDPNAETQKWTVNGELRIGFFSVKPIQPGEEIT 1524
Fly 1525 FDYQYLRYGRDAQRCYCEAANCRGWIGGEPDSDEGEQLDEESDSDAEMDEEELEAEPEEGQP--- 1586
Fly 1587 -RKSAKAKAKSKLKAKLPLATGRKRKEQTKPKDRE---YKAGRWLKPSATGSSSSAEKPPKKPKV 1647
Fly 1648 NKFQAMLEDPDVVEELSLLRRGGLKNQQDTLRFSRCLVRAKLLKTRLALLRVLTHGELPCRRLFL 1712
Fly 1713 DYHGLRLLHAWISENGNDDQLREALLDTLESLPI-PNRTMLSDSRVYQSVQLWSNSLEQQLAVVP 1776
Fly 1777 QEKQAALHKRMVALLQKWQALPEIFRIPKRERIEQMKEHEREADRQQKHVHASTALEDQRERESS 1841
Fly 1842 NDRFRQDRFRRDTTSSRIGKPIRMSGNNTIC-TITTQQKGSNGAP--DGMTRNDNRRRSDIGPPS 1903
Fly 1904 EQRRTLSKELRRSLFERKVALDEAERRVCTEDRLEHELRCEFFGADINTDPKQLPFYQKTDTNEW 1968
Fly 1969 FNSDDVPVPAPPRTELLTKALLSPDIDVGQGATDVEYKLPPGVDPLPPAWNWQVTSD-------- 2025
Fly 2026 -GDIYYYNLRERISQWEPPSPEQRL----QTLLEENTTQQPLHELQIDPAVLENELIQVDTDYVG 2085
Fly 2086 SLSAKSLAQYIEAKVRERRDL---RRSKLVSIRLISPRRDEDRLYNQLESRKYKENKEKIRRRKE 2147
Fly 2148 LYRRRKIEVLPDAV------DEIPVPGKAL----------------------PIQPYLFSSDEEE 2184
Fly 2185 TKVAAIEQPAAEEEQDSLNMAPSTSHAAMAALG----KAVAQPTG--LGTVGKRKLPMPPSVTVK 2243
Fly 2244 KHRQEQRSKKVKSSQSP--LTATSAR 2267 |
Gene | Sequence | Domain | Region | External ID | Identity |
---|---|---|---|---|---|
Set2 | NP_572888.2 | AWS | 1358..1410 | CDD:197795 | 27/222 (12%) |
SET | 1414..1533 | CDD:214614 | 50/118 (42%) | ||
PostSET | 1535..1551 | CDD:214703 | 6/15 (40%) | ||
WW | 2014..2043 | CDD:278809 | 8/37 (22%) | ||
SRI | 2270..2348 | CDD:285448 | |||
NSD1 | NP_071900.2 | Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite | 207..252 | 7/44 (16%) | |
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite | 281..311 | 8/55 (15%) | |||
MSH6_like | 319..429 | CDD:99898 | 24/141 (17%) | ||
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite | 487..514 | 11/44 (25%) | |||
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite | 872..891 | 3/18 (17%) | |||
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite | 936..1035 | 17/98 (17%) | |||
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite | 1067..1093 | 4/28 (14%) | |||
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite | 1112..1134 | 6/22 (27%) | |||
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite | 1243..1272 | 8/40 (20%) | |||
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite | 1294..1344 | 11/49 (22%) | |||
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite | 1382..1428 | 9/48 (19%) | |||
ING | <1431..1587 | CDD:331088 | 28/155 (18%) | ||
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite | 1480..1534 | 10/53 (19%) | |||
PHD1_NSD1_2 | 1545..1587 | CDD:277118 | 9/41 (22%) | ||
PHD2_NSD1 | 1592..1638 | CDD:277120 | 4/45 (9%) | ||
PHD3_NSD1 | 1639..1692 | CDD:277123 | 3/52 (6%) | ||
PHD4_NSD1 | 1709..1748 | CDD:277126 | 3/38 (8%) | ||
WHSC1_related | 1754..1848 | CDD:99899 | 3/93 (3%) | ||
AWS | 1891..1941 | CDD:197795 | 17/49 (35%) | ||
SET | 1942..2065 | CDD:214614 | 50/122 (41%) | ||
S-adenosyl-L-methionine binding | 1952..1954 | 0/1 (0%) | |||
S-adenosyl-L-methionine binding | 1994..1997 | 0/2 (0%) |