DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment E(z) and Nsd3

DIOPT Version :10

Sequence 1:NP_001261682.1 Gene:E(z) / 39203 FlyBaseID:FBgn0000629 Length:765 Species:Drosophila melanogaster
Sequence 2:NP_001074738.1 Gene:Nsd3 / 234135 MGIID:2142581 Length:1446 Species:Mus musculus


Alignment Length:937 Identity:189/937 - (20%)
Similarity:311/937 - (33%) Gaps:332/937 - (35%)


- Green bases have known domain annotations that are detailed below.


  Fly    19 IKIRQQKRYKRADE-----IKEAWIRNWDEHNHNVQDLYCESKVWQAKPYDPPHVDCVKRAEVTS 78
            ::.:.|:|:...:|     :|.||                  |...|:...|        |.:|.
Mouse   457 LRRQSQRRHTSLEEEEPPPVKIAW------------------KTAAARKSLP--------ASITM 495

  Fly    79 YNGIPSGPQKVPICVINAVTPIPTMYTWAPTQQNFMVEDETVLHNIPYMGDEVLDKDGKFIEELI 143
            :.| ....||   |.::.|..|         :|.|.:::.|              .|||||::.:
Mouse   496 HKG-SLDLQK---CNMSPVVKI---------EQVFALQNAT--------------GDGKFIDQFV 533

  Fly   144 KNYDGKVHGDKDPSFM--DDAIFVELVHALMRSYSKELEEAAPGTATAIKTETLAKSKQGEDDGV 206
              |..|..|:|....:  .|.:   ::.:..:...|..:.|:...||:.....:.|.:|..    
Mouse   534 --YSTKGIGNKTEISVRGQDRL---IISSPSQRSEKPAQSASSPEATSGSAGPVEKKQQRR---- 589

  Fly   207 VDVDADGESPMKLEKTDSKGDLTEVEKKETE-EPLETEDADVKPDVEEVKD-----KLPFPAPII 265
             .:....||....|....|    :::|::.| .|..:....::....|:.|     |....|...
Mouse   590 -SIRTRSESEKSAEVVPKK----KIKKEQVETAPQASLKTGLQKGASEISDSCKPLKKRSRASTD 649

  Fly   266 FQAISANFPDKGTAQE--LKEKYIELTEHQDPERPQECTPNIDGIKAESV----SRERTMHSFHT 324
            .:..|..:.|...:..  |.:..:...:..|   ....|.:.|...|:||    ||.....|...
Mouse   650 VETASCTYRDTSDSDSRGLSDGQVGFGKQVD---SPSATADADASDAQSVDSSLSRRGVGTSKKD 711

  Fly   325 LFCRRCFKY-DCFLH------RH-HVQGL------QGH------------------AGPNLQK-- 355
            ..|:.|.|. ||.:.      || ||:.|      :||                  :|.::::  
Mouse   712 TVCQVCEKAGDCLVACEGECCRHFHVECLGLTAVPEGHFTCEECETGQHPCFSCKVSGKDVKRCS 776

  Fly   356 -------------RRYP----ELKPFAEP--CSNSCYMLID---GMKEKLAADSKTPPI----DS 394
                         |::|    |.|.|..|  |.:||.|..|   ..|.::....:.|..    |:
Mouse   777 VSVCGKFYHEACVRKFPTAIFESKGFRCPQHCCSSCSMEKDIHKASKGRMMRCLRCPVAYHVGDA 841

  Fly   395 CNEA------------SSEDSNDSNSQFSNKDFNHENSKDNGLTV-------------------- 427
            |..|            |:.....|.|...|..|....::  ||.|                    
Mouse   842 CVAAGSVSVSSHILICSNHSKRSSQSAAINVGFCFVCAR--GLIVQDHSDPMFSSYAYKSHYLLS 904

  Fly   428 --NSAAVAEINSIMAG--------------------------MMNITSTQCVWTGADQALYRVLH 464
              |.|.:.::..|.:.                          .::|...:..|...|....:.||
Mouse   905 ESNRAELMKLPMIPSSSASKKRCEKGGRLLCCESCPASFHPECLSIDMPEGCWNCNDCKAGKKLH 969

  Fly   465 K-----VYLKNY-------CA----------IAHNM----------------------------- 478
            .     |.|.||       |:          :.|::                             
Mouse   970 YKQIVWVKLGNYRWWPAEICSPRSVPLNIQGLKHDLGDFPVFFFGSHDYYWVHQGRVFPYVEGDK 1034

  Fly   479 --------LTKTCRQVYEFAQKEDAEF-----SFEDLRQDFT----PPRKKKKKQRLWSLHCRKI 526
                    :.||.::..|.|.|...|.     |.|.|..:.|    ||.|..|..::    ..|:
Mouse  1035 HFAEGQTSINKTFKKALEEAAKRFQELKAQRESKEALEMERTSRKPPPYKHIKANKV----IGKV 1095

  Fly   527 QLKKDSSSNHVYNYTPCDHPGHPCDMNCSCIQTQNFCE-KFCNCSSDCQNRFPGCRCKAQCNTKQ 590
            |                             :|..:..| ..|||....:|   .|..::||..: 
Mouse  1096 Q-----------------------------VQVADLSEIPRCNCKPGDEN---PCGLESQCLNR- 1127

  Fly   591 CPCYLAVRECDPDLCQACGADQFKLTKITCKNVCVQRGLHKHLLMAPSDIAGWGIFLKEGAQKNE 655
                ::..||.|.:|.|  .|:       |:|.|..:.|:....:..::..|||:..|...:|.|
Mouse  1128 ----MSQYECHPQVCPA--GDR-------CQNQCFTKRLYPDAEVIKTERRGWGLRTKRSIKKGE 1179

  Fly   656 FISEYCGEIISQDEADRRGK-VYDKYMCSF-LFNLNNDFVVDATRKGNKIRFANHSINPNCYAKV 718
            |::||.||:|.::|...|.| .::..:.:| :..:..|.::||..|||..||.|||.||||..:.
Mouse  1180 FVNEYVGELIDEEECRLRIKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQK 1244

  Fly   719 MMVTGDHRIGIFAKRAIQPGEELFFDY 745
            ..|.||.|:|:||...|..|.||.|:|
Mouse  1245 WTVNGDVRVGLFALCDIPAGMELTFNY 1271

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
E(z)NP_001261682.1 PRC2_HTH_1 151..291 CDD:436286 23/149 (15%)
SANT 452..492 CDD:238096 14/98 (14%)
preSET_CXC 579..610 CDD:408079 8/30 (27%)
SET_EZH2 628..747 CDD:380995 45/120 (38%)
Nsd3NP_001074738.1 KIKL. /evidence=ECO:0000250|UniProtKB:Q9BZ95 164..167
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 191..257
PWWP_NSD3_rpt1 278..407 CDD:438991
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 354..377
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 411..476 3/18 (17%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 550..705 29/169 (17%)
PHD_SF 713..755 CDD:473978 12/41 (29%)
PHD2_NSD3 761..807 CDD:277122 7/45 (16%)
PHD_SF 808..860 CDD:473978 11/51 (22%)
PHD4_NSD3 927..962 CDD:277128 2/34 (6%)
PWWP_NSD3_rpt2 969..1063 CDD:438994 13/93 (14%)
AWS 1114..1152 CDD:465559 13/54 (24%)
SET_NSD3 1154..1295 CDD:380989 44/118 (37%)
PHD5_NSD3 1332..1374 CDD:277131
C5HCH 1373..1422 CDD:465605
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.