DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment egg and NSD1

DIOPT Version :9

Sequence 1:NP_611966.3 Gene:egg / 37962 FlyBaseID:FBgn0086908 Length:1262 Species:Drosophila melanogaster
Sequence 2:NP_071900.2 Gene:NSD1 / 64324 HGNCID:14234 Length:2696 Species:Homo sapiens


Alignment Length:1435 Identity:286/1435 - (19%)
Similarity:452/1435 - (31%) Gaps:482/1435 - (33%)


- Green bases have known domain annotations that are detailed below.


  Fly     2 SGQPTAVD-CLESSGSTVE-----DVQETPASREKSYGLPVRKGENSLESPAEQAAK-DVEIEEL 59
            :..|..|. .|.|.|||..     |..:..|:...|.|.....||.|...|...:.| |:.....
Human   946 TNSPVGVSKVLVSGGSTHNSEKKGDGTQNSANPSPSGGDSALSGELSASLPGLLSDKRDLPASGK 1010

  Fly    60 THSEAIAATGSTRKQCPYGGKAPDEPGKLAD--------ESEDRKGENTKAIASSPVLVAVDSDS 116
            :.|:.:     ||:.|  |...|.  .||.|        .:.:||...|:.......|.:|..|:
Human  1011 SRSDCV-----TRRNC--GRSKPS--SKLRDAFSAQMVKNTVNRKALKTERKRKLNQLPSVTLDA 1066

  Fly   117 SVELIESPVKFSSANESEKDPPKPD---AVNEAAAKEAEEMTDSSISSPTSES------------ 166
            .::  ....:..|.....:||.|.|   .:....:::.:..:|....|...:|            
Human  1067 VLQ--GDRERGGSLRGGAEDPSKEDPLQIMGHLTSEDGDHFSDVHFDSKVKQSDPGKISEKGLSF 1129

  Fly   167 ----FPEKDEKTNKENEQ--------------------EPPGMEVDQDVEESISR---------- 197
                .||.|...|.||::                    ..|...:::..|:..|.          
Human  1130 ENGKGPELDSVMNSENDELNGVNQVVPKKRWQRLNQRRTKPRKRMNRFKEKENSECAFRVLLPSD 1194

  Fly   198 PAEEYKIENTLKGHKRISLTEIEEHKIVDKKDDVLEVELEKGTAPKAAEDEKLNALLSDGDVFYD 262
            |.:|.:.|  ...|:..|.:.:||........|.|:     ...|:....:|.:|.:.|    .:
Human  1195 PVQEGRDE--FPEHRTPSASILEEPLTEQNHADCLD-----SAGPRLNVCDKSSASIGD----ME 1248

  Fly   263 KECVNCNCTKLHKQYVLANMATLNFYQVLRKSSKQQFLCMGCHDTAMDLYEEYAGQLMA---KQP 324
            ||   .....|..|..|...|..:..:.|||.||          ..::..||| .|:.|   ||.
Human  1249 KE---PGIPSLTPQAELPEPAVRSEKKRLRKPSK----------WLLEYTEEY-DQIFAPKKKQK 1299

  Fly   325 LLLKDFHQDHADFVALDSSDEEEEEKQPEKSDFSKNKLQLIEDELDDAIKNVLNKVDFTAQLSWS 389
            .:.:..|:        .||..|||.........::||                 :||..:.:|..
Human  1300 KVQEQVHK--------VSSRCEEESLLARGRSSAQNK-----------------QVDENSLISTK 1339

  Fly   390 K--TILQAKADHLERQFALADV---ELEKVQTTADKMHCALYNSCPVAHKHLPTLDIEPSDYVHE 449
            :  .:|:.:|..||...|.:::   ..|..|.|.         |.|||.:..|...:|..:.:.:
Human  1340 EEPPVLEREAPFLEGPLAQSELGGGHAELPQLTL---------SVPVAPEVSPRPALESEELLVK 1395

  Fly   450 VPPPGEIVR---PPIQLGETY-----YAVKNKAIASWVSIKVIE-------FTESTA------IN 493
            .|...|..|   |..:|.|:.     :..|...:.  :|.|..|       .|||.|      ..
Human  1396 TPGNYESKRQRKPTKKLLESNDLDPGFMPKKGDLG--LSKKCYEAGHLENGITESCATSYSKDFG 1458

  Fly   494 GNTMKSY-------KIRYLNTPYQMIKTV---TAKHIAYFEPPPVRLTIGTRVIAYFDGTTLSRG 548
            |.|.|.:       :.|:.....|..|..   ::|.|...|         ..::.:...|:....
Human  1459 GGTTKIFDKPRKRKRQRHAAAKMQCKKVKNDDSSKEIPGSE---------GELMPHRTATSPKET 1514

  Fly   549 KDKGVVQSAFYPGIIAEPLKQANRYRYLIFYDDGYTQYVPHRDVRLVCQASEKVWEDVHAASRDF 613
            .::||...   ||:.|....|..|.......::             |||..||:.|.:...::..
Human  1515 VEEGVEHD---PGMPASKKMQGERGGGAALKEN-------------VCQNCEKLGELLLCEAQCC 1563

  Fly   614 IQKYVEKYSVDRPMVQCTRGQSMTTESN---GTWLYARVIDIDCSLVLMQFEGDKNHTEWI---- 671
            ...::|...    :.:..||:.:..|..   .|....:....|....|:...|...|.|.:    
Human  1564 GAFHLECLG----LTEMPRGKFICNECRTGIHTCFVCKQSGEDVKRCLLPLCGKFYHEECVQKYP 1624

  Fly   672 --------YRGSLRL-------GPVFRETQNNMNSSSAQQLRVPRRTEPFIRYTKEMESSSKVNQ 721
                    :|.||.:       .|.      |:::|..:.:|..|.  |...:..:.        
Human  1625 PTVMQNKGFRCSLHICITCHAANPA------NVSASKGRLMRCVRC--PVAYHANDF-------- 1673

  Fly   722 QMRAFARKSSASAQNNALAAASSAATPAGGRTN--------------AGGVSTSNSASAVRHLNN 772
               ..|..|...|.|:.:  ..:..||..|..|              .|.:...:|..|..|...
Human  1674 ---CLAAGSKILASNSII--CPNHFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHREC 1733

  Fly   773 STIYVDDEN--------RPKGH---VVYFTAKRNLPPKMYK------CH-ECSPNCLFKIVHRLD 819
            ..|.:.:.|        ..|.|   :|:....|      |:      || ...|:.:.|:.|.:.
Human  1734 LNIDIPEGNWYCNDCKAGKKPHYREIVWVKVGR------YRWWPAEICHPRAVPSNIDKMRHDVG 1792

  Fly   820 SYSPL--------------AKPLLSGWERLVMRQKTKKSV--VYKGPCGKSLRSLAEVHRYLRAT 868
            .:..|              ..|.:.|  .:..:.|..|.|  .||    |:|:..|.....|:|.
Human  1793 EFPVLFFGSNDYLWTHQARVFPYMEG--DVSSKDKMGKGVDGTYK----KALQEAAARFEELKAQ 1851

  Fly   869 ENVLNVDNFDFTPDLKCLAEYSIDPSIVKDTDISKGQEKMAIPLVNYYDNTLPPPCTYAKQRIPT 933
            :            :|:.|.|            ..|..:|             |||..:.|...|.
Human  1852 K------------ELRQLQE------------DRKNDKK-------------PPPYKHIKVNRPI 1879

  Fly   934 EGVHL-NLDEEFLLCCDC----EDDCSDKSKCACWQLTVAGVRYCNPKKPIEEIGYQYKRLHEHV 993
            ..|.: ..|...:..|:|    |:.|...|:|....|                            
Human  1880 GRVQIFTADLSEIPRCNCKATDENPCGIDSECINRML---------------------------- 1916

  Fly   994 PTGIYECN-SRCKCKKNCLNRVVQFSLEMKLQVFKTSNRGWGLRCVNDIPKGAFICIYAGHLLTE 1057
               :|||: :.|.....|.|:........::::|:|..||||||...||.||.|:..|.|.|:  
Human  1917 ---LYECHPTVCPAGGRCQNQCFSKRQYPEVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELI-- 1976

  Fly  1058 TMANEGGQDAGDEYFADLDYIEVAEQLKEGYESEVDHSDPDAEEDNGGPDAEDDDDFRPNYHYQR 1122
                                                                |:::.|....|  
Human  1977 ----------------------------------------------------DEEECRARIRY-- 1987

  Fly  1123 KIKRSSRSGSTQNSSTQSSELDSQERAVINFNPNADLDETVRENSVRRLFGKDEAPYIMDAKTTG 1187
                                  :||..:.||. ...||             ||.   |:||...|
Human  1988 ----------------------AQEHDITNFY-MLTLD-------------KDR---IIDAGPKG 2013

  Fly  1188 NLGRYFNHSCSPNLFVQNVFVDTHDLRFPWVAFFSAAHIRSGTELTWNYNYEVGVVPGKVLYCQC 1252
            |..|:.||.|.||...|...|: .|.|   |..|:.:.|::|||||:|||.|. :..||.: |:|
Human  2014 NYARFMNHCCQPNCETQKWSVN-GDTR---VGLFALSDIKAGTELTFNYNLEC-LGNGKTV-CKC 2072

  Fly  1253 GAPNC 1257
            |||||
Human  2073 GAPNC 2077

Known Domains:


Indicated by green bases in alignment.

Software error:

Illegal division by zero at /www/www.flyrnai.org/docroot/cgi-bin/DRSC_prot_align.pl line 591.

For help, please send mail to the webmaster (ritg@hms.harvard.edu), giving this error message and the time and date of the error.

GeneSequenceDomainRegion External IDIdentity
eggNP_611966.3 HMT_MBD 823..879 CDD:238689 13/71 (18%)
Pre-SET 901..1013 CDD:282838 21/117 (18%)
SET 1022..1239 CDD:214614 53/216 (25%)
NSD1NP_071900.2 Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 207..252
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 281..311
MSH6_like 319..429 CDD:99898
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 487..514
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 872..891
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 936..1035 26/97 (27%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1067..1093 5/27 (19%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1112..1134 2/21 (10%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1243..1272 7/35 (20%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1294..1344 13/74 (18%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1382..1428 9/45 (20%)
ING <1431..1587 CDD:331088 34/184 (18%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1480..1534 12/65 (18%)
PHD1_NSD1_2 1545..1587 CDD:277118 10/45 (22%)
PHD2_NSD1 1592..1638 CDD:277120 8/45 (18%)
PHD3_NSD1 1639..1692 CDD:277123 10/73 (14%)
PHD4_NSD1 1709..1748 CDD:277126 6/38 (16%)
WHSC1_related 1754..1848 CDD:99899 20/105 (19%)
AWS 1891..1941 CDD:197795 13/80 (16%)
SET 1942..2065 CDD:214614 54/222 (24%)
S-adenosyl-L-methionine binding 1952..1954 1/1 (100%)
S-adenosyl-L-methionine binding 1994..1997 1/2 (50%)