DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment G9a and NSD1

DIOPT Version :9

Sequence 1:NP_001259088.1 Gene:G9a / 30971 FlyBaseID:FBgn0040372 Length:1657 Species:Drosophila melanogaster
Sequence 2:NP_071900.2 Gene:NSD1 / 64324 HGNCID:14234 Length:2696 Species:Homo sapiens


Alignment Length:1919 Identity:343/1919 - (17%)
Similarity:618/1919 - (32%) Gaps:655/1919 - (34%)


- Green bases have known domain annotations that are detailed below.


  Fly     9 NSMSSTFNSDCATSTAEGGTLLN---LNLAEDKTLKWRN------LANNQFASKEKKHKDKEEEE 64
            |.:|...||...::||.|..|.:   .|.|: |..:..|      |......||..:.|:|.:. 
Human   549 NELSRIANSLTGSNTAPGSFLFSSCGKNTAK-KEFETSNGDSLLGLPEGALISKCSREKNKPQR- 611

  Fly    65 RKEARNQEEIEDIKALLADVVDAAAVKL------EEEEAQNAEKVEPHTKCEIEEEGRKEM---- 119
                              .:|..:.|||      :||:..::..:     |...::|..::    
Human   612 ------------------SLVCGSKVKLCYIGAGDEEKRSDSISI-----CTTSDDGSSDLDPIE 653

  Fly   120 ---EYDQDVAKQDSEMEKKQNGKATSITVKMESNERAEKHATEIATTSTERWENESFKTEQQNKK 181
               |.|..|.:.....::.:|      .:.|:.||:. |::...||       |...|.:|: ..
Human   654 HSSESDNSVLEIPDAFDRTEN------MLSMQKNEKI-KYSRFAAT-------NTRVKAKQK-PL 703

  Fly   182 AAEKEEEPILAATQKLEANAEPLTTTRIEVAVASPLVVSSASVKLAADATNQMRAATSAGAATLA 246
            .:....:.::..|:..|...|   |:::.:   |.|..|:...|..:|.||...:.....:::::
Human   704 ISNSHTDHLMGCTKSAEPGTE---TSQVNL---SDLKASTLVHKPQSDFTNDALSPKFNLSSSIS 762

  Fly   247 DKNVQVSPGGTRRS----------------RRTPRPIDTPTSVTDEHVQVE-------------N 282
            .:|..:..|...::                :....|:.....|.:|...::             .
Human   763 SENSLIKGGAANQALLHSKSKQPKFRSIKCKHKENPVMAEPPVINEECSLKCCSSDTKGSPLASI 827

  Fly   283 KKFGKSE---------QYTDCSSHLERFTLDDNTAIVR---LQLK----------------SEPD 319
            .|.||.:         :.|..||.:|       ||:|:   .:||                |:|.
Human   828 SKSGKVDGLKLLNNMHEKTRDSSDIE-------TAVVKHVLSELKELSYRSLGEDVSDSGTSKPS 885

  Fly   320 KPSLTALSPEENSAPAPKRGRGRARKIRPDAEVETSEVILPCEDSLGEKKPGRKRKLPDEPIDQQ 384
            ||.|.:.:..:|..|           |.||.:..|..::|              :.:.|....:|
Human   886 KPLLFSSASSQNHIP-----------IEPDYKFSTLLMML--------------KDMHDSKTKEQ 925

  Fly   385 QL---SDLVVVKTEQEELGD----APLGDVKRMRRSVRLGNRLHADGSPWEEVKTEALHPQPS-A 441
            :|   .:||..::...  ||    :|:| |.::..|   |...|......:..:..| :|.|| .
Human   926 RLMTAQNLVSYRSPGR--GDCSTNSPVG-VSKVLVS---GGSTHNSEKKGDGTQNSA-NPSPSGG 983

  Fly   442 ELSFAEVTSEILPLAVLDEKTPPKKRGRKAKTPCVKLESETSCGLPFANGNKKTNSSGGCELQLP 506
            :.:.:...|..||..:.|::..|  ...|:::.||   :..:||                     
Human   984 DSALSGELSASLPGLLSDKRDLP--ASGKSRSDCV---TRRNCG--------------------- 1022

  Fly   507 KRSKRRIKPTPKI--------LENDELRCEFETKHIERMTQWESA---AAVDGDFETPTTGGNGS 560
                 |.||:.|:        ::|...|...:|:...::.|..|.   |.:.||.|   .||:..
Human  1023 -----RSKPSSKLRDAFSAQMVKNTVNRKALKTERKRKLNQLPSVTLDAVLQGDRE---RGGSLR 1079

  Fly   561 NSSTSRQKSD---------KSDGSNF----------EGGPGHPAGTSAIKKRLFSKSQRDIENYG 606
            ..:....|.|         ..||.:|          :..||.           .|:.....||  
Human  1080 GGAEDPSKEDPLQIMGHLTSEDGDHFSDVHFDSKVKQSDPGK-----------ISEKGLSFEN-- 1131

  Fly   607 AAMLAKSKLPPCPDVEQFLNDIKASRINANRSPEERKLN-------KKQQRKLAKQKEKHLKHLG 664
                .|.     |:::..:|.            |..:||       ||:.::|.:::.|..|.:.
Human  1132 ----GKG-----PELDSVMNS------------ENDELNGVNQVVPKKRWQRLNQRRTKPRKRMN 1175

  Fly   665 LQKNHRDE--------PSDNDSSNTDNEFFPTTRVQVGKPSVTLRVRNSVTKELPTTATLKSRRN 721
            ..|...:.        |||......|.  ||..|.    ||.::       .|.|.|   :....
Human  1176 RFKEKENSECAFRVLLPSDPVQEGRDE--FPEHRT----PSASI-------LEEPLT---EQNHA 1224

  Fly   722 PVVQAAKLTRRIGARAAGEVTEAAR-ASVPISTPDAEQLHSLDTSIQADVTPIRDLDMRPSTSRV 785
            ..:.:|.....:..:::..:.:..: ..:|..||.||               :.:..:|....|:
Human  1225 DCLDSAGPRLNVCDKSSASIGDMEKEPGIPSLTPQAE---------------LPEPAVRSEKKRL 1274

  Fly   786 SKFICLCQKPSQYYARNAPDSSYCCAIDHIDDQKIGCCNELSSEVHNLLRPSQRVSYMILCDEHK 850
                   :|||::..                        |.:.|...:..|.::          :
Human  1275 -------RKPSKWLL------------------------EYTEEYDQIFAPKKK----------Q 1298

  Fly   851 KRLQSHNCCAGCGIFCTQGKFVLCKQQHFFHPDCAQRFIL----STSYEKELGDEEDQGVKFSSP 911
            |::|                    :|.|.....|.:..:|    |::..|::.:......|...|
Human  1299 KKVQ--------------------EQVHKVSSRCEEESLLARGRSSAQNKQVDENSLISTKEEPP 1343

  Fly   912 VLVLKCP--------------HCGLDTPERTSTVTMKCQSLP--------VFLRT-----QKYKI 949
            ||..:.|              |..|  |:.|.:|.:..:..|        :.::|     .|.:.
Human  1344 VLEREAPFLEGPLAQSELGGGHAEL--PQLTLSVPVAPEVSPRPALESEELLVKTPGNYESKRQR 1406

  Fly   950 KPARLTTSSHLTQFGTVENANTPGATAR--NKGGLSTAVTLSAASSPASKTNGAQRGRAGTS--- 1009
            ||.:....|:....|.:......|.:.:  ..|.|...:|.|.|:|.:....|      ||:   
Human  1407 KPTKKLLESNDLDPGFMPKKGDLGLSKKCYEAGHLENGITESCATSYSKDFGG------GTTKIF 1465

  Fly  1010 -----NSNSRHA--------LNSINFAQLIPESVMNVVLRGHVVSASGRVTAEFTPRDMYYAVQN 1061
                 ....|||        :.:.:.::.||.|  ...|..|..:.|.:.|.|       ..|::
Human  1466 DKPRKRKRQRHAAAKMQCKKVKNDDSSKEIPGS--EGELMPHRTATSPKETVE-------EGVEH 1521

  Fly  1062 DDLERVAEILAADFNVLTPIREYLNGTCLHLVAHSGTLQM-------AYLLLCKGASSPDFVNIV 1119
            |.....::.:..:......::|.:...|..|    |.|.:       |:.|.|.|.:.......:
Human  1522 DPGMPASKKMQGERGGGAALKENVCQNCEKL----GELLLCEAQCCGAFHLECLGLTEMPRGKFI 1582

  Fly  1120 DYELRTAL----MCAVMNE---KCDMLNLFLQCG-----------ADVAIKGPDGKTSLHI---- 1162
            ..|.||.:    :|....|   :|    |...||           ....::....:.||||    
Human  1583 CNECRTGIHTCFVCKQSGEDVKRC----LLPLCGKFYHEECVQKYPPTVMQNKGFRCSLHICITC 1643

  Fly  1163 -AAQLGNLEATQLIVDSYRTSRNITSFLSFIDAQDEGGWTAMVWAAELGHTDIVRLASLPQAVFL 1226
             ||...|:.|::                                    |.  ::|....|.|.. 
Human  1644 HAANPANVSASK------------------------------------GR--LMRCVRCPVAYH- 1669

  Fly  1227 KLINIFLFIS----FLLNQDADPNI------CDNDNNTVLHWSTLHNDG-----LDTITVLLQSG 1276
              .|.|...:    ...|....||.      |.|..:..:.|..:.::|     .|:......  
Human  1670 --ANDFCLAAGSKILASNSIICPNHFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFH-- 1730

  Fly  1277 ADC-NVQNVEGD-------------------------------------TPLHI-ACRHSVTRMC 1302
            .:| |:...||:                                     .|.:| ..||.|....
Human  1731 RECLNIDIPEGNWYCNDCKAGKKPHYREIVWVKVGRYRWWPAEICHPRAVPSNIDKMRHDVGEFP 1795

  Fly  1303 IALIANGADLMIKNKAEQLPF-DCIPNEESECGRTVGFNMQMRSFRPLGLRTFVVCADASNGREA 1366
            : |.....|.:..::|...|: :...:.:.:.|:.|. ....::.:....|...:.|.    :|.
Human  1796 V-LFFGSNDYLWTHQARVFPYMEGDVSSKDKMGKGVD-GTYKKALQEAAARFEELKAQ----KEL 1854

  Fly  1367 RPIQVVRNELAMSENEDEADSLMWPDFRYVTQCIIQQNSVQIDRRVSQMRICSCLDSCSSDRCQC 1431
            |.:|           ||..:....|.:::          ::::|.:.:::|.:. |.....||.|
Human  1855 RQLQ-----------EDRKNDKKPPPYKH----------IKVNRPIGRVQIFTA-DLSEIPRCNC 1897

  Fly  1432 NGASSQNWYTAESRLNADFNYEDPAVIFECN-DVCGCNQLSCKNRVVQNGTRTPLQIVECEDQAK 1495
            ..       |.|:....|....:..:::||: .||.... .|:|:.........::|.  ....:
Human  1898 KA-------TDENPCGIDSECINRMLLYECHPTVCPAGG-RCQNQCFSKRQYPEVEIF--RTLQR 1952

  Fly  1496 GWGVRALANVPKGTFVGSYTGEILTAMEADRR--------TDDSYYFDLDNGHCIDANYYGNVTR 1552
            |||:|...::.||.||..|.||::...|...|        ..:.|...||....|||...||..|
Human  1953 GWGLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGNYAR 2017

  Fly  1553 FFNHSCEPNVLPVRVFYEHQDYRF---PKIAFFSCRDIDAGEEICFDYGEKFWRVEHRSCVG--- 1611
            |.||.|:||.       |.|.:..   .::..|:..||.||.|:.|:|        :..|:|   
Human  2018 FMNHCCQPNC-------ETQKWSVNGDTRVGLFALSDIKAGTELTFNY--------NLECLGNGK 2067

  Fly  1612 --CRCLTTTCKYASQSSSTNASPTNATTAPENET 1643
              |:|....|     |......|.|...|.|.::
Human  2068 TVCKCGAPNC-----SGFLGVRPKNQPIATEEKS 2096

Known Domains:


Indicated by green bases in alignment.

Software error:

Illegal division by zero at /www/www.flyrnai.org/docroot/cgi-bin/DRSC_prot_align.pl line 591.

For help, please send mail to the webmaster (ritg@hms.harvard.edu), giving this error message and the time and date of the error.

GeneSequenceDomainRegion External IDIdentity
G9aNP_001259088.1 ATP-synt_B 150..248 CDD:304375 19/97 (20%)
Ank_2 1056..1152 CDD:289560 20/120 (17%)
ANK 1088..1217 CDD:238125 26/158 (16%)
ANK repeat 1088..1120 CDD:293786 8/38 (21%)
ANK repeat 1124..1153 CDD:293786 8/46 (17%)
Ank_2 1127..1249 CDD:289560 23/154 (15%)
ANK 1155..1306 CDD:238125 31/209 (15%)
ANK repeat 1155..1196 CDD:293786 8/45 (18%)
ANK repeat 1199..1249 CDD:293786 9/59 (15%)
Ank_2 1205..1316 CDD:289560 25/164 (15%)
ANK repeat 1251..1283 CDD:293786 5/37 (14%)
ANK repeat 1285..1316 CDD:293786 9/68 (13%)
PreSET 1357..1466 CDD:128744 19/109 (17%)
SET 1495..1602 CDD:214614 38/117 (32%)
NSD1NP_071900.2 Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 207..252
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 281..311
MSH6_like 319..429 CDD:99898
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 487..514
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 872..891 5/18 (28%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 936..1035 26/136 (19%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1067..1093 7/28 (25%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1112..1134 5/38 (13%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1243..1272 6/43 (14%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1294..1344 10/79 (13%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1382..1428 7/45 (16%)
ING <1431..1587 CDD:331088 32/174 (18%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1480..1534 10/62 (16%)
PHD1_NSD1_2 1545..1587 CDD:277118 9/45 (20%)
PHD2_NSD1 1592..1638 CDD:277120 7/49 (14%)
PHD3_NSD1 1639..1692 CDD:277123 13/93 (14%)
PHD4_NSD1 1709..1748 CDD:277126 7/40 (18%)
WHSC1_related 1754..1848 CDD:99899 12/95 (13%)
AWS 1891..1941 CDD:197795 12/57 (21%)
SET 1942..2065 CDD:214614 40/139 (29%)
S-adenosyl-L-methionine binding 1952..1954 0/1 (0%)
S-adenosyl-L-methionine binding 1994..1997 0/2 (0%)