| Sequence 1: | NP_001259088.1 | Gene: | G9a / 30971 | FlyBaseID: | FBgn0040372 | Length: | 1657 | Species: | Drosophila melanogaster |
|---|---|---|---|---|---|---|---|---|---|
| Sequence 2: | XP_683890.5 | Gene: | nsd1a / 556086 | ZFINID: | ZDB-GENE-080519-3 | Length: | 2053 | Species: | Danio rerio |
| Alignment Length: | 1962 | Identity: | 349/1962 - (17%) |
|---|---|---|---|
| Similarity: | 594/1962 - (30%) | Gaps: | 738/1962 - (37%) |
- Green bases have known domain annotations that are detailed below.
|
Fly 1 MTDFVELMNSMSSTFNSDC-ATSTAEGGTLLNLNLAEDKTLKWRNLANNQFASKEKKHKDKEEEE 64
Fly 65 RKEARNQEEIEDIKALLADVVDAAAVKLEEE-----EAQNAEKVEPHTKCE-------------- 110
Fly 111 -----------------------------IEEEGR-KEMEYDQDVAKQDSEMEKKQNGKATSITV 145
Fly 146 KMESNERAEKHATEIATTSTERWENESFKTEQQNKKAAEKEEEPILAATQKLEANAEPLTTTRIE 210
Fly 211 VAVASPLVVSSASVKLAADATNQMRAATSAGAATLADKNVQVSPGGTRRSRRTPRPIDTPTS--- 272
Fly 273 ---------VTDEHVQVENKK--FGKSEQYTDCSSHLERFTLD------DNTAIVRLQL-KSEPD 319
Fly 320 KPSLTALSP--EENSAPAPKRGRGRARKIRPDAEVETSEVILPCEDSL-------GEKKPGRKRK 375
Fly 376 LPDEPIDQQQLSD--------------LVVVKTEQEE----LGDAPLG--DVKRMRRSVRLGNRL 420
Fly 421 HADGSPWEEVKTEALHPQPSAELSFAEVTSEILP---LAVLDEKTPPKKRGRKA--KTPCVKLES 480
Fly 481 ETSCGLPFANGNKK-------------TNSSGGCELQLPKRS-KRRIKPTPKIL-----ENDELR 526
Fly 527 CEFETKHIERMTQWESAAAVDGDFETPTTGGNGSNS-----STSRQKSDKSDGSNFEGGPGHPAG 586
Fly 587 TSAIKKRLFSKSQRDIENYGAAMLAKS-----KLPPCPDVEQFLNDIKASRINANRSPEER-KLN 645
Fly 646 KKQQRKLAKQKEKHLKHL-----------------GLQKN-HRDEPSDNDSSNTD---------- 682
Fly 683 ------NEFFP--------------------TTRVQVGKPSVTLRVRNSVTKELPTTATLKSRRN 721
Fly 722 PVV-------------------------QAAKLTRRIGARAAGEVTEAA---------------- 745
Fly 746 -----------------RASVPISTPDAEQLHS---LDTS--IQADVTPIRDLDMRPSTSRVSKF 788
Fly 789 ICLCQKP---------SQYYARNAPDSSYCCAIDHIDDQKIG-CCNELSSE-------VHNLLRP 836
Fly 837 SQRVSYMILCDEHKKRLQSHNCCAGCGIFC------TQGKFV------------LCKQQ------ 877
Fly 878 -------HFFHPDCAQRFILSTSYEKELGDEEDQGVKFSSPVLVLKCPHCGLDTPERTSTVTMKC 935
Fly 936 QSLPVFLRTQKYKIKPARLTTSSHLTQFGTVENANTPGATARNKGGLSTAVTLSAASSPASKTNG 1000
Fly 1001 AQRGRAGTSNSNSRHALNSINFAQLIPESVMNVVLRGHVVSASGRVTAEFTPRDMYYAVQNDDLE 1065
Fly 1066 RVAEILAADFNVLTPIREYLNGTCLHLVAHSGTLQMAYLLLCKGASSPDFVNIVDYELRTALMCA 1130
Fly 1131 VMNEKCDMLNLFLQ-----CGADVAIKGPDGKTSLHIAAQLGNLEATQLIVDSYRTSRNITSFLS 1190
Fly 1191 FIDAQDEGGWTAMVWAAELGHTDIVRLASLPQAVFLKLINIFLFISFLLNQDADPNICDNDNNTV 1255
Fly 1256 LHWSTLHNDGLDTITVLLQSGADCNVQNVEGDTPLHIACRHSVTRMCIALIANGADLMIKNKAEQ 1320
Fly 1321 LPFDCIPNEESECGRTVGFNMQMRSFRPLGLRTFVVCADASNGREARPIQVVRNELAMSENEDEA 1385
Fly 1386 DSLMWPDFRYVTQCIIQQNSVQIDRRVSQMRICSCLDSCSSDRCQCNGASSQNWYTAESRLNADF 1450
Fly 1451 NYEDPAVIFEC-NDVCGCNQLSCKNRVVQNGTRTPLQIVECE---DQAKGWGVRALANVPKGTFV 1511
Fly 1512 GSYTGEILTAMEADRRTDDS--------YYFDLDNGHCIDANYYGNVTRFFNHSCEPNVLPVRVF 1568
Fly 1569 YEHQDYRF---PKIAFFSCRDIDAGEEICFDYGEKFWRVEHRSCVG-----CRCLTTTCKYASQS 1625
Fly 1626 SSTNASPTNATT 1637 |
| Gene | Sequence | Domain | Region | External ID | Identity |
|---|---|---|---|---|---|
| G9a | NP_001259088.1 | PTZ00121 | <68..>188 | CDD:173412 | 24/168 (14%) |
| EHMT_ZBD | 786..933 | CDD:411018 | 30/194 (15%) | ||
| ANKYR | 1063..1322 | CDD:440430 | 32/263 (12%) | ||
| ANK repeat | 1088..1120 | CDD:293786 | 6/31 (19%) | ||
| ANK repeat | 1124..1153 | CDD:293786 | 6/33 (18%) | ||
| ANK repeat | 1155..1196 | CDD:293786 | 5/40 (13%) | ||
| ANK repeat | 1199..1249 | CDD:293786 | 6/49 (12%) | ||
| ANK repeat | 1251..1283 | CDD:293786 | 0/31 (0%) | ||
| ANK repeat | 1285..1316 | CDD:293786 | 4/30 (13%) | ||
| SET_EHMT | 1391..1622 | CDD:380941 | 63/250 (25%) | ||
| nsd1a | XP_683890.5 | None | |||