DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment NSD and ehmt1a

DIOPT Version :10

Sequence 1:NP_733239.1 Gene:NSD / 43351 FlyBaseID:FBgn0039559 Length:1427 Species:Drosophila melanogaster
Sequence 2:XP_005171835.2 Gene:ehmt1a / 100005761 ZFINID:ZDB-GENE-040724-44 Length:1059 Species:Danio rerio


Alignment Length:1117 Identity:233/1117 - (20%)
Similarity:356/1117 - (31%) Gaps:378/1117 - (33%)


- Green bases have known domain annotations that are detailed below.


  Fly   458 KPENL--LTFAGLKAFDDMREELRIKHGPKSAKY--RQMVPKRTKVV-------IWRQAIEEAQA 511
            |||.|  .:|..    |:..:.:.:...|::|:.  .|..|:..|||       .|:..:..|.|
Zfish    75 KPETLEKQSFGS----DNADKRITVVSVPEAAREEDEQREPEGVKVVKPAPPRLAWKTMVRSAPA 135

  Fly   512 MTQIPYSDRLEKFYQTYENVVTLNRQKRKRTKYMMQDTSDVGSS----------LYDSTDNLHNK 566
            ...:.                ||||:   |:.|    ..||.||          |.|||.....|
Zfish   136 PPALK----------------TLNRE---RSGY----ERDVESSPTQHLSLPQELKDSTQTAPKK 177

  Fly   567 QGTQLLAVKRERSESPFSPAFSP---------VKSKNEKRAKRRKLSNG-----TEADTGSNSMA 617
            :       ||:.....|.|...|         :.|.|.:.|.|...|.|     .||.....|..
Zfish   178 K-------KRKMGLYNFVPKKKPKGSKKANVSLSSSNLQEALRMTGSKGGQMSIEEAFRNKASSE 235

  Fly   618 VTPSQTETTV--DSSAYENPEFRQLLSAVMEYVMMNRSDEKVEKVLLSVV-SNIWSLKQIQLRE- 678
            ....||.|.|  |:..::|.|        .|......::|..|..|.::. .::::.....||| 
Zfish   236 KPKKQTSTAVIEDAEPHKNKE--------TEDAPEGPAEEYTELPLHALAHDSLFTAIPTGLRED 292

  Fly   679 ---------LERDLASGEIEEPLGSSVVGRGSGVGTIKRLSNRLMTMMVRRSMTPVVTPSTTPAP 734
                     ||..|.|..:|.|....||                 |:...:.|          |.
Zfish   293 KENMETDDTLEPPLCSCRMETPKNQDVV-----------------TLAEGKCM----------AV 330

  Fly   735 SEPDRRLSEPPKT--KKPVNRPIEEVIEDILQLDSKYLFRGLSRE---PICKYCYQAGSDLVRCS 794
            ...|.:||...|.  |:.:.||...:.:.:|..|.:   .|:.:.   |.|.:..:||: .:.|.
Zfish   331 ESVDGKLSLCRKVIMKQELMRPSLRIPQLVLCEDHR---AGMVKHQSCPGCGFFCRAGT-FMECQ 391

  Fly   795 ----------RTCSSWLHAD--CLERKVTGAPMPKIGSR----KALVIP--------PTSKSPSP 835
                      |:|:|.:...  |          |..|..    |.:.||        ||:..|:.
Zfish   392 PDGNISHRFHRSCASLIRGQVYC----------PHCGEEASHAKEVTIPKPDCVASVPTAAVPAH 446

  Fly   836 DEDHVTADAKEVVAVGTSLVCHECNVGEPEGCVICHQVESPAV----PSTPRKED-SSSHTPIED 895
            ..|:...|                            :|:|.:|    ...|.||. .:....:||
Zfish   447 KRDNALMD----------------------------RVKSDSVMVDGAGDPAKESLENVLNALED 483

  Fly   896 ------KLLTCSQPMCGKR------FHTSCCKYWPQASSSKHSARCPRH---------VCHTCVS 939
                  ||.........||      .|.......|.........:.|.|         :||..| 
Zfish   484 GKYKKFKLPPKQLYFSAKRGELQRILHMLVEGVDPNLRMDSEKRKTPLHCAAEEGHKDICHVLV- 547

  Fly   940 DDPSGKFQQLGSSKLAKC---VRCPATY-------HQLSKCIPAGT-----QMLNTTNIICPRHN 989
                    |.|:: |..|   .|.|..|       ..:...:.||.     .|..:|.:    |.
Zfish   548 --------QAGAN-LDMCDIEQRTPLMYACNNNHLENVKYLLKAGASSAHKDMRGSTCL----HL 599

  Fly   990 IAKADAHVNVLWCYICVKGGELVC-----------------CETCPIAVHAHCRNIPIKTNESYI 1037
            .|:| .|.|||...:.:...::.|                 .|...:.:.|.. ::.|:..|..:
Zfish   600 AARA-GHTNVLQYLLSLPSIDVNCKDDGGWAPLTWATENMRLEQVKMLISAGA-DVQIRDKEENL 662

  Fly  1038 CEECESGRLPLYGEIVWAKFNNFRWWPAIILPPTEVPSNILK--------KAHGEN--------- 1085
            |             :.||.|:..          .|:...:|.        ..||:.         
Zfish   663 C-------------LHWAAFSGC----------DEIAQLLLDHRSDLHAVNVHGDTPLHIAVRQN 704

  Fly  1086 --DFVVRFFGTHDHGWISRRRVYLYIEGDTGDG---HKTKSQLFRNYTTGVEEASRFLPIIKARR 1145
              |.|:.|........:..|      :|:|..|   ..:|.....|....:.:|.|....::.|.
Zfish   705 QLDCVMLFLSRGADVNLKNR------DGETPLGCCNINSKMWTILNTNKKLTDARRGRESLRERL 763

  Fly  1146 QEQDMERQSGNKLHPPPYVK-------------IKTNKAVPPLRFSQNLEDLSTCNCLPVDEHPC 1197
            ..:|:.|  |.:..|.|.|.             |..|.....:...:|::.|..|:|    :..|
Zfish   764 LCRDVSR--GYEDIPVPCVNGVDHEPCPSNFKYIPENCFTSQVNIDENIKHLQHCSC----KDDC 822

  Fly  1198 GPEAGCL-------------NRML----------FNECNPEYCKAGSLCENRMFEQRKSPRLEVV 1239
            . .:.|:             .|:|          ..||| ..|.....|.||:.:.....||:|.
Zfish   823 A-SSSCICGQLSMHCWYGKDGRLLKEFCRDDPPFLFECN-HACSCWRTCRNRVIQNGLRLRLQVF 885

  Fly  1240 YMNERGFGLVNREPIAVGDFVIEYVGEVINHAEFQRRMEQKQRDRDENYYF---LGVEKDFIIDA 1301
            .....|:|:...:.|..|.||.|:.||:|:..|...|       .:::|.|   ..|.:.:.||.
Zfish   886 RTERMGWGVRTLQDIPEGGFVCEFAGEIISDGEANIR-------ENDSYMFNLDNKVGEAYCIDG 943

  Fly  1302 GPKGNLARFMNHSCEPNCETQKWTVNCIH------RVGIFAIKDIPVNSELTFNYLWDDLMNNSK 1360
            ...||::|||||.||||....:  |...|      |:..||.|.|....||.|:| .|......|
Zfish   944 QFYGNVSRFMNHLCEPNLFPVR--VFTKHQDMRFPRIAFFASKHIQAGDELGFDY-GDHYWQIKK 1005

  Fly  1361 K--ACFCGAKRC 1370
            |  .|.||:.:|
Zfish  1006 KYFRCQCGSGKC 1017

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
NSDNP_733239.1 PWWP_NSD_rpt1 395..514 CDD:438972 16/66 (24%)
PHD2_NSD 867..932 CDD:277040 15/81 (19%)
PHD3_NSD 933..988 CDD:277041 14/69 (20%)
PHD4_NSD 1001..1041 CDD:277042 6/56 (11%)
PWWP_NSD_rpt2 1048..1143 CDD:438963 18/116 (16%)
AWS 1183..1233 CDD:197795 14/72 (19%)
SET_NSD 1233..1375 CDD:380950 49/149 (33%)
ehmt1aXP_005171835.2 None

Return to query results.
Submit another query.