DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment G9a and NSD

DIOPT Version :10

Sequence 1:NP_001259088.1 Gene:G9a / 30971 FlyBaseID:FBgn0040372 Length:1657 Species:Drosophila melanogaster
Sequence 2:NP_733239.1 Gene:NSD / 43351 FlyBaseID:FBgn0039559 Length:1427 Species:Drosophila melanogaster


Alignment Length:1686 Identity:321/1686 - (19%)
Similarity:528/1686 - (31%) Gaps:543/1686 - (32%)


- Green bases have known domain annotations that are detailed below.


  Fly   122 DQDVAKQDSEME-KKQNGKATSITVKMESNERAEKHATEIATTSTERWENESFKTEQQNKKAAEK 185
            |:.||.:..::| .:....:|.:.:..:..:...::..:.|...:|        .|.|.:...:|
  Fly    40 DESVATEGDDVEIPRDTNNSTPVRLLDKPGQNPVQNGAQPAAEESE--------LESQRQTPVQK 96

  Fly   186 EEEPILAATQK------LEANAEPLTTTRIEVAVASPLVVSSASVKLAADATNQMRAATSAGAA- 243
            :::..::...:      |::...|...........:||..|..:::......||..|..::.|. 
  Fly    97 QQQQRVSMVNRKRDLINLQSALSPKYIGYANANSPTPLSDSDDTIRTTRRRVNQAAALNNSSAGE 161

  Fly   244 TLADKNVQV-SPGGTRRSRRTPRPIDTPTSVTDEHVQVENKKFGKSEQYTDCSSHLERFTLDDNT 307
            |||..|... :|||....           ...|...|:.:|.:         .|.:|:..:.:..
  Fly   162 TLAHDNASPRTPGGGGGG-----------GGDDSANQLLSKTY---------MSPIEKLLIKNGA 206

  Fly   308 AIVRLQLKSEPDKPSLTALSPEENSAPAPKRGRGRARKIRPDAEVETSEVILPCEDSLGEKKPGR 372
                    |.|:.....|.|.:....|..::...|..|..|.|:| |.|:....:..:.||    
  Fly   207 --------SSPNSTGFEAGSEDLGIRPIVRKHVKRKMKRVPKAKV-TLELDEKNQQEVDEK---- 258

  Fly   373 KRKLPDEPID-----------QQQLSDLVVVKTEQEELGDAPLGDVKRMRRSVRLGNRLHADGSP 426
              .:..||||           |:..:..:.:|:|.|....|.:....:...::    ||....:|
  Fly   259 --SVKTEPIDEEVDRTDEAPTQEAQTTAISIKSETEAEHKAAVDVHIKQEDTI----RLDIVNNP 317

  Fly   427 WEE---VKTEALHP----QPSAELSFAEVTSEILPLA---VLDEKTPPKKRGRKAKTPCVKLESE 481
            .|.   |.||  .|    :.:.||:||      ||||   .:|.|:||........|   .::|.
  Fly   318 VESTSIVITE--EPKDLEKSTEELAFA------LPLASSTEVDLKSPPDLSSTALAT---SIKSP 371

  Fly   482 TSCGLPFANGNKKTNSSGGCELQL---------------------------------PKRSK--- 510
            :|..:..|.|.......|....|:                                 |:||.   
  Fly   372 SSVSIDSAKGLSIVTDPGWPTYQVGDLFWGKVFSYCFWPCMVCPDPLGQIVGNMPSHPQRSSLDN 436

  Fly   511 ----------------RR--IKPTPKILEN----------DELRCEFETKH----------IERM 537
                            ||  |||     ||          |::|.|...||          :.:.
  Fly   437 ANVPIQVHVRFFADNGRRNWIKP-----ENLLTFAGLKAFDDMREELRIKHGPKSAKYRQMVPKR 496

  Fly   538 TQ---W----ESAAAVD------------GDFETPTTGGNGSNSSTSRQKSDKSDGSNFEGGPGH 583
            |:   |    |.|.|:.            ..:|...|........|.....|.||          
  Fly   497 TKVVIWRQAIEEAQAMTQIPYSDRLEKFYQTYENVVTLNRQKRKRTKYMMQDTSD---------- 551

  Fly   584 PAGTSAIKKRLFSKSQRDIENYGAAMLA----KSKLPPCPDVEQFLNDIKASRINANRSPEERKL 644
             .|:|     |:..:.......|..:||    :|:.|..|..                ||.:.|.
  Fly   552 -VGSS-----LYDSTDNLHNKQGTQLLAVKRERSESPFSPAF----------------SPVKSKN 594

  Fly   645 NKK-QQRKLAKQKEKHLKHLGLQKNHRDEPSDNDSSNTDNEFFPTTRVQVGKPSVTLRVRNSVTK 708
            .|: ::|||:...|.......:........:..|||..:|..|......|.:..:..|....|.|
  Fly   595 EKRAKRRKLSNGTEADTGSNSMAVTPSQTETTVDSSAYENPEFRQLLSAVMEYVMMNRSDEKVEK 659

  Fly   709 ELPTTA----TLKSRRNPVVQAAKLTRRIGARAAGEVTEAARASVPISTPDAEQLHSLDTSIQAD 769
            .|.:..    :||.     :|..:|.|.:   |:||:.|...:||              ....:.
  Fly   660 VLLSVVSNIWSLKQ-----IQLRELERDL---ASGEIEEPLGSSV--------------VGRGSG 702

  Fly   770 VTPIRDLDMRPSTSRVSKFICLCQKPSQYYARNAPDSSYCCAIDHIDDQKIGCCNELSSEVHNLL 834
            |..|:.|..|..|..|.:.:.....||...|.:.||..               .:|.......:.
  Fly   703 VGTIKRLSNRLMTMMVRRSMTPVVTPSTTPAPSEPDRR---------------LSEPPKTKKPVN 752

  Fly   835 RPSQRVSYMILCDEHK---KRLQSHNCCAGCGIFCTQGKFVLCKQ--QHFFHPDCAQRFILSTSY 894
            ||.:.|...||..:.|   :.|.....|..|  :......|.|.:  ..:.|.||.:|.:.....
  Fly   753 RPIEEVIEDILQLDSKYLFRGLSREPICKYC--YQAGSDLVRCSRTCSSWLHADCLERKVTGAPM 815

  Fly   895 EKELGD---------------EEDQGVKFSSPVLV----LKCPHCGLDTPERTSTVTMKCQSL-- 938
            .| :|.               :||.....:..|:.    |.|..|.:..||.    .:.|..:  
  Fly   816 PK-IGSRKALVIPPTSKSPSPDEDHVTADAKEVVAVGTSLVCHECNVGEPEG----CVICHQVES 875

  Fly   939 PVFLRTQKYKIKPARLTTSSHLTQFGTVENANTPGATARNKGGLSTAVTLSAASSP-ASKTNGAQ 1002
            |....|      |.:..:|||           ||   ..:|        |...|.| ..|.....
  Fly   876 PAVPST------PRKEDSSSH-----------TP---IEDK--------LLTCSQPMCGKRFHTS 912

  Fly  1003 RGRAGTSNSNSRHALNSINFAQLIPESVMNVVLRGHVVSASGRVTAEFTPRDMYYAVQNDDLERV 1067
            ..:.....|:|:|:..                                .||.:.:...:||    
  Fly   913 CCKYWPQASSSKHSAR--------------------------------CPRHVCHTCVSDD---- 941

  Fly  1068 AEILAADFNVLTPIREYLNGTCLHLVAHSGTLQMAYLLLCKGA---------SSPDFVNIVDYEL 1123
               .:..|..|                  |:.::|..:.|...         :....:|      
  Fly   942 ---PSGKFQQL------------------GSSKLAKCVRCPATYHQLSKCIPAGTQMLN------ 979

  Fly  1124 RTALMCAVMN-EKCDMLNLFLQCGADVAIKGPDGKTSLHIAAQLGNLEATQLIVDSYRTSRNITS 1187
            .|.::|...| .|.|.....|.|  .:.:||          .:|...|...:.|.::..:..|.:
  Fly   980 TTNIICPRHNIAKADAHVNVLWC--YICVKG----------GELVCCETCPIAVHAHCRNIPIKT 1032

  Fly  1188 FLSFIDAQDEGGWTAMVWAAELGHTDIVRLASLPQAVFLKLINIFLFISFLLNQDADP-NI---C 1248
            ..|:|..:.|.|                ||....:.|:.|..|...:.:.:|.....| ||   .
  Fly  1033 NESYICEECESG----------------RLPLYGEIVWAKFNNFRWWPAIILPPTEVPSNILKKA 1081

  Fly  1249 DNDNNTVLHWSTLHNDG-LDTITVLLQSGADCNVQNVEGDTPLHIACRHSVTRMCIALIANGADL 1312
            ..:|:.|:.:...|:.| :....|.|.         :||||                  .:|   
  Fly  1082 HGENDFVVRFFGTHDHGWISRRRVYLY---------IEGDT------------------GDG--- 1116

  Fly  1313 MIKNKAEQLPFDCIPNEESECGRTVGFNMQMRSFRPLGLRTFVVCADASNGREARPIQVVRNELA 1377
               :|.:...|........|..|          |.|:                   |:..|.|..
  Fly  1117 ---HKTKSQLFRNYTTGVEEASR----------FLPI-------------------IKARRQEQD 1149

  Fly  1378 MSENEDEADSLMWPDFRYVTQCIIQQNSVQIDRRVSQMRICSCLDSCSSDRCQCNGASSQNWYTA 1442
            |   |.::.:.:.|. .||        .::.::.|..:|....|:..|:  |.|...........
  Fly  1150 M---ERQSGNKLHPP-PYV--------KIKTNKAVPPLRFSQNLEDLST--CNCLPVDEHPCGPE 1200

  Fly  1443 ESRLNADFNYEDPAVIF-ECN-DVCGCNQLSCKNRVVQNGTRTPLQIVECEDQAKGWGVRALANV 1505
            ...||        .::| ||| :.|....| |:||:.:......|::|...:  :|:|:.....:
  Fly  1201 AGCLN--------RMLFNECNPEYCKAGSL-CENRMFEQRKSPRLEVVYMNE--RGFGLVNREPI 1254

  Fly  1506 PKGTFVGSYTGEILTAMEADRR-------TDDSYYF-DLDNGHCIDANYYGNVTRFFNHSCEPNV 1562
            ..|.||..|.||::...|..||       .|::||| .::....|||...||:.||.|||||||.
  Fly  1255 AVGDFVIEYVGEVINHAEFQRRMEQKQRDRDENYYFLGVEKDFIIDAGPKGNLARFMNHSCEPNC 1319

  Fly  1563 LPVRVFYEHQDYR---FPKIAFFSCRDIDAGEEICFDYGEKFW-RVEHRSCVGCRCLTTTC 1619
                   |.|.:.   ..::..|:.:||....|:.|:|   .| .:.:.|...|.|....|
  Fly  1320 -------ETQKWTVNCIHRVGIFAIKDIPVNSELTFNY---LWDDLMNNSKKACFCGAKRC 1370

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
G9aNP_001259088.1 PTZ00121 <68..>188 CDD:173412 10/66 (15%)
EHMT_ZBD 786..933 CDD:411018 31/170 (18%)
ANKYR 1063..1322 CDD:440430 45/273 (16%)
ANK repeat 1088..1120 CDD:293786 4/40 (10%)
ANK repeat 1124..1153 CDD:293786 7/29 (24%)
ANK repeat 1155..1196 CDD:293786 6/40 (15%)
ANK repeat 1199..1249 CDD:293786 10/53 (19%)
ANK repeat 1251..1283 CDD:293786 6/32 (19%)
ANK repeat 1285..1316 CDD:293786 5/30 (17%)
SET_EHMT 1391..1622 CDD:380941 64/243 (26%)
NSDNP_733239.1 PWWP_NSD_rpt1 395..514 CDD:438972 20/123 (16%)
PHD2_NSD 867..932 CDD:277040 18/124 (15%)
PHD3_NSD 933..988 CDD:277041 10/85 (12%)
PHD4_NSD 1001..1041 CDD:277042 9/51 (18%)
PWWP_NSD_rpt2 1048..1143 CDD:438963 25/156 (16%)
AWS 1183..1233 CDD:197795 14/60 (23%)
SET_NSD 1233..1375 CDD:380950 44/150 (29%)

Return to query results.
Submit another query.