DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment NSD and Ehmt1

DIOPT Version :10

Sequence 1:NP_733239.1 Gene:NSD / 43351 FlyBaseID:FBgn0039559 Length:1427 Species:Drosophila melanogaster
Sequence 2:XP_006498485.1 Gene:Ehmt1 / 77683 MGIID:1924933 Length:1312 Species:Mus musculus


Alignment Length:1562 Identity:309/1562 - (19%)
Similarity:490/1562 - (31%) Gaps:508/1562 - (32%)


- Green bases have known domain annotations that are detailed below.


  Fly     7 AHSEIEGDAAHGNVLCNSASDSLTATDEVAAGNDES---VATEGD------------DVEIPRDT 56
            |..|.:.|......|.....|:..|.||.:....|.   :|.:|:            .:..|:.|
Mouse    26 AKQETKQDCCMKTELLREGKDTPMAADEGSTEKQEGETPMAADGETNGSCEKSGDPSHLNAPKHT 90

  Fly    57 NNSTPVRLLDKPGQNPVQNGAQPAAEESELESQRQTPVQKQQQQRVSMVNRKRDLINLQSALSPK 121
            ..:|  |...:.|.|.|...|:....|.:.|..:|..|......:.|::......:| :.||..:
Mouse    91 QENT--RASPQEGTNRVSRVAENGVSERDTEVGKQNHVTADDFMQTSVIGSNGYFLN-KPALQGQ 152

  Fly   122 YIGYANANSPTPLSDSDDTIRTTRRRVNQAAAL-NNSSAGETLAHDNASPRTPGGGGGGGGDDSA 185
            .:...|             |.|:....:.|..| ..:|...||   :|.|:||.......|:.||
Mouse   153 PLRTPN-------------ILTSSLPGHAAKTLPGGASKCRTL---SALPQTPTTAPTVPGEGSA 201

  Fly   186 NQLLSKTYMSPIEKLLIKNGASSPNSTGFEAGSEDLGIRPIVRKHVKRKMKRVPKAKVTLELDEK 250
            :....|                 |.::|.:           ||.|  |..|.:||:.:.|....|
Mouse   202 DTEDRK-----------------PTASGTD-----------VRVH--RARKTMPKSILGLHAASK 236

  Fly   251 NQQEVDEKSVKTEPID---EEVDRTDEAPTQEA--------QTTAISIKSETEAEHKAAVDVHIK 304
            :.:||.:.....|.|:   .|..|....||..|        |....:.||:|..           
Mouse   237 DHREVQDHKEPKEDINRNISECGRQQLLPTFPALHQSLPQNQCYMATTKSQTAC----------- 290

  Fly   305 QEDTIRLDIVNNPVESTSIVITEEPKDLEKSTEELAFALPLASSTE----------VDLKSPPDL 359
                                              |.|.|..|.|.:          |..|....|
Mouse   291 ----------------------------------LPFVLAAAVSRKKKRRMGTYSLVPKKKTKVL 321

  Fly   360 SSTALATSIKSPSSVSIDSAKGLSIVTDPGWPTYQVGDLFWGKVFSYCFWPCMVCPDPLGQIVGN 424
            ....:....||.:..:: .|||                                           
Mouse   322 KQRTVIEMFKSITHSTV-GAKG------------------------------------------- 342

  Fly   425 MPSHPQRSSLDNANVPIQVHVRFFADNGRRNWIKPENLLTFAGLKAFDDMREELR--IKHGPKSA 487
                  ..:||::    .:||     ||       |:|    .:.:.|:..:||.  ..||.:.|
Mouse   343 ------EKALDDS----ALHV-----NG-------ESL----EMDSEDEDSDELEDDEDHGAEQA 381

  Fly   488 KYRQMVPKRTKVVIWRQAIEEAQAMTQIPYSDRLEKFYQTYENVVTLNRQKRKRTKYMMQDTSDV 552
            ........||.    ::::.|         :||..|.....|.               .|::.|.
Mouse   382 AAFPTEDSRTS----KESMSE---------TDRAAKMDGDSEE---------------EQESPDT 418

  Fly   553 GSSLYDSTD--NLHNKQGTQLLAVKRE-RSESPFSPAFSPVKSKNEKRAKRRKLSNGTEADTGSN 614
            |.. .|..|  :|.::...:...:||. :::||:      :|...::|.:.||..:..   .||.
Mouse   419 GED-EDGGDESDLSSESSIKKKFLKRRGKTDSPW------IKPARKRRRRSRKKPSSM---LGSE 473

  Fly   615 SMAVTPSQTETTV--DSSAYENPEFRQLLSAVMEYVMMNRSDEKVEKVLLSVVSNIWSLKQIQLR 677
            :...:|...|...  ||:.|            || |.::..|.:|..:|.|...|          
Mouse   474 ACKSSPGSMEQAALGDSAGY------------ME-VSLDSLDLRVRGILSSQTEN---------- 515

  Fly   678 ELERDLASGEIEEPLGSSVVGRGSGVGTIKRLSNRLMTMMVRRSMTPVVTPSTTPAPSEPDRRLS 742
               ..||||  .:.||:      .|:..:...|.|:.|...|...|  :..:...|....|..|.
Mouse   516 ---EGLASG--PDVLGT------DGLQEVPLCSCRMETPKSREIST--LANNQCMATESVDHELG 567

  Fly   743 EPPKT------KKPVNR-PIEEVIEDILQLDSKYLFRGLSRE----PICKYCYQAGSDLVRC--S 794
            ....:      .:|.|: |:..:.||         .||...:    |.|.|...|| :.:.|  .
Mouse   568 RCTNSVVKYELMRPSNKAPLLVLCED---------HRGRMVKHQCCPGCGYFCTAG-NFMECQPE 622

  Fly   795 RTCSSWLHADCLERKVTGAPMPKIGSR----------KALVIPPTSKSPSPDEDHVTADAKEVVA 849
            .:.|...|.||..|....:..|..|..          ||......:.:|. .|..:.|:.:....
Mouse   623 SSISHRFHKDCASRVNNASYCPHCGEEASKAKEVTIAKADTTSTVTLAPG-QEKSLAAEGRADTT 686

  Fly   850 VGTSLVCHECNVGEPEG---------CVICHQVESPA--VPSTPRKEDSSSHTPIEDKLLTCSQP 903
            .|:.       .|.||.         ...|.....||  |..|...........:|..|:.....
Mouse   687 TGSI-------AGAPEDERSQSTAPQAPECFDPAGPAGLVRPTSGLSQGPGKETLESALIALDSE 744

  Fly   904 MCGK-RFHTSCCKYW------------------PQASSSKHSARCPRH---------VCH----- 935
            ...| |||.....:.                  |.......|.|.|.|         :||     
Mouse   745 KPKKLRFHPKQLYFSARQGELQKVLLMLVDGIDPNFKMEHQSKRSPLHAAAEAGHVDICHMLVQA 809

  Fly   936 -----TCVSDDPSGKFQQLGSSKLAKCVRCPATYHQLSKCIPAGTQMLNTTNIICPRH------- 988
                 ||..|..:...:...::.|          ..:...|.||.|       :.|:.       
Mouse   810 GANIDTCSEDQRTPLMEAAENNHL----------DAVKYLIKAGAQ-------VDPKDAEGSTCL 857

  Fly   989 NIAKADAHVNVLWCYICVKGGELVCCET----CPI---AVHAHCR----------NIPIKTNESY 1036
            ::|....|.:|:. |:...|...|.|:.    .|:   ..:.|..          :|.|:.||..
Mouse   858 HLAAKKGHYDVVQ-YLLSNGQMDVNCQDDGGWTPMIWATEYKHVELVKLLLSKGSDINIRDNEEN 921

  Fly  1037 ICEECESGRLPLYGEIVWAKFNNFRWWPAIILPPT----------EVPSNILKKAHGEN--DFVV 1089
            ||             :.||.|:.......|:|...          :.|.:|   |..||  |.||
Mouse   922 IC-------------LHWAAFSGCVDIAEILLAAKCDLHAVNIHGDSPLHI---AARENRYDCVV 970

  Fly  1090 RFFGTHDHGWISRRRVYLYIEGDTG-DGHKTKSQLFR--NYTTGVEEASRFLPIIKARRQEQDME 1151
            .|........:..:      ||:|. ......||::.  ..:..:.:::...|:...:...:|:.
Mouse   971 LFLSRDSDVTLKNK------EGETPLQCASLSSQVWSALQMSKALRDSAPDKPVAVEKTVSRDIA 1029

  Fly  1152 R-----------QSGNKLHPPPYVKIKTNKAVPPLRFSQNLEDLSTCNCLPVDEHP-----CG-- 1198
            |           ...::|.|..|..:..|....|:...:|:..|..|.|  ||:..     ||  
Mouse  1030 RGYERIPIPCVNAVDSELCPTNYKYVSQNCVTSPMNIDRNITHLQYCVC--VDDCSSSTCMCGQL 1092

  Fly  1199 -------------PEAGCLNRMLFNECNPEYCKAGSLCENRMFEQRKSPRLEVVYMNERGFGLVN 1250
                         ||.......|..||| ..|.....|.||:.:.....||::....:.|:|:.:
Mouse  1093 SMRCWYDKDGRLLPEFNMAEPPLIFECN-HACSCWRNCRNRVVQNGLRARLQLYRTQDMGWGVRS 1156

  Fly  1251 REPIAVGDFVIEYVGEVINHAEFQRRMEQKQRDRDENYYFLGVEKD---FIIDAGPKGNLARFMN 1312
            .:.|.:|.||.|||||:|:.:|...|.|       ::|.|....||   :.|||...||::||:|
Mouse  1157 LQDIPLGTFVCEYVGELISDSEADVREE-------DSYLFDLDNKDGEVYCIDARFYGNVSRFIN 1214

  Fly  1313 HSCEPNCETQKWTVNCIH------RVGIFAIKDIPVNSELTFNY---LWDDLMNNSKKACFCGAK 1368
            |.||||....:  |...|      |:..|:.:.|....:|.|:|   .||  :.....:|.||:.
Mouse  1215 HHCEPNLVPVR--VFMSHQDLRFPRIAFFSTRLIQAGEQLGFDYGERFWD--VKGKLFSCRCGSS 1275

  Fly  1369 RC 1370
            :|
Mouse  1276 KC 1277

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
NSDNP_733239.1 PWWP_NSD_rpt1 395..514 CDD:438972 17/120 (14%)
PHD2_NSD 867..932 CDD:277040 15/85 (18%)
PHD3_NSD 933..988 CDD:277041 11/64 (17%)
PHD4_NSD 1001..1041 CDD:277042 12/56 (21%)
PWWP_NSD_rpt2 1048..1143 CDD:438963 20/109 (18%)
AWS 1183..1233 CDD:197795 17/69 (25%)
SET_NSD 1233..1375 CDD:380950 48/150 (32%)
Ehmt1XP_006498485.1 EHMT_ZBD 530..660 CDD:411018 28/141 (20%)
ANKYR 734..990 CDD:440430 54/295 (18%)
ANK repeat 788..817 CDD:293786 6/28 (21%)
ANK repeat 819..850 CDD:293786 7/47 (15%)
ANK repeat 852..884 CDD:293786 7/32 (22%)
ANK repeat 886..917 CDD:293786 4/30 (13%)
ANK repeat 919..950 CDD:293786 8/43 (19%)
ANK repeat 952..983 CDD:293786 9/33 (27%)
SET_EHMT1 1050..1280 CDD:380933 69/242 (29%)

Return to query results.
Submit another query.