DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Set2 and Nsd2

DIOPT Version :10

Sequence 1:NP_572888.2 Gene:Set2 / 32301 FlyBaseID:FBgn0030486 Length:2362 Species:Drosophila melanogaster
Sequence 2:NP_001178481.2 Gene:Nsd2 / 680537 RGDID:1307955 Length:1365 Species:Rattus norvegicus


Alignment Length:1546 Identity:322/1546 - (20%)
Similarity:535/1546 - (34%) Gaps:470/1546 - (30%)


- Green bases have known domain annotations that are detailed below.


  Fly   247 FPIQKPKSKLRVSLKRLKLGGRLESSDSGN----------------SPSSSSPEVEPPALQDENA 295
            |.|:|....::..:|.:|:....|...|.|                |.:..|..::...:|..|.
  Rat     3 FSIRKSPLSVQKVVKCMKMKQAPEILGSANGKTQNCEVNHECSVFLSKAQLSNSLQEGVMQKFNG 67

  Fly   296 MDERP----KQEQNLSRMVDAEENSDSDSQIIFIEIETESPKGEEEQEEGRPVEVEPQDLIDIDM 356
            .|..|    ::.::|:..|...|....|:::.|   ||:..||........|::           
  Rat    68 HDALPFLPAEKLKDLTSCVFNGEPGAHDTKLCF---ETQEVKGIGTPPNTTPIK----------- 118

  Fly   357 ELAKQEPTPDPEEDLDEIMVEVLSGPPSLWS------ADDEAEEEEDATVQRATPPGKEPAADSC 415
                   ...||..| :|....::|.|...|      |.|.::.||:         |::    |.
  Rat   119 -------NGSPEIKL-KITKTYMNGKPLFESSICGDGAADMSQSEEN---------GQK----SD 162

  Fly   416 SSAPRRSRRSAPLSGSSRQGKTLEETFAEIAAESSKQILEAEES--QDQEEQHILIDLIEDTLSE 478
            :...|..:||........||.......::|::...|:|...:||  ....::.:|:......|..
  Rat   163 NKTRRNRKRSIKYDSLLEQGLVEAALVSKISSPEDKKIPVKKESCPNSGRDRDLLLKYNVGDLVW 227

  Fly   479 SEVTSSVSPTIEHMVVEEVVVEENQLVDEADEILDSKQEFVIKK---------VFSESDNIAASL 534
            |:|  |..|....||             .||.:|.:..:...:|         .|.::...|...
  Rat   228 SKV--SGYPWWPCMV-------------SADPLLHNHTKLKGQKKSARQYHVQFFGDAPERAWIF 277

  Fly   535 NKDI--FEPKVETKATCGEVVPRPEMVTEDVYITEGIAATLEKSAVVTKPTTEMIAETKLSDEVV 597
            .|.:  ||.:.:.:..|.|...:.....|.:.:.:.|:..|.                       
  Rat   278 EKSLVAFEGEEQFEKLCQESAKQAPTKAEKIKLLKPISGRLR----------------------- 319

  Fly   598 IEPPLKDESDPKQTEVELPESKPAVNIPKSERILSAEVETTSSPLVPPECCTLESVSGPVLLETS 662
                       .|.|:.:.:::.|.::...||............|         .::..|..|..
  Rat   320 -----------AQWEMGIVQAEEAASMSVEERKAKFTFLYVGDQL---------RLNPQVAKEAG 364

  Fly   663 LSTEEKSNENVETTPLKTEAA------KEDSPPAAPEEEASNSSEEPNFLLEDYESNQEQVAEDE 721
            ::| |...|.|:::....|||      :|:..|......|..||                     
  Rat   365 IAT-EPLGEMVDSSVANEEAAVDPGTMREEDIPVKRRRRAKRSS--------------------- 407

  Fly   722 MMKCNNQKG----QKQTPLPEM--KEPEKPVAETVSKKEKAMENPARSSPAIVDKKVRAGEMEKK 780
              ...||:|    :|.|| |:|  .||::.|.....:| ::..:.:||         |.|:...:
  Rat   408 --SAENQEGDPGTEKSTP-PKMADAEPKRGVGSPAGRK-RSTGSASRS---------RKGDSAAQ 459

  Fly   781 VVKSTKGTVPEKKMDSKKSCAAVTPAKQKESGKSAKEAILKKE----TEKEKS----------SA 831
            .:     ...:|..|.       ..|:..::.:...|.:|..:    .||:|:          ||
  Rat   460 FL-----VFCQKHRDE-------VVAEHPDASEEEIEELLGSQWSMLNEKQKARYNTKFSLMISA 512

  Fly   832 KLDSSSPNTLDKKGKDTAQWSPQLQTLPKSSTKPPQ----ESAPSVISKTTSNQPAPKEEQHAAK 892
            :.:..|.||..||           :|..|.:..||:    |.||....:|         ::|:.:
  Rat   513 QSEEDSGNTSGKK-----------RTHTKRTDDPPEDVDVEDAPRKRLRT---------DKHSLR 557

  Fly   893 K--GLSDNSP-PSVLKAKEKAVSGFVECDAMFKAMDLANAQLRLDEKNKKKLKKVPTKVEAPPKV 954
            |  .::|.:. .|..||.|.|.|  ::..|..|  :|::|...|.::|:..              
  Rat   558 KRETITDKTARTSSYKAIEAASS--LKSQAATK--NLSDACKPLKKRNRAS-------------- 604

  Fly   955 EPPTAVPVPGQKKSLSGKTSLRRNTVYEDSPNLERNSSP--SSDSAQANTSAGKLKPSKVKKKIN 1017
              .||....|..||.|...||..|.| .|:|..|.:.||  |:|..|...|   :...|.::.:.
  Rat   605 --ATASSALGFNKSSSPSASLTENEV-SDNPGDEPSESPYESADETQTEAS---VSSKKSERGMA 663

  Fly  1018 PRRSTICEAAKDLRSSSSSSTPTREVAASSPVSTSSDSSSKRNGSKRTTSDLDGG---------S 1073
            .::..:|:..:     .:.|....|.........:....|:|...:.|.::...|         |
  Rat   664 AKKEYVCQLCE-----KTGSLLLCEGPCCGAFHLACLGLSQRPEGRFTCTECASGIHSCFVCKES 723

  Fly  1074 KLDQRRYTI--CEDRQPETAI-PVPLT---KRRFSMHPKASANPLHDTLLQTAGKKRGRKEGKES 1132
            |::.:|..:  |.....|..: ..|||   .|.|..       |||..:...|......:..|..
  Rat   724 KMEVKRCMVNQCGKFYHEACVKKYPLTVFESRGFRC-------PLHSCMSCHASNPSNPRPSKGK 781

  Fly  1133 LSR-----------------------QNSLDSSS--SASQG--------------APKKKALKSA 1158
            :.|                       .||:..:.  :|.:|              ..|..:|...
  Rat   782 MMRCVRCPVAYHGGDACLAAGCSVIASNSIICTGHFTARKGKRHHTHVNVSWCFVCSKGGSLLCC 846

  Fly  1159 EILSAALLETESSESTSSGSKMSRW---DVQTSPELEAANPFGDIAKFIEDGVNLLKRDKVDEDQ 1220
            |...||......|.....||    |   |.:...:|.    |.||                    
  Rat   847 EACPAAFHPDCLSIEMPDGS----WFCNDCRAGKKLH----FQDI-------------------- 883

  Fly  1221 RKEGQDEVKREADPEEDEFAQRVANME-TPATTPTPSPTQSNPEDSASTTTVLKELETGGGVRRS 1284
                              ...::.|.. .||....|.....|.:                  :..
  Rat   884 ------------------IWVKLGNYRWWPAEVCHPKNVPPNIQ------------------KMK 912

  Fly  1285 HRIKQKP---------------------QGPRASQGRGVASVA-LAPISMDEQLAELANIEAINE 1327
            |.|.:.|                     :|.|.|:.:||..:. :...::.|..|.      .||
  Rat   913 HEIGEFPVFFFGSKDYYWTHQARVFPYMEGDRGSRYQGVRGIGRVFKNALQEAEAR------FNE 971

  Fly  1328 QFLRSEGLNT---------FQLLKENF-YRCARQVSQENAEM-QCDCFLTGDEEAQGHLSCGAG- 1380
            ..|:.|...|         ::.:|.|. |...:..:.:.:|: :|:|..| ||.     .||:. 
  Rat   972 IKLQREARETQESERKPPPYKHIKVNKPYGKVQIYTADISEIPKCNCKPT-DEN-----PCGSDS 1030

  Fly  1381 -CINRMLMIECGP-LCSNGARCTNKRFQQHQCWPCRVFRTEKKGCGITAELLIPPGEFIMEYVGE 1443
             |:|||||.||.| :|..|..|.|:.|.:.|....::.:|:.||.|:.|:..|..|||:.|||||
  Rat  1031 ECLNRMLMFECHPQVCPAGEYCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGE 1095

  Fly  1444 VIDSEEFERRQHLYSKDRNRHYYFMALRGEAVIDATSKGNISRYINHSCDPNAETQKWTVNGELR 1508
            :||.||...|.....::...|:|.:.:..:.:|||..|||.||::||||.||.||.||||||:.|
  Rat  1096 LIDEEECMARIKYAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTR 1160

  Fly  1509 IGFFSVKPIQPGEEITFDYQYLRYGRDAQRCYCEAANCRGWIGGEPDSDEGEQLDEESDSDAEMD 1573
            :|.|:|..|..|.|:||:|.....|.:...|.|.|:||.|::|..|.:......:|:|       
  Rat  1161 VGLFAVCDIPAGTELTFNYNLDCLGNEKTVCRCGASNCSGFLGDRPKTSTSLSSEEKS------- 1218

  Fly  1574 EEELEAEPEEGQPRKSAKAKAKSKLKAKLPLATGRKRKEQTKPKDREYKAG 1624
                              .|||.|.:.:.....|:::.|     |..::.|
  Rat  1219 ------------------KKAKKKTRRRRAKGEGKRQSE-----DECFRCG 1246

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
Set2NP_572888.2 valS <709..832 CDD:237855 25/142 (18%)
AWS 1358..1410 CDD:197795 22/55 (40%)
SET_SETD2 1410..1551 CDD:380949 60/140 (43%)
WW 2014..2043 CDD:459800
SRI 2266..2355 CDD:462404
Nsd2NP_001178481.2 PWWP_NSD2_rpt1 220..347 CDD:438990 26/175 (15%)
HMG-box_NSD2 452..507 CDD:438807 10/66 (15%)
TNG2 <546..710 CDD:227367 42/201 (21%)
PHD1_NSD1_2 669..711 CDD:277118 6/46 (13%)
PHD_SF 716..762 CDD:473978 11/52 (21%)
PHD3_NSD2 763..816 CDD:277124 5/52 (10%)
PHD_SF 833..873 CDD:473978 10/43 (23%)
PWWP_NSD2_rpt2 879..974 CDD:438993 20/160 (13%)
AWS 1012..1062 CDD:197795 22/55 (40%)
SET_NSD2 1062..1203 CDD:380988 60/140 (43%)
PHD5_NSD2 1241..1283 CDD:277130 1/6 (17%)
C5HCH 1282..1327 CDD:465605
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.