DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Set2 and Nsd2

DIOPT Version :10

Sequence 1:NP_572888.2 Gene:Set2 / 32301 FlyBaseID:FBgn0030486 Length:2362 Species:Drosophila melanogaster
Sequence 2:NP_001074571.2 Gene:Nsd2 / 107823 MGIID:1276574 Length:1366 Species:Mus musculus


Alignment Length:1423 Identity:297/1423 - (20%)
Similarity:505/1423 - (35%) Gaps:415/1423 - (29%)


- Green bases have known domain annotations that are detailed below.


  Fly   367 PEEDLDEIMVEVLSGPPSLWSADD-----EAEEEEDATVQRATPPGKEPAADSCSSAPRRSRRSA 426
            |.|.|.::...|.:|.|   .|.|     ||:|.:..    .|||...|..:.......:     
Mouse    75 PAEKLKDLTSCVFNGEP---GAHDTKLCFEAQEVKGI----GTPPNTTPIKNGSPEIKLK----- 127

  Fly   427 PLSGSSRQGKTLEETFAEIAAESSKQILEAEESQDQEE--------QHILID-LIEDTLSESEVT 482
             ::.:...||.|.|  :.|..:.:..:.::||::.:.:        :.|..| |:|..|.|:.:.
Mouse   128 -ITKTYMNGKPLFE--SSICGDGAADVSQSEENEQKSDNKTRRNRKRSIKYDSLLEQGLVEAALV 189

  Fly   483 SSV-SPTIEHMVVE-----------EVVVEEN---------------QLVDEADEILDSKQEFVI 520
            |.: ||..:.:.|:           :::::.|               ..:..||.:|.:..:...
Mouse   190 SKISSPADKKIPVKKESCPNTGRDRDLLLKYNVGDLVWSKVSGYPWWPCMVSADPLLHNHTKLKG 254

  Fly   521 KK---------VFSESDNIAASLNKDI--FEPKVETKATCGEVVPRPEMVTEDVYITEGIAATL- 573
            :|         .|.::...|....|.:  ||.:.:.:..|.|...:.....|.:.:.:.|:..| 
Mouse   255 QKKSARQYHVQFFGDAPERAWIFEKSLVAFEGEEQFEKLCQESAKQAPTKAEKIKLLKPISGRLR 319

  Fly   574 -----------------------------------------EKSAVVTKPTTEMIAETKLSDEVV 597
                                                     :::.:||:|..||:..:..|:|..
Mouse   320 AQWEMGIVQAEEAASMSIEERKAKFTFLYVGDQLHLNPQVAKEAGIVTEPLGEMVDSSGASEEAA 384

  Fly   598 IEP-PLKDESDPKQTEVELPESKPAVNIPKSERILSAEVETTSSPLVPPECCTLESVSGPVLLET 661
            ::| .:::|..|.:.......|..|.|         .|.:..:....||:....|...|      
Mouse   385 VDPGSVREEDIPTKRRRRTKRSSSAEN---------QEGDPGTDKSTPPKMAEAEPKRG------ 434

  Fly   662 SLSTEEKSNENVETTPLKTEAAKEDSPPAAPEEEASNSSEEPNFLLEDYESNQEQVAEDEMMKCN 726
                            :.:.|.::.|..:||.....:|:.:  ||:...:...|.|||..     
Mouse   435 ----------------VGSPAGRKKSTGSAPRSRKGDSAAQ--FLVFCQKHRDEVVAEHP----- 476

  Fly   727 NQKGQKQTPLPEMKEPEKPVAETVSKKEKAMENPARSSPAIVDKKVRAGEMEKKVVKSTK-GTVP 790
            :..|:      |::|........:::|:||..|...|.......:..:|....|....|| ...|
Mouse   477 DASGE------EIEELLGSQWSMLNEKQKARYNTKFSLMISAQSEEDSGNGNGKKRSHTKRADDP 535

  Fly   791 EKKMDSKKSCAAVTPAKQKESGKSAKEAILKKETEKEKSSAKLDSSSPNTLDKKGKDTAQWSPQL 855
            .:.:|.:.:     |.|:..:.|.:    |:|:.|              |:..|   ||      
Mouse   536 AEDVDVEDA-----PRKRLRADKHS----LRKQRE--------------TITDK---TA------ 568

  Fly   856 QTLPKSSTKPPQESAPSVISKTTSNQPAPKEEQHAAKKGLSDNSPPSVLKAKEKAVSGFVECDAM 920
                ::|:....|:|.|:.|:             ||.|.|||...|  ||.:.:|.:        
Mouse   569 ----RTSSYKAIEAASSLKSQ-------------AATKNLSDACKP--LKKRNRASA-------- 606

  Fly   921 FKAMDLANAQLRLDEKNKKKLKKVPTKVEAPPKVEPPTAVPVPGQKKSLSGKTSLRRNTVYEDSP 985
                                                 ||....|..||.|...||..:.| .|||
Mouse   607 -------------------------------------TASSALGFNKSSSPSASLTEHEV-SDSP 633

  Fly   986 NLERNSSP--SSDSAQANTSAGKLKPSKVKKKINPRRSTICEAAKDLRSSSSSSTPTREVAASSP 1048
            ..|.:.||  |:|..|...|   :...|.::.:..::..:|:..:     .:.|....|......
Mouse   634 GDEPSESPYESADETQTEAS---VSSKKSERGMAAKKEYVCQLCE-----KTGSLLLCEGPCCGA 690

  Fly  1049 VSTSSDSSSKRNGSKRTTSDLDGG---------SKLDQRRYTI--CEDRQPETAI-PVPLT---K 1098
            ...:....|:|...:.|.::...|         ||::.:|..:  |.....|..: ..|||   .
Mouse   691 FHLACLGLSRRPEGRFTCTECASGIHSCFVCKESKMEVKRCVVNQCGKFYHEACVKKYPLTVFES 755

  Fly  1099 RRFSMHPKASANPLHDTLLQTAGKKRGRKEGKESLSRQNSLDSSSSASQGAPKKKALKSAEILSA 1163
            |.|..       |||..:...|......:..|..:.|   ......|..|.....|...:.|.|.
Mouse   756 RGFRC-------PLHSCMSCHASNPSNPRPSKGKMMR---CVRCPVAYHGGDACLAAGCSVIASN 810

  Fly  1164 ALLETESSESTSSGSKMSRWDVQTSPELEAANPFGDI-------AKFIEDGVNLLKRD------K 1215
            :::.|  ...|:...|.....|..| .....:..|.:       |.|..|.:|:...|      .
Mouse   811 SIICT--GHFTARKGKRHHTHVNVS-WCFVCSKGGSLLCCEACPAAFHPDCLNIEMPDGSWFCND 872

  Fly  1216 VDEDQRKEGQDEVKREADPEEDEFAQRVANME-TPATTPTPSPTQSNPEDSASTTTVLKELETGG 1279
            ....::...||.:           ..::.|.. .||....|.....|.:                
Mouse   873 CRAGKKLHFQDII-----------WVKLGNYRWWPAEVCHPKNVPPNIQ---------------- 910

  Fly  1280 GVRRSHRIKQKP---------------------QGPRASQGRGVASVALAPISMDEQLAELANIE 1323
              :..|.|.:.|                     :|.|.|:.:||..:.        ::.:.|..|
Mouse   911 --KMKHEIGEFPVFFFGSKDYYWTHQARVFPYMEGDRGSRYQGVRGIG--------RVFKNALQE 965

  Fly  1324 A---INEQFLRSEGLNT---------FQLLKENF-YRCARQVSQENAEM-QCDCFLTGDEEAQGH 1374
            |   .||..|:.|...|         ::.:|.|. |...:..:.:.:|: :|:|..| ||.    
Mouse   966 AEARFNEVKLQREARETQESERKPPPYKHIKVNKPYGKVQIYTADISEIPKCNCKPT-DEN---- 1025

  Fly  1375 LSCGAG--CINRMLMIECGP-LCSNGARCTNKRFQQHQCWPCRVFRTEKKGCGITAELLIPPGEF 1436
             .||:.  |:|||||.||.| :|..|..|.|:.|.:.|....::.:|:.||.|:.|:..|..|||
Mouse  1026 -PCGSDSECLNRMLMFECHPQVCPAGEYCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEF 1089

  Fly  1437 IMEYVGEVIDSEEFERRQHLYSKDRNRHYYFMALRGEAVIDATSKGNISRYINHSCDPNAETQKW 1501
            :.|||||:||.||...|.....::...|:|.:.:..:.:|||..|||.||::||||.||.||.||
Mouse  1090 VNEYVGELIDEEECMARIKYAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKW 1154

  Fly  1502 TVNGELRIGFFSVKPIQPGEEITFDYQYLRYGRDAQRCYCEAANCRGWIGGEPDSDEGEQLDEES 1566
            ||||:.|:|.|:|..|..|.|:||:|.....|.:...|.|.|:||.|::|..|            
Mouse  1155 TVNGDTRVGLFAVCDIPAGTELTFNYNLDCLGNEKTVCRCGASNCSGFLGDRP------------ 1207

  Fly  1567 DSDAEMDEEELEAEPEEGQPRKSAKAKAKSKLKAKLPLATGRKRKEQTKPKDREYKAG 1624
            .:.|.:..|      |:|:       |||.|.:.:.....|:::.|     |..::.|
Mouse  1208 KTSASLSSE------EKGK-------KAKKKTRRRRAKGEGKRQSE-----DECFRCG 1247

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
Set2NP_572888.2 valS <709..832 CDD:237855 24/123 (20%)
AWS 1358..1410 CDD:197795 22/55 (40%)
SET_SETD2 1410..1551 CDD:380949 60/140 (43%)
WW 2014..2043 CDD:459800
SRI 2266..2355 CDD:462404
Nsd2NP_001074571.2 Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 149..169 2/19 (11%)
PWWP_NSD2_rpt1 220..347 CDD:438990 15/126 (12%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 373..455 18/112 (16%)
HMG-box_NSD2 452..507 CDD:438807 14/67 (21%)
TNG2 <546..711 CDD:227367 50/264 (19%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 595..659 23/114 (20%)
PHD1_NSD1_2 670..712 CDD:277118 6/46 (13%)
PHD2_NSD2 717..763 CDD:277121 11/52 (21%)
PHD3_NSD2 764..817 CDD:277124 9/57 (16%)
PHD_SF 834..874 CDD:473978 6/39 (15%)
PWWP_NSD2_rpt2 880..975 CDD:438993 20/131 (15%)
AWS 1013..1063 CDD:197795 22/55 (40%)
SET_NSD2 1063..1204 CDD:380988 60/140 (43%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1207..1233 9/50 (18%)
PHD5_NSD2 1242..1284 CDD:277130 1/6 (17%)
C5HCH 1283..1328 CDD:465605
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1330..1366
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.