DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment NSD and Nsd1

DIOPT Version :9

Sequence 1:NP_733239.1 Gene:NSD / 43351 FlyBaseID:FBgn0039559 Length:1427 Species:Drosophila melanogaster
Sequence 2:NP_032765.3 Gene:Nsd1 / 18193 MGIID:1276545 Length:2691 Species:Mus musculus


Alignment Length:1635 Identity:439/1635 - (26%)
Similarity:664/1635 - (40%) Gaps:367/1635 - (22%)


- Green bases have known domain annotations that are detailed below.


  Fly    31 ATDEVAAGNDESVATEGDD--VEIPRDTNNSTPVRLLDKPG-QNPVQNGAQPAAEESELESQRQT 92
            |.||....|..||:|..||  .::....:||         | ||.|. |...|.:::|    ...
Mouse   626 AGDEEKRSNSVSVSTTSDDGCSDLDPTEHNS---------GFQNSVL-GITDAFDKTE----NAL 676

  Fly    93 PVQKQQQQ--RVSMVNRKRD-------------LINLQSALSPKYIGYANAN-------SPTPLS 135
            .|.|.:.|  |..:.||.::             |:.....:.|:....:..|       ||.|..
Mouse   677 SVHKNETQYSRYPVTNRIKEKQKSLITNSHADHLMGSTKTMEPETAELSQVNLSDLKISSPIPKP 741

  Fly   136 DSD---DTIRTTRRRVNQAAALNNSSAGETLAHDNASPRTPGGGGGGGGDDSANQ-LLSKTYMSP 196
            ..:   |.:.|            ..||...:.::|  |.|.||        .||| ||......|
Mouse   742 QPEFRNDGLTT------------KFSAPPGIRNEN--PLTKGG--------LANQTLLPLKCRQP 784

  Fly   197 IEKLLIKNGASSPNSTGFEAGSEDLGIRPIVRKHVKRKMKRVPKA------KVTLELDEKNQQEV 255
            ..:.:......||......|.||||.::..........:..:.|:      |:...:.||.:...
Mouse   785 KFRSIKCKHKESPAVAETSATSEDLSLKCCSSDTNGSPLANISKSGKGEGLKLLNNMHEKTRDSS 849

  Fly   256 D-EKSVKTEPIDEEVDRTDEAPTQEAQTTAISIKSETEAEHKAAVDVHIKQEDTIRLDIVNNPVE 319
            | |.:|....:.|..:.:..:.:::...:..:..|:......|:...||             |:|
Mouse   850 DIETAVVKHVLSELKELSYRSLSEDVSDSGTAKASKPLLFSSASSQNHI-------------PIE 901

  Fly   320 -----STSIVITEEPKDLEKSTEELAFALPLASSTEVD----LKSPPDLSSTALATSIKSPSSVS 375
                 ||.:::.::..|.:...:.|..|..|||....|    ....|..:|..|.....:|:|..
Mouse   902 PDYKFSTLLMMLKDMHDSKTKEQRLMTAQNLASYRTPDRGDCSSGSPVGTSKVLVLGSSTPNSEK 966

  Fly   376 IDSAKGLSIVTDPGWPTYQV-GDL---------------FWGKVFSYCFWPCMVCPDPLGQIVGN 424
            ...:...|:...||.....: |:|               ..||:.|.|. |...|    |:.   
Mouse   967 PGDSTQDSVHQSPGGGDSALSGELSSSLSSLASDKRELPACGKIRSNCI-PRRNC----GRA--- 1023

  Fly   425 MPSHPQRSSLDNANVPIQVHVRFFADNGRRNWIKPENLLTFAGLKAFDDMREELRIK-----HGP 484
            .||...|.::....|...|:.:.         :|.|....|:.|.|.......|..|     :||
Mouse  1024 KPSSKLRETISAQMVKPSVNPKA---------LKTERKRKFSRLPAVTLAANRLGNKESGSVNGP 1079

  Fly   485 ---------KSAKYRQMVPKRTKVVIWRQAIEEAQAMTQIPYSDRLEKFYQTYENVVTLNRQKRK 540
                     |....:||...|.:...:.....:::|....| ...||| ..::||        ||
Mouse  1080 SRGGAEDPGKEEPLQQMDLLRNEDTHFSDVHFDSKAKQSDP-DKNLEK-EPSFEN--------RK 1134

  Fly   541 RTKYMMQDTSDVGSSLYDSTDNLHN------KQGTQLLAVKR-------------ERSESPFS-- 584
                    ..::||.:....|.||.      |:..|.|..:|             |.||..|.  
Mouse  1135 --------GPELGSEMNTENDELHGVNQVVPKKRWQRLNQRRPKPGKRANRFREKENSEGAFGVL 1191

  Fly   585 -PAFSPVKSKN---EKRAKRRKLSNGTEADT--GSNSMAVTPS--------------QTETTVDS 629
             ||.:..|::.   |:||........:.||.  ||:|.:|.|.              :.||.:.|
Mouse  1192 LPADAVQKAREDYLEQRAPPTSKPEDSAADPNHGSHSESVAPRLNVCEKSSVGMGDVEKETGIPS 1256

  Fly   630 ----SAYENPEFRQ-----------LLSAVMEYVMM-------NRSDEKVEKV--------LLSV 664
                :....|..|.           ||....||..:       .:..|:|.||        ||:.
Mouse  1257 LMPQTKLPEPAIRSEKKRLRKPSKWLLEYTEEYDQIFAPKKKQKKVQEQVHKVSSRCEDESLLAR 1321

  Fly   665 VSNIWSLKQ------IQLRE----LERDLASGEIEEPLGSSVVGRGSGVGTIKRLSNRLMTMMVR 719
            .......||      |..:|    |||:  :..:|.||..|.:|       :.......:|:.| 
Mouse  1322 CQPSAQNKQVDENSLISTKEEPPVLERE--APFLEGPLAQSDLG-------VTHAELPQLTLSV- 1376

  Fly   720 RSMTPVVTPSTTPAPS-EPDRRLSEPPKT--KKPVNRPIEEVIEDILQLDSKYLFR----GLSRE 777
                 .|.|..:|.|: |.:..|.:.|..  .|...:|.::::|. ..||..::.:    ||||:
Mouse  1377 -----PVAPEASPRPALESEELLVKTPGNYESKRQRKPTKKLLES-NDLDPGFMPKKGDLGLSRK 1435

  Fly   778 PICKYCYQA---GSDLVRCSRTCSSWLHADCLERKVTGAPMPKIGSRKALVIPP----------- 828
                 |::|   |:.:|. ||..|..........|:...|..:  .|:.||...           
Mouse  1436 -----CFEASRSGNGIVE-SRATSHLKEFSGGTTKIFDKPRKR--KRQRLVTARVHYKKVKKEDL 1492

  Fly   829 TSKSPSPDED---HVT-ADAKEVVAVGT-----------------------SLVCHEC-NVGE-- 863
            |..:||.:.:   |.| |..||::..|.                       ..||..| .:||  
Mouse  1493 TKDTPSSEGELLIHRTAASPKEILEEGVEHDPGMSASKKLQVERGGGAALKENVCQNCEKLGELL 1557

  Fly   864 --PEGCVICHQVESPAVPSTPRKE--DSSSHTPIE---------DKLLTCSQPMCGKRFHTSCCK 915
              ...|.....:|...:|..||.:  .:..||.|.         :.:..|..|:|||.:|..|.:
Mouse  1558 LCEAQCCGAFHLECLGLPEMPRGKFICNECHTGIHTCFVCKQSGEDVKRCLLPLCGKFYHEECVQ 1622

  Fly   916 YWPQASSSKHSARCPRHVCHTCVSDDPSGKFQQLGSSKLAKCVRCPATYHQLSKCIPAGTQMLNT 980
            .:|...:.....|||.|:|.||.:.:|:......|  :|.:|||||..||....|:.||:::|.:
Mouse  1623 KYPPTVTQNKGFRCPLHICITCHAANPANVSASKG--RLMRCVRCPVAYHANDFCLAAGSKILAS 1685

  Fly   981 TNIICPRHNIAKADA----HVNVLWCYICVKGGELVCCETCPIAVHAHCRNIPIKTNESYICEEC 1041
            .:||||.|...:...    ||||.||::|.:||.|:||::||.|.|..|.||.|.....| |.:|
Mouse  1686 NSIICPNHFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNIDIPEGNWY-CNDC 1749

  Fly  1042 ESGRLPLYGEIVWAKFNNFRWWPAIILPPTEVPSNILKKAHGENDFVVRFFGTHDHGWISRRRVY 1106
            ::|:.|.|.||||.|...:|||||.|..|..|||||.|..|...:|.|.|||::|:.|..:.||:
Mouse  1750 KAGKKPHYREIVWVKVGRYRWWPAEICHPRAVPSNIDKMRHDVGEFPVLFFGSNDYLWTHQARVF 1814

  Fly  1107 LYIEGDTGDGHKTKSQLFRNYTTGVEEASRFLPIIKARRQEQDMERQSGNKLHPPPYVKIKTNKA 1171
            .|:|||.....|....:...|...::||:.....:||:::.:.::....|...||||..||.|:.
Mouse  1815 PYMEGDVSSKDKMGKGVDGTYKKALQEAAARFEELKAQKELRQLQEDRKNDKKPPPYKHIKVNRP 1879

  Fly  1172 VPPLR-FSQNLEDLSTCNCLPVDEHPCGPEAGCLNRMLFNECNPEYCKAGSLCENRMFEQRKSPR 1235
            :..:: |:.:|.::..|||...||:|||.::.|:||||..||:|..|.||..|:|:.|.:|:.|.
Mouse  1880 IGRVQIFTADLSEIPRCNCKATDENPCGIDSECINRMLLYECHPTVCPAGVRCQNQCFSKRQYPD 1944

  Fly  1236 LEVVYMNERGFGLVNREPIAVGDFVIEYVGEVINHAEFQRRMEQKQRDRDENYYFLGVEKDFIID 1300
            :|:....:||:||..:..|..|:||.|||||:|:..|.:.|:...|.....|:|.|.::||.|||
Mouse  1945 VEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIID 2009

  Fly  1301 AGPKGNLARFMNHSCEPNCETQKWTVNCIHRVGIFAIKDIPVNSELTFNYLWDDLMNNSKKACFC 1365
            ||||||.||||||.|:||||||||:||...|||:||:.||...:||||||.. :.:.|.|..|.|
Mouse  2010 AGPKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNL-ECLGNGKTVCKC 2073

  Fly  1366 GAKRCSGEIGGKLKDDAV-----------KAHAKLKQMRRAKASAVR---------------IHV 1404
            ||..|||.:|.:.|:..:           |.|.|    ||::....:               :..
Mouse  2074 GAPNCSGFLGVRPKNQPIVTEEKSRKFKRKPHGK----RRSQGEVTKEREDECFSCGDAGQLVSC 2134

  Fly  1405 KPKKTPKVKH 1414
            |....|||.|
Mouse  2135 KKPGCPKVYH 2144

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
NSDNP_733239.1 MSH6_like 391..508 CDD:99898 27/146 (18%)
PHD2_NSD 867..932 CDD:277040 19/75 (25%)
PHD3_NSD 933..988 CDD:277041 21/54 (39%)
PHD4_NSD 1001..1041 CDD:277042 18/39 (46%)
WHSC1_related 1047..1141 CDD:99899 38/93 (41%)
AWS 1183..1233 CDD:197795 23/49 (47%)
SET 1234..1354 CDD:214614 63/119 (53%)
Nsd1NP_032765.3 MSH6_like 320..430 CDD:99898
TNG2 <1453..1587 CDD:227367 25/135 (19%)
PHD1_NSD1_2 1546..1587 CDD:277118 10/40 (25%)
PHD2_NSD1 1593..1639 CDD:277120 11/45 (24%)
PHD3_NSD1 1640..1693 CDD:277123 21/54 (39%)
PHD4_NSD1 1710..1749 CDD:277126 18/39 (46%)
WHSC1_related 1755..1849 CDD:99899 38/93 (41%)
AWS 1902..1940 CDD:375420 19/37 (51%)
SET_NSD1 1942..2083 CDD:380987 72/141 (51%)
PHD5_NSD1 2121..2163 CDD:277129 5/24 (21%)
C5HCH 2162..2211 CDD:375464
PHA03307 2255..>2576 CDD:223039
Blue background indicates that the domain is not in the aligned region.


Information from Original Tools:


Tool Simple Score Weighted Score Original Tool Information
BLAST Result Score Score Type Cluster ID
Compara 00.000 Not matched by this tool.
Domainoid 1 1.000 129 1.000 Domainoid score I5226
eggNOG 1 0.900 - - E2759_KOG1081
Hieranoid 1 1.000 - -
Homologene 00.000 Not matched by this tool.
Inparanoid 1 1.050 470 1.000 Inparanoid score I1501
Isobase 00.000 Not matched by this tool.
OMA 00.000 Not matched by this tool.
OrthoDB 1 1.010 - - D38976at33208
OrthoFinder 1 1.000 - - FOG0001511
OrthoInspector 1 1.000 - - otm42471
orthoMCL 1 0.900 - - OOG6_102558
Panther 1 1.100 - - O PTHR22884
Phylome 1 0.910 - -
RoundUp 1 1.030 - avgDist Average_Evolutionary_Distance R4489
SonicParanoid 1 1.000 - - X847
SwiftOrtho 00.000 Not matched by this tool.
TreeFam 1 0.960 - -
1312.860

Return to query results.
Submit another query.