DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment NSD and Nsd1

DIOPT Version :9

Sequence 1:NP_733239.1 Gene:NSD / 43351 FlyBaseID:FBgn0039559 Length:1427 Species:Drosophila melanogaster
Sequence 2:XP_006253682.1 Gene:Nsd1 / 306764 RGDID:1307748 Length:2689 Species:Rattus norvegicus


Alignment Length:1578 Identity:413/1578 - (26%)
Similarity:640/1578 - (40%) Gaps:421/1578 - (26%)


- Green bases have known domain annotations that are detailed below.


  Fly    27 DSLTATDEVAAGNDESVATEGDDVEIPRDTNNSTPVRLLDKPGQNP----VQNGAQPAAEESELE 87
            :|.||.:..|...|.|:.....|       .|.:|:..:.|.|:..    :.|..:...:.|::|
  Rat   795 ESPTAAETSATSEDLSLKCCSSD-------TNGSPMTSISKSGKGEGLKLLNNMHEKTRDSSDIE 852

  Fly    88 SQRQTPVQK---QQQQRVSMVNRKRDLINLQSALSPKYIGYANANSPT---------------PL 134
                |.|.|   .:.:.:|..:...|:.:..::.|.|.:.:::|:|..               .|
  Rat   853 ----TAVVKHVLSELKELSYRSLSEDVSDSGTSKSSKPLLFSSASSQNHIPIEPDYKFSTLLMML 913

  Fly   135 SDSDDTIRTTRRRVNQAAALNNSSAGETLAHDNASPRTPGGGGGGGGDDSANQLLSKTYMSPI-- 197
            .|..|: :|..:|:             ..|.:.||.|||..|....|             ||:  
  Rat   914 KDMHDS-KTKEQRL-------------MTAQNVASYRTPDRGDCSSG-------------SPVGT 951

  Fly   198 EKLLI---------KNGASSPNSTGFEAG----------SEDLGIRP----------------IV 227
            .|:|:         |.|.|:.:|.....|          |..|.:.|                |.
  Rat   952 SKVLVLGGSTHNSEKPGDSTQDSVRLSPGGGDSALSGELSSSLSVLPSDKRDLPACGKIRSNCIP 1016

  Fly   228 RKHVKRKMKRVPKAKVTLELDEKNQQEVDEKSVKTEPIDEEVDRTDEAPTQEAQTTAISIKSETE 292
            |::..| .|..||.:||:. .:..:..|:.|::|||       |..:.....|.|.|.:.....|
  Rat  1017 RRNCGR-AKLSPKLRVTIS-TQMAKPSVNPKALKTE-------RKRKLSRLPAVTLAANGLGNKE 1072

  Fly   293 A------EHKAAVDVHIKQEDTIRLDIVNNPVEST---SIVITEEP-KDLEKSTEELAFALPLAS 347
            :      ..|.......|:|...::|::.|  |.|   |.|...:| |.|||   |.:|.     
  Rat  1073 SGGSVNGPLKGGAQDPAKEEPLQQMDLLRN--EETHFDSKVKQSDPDKILEK---EPSFE----- 1127

  Fly   348 STEVDLKSPPDLSSTALATSIKSPSSVSIDSAKGL-SIVTDPGWPTYQVGDLFWGKVFSYCFWPC 411
                :.|.|          .:.|..::..|...|: .:|....|                     
  Rat  1128 ----NRKGP----------EVGSEINIENDEPHGVDQVVPKKRW--------------------- 1157

  Fly   412 MVCPDPLGQIVGNMPSHPQRSS----LDNA---------NVPIQVHVRFFADNGRRNWIK----P 459
                ..|.|   ..|...:|:|    .:|:         ..|:|        .||.::::    |
  Rat  1158 ----QRLNQ---RRPKPGKRASRFREKENSEGAFGVLLPGDPVQ--------KGRDDYLEQRAPP 1207

  Fly   460 ENLLTFAGLKAFDDMREELRIKH----GP------KSAKYRQMVPKRTKVVIWRQAIEEAQAMTQ 514
            .::|.       |...:...:.|    ||      ||:.....:.|.|       .|......|:
  Rat  1208 TSILE-------DSAADPNHVSHSESVGPRLNVCDKSSVSMGDLEKET-------GIPSLTPQTK 1258

  Fly   515 IPY-SDRLEKFYQTYENVVTLNRQKRKRTKYMMQDTSDVGSSLYDSTDNLHNKQGTQLLAVKRER 578
            ||. :.|.||            ::.||.:|::::.|.:     ||           |:.|.|:::
  Rat  1259 IPEPAVRSEK------------KRLRKPSKWLLEYTEE-----YD-----------QIFAPKKKQ 1295

  Fly   579 SESPFSPAFSPVKSKNEKR---AKRRKLSNGTEADTGSNSMAVTPSQTETTVDSSAY-ENP---- 635
            .:  .......|.|:.|..   |:.|..:...:.|  .||:..|..:.......:.: |.|    
  Rat  1296 KK--VQEQVHKVSSRCEDESLLARCRSSAQNKQVD--ENSLISTKEEPPVLEREAPFLEGPLVQS 1356

  Fly   636 -------EFRQL-LSAVMEYVMMNRSDEKVEKVLLSVVSNIWSLKQ-------IQLRELE----- 680
                   |..|| ||..:...:..|...:.|::|:....|....:|       ::..:|:     
  Rat  1357 DLGVAHAELPQLTLSVPVAPEVSPRPTLESEELLVKTPGNYEGKRQRKPTKKLLESNDLDPGFMP 1421

  Fly   681 --RDLA-------SGEIEEPLGSS-----VVGRGSGVGTI-----KRLSNRLMTMMVR-RSMTPV 725
              .||.       ||.:|..:|.|     :...|.|...|     ||...|..|..|. :.:...
  Rat  1422 KKGDLGLSRKCCESGHLENGVGDSRATPHLKEFGGGTTRIFDKPRKRKRQRHGTARVHYKRVKKE 1486

  Fly   726 VTPSTTPAPSEPD---RRLSEPPKTKKPVNRPIEEVIEDILQLD-----SKYL--FRG---LSRE 777
            .:...||:.:|.:   .|.:..||          |::|:.::.|     ||.|  .||   ..:|
  Rat  1487 DSARETPSSAEGELMIHRTAASPK----------EILEEGIEHDPGMSASKRLQGERGGGAALKE 1541

  Fly   778 PICKYCYQAGSDLVRCSRTCSSWLHADCLERKVTGAPMPKIGSRKALVIPPTSKSPSPDEDHVTA 842
            .:|:.|.:.| :|:.|...|....|.:||                .|...|..|           
  Rat  1542 NVCQNCEKLG-ELLLCEAQCCGAFHLECL----------------GLTEMPRGK----------- 1578

  Fly   843 DAKEVVAVGTSLVCHECNVGEPEGCVICHQVESPAVPSTPRKEDSSSHTPIEDKLLTCSQPMCGK 907
                       .:|:||..| ...|.:|.|          ..||          :..|..|:|||
  Rat  1579 -----------FICNECRTG-IHTCFVCKQ----------SGED----------VKRCLLPLCGK 1611

  Fly   908 RFHTSCCKYWPQASSSKHSARCPRHVCHTCVSDDPSGKFQQLGSSKLAKCVRCPATYHQLSKCIP 972
            .:|..|.:.:|.........|||.|:|.||.:.:|:......|  :|.:|||||..||....|:.
  Rat  1612 FYHEECVQKYPPTVVQNKGFRCPLHICITCHAANPANVSASKG--RLMRCVRCPVAYHANDFCLA 1674

  Fly   973 AGTQMLNTTNIICPRHNIAKADA----HVNVLWCYICVKGGELVCCETCPIAVHAHCRNIPIKTN 1033
            ||:::|.:.:||||.|...:...    ||||.||::|.:||.|:||::||.|.|..|.||.|...
  Rat  1675 AGSKILASNSIICPNHFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNIDIPEG 1739

  Fly  1034 ESYICEECESGRLPLYGEIVWAKFNNFRWWPAIILPPTEVPSNILKKAHGENDFVVRFFGTHDHG 1098
            ..| |.:|::|:.|.|.||||.|...:|||||.|..|..|||||.|..|...:|.|.|||::|:.
  Rat  1740 NWY-CNDCKAGKKPHYREIVWVKVGRYRWWPAEICHPRAVPSNIDKMRHDVGEFPVLFFGSNDYL 1803

  Fly  1099 WISRRRVYLYIEGDTGDGHKTKSQLFRNYTTGVEEASRFLPIIKARRQEQDMERQSGNKLHPPPY 1163
            |..:.||:.|:|||.....|....:...|...::||:.....:||:::.:.::....|...||||
  Rat  1804 WTHQARVFPYMEGDVSSKDKMGKGVDGTYKKALQEAAARFEELKAQKELRQLQEDRKNDKKPPPY 1868

  Fly  1164 VKIKTNKAVPPLR-FSQNLEDLSTCNCLPVDEHPCGPEAGCLNRMLFNECNPEYCKAGSLCENRM 1227
            ..||.|:.:..:: |:.:|.::..|||...||:|||.::.|:||||..||:|..|.||..|:|:.
  Rat  1869 KHIKVNRPIGRVQIFTADLSEIPRCNCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQC 1933

  Fly  1228 FEQRKSPRLEVVYMNERGFGLVNREPIAVGDFVIEYVGEVINHAEFQRRMEQKQRDRDENYYFLG 1292
            |.:|:.|.:|:....:||:||..:..|..|:||.|||||:|:..|.:.|:...|.....|:|.|.
  Rat  1934 FSKRQYPDVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLT 1998

  Fly  1293 VEKDFIIDAGPKGNLARFMNHSCEPNCETQKWTVNCIHRVGIFAIKDIPVNSELTFNYLWDDLMN 1357
            ::||.|||||||||.||||||.|:||||||||:||...|||:||:.||...:||||||.. :.:.
  Rat  1999 LDKDRIIDAGPKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNL-ECLG 2062

  Fly  1358 NSKKACFCGAKRCSGEIGGKLKDDAV-----------KAHAKLKQMRRAKASAVR---------- 1401
            |.|..|.|||..|||.:|.:.|:..:           |.|.|    ||::....:          
  Rat  2063 NGKTVCKCGAPNCSGFLGVRPKNQPIVTEEKSRKFKRKPHGK----RRSQGEVTKEREDECFSCG 2123

  Fly  1402 -----IHVKPKKTPKVKH 1414
                 :..|....|||.|
  Rat  2124 DGGQLVSCKKPGCPKVYH 2141

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
NSDNP_733239.1 MSH6_like 391..508 CDD:99898 21/143 (15%)
PHD2_NSD 867..932 CDD:277040 16/64 (25%)
PHD3_NSD 933..988 CDD:277041 21/54 (39%)
PHD4_NSD 1001..1041 CDD:277042 18/39 (46%)
WHSC1_related 1047..1141 CDD:99899 38/93 (41%)
AWS 1183..1233 CDD:197795 23/49 (47%)
SET 1234..1354 CDD:214614 63/119 (53%)
Nsd1XP_006253682.1 MSH6_like 319..429 CDD:99898
PHD1_NSD1_2 1543..1585 CDD:277118 14/80 (18%)
PHD2_NSD1 1590..1636 CDD:277120 16/65 (25%)
PHD3_NSD1 1637..1690 CDD:277123 21/54 (39%)
PHD4_NSD1 1707..1746 CDD:277126 18/39 (46%)
WHSC1_related 1752..1846 CDD:99899 38/93 (41%)
AWS 1889..1939 CDD:197795 23/49 (47%)
SET 1940..2063 CDD:214614 63/123 (51%)
PHD5_NSD1 2118..2160 CDD:277129 5/24 (21%)
Blue background indicates that the domain is not in the aligned region.


Information from Original Tools:


Tool Simple Score Weighted Score Original Tool Information
BLAST Result Score Score Type Cluster ID
Compara 00.000 Not matched by this tool.
Domainoid 1 1.000 129 1.000 Domainoid score I5113
eggNOG 1 0.900 - - E2759_KOG1081
Hieranoid 1 1.000 - -
Homologene 00.000 Not matched by this tool.
Inparanoid 1 1.050 492 1.000 Inparanoid score I1355
OMA 00.000 Not matched by this tool.
OrthoDB 1 1.010 - - D38976at33208
OrthoFinder 1 1.000 - - FOG0001511
OrthoInspector 1 1.000 - - otm44534
orthoMCL 1 0.900 - - OOG6_102558
Panther 1 1.100 - - O PTHR22884
Phylome 1 0.910 - -
SonicParanoid 1 1.000 - - X847
SwiftOrtho 1 1.000 - -
TreeFam 00.000 Not matched by this tool.
1211.870

Return to query results.
Submit another query.