DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Set2 and Nsd1

DIOPT Version :9

Sequence 1:NP_572888.2 Gene:Set2 / 32301 FlyBaseID:FBgn0030486 Length:2362 Species:Drosophila melanogaster
Sequence 2:NP_032765.3 Gene:Nsd1 / 18193 MGIID:1276545 Length:2691 Species:Mus musculus


Alignment Length:2888 Identity:547/2888 - (18%)
Similarity:926/2888 - (32%) Gaps:1041/2888 - (36%)


- Green bases have known domain annotations that are detailed below.


  Fly    11 SPVASRGRGRGRPPKVALS---ALGNTPP---HINPSLKH---------------ADAEASPTAP 54
            ||:.......|.|..:|:.   ...|:|.   .:..:.|:               .|:|..|..|
Mouse   110 SPIVCTSLSPGGPTALAMKQEPTCNNSPELQLRVTKTTKNGFLHFENFTGVDDADVDSEMDPEQP 174

  Fly    55 EDQDSGQSEC-RRSSRKKIIKFDVRDLLNKNRKAHKIQIEARIDSNPSTGH-------------- 104
            ..:|....|. ..:.......::     .|:....::.:.:..||.|.:.|              
Mouse   175 VTEDESIEEIFEETQTNATCNYE-----PKSENGVEVAMGSEQDSMPESRHGAVERPFLPLAPQT 234

  Fly   105 ------SQSGTTAASTSMSTATASAASASSAATVSRLFSMFEMSHQSLPPPPPPPTALEIFAKPR 163
                  .:|....::...:...|..:...:..||...|:...:|.|..|.           :.|.
Mouse   235 EKQKNKQRSEVDGSNEKTALLPAPTSLGDTNVTVEEQFNSINLSFQDDPD-----------SSPS 288

  Fly   164 PTQSLIVAQVTSEPSAVGGAHPVQTMAGLPPVTPRKRGRPRKSQLADAAII----------PTVI 218
            |..:::....||.||         |...||...|:|:..|.|.::.|  :|          |..|
Mouse   289 PLGNMLEIPGTSSPS---------TSQELPFCQPKKKSTPLKYEVGD--LIWAKFKRRPWWPCRI 342

  Fly   219 VPSCSDSDTNSTSTTTSNMSSDSGELPGFPIQKPKSKLRVSLKR----------------LKLGG 267
               |||                       |:....||::|:.:|                ..:.|
Mouse   343 ---CSD-----------------------PLINTHSKMKVANRRPYREYYVEAFGDPSEKAWVAG 381

  Fly   268 RLESSDSGNSPSSSSPEVEPPALQDENAMDER---------------------PKQEQN-----L 306
            :......|.......|.:.....|.|.....:                     ||..:|     .
Mouse   382 KAIVMFEGRHQFEELPVLRKRGKQKEKGYRHKVPQKILSKWEASVGLAEQYDVPKGSKNQKCVSS 446

  Fly   307 SRMVDAEEN-------SDSDSQIIFIE-------IETESPKGEEEQEEGRPVEVEPQDLID---- 353
            |..:|:||:       :|.||:.:.:.       .::|....|:|:...:....:..|.|.    
Mouse   447 SVKLDSEEDMPFEDCTNDPDSEHLLLNGCLKSLAFDSEHSADEKEKPCAKSRVRKSSDNIKRTSV 511

  Fly   354 ----IDMELAKQE---PTPDPEEDLDEIMVEVLSGPPSLWSADDEAEEEEDATVQRATPPG---- 407
                :..|..|:|   ..|      |.:.::.:||..|...|.:|.....::....:|.||    
Mouse   512 KKDLVPFESRKEERRGKIP------DNLGLDFISGGVSDKQASNELSRIANSLTGSSTAPGSFLF 570

  Fly   408 ------------KEPAADSCSSAPRRSRRSAPLSGSSRQGKTLEETFAEIAAESSKQILEAEESQ 460
                        :.|..||.|..    ..||.:|..|.:.|.|:.  .::.  |||..|....:.
Mouse   571 SSSVQNTAKTDFETPDCDSLSGL----SESALISKHSGEKKKLQP--GQVC--SSKVQLCYVGAG 627

  Fly   461 DQEEQHILIDLIEDTLSESEVTSSVSPTIEHMVVEEVVVEENQLVDEADEILDSKQEFVIKKVFS 525
            |:|::.   :.:..:.:..:..|.:.||..:...:..|:.                   |...|.
Mouse   628 DEEKRS---NSVSVSTTSDDGCSDLDPTEHNSGFQNSVLG-------------------ITDAFD 670

  Fly   526 ESDNIAASLNKDIFEPKVETKATCGEVVPRPEMVTEDVYITEGIAATLEKSAVVTKPTTEMIAET 590
            :::| |.|::|:      ||:.:...|..|.:...:.: ||...|..|..|....:|.|..:::.
Mouse   671 KTEN-ALSVHKN------ETQYSRYPVTNRIKEKQKSL-ITNSHADHLMGSTKTMEPETAELSQV 727

  Fly   591 KLSDEVVIEP---------------------------PLKDESDPKQTEVELPESKP---AVNIP 625
            .|||..:..|                           ||.......||.:.|...:|   ::...
Mouse   728 NLSDLKISSPIPKPQPEFRNDGLTTKFSAPPGIRNENPLTKGGLANQTLLPLKCRQPKFRSIKCK 792

  Fly   626 KSERILSAEVETTSSPLVPPECCTLESVSGPVL------------LETSLSTEEKSNENVETTPL 678
            ..|....||...||..| ..:||:.::...|:.            |..::..:.:.:.::||..:
Mouse   793 HKESPAVAETSATSEDL-SLKCCSSDTNGSPLANISKSGKGEGLKLLNNMHEKTRDSSDIETAVV 856

  Fly   679 K--TEAAKE-----------DSPPAAPEEE---ASNSSE-----EPNF-------LLEDYESNQE 715
            |  ....||           ||..|...:.   :|.||:     ||::       :|:|...::.
Mouse   857 KHVLSELKELSYRSLSEDVSDSGTAKASKPLLFSSASSQNHIPIEPDYKFSTLLMMLKDMHDSKT 921

  Fly   716 QVAEDEMMKCNNQKGQKQTPLPEMKEPEK-------PV--------AETVSKKEK---AMENPAR 762
            :  |..:|...|        |...:.|::       ||        ..:....||   :.::...
Mouse   922 K--EQRLMTAQN--------LASYRTPDRGDCSSGSPVGTSKVLVLGSSTPNSEKPGDSTQDSVH 976

  Fly   763 SSPAIVDKKVRAGEMEKKV--VKSTKGTVPE-KKMDS----KKSCAAVTP-AKQKE--SGKSAKE 817
            .||...|..: :||:...:  :.|.|..:|. .|:.|    :::|....| :|.:|  |.:..|.
Mouse   977 QSPGGGDSAL-SGELSSSLSSLASDKRELPACGKIRSNCIPRRNCGRAKPSSKLRETISAQMVKP 1040

  Fly   818 AILKK--ETEKEKSSAKL-------------DSSSPNTLDKKG-KDTAQWSP--QLQTLPKSST- 863
            ::..|  :||:::..::|             :|.|.|...:.| :|..:..|  |:..|....| 
Mouse  1041 SVNPKALKTERKRKFSRLPAVTLAANRLGNKESGSVNGPSRGGAEDPGKEEPLQQMDLLRNEDTH 1105

  Fly   864 --------KPPQESAPSVISKTTS-----------------------NQPAPKE-----EQHAAK 892
                    |..|......:.|..|                       ||..||:     .|...|
Mouse  1106 FSDVHFDSKAKQSDPDKNLEKEPSFENRKGPELGSEMNTENDELHGVNQVVPKKRWQRLNQRRPK 1170

  Fly   893 KGLSDNSPPSVLKAKEKAVSGF---VECDAMFKA---------------------------MDLA 927
            .|...|.    .:.||.:...|   :..||:.||                           .:..
Mouse  1171 PGKRANR----FREKENSEGAFGVLLPADAVQKAREDYLEQRAPPTSKPEDSAADPNHGSHSESV 1231

  Fly   928 NAQLRLDEKNKKKLKKVPTKVEAPPKVEPPTAVPVPGQKKSLSGKTSLRRNT--VYEDSPNLERN 990
            ..:|.:.||:...:..| .|....|.:.|.|.:|.|..:   |.|..||:.:  :.|.:...::.
Mouse  1232 APRLNVCEKSSVGMGDV-EKETGIPSLMPQTKLPEPAIR---SEKKRLRKPSKWLLEYTEEYDQI 1292

  Fly   991 SSPSSDSAQANTSAGKLKPSKVKKKINPRRSTICEAAKDLRSSSSSSTPTREVAASSPVSTSSDS 1055
            .:|            |.|..||::::: :.|:.|| .:.|.:....|...::|..:|.:||..:.
Mouse  1293 FAP------------KKKQKKVQEQVH-KVSSRCE-DESLLARCQPSAQNKQVDENSLISTKEEP 1343

  Fly  1056 SSKRNGSKRTTSDLDGGSKLDQRRYTICEDRQPETAIPVPLTKRRFSMHPKASANPL---HDTLL 1117
            ..    .:|....|:|  .|.|....:.....|:..:.||:.       |:||..|.   .:.|:
Mouse  1344 PV----LEREAPFLEG--PLAQSDLGVTHAELPQLTLSVPVA-------PEASPRPALESEELLV 1395

  Fly  1118 QTAG---KKRGRKEGKESLSRQNSLDSSSSASQG------------------------------- 1148
            :|.|   .||.||..|: |...|.||......:|                               
Mouse  1396 KTPGNYESKRQRKPTKK-LLESNDLDPGFMPKKGDLGLSRKCFEASRSGNGIVESRATSHLKEFS 1459

  Fly  1149 --------APKKKALKSAEILSAAL-----------LETESSE-------STSSGSKMSRWDVQT 1187
                    .|:|:  |...:::|.:           .:|.|||       :.:|..::....|:.
Mouse  1460 GGTTKIFDKPRKR--KRQRLVTARVHYKKVKKEDLTKDTPSSEGELLIHRTAASPKEILEEGVEH 1522

  Fly  1188 SPELEAANPFGDIAKFIEDGVNLLKRDKVDEDQRKEGQ--------------------------- 1225
            .|.:.|:....     :|.|.....::.|.::..|.|:                           
Mouse  1523 DPGMSASKKLQ-----VERGGGAALKENVCQNCEKLGELLLCEAQCCGAFHLECLGLPEMPRGKF 1582

  Fly  1226 -------------------DEVKREADPEEDEFAQRVANMETPATTPTPSPTQ------------ 1259
                               ::|||...|...:|.......:.|     |:.||            
Mouse  1583 ICNECHTGIHTCFVCKQSGEDVKRCLLPLCGKFYHEECVQKYP-----PTVTQNKGFRCPLHICI 1642

  Fly  1260 ----SNPED-SASTTTVLK-------------ELETGG------------------GVRRSHRIK 1288
                :||.: |||...:::             .|..|.                  |.|....:.
Mouse  1643 TCHAANPANVSASKGRLMRCVRCPVAYHANDFCLAAGSKILASNSIICPNHFTPRRGCRNHEHVN 1707

  Fly  1289 QK------------------------------PQG------------------------------ 1293
            ..                              |:|                              
Mouse  1708 VSWCFVCSEGGSLLCCDSCPAAFHRECLNIDIPEGNWYCNDCKAGKKPHYREIVWVKVGRYRWWP 1772

  Fly  1294 -----PRA---------------------------------------------SQGRGVASVALA 1308
                 |||                                             ..|:||......
Mouse  1773 AEICHPRAVPSNIDKMRHDVGEFPVLFFGSNDYLWTHQARVFPYMEGDVSSKDKMGKGVDGTYKK 1837

  Fly  1309 PISMDEQLAELANIEAINE-QFLRSEGLN-----TFQLLKENFYRCARQVSQENAEM----QCDC 1363
              ::.|..|....::|..| :.|:.:..|     .::.:|.|  |...:|....|::    :|:|
Mouse  1838 --ALQEAAARFEELKAQKELRQLQEDRKNDKKPPPYKHIKVN--RPIGRVQIFTADLSEIPRCNC 1898

  Fly  1364 FLTGDEEAQGHLSCG--AGCINRMLMIECGP-LCSNGARCTNKRFQQHQCWPCRVFRTEKKGCGI 1425
            ..| ||.     .||  :.||||||:.||.| :|..|.||.|:.|.:.|.....:|||.::|.|:
Mouse  1899 KAT-DEN-----PCGIDSECINRMLLYECHPTVCPAGVRCQNQCFSKRQYPDVEIFRTLQRGWGL 1957

  Fly  1426 TAELLIPPGEFIMEYVGEVIDSEEFERRQHLYSKDRNRHYYFMALRGEAVIDATSKGNISRYINH 1490
            ..:..|..|||:.|||||:||.||...|.....:....::|.:.|..:.:|||..|||.:|::||
Mouse  1958 RTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGNYARFMNH 2022

  Fly  1491 SCDPNAETQKWTVNGELRIGFFSVKPIQPGEEITFDYQYLRYGRDAQRCYCEAANCRGWIGGEPD 1555
            .|.||.|||||:|||:.|:|.|::..|:.|.|:||:|.....|.....|.|.|.||.|::|..| 
Mouse  2023 CCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGKTVCKCGAPNCSGFLGVRP- 2086

  Fly  1556 SDEGEQLDEESDSDAEMDEEELEAEPEEGQP----RKSAKAKAKSKLKAKLPLATGRKRKEQTKP 1616
                                       :.||    .||.|.|.|       |....|.:.|.||.
Mouse  2087 ---------------------------KNQPIVTEEKSRKFKRK-------PHGKRRSQGEVTKE 2117

  Fly  1617 KDRE-YKAGRWLKPSATGSSSSAEKP--PK----------KPKVNKFQAMLEDPDVVEE-----L 1663
            ::.| :..|      ..|...|.:||  ||          |....|::......||..:     .
Mouse  2118 REDECFSCG------DAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQCDVCGKEAASFC 2176

  Fly  1664 SLLRRGGLKNQQDTLRFSRCLVRAKLLKTRLALLRVLTHGELPCRRLFLDYHGLRLLHAWISENG 1728
            .:......|..::.:.|.     :||            .|.|.|..           |.....|.
Mouse  2177 EMCPSSFCKQHREGMLFI-----SKL------------DGRLSCTE-----------HDPCGPNP 2213

  Fly  1729 -NDDQLREALLDTLESLPIPNRTMLSDSRVYQSVQLWSNSLEQQLAVVPQEKQAALHKRMVALLQ 1792
             ...::||.:..|..|.|.|                         ...|:|:.:.:..:.     
Mouse  2214 LEPGEIREYVPPTATSPPSP-------------------------GTQPKEQSSEMATQG----- 2248

  Fly  1793 KWQALPEIFRIPKRERIEQMKEHEREADRQQKHVHASTALEDQRERESSNDRFRQDRFRRDTTSS 1857
                       ||       |..:...|..|....:..||....:|....:|..:   |.|::|.
Mouse  2249 -----------PK-------KSDQPPTDATQLLPLSKKALTGSCQRPLLPERPPE---RTDSSSH 2292

  Fly  1858 RIGKPIRMSGNNTIC-TITTQQKGSNGAP--DGMTRNDNRRRSDIGPPSEQRRTLSKELRRSLFE 1919
            .:.:...::|:.|.. ::.:.|:..:..|  :|.......|.|.:..||......|..|.|.|  
Mouse  2293 LLDRIRDLAGSGTKSQSLVSSQRPQDRPPAKEGPRPQPPDRASPMTRPSSSPSVSSLPLERPL-- 2355

  Fly  1920 RKVALDEAERRVCTEDRLEHELRCEFFGADINTDPKQLPFYQKTDTNEWFNSDDVPVPAPPRTEL 1984
                      |: |:.||:..:     ||   ..||. ...:||..     |..:.:.:|.|  |
Mouse  2356 ----------RM-TDSRLDKSI-----GA---ASPKS-QAVEKTPA-----STGLRLSSPDR--L 2393

  Fly  1985 LTKALLSPDIDVGQGATDVEYKLPPGVDPLPPAWNWQVTSDGDIYYYNLRERISQWEPPSPEQRL 2049
            ||..  ||               .|.:...||          :..:.:|.:|:     |.||:.|
Mouse  2394 LTTN--SP---------------KPQISDRPP----------EKSHASLTQRL-----PPPEKVL 2426

  Fly  2050 QTLLEENTTQQPLHELQIDPAVLENELIQVDTDYVGSLSAKSLAQYIEAKVRERRDL---RRSKL 2111
            ..:::....:             |..|..||.:        :.:::..|.|.:..||   ::.:.
Mouse  2427 SAVVQSLVAK-------------EKALRPVDQN--------TQSKHRPAVVMDLIDLTPRQKERA 2470

  Fly  2112 VSIRLISPRRDEDRLYNQLESRKYKENKEKIRRRKELYRRRKIEVLPDAVDEIPVPGKALPIQPY 2176
            .|.:.::|:.||...  .|||..:..:|             .:..:|.|.::|.|   :..:|| 
Mouse  2471 ASPQEVTPQADEKTA--MLESSSWPSSK-------------GLGHIPRATEKISV---SESLQP- 2516

  Fly  2177 LFSSDEEETKVAAIEQPAAEEEQDSLNMAPSTSHA-------AMAALGKAVAQPTGLGTVGKRKL 2234
                   ..||||   |:....|    ...|.:||       |.|.|.::..|.:|...||..:.
Mouse  2517 -------SGKVAA---PSEHPWQ----AVKSLTHARFLSPPSAKAFLYESATQASGRTPVGAEQT 2567

  Fly  2235 PMPPSVT---VKKHRQEQRSKKVKSSQS 2259
            |.|||..   ||:.:|..|....||.||
Mouse  2568 PGPPSPAPGLVKQVKQLSRGLTAKSGQS 2595

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
Set2NP_572888.2 AWS 1358..1410 CDD:197795 22/58 (38%)
SET 1414..1533 CDD:214614 50/118 (42%)
PostSET 1535..1551 CDD:214703 6/15 (40%)
WW 2014..2043 CDD:278809 4/28 (14%)
SRI 2270..2348 CDD:285448
Nsd1NP_032765.3 MSH6_like 320..430 CDD:99898 18/137 (13%)
TNG2 <1453..1587 CDD:227367 17/140 (12%)
PHD1_NSD1_2 1546..1587 CDD:277118 3/40 (8%)
PHD2_NSD1 1593..1639 CDD:277120 9/50 (18%)
PHD3_NSD1 1640..1693 CDD:277123 7/52 (13%)
PHD4_NSD1 1710..1749 CDD:277126 2/38 (5%)
WHSC1_related 1755..1849 CDD:99899 8/95 (8%)
AWS 1902..1940 CDD:375420 19/42 (45%)
SET_NSD1 1942..2083 CDD:380987 57/140 (41%)
PHD5_NSD1 2121..2163 CDD:277129 10/47 (21%)
C5HCH 2162..2211 CDD:375464 10/76 (13%)
PHA03307 2255..>2576 CDD:223039 91/438 (21%)
Blue background indicates that the domain is not in the aligned region.


Information from Original Tools:


Tool Simple Score Weighted Score Original Tool Information
BLAST Result Score Score Type Cluster ID
Compara 1 0.930 - - C167848347
Domainoid 00.000 Not matched by this tool.
eggNOG 00.000 Not matched by this tool.
Hieranoid 00.000 Not matched by this tool.
Homologene 00.000 Not matched by this tool.
Inparanoid 00.000 Not matched by this tool.
Isobase 00.000 Not matched by this tool.
OMA 00.000 Not matched by this tool.
OrthoDB 1 1.010 - - D507784at2759
OrthoFinder 00.000 Not matched by this tool.
OrthoInspector 00.000 Not matched by this tool.
orthoMCL 00.000 Not matched by this tool.
Panther 00.000 Not matched by this tool.
Phylome 1 0.910 - -
RoundUp 1 1.030 - avgDist Average_Evolutionary_Distance R4489
SonicParanoid 00.000 Not matched by this tool.
SwiftOrtho 00.000 Not matched by this tool.
TreeFam 00.000 Not matched by this tool.
43.880

Return to query results.
Submit another query.