DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Set2 and Nsd1

DIOPT Version :9

Sequence 1:NP_572888.2 Gene:Set2 / 32301 FlyBaseID:FBgn0030486 Length:2362 Species:Drosophila melanogaster
Sequence 2:XP_006253682.1 Gene:Nsd1 / 306764 RGDID:1307748 Length:2689 Species:Rattus norvegicus


Alignment Length:2897 Identity:551/2897 - (19%)
Similarity:906/2897 - (31%) Gaps:1061/2897 - (36%)


- Green bases have known domain annotations that are detailed below.


  Fly    11 SPVASRGRGRGRPPKVALS--ALGNTPPHINPSL-------------------KHADAEASPTAP 54
            ||:.......|.|..:|::  ...|.||.:..|:                   ...|:|..|..|
  Rat   109 SPIVCTSLSPGGPTALAMNQEPTCNNPPELQLSVTKTTKNGFLHFDNFTCVDDADVDSEMDPEQP 173

  Fly    55 EDQDSGQSEC-RRSSRKKIIKFDVRDLLNKNRKAHKIQIEARIDSNPSTGH-------------- 104
            ..:|....|. ..:.......::     .|:....::.:.:..||.|.:.|              
  Rat   174 VTEDESIEEIFEETQTNASCNYE-----PKSENGVEMAMGSEQDSMPESRHGAVEQPFVPLAPQT 233

  Fly   105 ------SQSGTTAASTSMSTATASAASASSAATVSRLFSMFEMSHQSLPPPPPPPTALEIFAKPR 163
                  .:|....::...:...|..:...:..|:...|:...:|.|..|.           :...
  Rat   234 EKQKNKQRSEVDGSNEKTALLPAPNSLGDTNVTIEEQFNSINLSFQDDPD-----------SSTS 287

  Fly   164 PTQSLIVAQVTSEPSAVGGAHPVQTMAGLPPVTPRKRGRPRKSQLADAAII----------PTVI 218
            ...:::....||.||         |...||...|:|:..|.|.::.|  :|          |..|
  Rat   288 TLGNMLELPGTSSPS---------TSQELPFCHPKKKSTPLKYEVGD--LIWAKFKRRPWWPCRI 341

  Fly   219 VPSCSDS--DTNSTSTTTSNMSSDSGELPGFPIQKPKSKLRVSLKRL----------------KL 265
               |||.  :|:|.....:........:..|  ..|..|..|:.|.:                |.
  Rat   342 ---CSDPLINTHSKMKVANRRPYREYYVEAF--GDPSEKAWVAGKAIVMFEGRHQFEELPVLRKR 401

  Fly   266 GGRLESSDSGNSPSSSSPEVEPP-ALQDENAMDERPKQEQNL--SRMVDAEEN-------SDSDS 320
            |.:.|.......|.....:.|.. .|.::..:...||.::.:  |..:|:||:       :|.||
  Rat   402 GKQKEKGYRHKVPQKILSKWEASVGLAEQCDVPRGPKTQKCVPSSAKLDSEEDMPFEDCANDPDS 466

  Fly   321 Q--------IIFIEIETESPKGEEEQEEGRPVEVEPQDLID--------IDMELAKQEPTPDPEE 369
            :        :..:..::|....|:|:...:....:..|.|.        :..|..|:|......|
  Rat   467 EHDLLLNGCLKSLAFDSEHSADEKEKPCAKSRVRKSSDNIKRTSVKKGLMPFEAQKEERRGKSPE 531

  Fly   370 DLDEIMVEVLSGPPSLWSADDEAEEEEDATVQRATPPG----------------KEPAADSCSSA 418
            :|.   ::.|||..|...|.:|.....::....:..||                :.|..||.|..
  Rat   532 NLG---LDFLSGGVSDKQASNELSRIANSLTGSSAAPGQFLFSSCGQNTAKTDFETPNCDSLSGL 593

  Fly   419 PRRSRRSAPLSGSSRQGKTLEETFAEIAAESSKQILEAEESQDQEEQHILIDLIEDTLSESEVTS 483
                ..||.:|..|.:.|.|:.  .::.  |||..|....:.|:|::.   |.:....:..:.:|
  Rat   594 ----SESALISKHSGEKKKLQP--GQVC--SSKVQLCYVGAGDEEKRS---DSVSVCTTSDDGSS 647

  Fly   484 SVSPTIEHMVVEEVVVEENQLVDEADEILDSKQEFVIKKVFSESDNIAASLNKDIFEPKVETKAT 548
            .:.||..:....:.|:|....:|:.:.                    |.|::|:      |||.:
  Rat   648 DLDPTEHNSEFHKSVLEVTDALDKTEN--------------------ALSMHKN------ETKYS 686

  Fly   549 CGEVVPRPEMVTE--DVYITEGIAATLEKSAVVTKPTTEMIAETKLSDEVVIEPPLKDESD---- 607
               ..|....|.|  ...||......|..|....:|.|..|::..|||..:..|..|.:.:    
  Rat   687 ---RYPATNRVKEKQKSLITNSHTDHLMDSTKTVEPGTAEISQVNLSDLKISSPIPKPQPEFRND 748

  Fly   608 -----------------------PKQTEVELPESKP---AVNIPKSERILSAEVETTSSPLVPPE 646
                                   ..||.:.|...:|   ::.....|...:||...||..| ..:
  Rat   749 GLTTKFNAPPGIRNENSLTKGGLANQTLLPLKCRQPKFRSIKCKHKESPTAAETSATSEDL-SLK 812

  Fly   647 CCTLESVSGPVLLETSLSTEEK---------------SNENVETTPLK----------TEAAKED 686
            ||:.::...|:   ||:|...|               .:.::||..:|          ..:..||
  Rat   813 CCSSDTNGSPM---TSISKSGKGEGLKLLNNMHEKTRDSSDIETAVVKHVLSELKELSYRSLSED 874

  Fly   687 ------SPPAAPEEEASNSSE-----EPNF-------LLEDYESNQEQVAEDEMMKCNN------ 727
                  |..:.|...:|.||:     ||::       :|:|...::.:  |..:|...|      
  Rat   875 VSDSGTSKSSKPLLFSSASSQNHIPIEPDYKFSTLLMMLKDMHDSKTK--EQRLMTAQNVASYRT 937

  Fly   728 -QKGQKQTPLP-----------EMKEPEKPVAETVSKKEKAMENPARSSPAIVDKKVRAGEMEKK 780
             .:|...:..|           .....|||...|        ::..|.||...|..: :||:   
  Rat   938 PDRGDCSSGSPVGTSKVLVLGGSTHNSEKPGDST--------QDSVRLSPGGGDSAL-SGEL--- 990

  Fly   781 VVKSTKGTVPEKKMD------------SKKSC--AAVTP-AKQKESGKSAKEAILKK--ETEKEK 828
              .|:...:|..|.|            .:::|  |.::| .:...|.:.||.::..|  :||:::
  Rat   991 --SSSLSVLPSDKRDLPACGKIRSNCIPRRNCGRAKLSPKLRVTISTQMAKPSVNPKALKTERKR 1053

  Fly   829 SSAKLD--SSSPNTLDKK-------------GKDTAQWSP--QLQTLPKSST----KPPQESAPS 872
            ..::|.  :.:.|.|..|             .:|.|:..|  |:..|....|    |..|.....
  Rat  1054 KLSRLPAVTLAANGLGNKESGGSVNGPLKGGAQDPAKEEPLQQMDLLRNEETHFDSKVKQSDPDK 1118

  Fly   873 VISKTTS-----------------------------------NQPAPKEEQHAAK---------- 892
            ::.|..|                                   ||..||..:.|::          
  Rat  1119 ILEKEPSFENRKGPEVGSEINIENDEPHGVDQVVPKKRWQRLNQRRPKPGKRASRFREKENSEGA 1183

  Fly   893 -----------KG----LSDNSPPSVLKAKEKAVSGFVECDAMFKAMDLANAQLRLDEKNKKKLK 942
                       ||    |...:||:.:.....|....|      ...:....:|.:.:|:...:.
  Rat  1184 FGVLLPGDPVQKGRDDYLEQRAPPTSILEDSAADPNHV------SHSESVGPRLNVCDKSSVSMG 1242

  Fly   943 KVPTKVEAPPKVEPPTAVPVPGQKKSLSGKTSLRRNT--VYEDSPNLERNSSPSSDSAQANTSAG 1005
            .: .|....|.:.|.|.:|.|..:   |.|..||:.:  :.|.:...::..:|            
  Rat  1243 DL-EKETGIPSLTPQTKIPEPAVR---SEKKRLRKPSKWLLEYTEEYDQIFAP------------ 1291

  Fly  1006 KLKPSKVKKKINPRRSTICEAAKDLRSSSSSSTPTREVAASSPVSTSSDSSSKRNGSKRTTSDLD 1070
            |.|..||::::: :.|:.|| .:.|.:...||...::|..:|.:||..:...    .:|....|:
  Rat  1292 KKKQKKVQEQVH-KVSSRCE-DESLLARCRSSAQNKQVDENSLISTKEEPPV----LEREAPFLE 1350

  Fly  1071 GGSKLDQRRYTICEDRQPETAIPVPLTKRRFSMHPKASANPL---HDTLLQTAGKKRGRKEGK-- 1130
            |  .|.|....:.....|:..:.||:.       |:.|..|.   .:.|::|.|...|:::.|  
  Rat  1351 G--PLVQSDLGVAHAELPQLTLSVPVA-------PEVSPRPTLESEELLVKTPGNYEGKRQRKPT 1406

  Fly  1131 ESLSRQNSLDSSSSASQGAPKKKALKSAEILSAALLETESSESTSSGSKMSRWDVQTSPELEAAN 1195
            :.|...|.||...     .|||..|.    ||....|:...|:...       |.:.:|.|:   
  Rat  1407 KKLLESNDLDPGF-----MPKKGDLG----LSRKCCESGHLENGVG-------DSRATPHLK--- 1452

  Fly  1196 PFGDIAKFIEDGVNLLKRDKVDEDQRKEGQDEVKREADPEEDEFAQRVANMETPA---------- 1250
            .||.....|.|.....||       ::.|...|..:...:||      :..|||:          
  Rat  1453 EFGGGTTRIFDKPRKRKR-------QRHGTARVHYKRVKKED------SARETPSSAEGELMIHR 1504

  Fly  1251 TTPTPSP-----TQSNPEDSASTTTVLKEL--ETGGGVRRSHRIKQK------------------ 1290
            |..:|..     .:.:|..|||     |.|  |.|||......:.|.                  
  Rat  1505 TAASPKEILEEGIEHDPGMSAS-----KRLQGERGGGAALKENVCQNCEKLGELLLCEAQCCGAF 1564

  Fly  1291 ----------PQG---------------------------------------------PRASQGR 1300
                      |:|                                             |...|.:
  Rat  1565 HLECLGLTEMPRGKFICNECRTGIHTCFVCKQSGEDVKRCLLPLCGKFYHEECVQKYPPTVVQNK 1629

  Fly  1301 GVASVALAP--ISMDEQLAELANIEAINEQFLR------SEGLNTF------QLLKENFYRCARQ 1351
            |..    .|  |.:....|..||:.|...:.:|      :...|.|      ::|..|...|...
  Rat  1630 GFR----CPLHICITCHAANPANVSASKGRLMRCVRCPVAYHANDFCLAAGSKILASNSIICPNH 1690

  Fly  1352 -------------------VSQENAEMQC-------------------------DC--------- 1363
                               |..|...:.|                         ||         
  Rat  1691 FTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNIDIPEGNWYCNDCKAGKKPHYR 1755

  Fly  1364 ---------------------------------------------------------FLTGD--- 1368
                                                                     ::.||   
  Rat  1756 EIVWVKVGRYRWWPAEICHPRAVPSNIDKMRHDVGEFPVLFFGSNDYLWTHQARVFPYMEGDVSS 1820

  Fly  1369 -----------------------EE--AQGHL--------------------------------- 1375
                                   ||  ||..|                                 
  Rat  1821 KDKMGKGVDGTYKKALQEAAARFEELKAQKELRQLQEDRKNDKKPPPYKHIKVNRPIGRVQIFTA 1885

  Fly  1376 ----------------SCG--AGCINRMLMIECGP-LCSNGARCTNKRFQQHQCWPCRVFRTEKK 1421
                            .||  :.||||||:.||.| :|..|.||.|:.|.:.|.....:|||.::
  Rat  1886 DLSEIPRCNCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFSKRQYPDVEIFRTLQR 1950

  Fly  1422 GCGITAELLIPPGEFIMEYVGEVIDSEEFERRQHLYSKDRNRHYYFMALRGEAVIDATSKGNISR 1486
            |.|:..:..|..|||:.|||||:||.||...|.....:....::|.:.|..:.:|||..|||.:|
  Rat  1951 GWGLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGNYAR 2015

  Fly  1487 YINHSCDPNAETQKWTVNGELRIGFFSVKPIQPGEEITFDYQYLRYGRDAQRCYCEAANCRGWIG 1551
            ::||.|.||.|||||:|||:.|:|.|::..|:.|.|:||:|.....|.....|.|.|.||.|::|
  Rat  2016 FMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGKTVCKCGAPNCSGFLG 2080

  Fly  1552 GEPDSDEGEQLDEESDSDAEMDEEELEAEPEEGQP----RKSAKAKAKSKLKAKLPLATGRKRKE 1612
            ..|                            :.||    .||.|.|.|       |....|.:.|
  Rat  2081 VRP----------------------------KNQPIVTEEKSRKFKRK-------PHGKRRSQGE 2110

  Fly  1613 QTKPKDRE-YKAGRWLKPSATGSSSSAEKPPKKPKVNKFQAMLEDPDVVEELSLLRRGGLKNQQD 1676
            .||.::.| :..|       .|....:.|.|..|||  :.|        :.|:|.:|...|.:  
  Rat  2111 VTKEREDECFSCG-------DGGQLVSCKKPGCPKV--YHA--------DCLNLTKRPAGKWE-- 2156

  Fly  1677 TLRFSRCLVRAKLLKTRLALLRVLTHGELPCRRLFLDYHG-----LRLLHAWISENGND------ 1730
             ..:.:|.|..|         ...:..|: |...|...|.     :..|...:|...:|      
  Rat  2157 -CPWHQCDVCGK---------EAASFCEM-CPSSFCKQHREGMLFISKLDGRLSCTEHDPCGPNP 2210

  Fly  1731 ---DQLREALLDTLESLPIPNRTMLSDSRVYQSVQLWSNSLEQQLAVVPQE---------KQAAL 1783
               .::||.:..|....|.|     ...:..||.::.:..        |::         :..:|
  Rat  2211 LEPGEIREYVPPTATLSPSP-----GTQQTEQSSEMGTQG--------PKKSDQPPTDATQMLSL 2262

  Fly  1784 HKRMVALLQKWQALPEIFRIPKRERIEQMKEHEREADRQQKHVHASTALEDQRERESSN---DRF 1845
            .|:.|....:...|||  |.|:|                               .:||:   ||.
  Rat  2263 SKKAVTGTCQRPLLPE--RPPER-------------------------------TDSSSHLLDRI 2294

  Fly  1846 RQDRFRRDTTSSRIGKPIRMSGNNTIC-TITTQQKGSNGAP--DGMTRNDNRRRSDIGPPSEQRR 1907
            |.                 ::|:.|.. ::.:.|:..:..|  :|.......|.|.:..||....
  Rat  2295 RD-----------------LAGSGTKSQSLVSSQRPQDRPPAKEGPRPQPPDRASPVTRPSSSPS 2342

  Fly  1908 TLSKELRRSLFERKVALDEAERRVCTEDRLEHELRCEFFGADINTDPKQLPFYQKTDTNEWFNSD 1972
            ..|..|.|.|            |: ||.||:..:     ||   ..||               |.
  Rat  2343 VSSLPLERPL------------RM-TEPRLDKSI-----GA---ASPK---------------SQ 2371

  Fly  1973 DVPVPAPPRTELLTKALLSPDIDVGQGATDVEYKLPPGVDPLPPAWNWQVTSDGDIYYYNLRERI 2037
            .|..|..| |.|   .|.|||..:...:.      .|.:...||          |..:.:|.:|:
  Rat  2372 AVEKPPAP-TGL---RLSSPDRLLNTNSP------KPQISDRPP----------DKSHASLTQRL 2416

  Fly  2038 SQWEPPSPEQRLQTLLEENTTQQPLHELQIDPAVLENELIQVDTDYVGSLSAKSLAQYIEAKVRE 2102
                 |.||:.|..:::....:             |..|..||.:        :.|::..|.|.:
  Rat  2417 -----PPPEKVLSAVVQSLVAK-------------EKALRPVDQN--------TQAKHRAAVVMD 2455

  Fly  2103 RRDLRRSKLVSIRLISPRRDEDRLYNQLESRKYKENKEKIRRRKELYRRRKIEVLPDAVDEIPVP 2167
            ..||           :||:.| |..:..|.....:.|..:.........|.:..:|.||::   .
  Rat  2456 LIDL-----------TPRQKE-RAASPQEVTAQADEKTPVLESSSRPTSRGLGHVPRAVEK---G 2505

  Fly  2168 GKALPIQPYLFSSDEEETKVAAIEQPAAEEEQDSLNMAPSTSHA-------AMAALGKAVAQPTG 2225
            |.:.|:||...|:...|....|::               |.:||       |.|.|.::..|.:|
  Rat  2506 GVSDPLQPPGKSAAPSEHPWQAVK---------------SLTHARFLSPPSAKAFLYESAIQASG 2555

  Fly  2226 LGTVGKRKLPMPPSVT---VKKHRQEQRSKKVKSSQS 2259
            ...||..:.|.|||..   :|:.:|..|....||.||
  Rat  2556 RAPVGSEQTPGPPSPAPGLLKQVKQLSRGLTAKSGQS 2592

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
Set2NP_572888.2 AWS 1358..1410 CDD:197795 27/222 (12%)
SET 1414..1533 CDD:214614 50/118 (42%)
PostSET 1535..1551 CDD:214703 6/15 (40%)
WW 2014..2043 CDD:278809 5/28 (18%)
SRI 2270..2348 CDD:285448
Nsd1XP_006253682.1 MSH6_like 319..429 CDD:99898 21/116 (18%)
PHD1_NSD1_2 1543..1585 CDD:277118 3/41 (7%)
PHD2_NSD1 1590..1636 CDD:277120 4/49 (8%)
PHD3_NSD1 1637..1690 CDD:277123 11/52 (21%)
PHD4_NSD1 1707..1746 CDD:277126 3/38 (8%)
WHSC1_related 1752..1846 CDD:99899 3/93 (3%)
AWS 1889..1939 CDD:197795 17/49 (35%)
SET 1940..2063 CDD:214614 50/122 (41%)
PHD5_NSD1 2118..2160 CDD:277129 13/61 (21%)
Blue background indicates that the domain is not in the aligned region.


Information from Original Tools:


Tool Simple Score Weighted Score Original Tool Information
BLAST Result Score Score Type Cluster ID
Compara 1 0.930 - - C166351960
Domainoid 00.000 Not matched by this tool.
eggNOG 00.000 Not matched by this tool.
Hieranoid 00.000 Not matched by this tool.
Homologene 00.000 Not matched by this tool.
Inparanoid 00.000 Not matched by this tool.
OMA 00.000 Not matched by this tool.
OrthoDB 1 1.010 - - D507784at2759
OrthoFinder 00.000 Not matched by this tool.
OrthoInspector 00.000 Not matched by this tool.
orthoMCL 00.000 Not matched by this tool.
Panther 00.000 Not matched by this tool.
Phylome 1 0.910 - -
SonicParanoid 00.000 Not matched by this tool.
SwiftOrtho 1 1.000 - -
TreeFam 00.000 Not matched by this tool.
43.850

Return to query results.
Submit another query.