DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment egg and Nsd1

DIOPT Version :9

Sequence 1:NP_611966.3 Gene:egg / 37962 FlyBaseID:FBgn0086908 Length:1262 Species:Drosophila melanogaster
Sequence 2:XP_006253682.1 Gene:Nsd1 / 306764 RGDID:1307748 Length:2689 Species:Rattus norvegicus


Alignment Length:1596 Identity:294/1596 - (18%)
Similarity:462/1596 - (28%) Gaps:651/1596 - (40%)


- Green bases have known domain annotations that are detailed below.


  Fly    35 PVRKGENSL---------------ESPAEQAAKDVEIEELTHSEAIAATGSTRKQCPYGGKAPDE 84
            |..:.||||               ..|..::.|....|..|.:|..|.:.....:|    .:.|.
  Rat   758 PGIRNENSLTKGGLANQTLLPLKCRQPKFRSIKCKHKESPTAAETSATSEDLSLKC----CSSDT 818

  Fly    85 PGK-LADESEDRKGENTKAI--------ASSPVLVAV-------------------DSDSSVELI 121
            .|. :...|:..|||..|.:        .||.:..||                   .|||.....
  Rat   819 NGSPMTSISKSGKGEGLKLLNNMHEKTRDSSDIETAVVKHVLSELKELSYRSLSEDVSDSGTSKS 883

  Fly   122 ESPVKFSSANESEKDPPKPD-----------AVNEAAAKEAEEMTDSSISS-------PTSESFP 168
            ..|:.||||:.....|.:||           .::::..||...||..:::|       ..|...|
  Rat   884 SKPLLFSSASSQNHIPIEPDYKFSTLLMMLKDMHDSKTKEQRLMTAQNVASYRTPDRGDCSSGSP 948

  Fly   169 EKDEKT----NKENEQEPPGMEVDQDVEESISRPAEEYKIENTLKGHKRISLTEIEEHKIVDKKD 229
            ....|.    ...:..|.||......|..|   |...   ::.|.|....||:.:..    ||:|
  Rat   949 VGTSKVLVLGGSTHNSEKPGDSTQDSVRLS---PGGG---DSALSGELSSSLSVLPS----DKRD 1003

  Fly   230 DVLEVELEKGTAP-----KAAEDEKLNALLSDGDVFYDKECVNCNCTKLHKQYVLANM--ATLNF 287
            .....::.....|     :|....||...:|   ....|..||....|..::..|:.:  .||..
  Rat  1004 LPACGKIRSNCIPRRNCGRAKLSPKLRVTIS---TQMAKPSVNPKALKTERKRKLSRLPAVTLAA 1065

  Fly   288 YQVLRKSSKQQF---LCMGCHDTAMDLYEEYAGQLMAKQPLLLKDF---HQDHADFVALDSSDEE 346
            ..:..|.|....   |..|..|.|.:            :||...|.   .:.|.|.....|..::
  Rat  1066 NGLGNKESGGSVNGPLKGGAQDPAKE------------EPLQQMDLLRNEETHFDSKVKQSDPDK 1118

  Fly   347 EEEKQPEKSDFSKNKLQLIEDELDDAIKNVLNK----VDFTAQLSWSKTILQAKADHLERQFALA 407
            ..||:|   .|...|...:..|:     |:.|.    ||                          
  Rat  1119 ILEKEP---SFENRKGPEVGSEI-----NIENDEPHGVD-------------------------- 1149

  Fly   408 DVELEKVQTTADKMHCALYNSCPVAHKHLPTL-DIEPSDYVHEVPPPGEIVRPPIQLGETYYAVK 471
                   |....|....|....|...|..... :.|.|:....|..||:    |:|.|...|..:
  Rat  1150 -------QVVPKKRWQRLNQRRPKPGKRASRFREKENSEGAFGVLLPGD----PVQKGRDDYLEQ 1203

  Fly   472 NKAIASWVSIKVIEFTESTAINGNTMKSYKIRYLNTPYQMIKTVTAKHIAYFEPPPVRLTIGTRV 536
            .....|        ..|.:|.:.|                       |:::.|....||.:    
  Rat  1204 RAPPTS--------ILEDSAADPN-----------------------HVSHSESVGPRLNV---- 1233

  Fly   537 IAYFDGTTLSRG---KDKGVVQSAFYPGI-----IAEPLKQANRYR------YLIFYDDGYTQ-Y 586
               .|.:::|.|   |:.|:      |.:     |.||..::.:.|      :|:.|.:.|.| :
  Rat  1234 ---CDKSSVSMGDLEKETGI------PSLTPQTKIPEPAVRSEKKRLRKPSKWLLEYTEEYDQIF 1289

  Fly   587 VPHRDVRLVCQASEKVWEDVHAASRDF--------IQKYVEKYSVDRPMVQCTRGQSMTTESNGT 643
            .|.:       ..:||.|.||..|...        .:...:...||...:..|:.:....|....
  Rat  1290 APKK-------KQKKVQEQVHKVSSRCEDESLLARCRSSAQNKQVDENSLISTKEEPPVLEREAP 1347

  Fly   644 WLYARVIDIDCSL----------------------------VLMQFEGDKNHTEWIYRGSLRLGP 680
            :|...::..|..:                            :|::..|:       |.|..:..|
  Rat  1348 FLEGPLVQSDLGVAHAELPQLTLSVPVAPEVSPRPTLESEELLVKTPGN-------YEGKRQRKP 1405

  Fly   681 VFRETQNNMNSSSAQQLRVPR-----------------------RTEPFIR-----YTKEMESSS 717
                |:..:.|:......:|:                       |..|.::     .|:..:...
  Rat  1406 ----TKKLLESNDLDPGFMPKKGDLGLSRKCCESGHLENGVGDSRATPHLKEFGGGTTRIFDKPR 1466

  Fly   718 KVNQQMRAFAR------------KSSASAQNNALAAASSAATP----AGGRTNAGGVSTSNSASA 766
            |..:|....||            :.:.|:....|....:||:|    ..|..:..|:|.|.....
  Rat  1467 KRKRQRHGTARVHYKRVKKEDSARETPSSAEGELMIHRTAASPKEILEEGIEHDPGMSASKRLQG 1531

  Fly   767 VR----HLNNSTIYVDDENRPK-GHVVYFTAK------------RNLPPKMYKCHECSPNCLFKI 814
            .|    .|..:..    :|..| |.::...|:            ..:|...:.|:||...     
  Rat  1532 ERGGGAALKENVC----QNCEKLGELLLCEAQCCGAFHLECLGLTEMPRGKFICNECRTG----- 1587

  Fly   815 VHRLDSYSPLAKPLLSGWERLVMRQK---TKKSVVYKGP-CGKSLRSLAEVHRYLRATENVLNVD 875
            :|..                .|.:|.   .|:.::   | |||...... |.:|....     |.
  Rat  1588 IHTC----------------FVCKQSGEDVKRCLL---PLCGKFYHEEC-VQKYPPTV-----VQ 1627

  Fly   876 NFDFTPDLK-CLAEYSIDPSIVKDTDISKGQEKMAIPL-VNYYDNTLPPPCTYAKQRI------- 931
            |..|...|. |:..::.:|:   :...|||:....:.. |.|:.|..   |..|..:|       
  Rat  1628 NKGFRCPLHICITCHAANPA---NVSASKGRLMRCVRCPVAYHANDF---CLAAGSKILASNSII 1686

  Fly   932 ------PTEGV----HLNLDEEF-------LLCCD---------------------CEDDCSDKS 958
                  |..|.    |:|:...|       |||||                     | :||....
  Rat  1687 CPNHFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNIDIPEGNWYC-NDCKAGK 1750

  Fly   959 KCACWQLTVAGV-RY-------CNP---------------------------------------- 975
            |....::....| ||       |:|                                        
  Rat  1751 KPHYREIVWVKVGRYRWWPAEICHPRAVPSNIDKMRHDVGEFPVLFFGSNDYLWTHQARVFPYME 1815

  Fly   976 -----------------KKPIEEIGYQYKRL---------------------HEHV----PTG-- 996
                             ||.::|...:::.|                     ::|:    |.|  
  Rat  1816 GDVSSKDKMGKGVDGTYKKALQEAAARFEELKAQKELRQLQEDRKNDKKPPPYKHIKVNRPIGRV 1880

  Fly   997 -IYECN----SRCKCK----------KNCLNRVVQFSLE--------------------MKLQVF 1026
             |:..:    .||.||          ..|:||::.:...                    ..:::|
  Rat  1881 QIFTADLSEIPRCNCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFSKRQYPDVEIF 1945

  Fly  1027 KTSNRGWGLRCVNDIPKGAFICIYAGHLLTETMANEGGQDAGDEYFADLDYIEVAEQLKEGYESE 1091
            :|..||||||...||.||.|:..|.|.|:                                    
  Rat  1946 RTLQRGWGLRTKTDIKKGEFVNEYVGELI------------------------------------ 1974

  Fly  1092 VDHSDPDAEEDNGGPDAEDDDDFRPNYHYQRKIKRSSRSGSTQNSSTQSSELDSQERAVINFNPN 1156
                              |:::.|....|                        :||..:.||. .
  Rat  1975 ------------------DEEECRARIRY------------------------AQEHDITNFY-M 1996

  Fly  1157 ADLDETVRENSVRRLFGKDEAPYIMDAKTTGNLGRYFNHSCSPNLFVQNVFVDTHDLRFPWVAFF 1221
            ..||             ||.   |:||...||..|:.||.|.||...|...|: .|.|   |..|
  Rat  1997 LTLD-------------KDR---IIDAGPKGNYARFMNHCCQPNCETQKWSVN-GDTR---VGLF 2041

  Fly  1222 SAAHIRSGTELTWNYNYEVGVVPGKVLYCQCGAPNC 1257
            :.:.|::|||||:|||.|. :..||.: |:||||||
  Rat  2042 ALSDIKAGTELTFNYNLEC-LGNGKTV-CKCGAPNC 2075

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
eggNP_611966.3 HMT_MBD 823..879 CDD:238689 11/59 (19%)
Pre-SET 901..1013 CDD:282838 41/264 (16%)
SET 1022..1239 CDD:214614 53/216 (25%)
Nsd1XP_006253682.1 MSH6_like 319..429 CDD:99898
PHD1_NSD1_2 1543..1585 CDD:277118 7/45 (16%)
PHD2_NSD1 1590..1636 CDD:277120 12/70 (17%)
PHD3_NSD1 1637..1690 CDD:277123 11/58 (19%)
PHD4_NSD1 1707..1746 CDD:277126 7/39 (18%)
WHSC1_related 1752..1846 CDD:99899 8/93 (9%)
AWS 1889..1939 CDD:197795 7/49 (14%)
SET 1940..2063 CDD:214614 54/222 (24%)
PHD5_NSD1 2118..2160 CDD:277129
Blue background indicates that the domain is not in the aligned region.


Information from Original Tools:


Tool Simple Score Weighted Score Original Tool Information
BLAST Result Score Score Type Cluster ID
Compara 1 0.930 - - C166351958
Domainoid 00.000 Not matched by this tool.
eggNOG 00.000 Not matched by this tool.
Hieranoid 00.000 Not matched by this tool.
Homologene 00.000 Not matched by this tool.
Inparanoid 00.000 Not matched by this tool.
OMA 00.000 Not matched by this tool.
OrthoDB 00.000 Not matched by this tool.
OrthoFinder 00.000 Not matched by this tool.
OrthoInspector 00.000 Not matched by this tool.
orthoMCL 00.000 Not matched by this tool.
Panther 00.000 Not matched by this tool.
Phylome 1 0.910 - -
SonicParanoid 00.000 Not matched by this tool.
SwiftOrtho 00.000 Not matched by this tool.
TreeFam 00.000 Not matched by this tool.
21.840

Return to query results.
Submit another query.