DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment egg and Nsd1

DIOPT Version :9

Sequence 1:NP_611966.3 Gene:egg / 37962 FlyBaseID:FBgn0086908 Length:1262 Species:Drosophila melanogaster
Sequence 2:NP_032765.3 Gene:Nsd1 / 18193 MGIID:1276545 Length:2691 Species:Mus musculus


Alignment Length:1547 Identity:286/1547 - (18%)
Similarity:464/1547 - (29%) Gaps:615/1547 - (39%)


- Green bases have known domain annotations that are detailed below.


  Fly    62 SEAIAATGSTRKQCPYGGKAPDEPGK-LADESEDRKGENTKAI--------ASSPVLVAV----- 112
            |.|:|.|.:|.:.......:.|..|. ||:.|:..|||..|.:        .||.:..||     
Mouse   796 SPAVAETSATSEDLSLKCCSSDTNGSPLANISKSGKGEGLKLLNNMHEKTRDSSDIETAVVKHVL 860

  Fly   113 --------------DSDSSVELIESPVKFSSANESEKDPPKPD-----------AVNEAAAKEAE 152
                          .|||.......|:.||||:.....|.:||           .::::..||..
Mouse   861 SELKELSYRSLSEDVSDSGTAKASKPLLFSSASSQNHIPIEPDYKFSTLLMMLKDMHDSKTKEQR 925

  Fly   153 EMTDSSISS-------PTSESFPEKDEKT----NKENEQEPPGMEVDQDVEESISRPAEEYKIEN 206
            .||..:::|       ..|...|....|.    :.....|.||......|.:|   |...   ::
Mouse   926 LMTAQNLASYRTPDRGDCSSGSPVGTSKVLVLGSSTPNSEKPGDSTQDSVHQS---PGGG---DS 984

  Fly   207 TLKGHKRISLTEIEEHKIVDKKDDVLEVELEKGTAP-----KAAEDEKLNALLSDGDVFYDKECV 266
            .|.|....||:.:..    ||::.....::.....|     :|....||...:|...|   |..|
Mouse   985 ALSGELSSSLSSLAS----DKRELPACGKIRSNCIPRRNCGRAKPSSKLRETISAQMV---KPSV 1042

  Fly   267 NCNCTKLHKQYVLANMATLNFYQVLRKSSKQQFLCMGCHDTAMDLYEEYAGQLMA---------- 321
            |....|..::               ||.|:...:.:    .|..|..:.:|.:..          
Mouse  1043 NPKALKTERK---------------RKFSRLPAVTL----AANRLGNKESGSVNGPSRGGAEDPG 1088

  Fly   322 -KQPL----LLKD----FHQDHADFVALDSSDEEEEEKQPEKSDFSKNKLQLIEDEL---DDAIK 374
             ::||    ||::    |...|.|..|..|..::..||:|   .|...|...:..|:   :|.:.
Mouse  1089 KEEPLQQMDLLRNEDTHFSDVHFDSKAKQSDPDKNLEKEP---SFENRKGPELGSEMNTENDELH 1150

  Fly   375 NVLNKVDFTAQLSWSKTILQAKADHLERQFALADVELEK----VQTTADKMHCALYNSCPVAHKH 435
            .| |:|  ..:..|.: :.|.:....:|.....:.|..:    |...||.:..|           
Mouse  1151 GV-NQV--VPKKRWQR-LNQRRPKPGKRANRFREKENSEGAFGVLLPADAVQKA----------- 1200

  Fly   436 LPTLDIEPSDYVHEVPPPGEIVRPPIQLGETYYAVKNKAIASWVSIKVIEFTESTAIN-GNTMKS 499
                   ..||:.:..||..  :|.....:..:...::::|..:::     .|.:::. |:..|.
Mouse  1201 -------REDYLEQRAPPTS--KPEDSAADPNHGSHSESVAPRLNV-----CEKSSVGMGDVEKE 1251

  Fly   500 YKIRYLNTPYQMIKTVTAKHIAYFEPPPVRLTIGTRVIAYFDGTTLSRGKDKGVVQSAFYPGIIA 564
            ..|     |..|.:|...:                                         |.|.:
Mouse  1252 TGI-----PSLMPQTKLPE-----------------------------------------PAIRS 1270

  Fly   565 EPLKQANRYRYLIFYDDGYTQ-YVPHRDVRLVCQASEKVWEDVHAASRDF--------IQKYVEK 620
            |..:.....::|:.|.:.|.| :.|.:       ..:||.|.||..|...        .|...:.
Mouse  1271 EKKRLRKPSKWLLEYTEEYDQIFAPKK-------KQKKVQEQVHKVSSRCEDESLLARCQPSAQN 1328

  Fly   621 YSVDRPMVQCTRGQSMTTESNGTWLYARVIDIDCSLVLMQFEGDKNHTEWIYRGSLRLG-PVFRE 684
            ..||...:..|:.:....|....:|...:...|..:         .|.|   ...|.|. ||..|
Mouse  1329 KQVDENSLISTKEEPPVLEREAPFLEGPLAQSDLGV---------THAE---LPQLTLSVPVAPE 1381

  Fly   685 TQNNMNSSSAQQL-RVP------RRTEPFIRYTKEMESSSKVNQQMRAFARKSSASAQNNALAAA 742
            ........|.:.| :.|      |:.:|    ||::..|:.::...  ..:|...........|:
Mouse  1382 ASPRPALESEELLVKTPGNYESKRQRKP----TKKLLESNDLDPGF--MPKKGDLGLSRKCFEAS 1440

  Fly   743 SSAATPAGGRTNAGGVSTSNSASAVRHLNNSTIYVDDENRPKGHVVYFTAKRNLPPKMYKCHECS 807
            .|          ..|:..|.:.|.::..:..|..:.|:.|.:......||:.:     ||  :..
Mouse  1441 RS----------GNGIVESRATSHLKEFSGGTTKIFDKPRKRKRQRLVTARVH-----YK--KVK 1488

  Fly   808 PNCLFK---------IVHRLDSYSPLAKPLL-SGWERLVMRQKTKKSVVYKG------------- 849
            ...|.|         ::|| .:.||  |.:| .|.|.......:||..|.:|             
Mouse  1489 KEDLTKDTPSSEGELLIHR-TAASP--KEILEEGVEHDPGMSASKKLQVERGGGAALKENVCQNC 1550

  Fly   850 ---------------------------PCGKSLRSLAEVHRYL-------RATENVLN------- 873
                                       |.||.:.:  |.|..:       ::.|:|..       
Mouse  1551 EKLGELLLCEAQCCGAFHLECLGLPEMPRGKFICN--ECHTGIHTCFVCKQSGEDVKRCLLPLCG 1613

  Fly   874 ---------------VDNFDFTPDLK-CLAEYSIDPSIVKDTDISKGQEKMAIPL-VNYYDNTLP 921
                           ..|..|...|. |:..::.:|:   :...|||:....:.. |.|:.|.. 
Mouse  1614 KFYHEECVQKYPPTVTQNKGFRCPLHICITCHAANPA---NVSASKGRLMRCVRCPVAYHANDF- 1674

  Fly   922 PPCTYAKQRI-------------PTEGV----HLNLDEEF-------LLCCD-CE---------- 951
              |..|..:|             |..|.    |:|:...|       ||||| |.          
Mouse  1675 --CLAAGSKILASNSIICPNHFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI 1737

  Fly   952 ---------DDCSDKSKCACWQLTVAGV-RY-------CNP------------------------ 975
                     :||....|....::....| ||       |:|                        
Mouse  1738 DIPEGNWYCNDCKAGKKPHYREIVWVKVGRYRWWPAEICHPRAVPSNIDKMRHDVGEFPVLFFGS 1802

  Fly   976 ---------------------------------KKPIEEIGYQYKRL------------------ 989
                                             ||.::|...:::.|                  
Mouse  1803 NDYLWTHQARVFPYMEGDVSSKDKMGKGVDGTYKKALQEAAARFEELKAQKELRQLQEDRKNDKK 1867

  Fly   990 ---HEHV----PTG---IYECN----SRCKCK----------KNCLNRVVQFSLE---------- 1020
               ::|:    |.|   |:..:    .||.||          ..|:||::.:...          
Mouse  1868 PPPYKHIKVNRPIGRVQIFTADLSEIPRCNCKATDENPCGIDSECINRMLLYECHPTVCPAGVRC 1932

  Fly  1021 ----------MKLQVFKTSNRGWGLRCVNDIPKGAFICIYAGHLLTETMANEGGQDAGDEYFADL 1075
                      ..:::|:|..||||||...||.||.|:..|.|.|:                    
Mouse  1933 QNQCFSKRQYPDVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELI-------------------- 1977

  Fly  1076 DYIEVAEQLKEGYESEVDHSDPDAEEDNGGPDAEDDDDFRPNYHYQRKIKRSSRSGSTQNSSTQS 1140
                                              |:::.|....|                    
Mouse  1978 ----------------------------------DEEECRARIRY-------------------- 1988

  Fly  1141 SELDSQERAVINFNPNADLDETVRENSVRRLFGKDEAPYIMDAKTTGNLGRYFNHSCSPNLFVQN 1205
                :||..:.||. ...||             ||.   |:||...||..|:.||.|.||...|.
Mouse  1989 ----AQEHDITNFY-MLTLD-------------KDR---IIDAGPKGNYARFMNHCCQPNCETQK 2032

  Fly  1206 VFVDTHDLRFPWVAFFSAAHIRSGTELTWNYNYEVGVVPGKVLYCQCGAPNC 1257
            ..|: .|.|   |..|:.:.|::|||||:|||.|. :..||.: |:||||||
Mouse  2033 WSVN-GDTR---VGLFALSDIKAGTELTFNYNLEC-LGNGKTV-CKCGAPNC 2078

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
eggNP_611966.3 HMT_MBD 823..879 CDD:238689 17/125 (14%)
Pre-SET 901..1013 CDD:282838 41/263 (16%)
SET 1022..1239 CDD:214614 53/216 (25%)
Nsd1NP_032765.3 MSH6_like 320..430 CDD:99898
TNG2 <1453..1587 CDD:227367 25/145 (17%)
PHD1_NSD1_2 1546..1587 CDD:277118 3/42 (7%)
PHD2_NSD1 1593..1639 CDD:277120 4/45 (9%)
PHD3_NSD1 1640..1693 CDD:277123 11/58 (19%)
PHD4_NSD1 1710..1749 CDD:277126 7/38 (18%)
WHSC1_related 1755..1849 CDD:99899 8/93 (9%)
AWS 1902..1940 CDD:375420 3/37 (8%)
SET_NSD1 1942..2083 CDD:380987 63/238 (26%)
PHD5_NSD1 2121..2163 CDD:277129
C5HCH 2162..2211 CDD:375464
PHA03307 2255..>2576 CDD:223039
Blue background indicates that the domain is not in the aligned region.


Information from Original Tools:


Tool Simple Score Weighted Score Original Tool Information
BLAST Result Score Score Type Cluster ID
Compara 1 0.930 - - C167848345
Domainoid 00.000 Not matched by this tool.
eggNOG 00.000 Not matched by this tool.
Hieranoid 00.000 Not matched by this tool.
Homologene 00.000 Not matched by this tool.
Inparanoid 00.000 Not matched by this tool.
Isobase 00.000 Not matched by this tool.
OMA 00.000 Not matched by this tool.
OrthoDB 00.000 Not matched by this tool.
OrthoFinder 00.000 Not matched by this tool.
OrthoInspector 00.000 Not matched by this tool.
orthoMCL 00.000 Not matched by this tool.
Panther 00.000 Not matched by this tool.
Phylome 1 0.910 - -
RoundUp 00.000 Not matched by this tool.
SonicParanoid 00.000 Not matched by this tool.
SwiftOrtho 00.000 Not matched by this tool.
TreeFam 00.000 Not matched by this tool.
21.840

Return to query results.
Submit another query.