DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Set8 and Nsd1

DIOPT Version :10

Sequence 1:NP_650354.1 Gene:Set8 / 41743 FlyBaseID:FBgn0011474 Length:691 Species:Drosophila melanogaster
Sequence 2:NP_001388465.1 Gene:Nsd1 / 306764 RGDID:1307748 Length:2689 Species:Rattus norvegicus


Alignment Length:951 Identity:174/951 - (18%)
Similarity:290/951 - (30%) Gaps:336/951 - (35%)


- Green bases have known domain annotations that are detailed below.


  Fly     5 RRRQRPAKEAASSSSGGASSGS------GIPV--------DQALPLNVAGNLLEDQYFASPK--- 52
            :||.:|.|.|:.......|.|:      |.||        :|..|   ..::|||. .|.|.   
  Rat  1162 QRRPKPGKRASRFREKENSEGAFGVLLPGDPVQKGRDDYLEQRAP---PTSILEDS-AADPNHVS 1222

  Fly    53 ----------------------RKDCRLMKVTQNGQLPEATMMAHNKD-NKAGRTIGVPLATRSQ 94
                                  .|:..:..:|...::||..:.:..|. .|..:.:      ...
  Rat  1223 HSESVGPRLNVCDKSSVSMGDLEKETGIPSLTPQTKIPEPAVRSEKKRLRKPSKWL------LEY 1281

  Fly    95 TRTIENFFKANAAAKDSQKTIHTEEQLNLGNQELKLDDEEL------NGQIKLDDEVLKLADKQ- 152
            |...:..|    |.|..||.:  :||::  ....:.:||.|      :.|.|..||...::.|: 
  Rat  1282 TEEYDQIF----APKKKQKKV--QEQVH--KVSSRCEDESLLARCRSSAQNKQVDENSLISTKEE 1338

  Fly   153 ---INENLPFADEVDAKAEQKLMDEELQQVV-----------------EELL------FDGS-SR 190
               :....||.:....:::..:...||.|:.                 ||||      ::|. .|
  Rat  1339 PPVLEREAPFLEGPLVQSDLGVAHAELPQLTLSVPVAPEVSPRPTLESEELLVKTPGNYEGKRQR 1403

  Fly   191 ASSNSPFYQHDMD---------------------VMQEIQQTPEIPHIKKVTEPLEGLGSLADFQ 234
            ..:......:|:|                     :...:..:...||:|:.     |.|:...|.
  Rat  1404 KPTKKLLESNDLDPGFMPKKGDLGLSRKCCESGHLENGVGDSRATPHLKEF-----GGGTTRIFD 1463

  Fly   235 THRSALRDSHSS----------------THSSSTDNIFLQ------EPVLTLDIDRTPTKASSIK 277
            ..|...|..|.:                |.||:...:.:.      :.:|...|:..|..::|.:
  Rat  1464 KPRKRKRQRHGTARVHYKRVKKEDSARETPSSAEGELMIHRTAASPKEILEEGIEHDPGMSASKR 1528

  Fly   278 INRSFELAGAVFSSPPSVLNACLN-GRFNQIVSLNGQ------KEALDLPHFDLDQHDSSSCDSG 335
            : :.....||....     |.|.| .:..:::....|      .|.|.|......:...:.|.:|
  Rat  1529 L-QGERGGGAALKE-----NVCQNCEKLGELLLCEAQCCGAFHLECLGLTEMPRGKFICNECRTG 1587

  Fly   336 V-------------------ACG----------------------------LTANTESPA----- 348
            :                   .||                            :|.:..:||     
  Rat  1588 IHTCFVCKQSGEDVKRCLLPLCGKFYHEECVQKYPPTVVQNKGFRCPLHICITCHAANPANVSAS 1652

  Fly   349 -GQPRR--RKPATPH----------RILCPSPIKTALKVTGGICKVGSADPLSPRKSPRK----- 395
             |:..|  |.|...|          :||..:.|         ||    .:..:||:..|.     
  Rat  1653 KGRLMRCVRCPVAYHANDFCLAAGSKILASNSI---------IC----PNHFTPRRGCRNHEHVN 1704

  Fly   396 -----LPTTTAAVAACKS------RRRLNQPKPQA------------PYQPQL------------ 425
                 :.:...::..|.|      |..||...|:.            |:..::            
  Rat  1705 VSWCFVCSEGGSLLCCDSCPAAFHRECLNIDIPEGNWYCNDCKAGKKPHYREIVWVKVGRYRWWP 1769

  Fly   426 ------QKPPSQQQQQQQD--DIVVVLDDDDD-------------EGD-----------DEDDVR 458
                  :..||...:.:.|  :..|:....:|             |||           |....:
  Rat  1770 AEICHPRAVPSNIDKMRHDVGEFPVLFFGSNDYLWTHQARVFPYMEGDVSSKDKMGKGVDGTYKK 1834

  Fly   459 ALIKAAEERENQNKAPATANSNKAGMKTMLKPAPVKSKTKSKGP-------TKGQPPLPLAATNG 516
            ||.:|| .|..:.||.......:...|...||.|.| ..|...|       |.....:|..    
  Rat  1835 ALQEAA-ARFEELKAQKELRQLQEDRKNDKKPPPYK-HIKVNRPIGRVQIFTADLSEIPRC---- 1893

  Fly   517 NREMTDFFPV--------RRSVRKTKTAVKEEWMRGLEQAVLEERCDGLQVRHFMGKGRGVVADR 573
            |.:.||..|.        |..:.:....|.....|...|...:.:...:::...:.:|.|:....
  Rat  1894 NCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFSKRQYPDVEIFRTLQRGWGLRTKT 1958

  Fly   574 PFKRNEFVVEYVGDLISIGEAAEREKRYALDEN-AGCYMYYFKHKSQQYCIDATVDTGKLG---R 634
            ..|:.|||.||||:||...|...| .|||.:.: ...||.....       |..:|.|..|   |
  Rat  1959 DIKKGEFVNEYVGELIDEEECRAR-IRYAQEHDITNFYMLTLDK-------DRIIDAGPKGNYAR 2015

  Fly   635 LINHSRAGNLMTKVVLIKQRPHLVLLAKDDIEPGEELTYDY 675
            .:||....|..|:...:.....:.|.|..||:.|.|||::|
  Rat  2016 FMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNY 2056

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
Set8NP_650354.1 SET_SETD8 539..678 CDD:380926 39/141 (28%)
Nsd1NP_001388465.1 PWWP_NSD1_rpt1 318..433 CDD:438989
TNG2 <1452..1585 CDD:227367 23/143 (16%)
PHD1_NSD1_2 1543..1585 CDD:277118 6/41 (15%)
PHD2_NSD1 1590..1636 CDD:277120 2/45 (4%)
PHD3_NSD1 1637..1690 CDD:277123 13/65 (20%)
PHD4_NSD1 1707..1746 CDD:277126 6/38 (16%)
PWWP_NSD1_rpt2 1753..1848 CDD:438992 14/95 (15%)
AWS 1899..1937 CDD:465559 6/37 (16%)
SET_NSD1 1939..2080 CDD:380987 37/126 (29%)
PHD5_NSD1 2118..2160 CDD:277129
C5HCH 2159..2208 CDD:465605
PHA03247 <2221..2593 CDD:223021
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.