DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment NSD and CLF

DIOPT Version :10

Sequence 1:NP_733239.1 Gene:NSD / 43351 FlyBaseID:FBgn0039559 Length:1427 Species:Drosophila melanogaster
Sequence 2:NP_179919.1 Gene:CLF / 816870 AraportID:AT2G23380 Length:902 Species:Arabidopsis thaliana


Alignment Length:1050 Identity:197/1050 - (18%)
Similarity:336/1050 - (32%) Gaps:367/1050 - (34%)


- Green bases have known domain annotations that are detailed below.


  Fly   484 PKSAKYRQMVPKRTKVVIWRQAIEEAQAMTQIPYSDRLEKFYQTYENVVTLNR---------QKR 539
            |.|:..|...||            ::.|..:.|.|..:.:..::.:..:..:|         :.:
plant     7 PSSSATRSEPPK------------DSPAEERGPASKEVSEVIESLKKKLAADRCISIKKRIDENK 59

  Fly   540 KR----TKYMMQDTSDVGSSLYDSTDNLHNKQGTQLLAVKRERSESPFSPAFSPVKSKNEKRAKR 600
            |.    |:..|:.:.:.|.|..|.:|          |.|||:| :||      .:||..::....
plant    60 KNLFAITQSFMRSSMERGGSCKDGSD----------LLVKRQR-DSP------GMKSGIDESNNN 107

  Fly   601 RKLSNGTEADTGSNSMAVTPSQTETTVDSSAYENPEFRQLLSAVMEYVMMNRSDEKVEKVLLSVV 665
            |.:.:|.     ::|..|..|.....:.....:.|:.:: ||....:|.::|:....|       
plant   108 RYVEDGP-----ASSGMVQGSSVPVKISLRPIKMPDIKR-LSPYTTWVFLDRNQRMTE------- 159

  Fly   666 SNIWSLKQIQLRELERDLASGEIEEPLGSSVVGRGSGVGTIKRLSNRLMTMMVRRSMTPVVTPST 730
                                       ..||||                    ||.:....|...
plant   160 ---------------------------DQSVVG--------------------RRRIYYDQTGGE 177

  Fly   731 TPAPSEPDRRLSEPPKTKKPVNRPIEEVIEDIL-QLDSKYLFRGLSREPICKYCYQAGSDLVRCS 794
            ....|:.:....:..:.|:....|.:.:|...| ||       |||...:.:.            
plant   178 ALICSDSEEEAIDDEEEKRDFLEPEDYIIRMTLEQL-------GLSDSVLAEL------------ 223

  Fly   795 RTCSSWLHADCLERKVTGAPMPKIGSRKALVIPPTSKSPSPDEDH----VTADAKEVVAVGTSLV 855
                    |..|.|..:     :|.:|..:::.....|.|.|...    :..|.:..:....:|.
plant   224 --------ASFLSRSTS-----EIKARHGVLMKEKEVSESGDNQAESSLLNKDMEGALDSFDNLF 275

  Fly   856 CHECNVGE--PEGCV--ICHQVESPAVPSTPRKEDSSSHTPIEDKLLTCSQPMCGK------RF- 909
            |..|.|.:  ..||.  :....|.|| |..|         |: |:.|||. ..|.|      || 
plant   276 CRRCLVFDCRLHGCSQDLIFPAEKPA-PWCP---------PV-DENLTCG-ANCYKTLLKSGRFP 328

  Fly   910 ---------HTSC----CKYWPQASSSKHSARCPR----------HVCHTCVSDDPSGKFQQLGS 951
                     .||.    .|..|...|||.:.|.|:          ..|....||..:|..|...|
plant   329 GYGTIEGKTGTSSDGAGTKTTPTKFSSKLNGRKPKTFPSESASSNEKCALETSDSENGLQQDTNS 393

  Fly   952 SKLAKCVRCPAT---------YHQLSKCIPAGTQMLNTTNIICPRHNIAKADAHVNVLWCYICVK 1007
            .|::...:...:         .:::::.:|..||...           .|.:|           .
plant   394 DKVSSSPKVKGSGRRVGRKRNKNRVAERVPRKTQKRQ-----------KKTEA-----------S 436

  Fly  1008 GGELVCCETCPIAVHAHCRNIPIKTNESYICEECESGRLPLYGEIVWAKFNNFRWWPAIILPPTE 1072
            ..:.:...:|..:...|..|....::.....:...||:....|                  .|.|
plant   437 DSDSIASGSCSPSDAKHKDNEDATSSSQKHVKSGNSGKSRKNG------------------TPAE 483

  Fly  1073 VPSNILK---------------KAHGENDFVVR--FFG-THDHGWISRRRVYLYIEGDTGD---- 1115
            |.:|.:|               .|.|.::.:.:  |.| |...|.::..:::..:|....|    
plant   484 VSNNSVKDDVPVCQSNEVASELDAPGSDESLRKEEFMGETVSRGRLATNKLWRPLEKSLFDKGVE 548

  Fly  1116 ---------------GHKTKSQLFRNYTTGVEEASRF------------------LPIIKARRQE 1147
                           |.|:..::|:..|....:||.|                  :...:.||:.
plant   549 IFGMNSCLIARNLLSGFKSCWEVFQYMTCSENKASFFGGDGLNPDGSSKFDINGNMVNNQVRRRS 613

  Fly  1148 QDMERQSGNKLHPPPYV----------KIKTNKAVPPLRFSQNLEDLSTCNCLPVDEHPCGPEAG 1202
            :.:.|:  .|:....|.          |..|.|...|.|      ..:.|||    :..||.|..
plant   614 RFLRRR--GKVRRLKYTWKSAAYHSIRKRITEKKDQPCR------QFNPCNC----KIACGKECP 666

  Fly  1203 CL-----------------NRM-------------------LFNECNPEYCK-------AGSL-- 1222
            ||                 ||.                   ...||:|:.|:       .|||  
plant   667 CLLNGTCCEKYCGCPKSCKNRFRGCHCAKSQCRSRQCPCFAADRECDPDVCRNCWVIGGDGSLGV 731

  Fly  1223 ---------CENRMFEQRKSPRLEVVYMNERGFGLVNREPIAVGDFVIEYVGEVINHAEFQRRME 1278
                     |.|.....::..|:.:...:..|:|...:..::..:::.||.||:|:|.|..:|  
plant   732 PSQRGDNYECRNMKLLLKQQQRVLLGISDVSGWGAFLKNSVSKHEYLGEYTGELISHKEADKR-- 794

  Fly  1279 QKQRDRDENYYFLGVEKDFIIDAGPKGNLARFMNHSCEPNCETQKWTVNCIHRVGIFAIKDIPVN 1343
            .|..||:...:...:...|::||..||:..:|.|||.||||..:...|...|||||||.:.|...
plant   795 GKIYDRENCSFLFNLNDQFVLDAYRKGDKLKFANHSPEPNCYAKVIMVAGDHRVGIFAKERILAG 859

  Fly  1344 SELTFNYLWD 1353
            .||.::|.::
plant   860 EELFYDYRYE 869

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
NSDNP_733239.1 PWWP_NSD_rpt1 395..514 CDD:438972 6/29 (21%)
PHD2_NSD 867..932 CDD:277040 24/96 (25%)
PHD3_NSD 933..988 CDD:277041 10/63 (16%)
PHD4_NSD 1001..1041 CDD:277042 3/39 (8%)
PWWP_NSD_rpt2 1048..1143 CDD:438963 21/149 (14%)
AWS 1183..1233 CDD:197795 19/103 (18%)
SET_NSD 1233..1375 CDD:380950 39/120 (33%)
CLFNP_179919.1 preSET_CXC 690..721 CDD:408079 4/30 (13%)
SET_EZH 752..868 CDD:380917 39/117 (33%)

Return to query results.
Submit another query.