DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment NSD and SDG4

DIOPT Version :10

Sequence 1:NP_733239.1 Gene:NSD / 43351 FlyBaseID:FBgn0039559 Length:1427 Species:Drosophila melanogaster
Sequence 2:NP_567859.1 Gene:SDG4 / 829210 AraportID:AT4G30860 Length:497 Species:Arabidopsis thaliana


Alignment Length:693 Identity:155/693 - (22%)
Similarity:239/693 - (34%) Gaps:251/693 - (36%)


- Green bases have known domain annotations that are detailed below.


  Fly   713 LMTMMVRRSMTPVVTPSTTPAPSEPD--RRLSEPPKTKKPVNR------PIEEVIEDILQLDSKY 769
            |..|.:..|:.....||..||.|.|:  :.::.|.......|.      |.||.::||...:...
plant     4 LGNMSMSASVALTCCPSFLPAASGPELAKSINSPENLAGECNGKHLPMIPPEEEVKDIKIANGVT 68

  Fly   770 LFRGLSREPICKYCYQAGSDLVRCSRTCSSWLHADCLERKVTGAPMPKIGSRKALVIPPTSKSPS 834
            .|   :|:       |..||.|                             :|..|:        
plant    69 AF---TRK-------QNPSDRV-----------------------------KKGFVL-------- 86

  Fly   835 PDEDHVTADAKEVVAVGTS-LVCH-ECNVGEPE--GCVICHQVESPAVPSTPRKEDSSSHTPIED 895
              :|||....|..||.|.| ..|. ...||..:  .|::||:   |..|.             ||
plant    87 --DDHVKDWVKRRVASGVSESTCFLPFLVGAKKMVDCLVCHK---PVYPG-------------ED 133

  Fly   896 KLLTCSQPMCGKRFHTSCCKYWPQASSSKHSA-RCPRHVCHTCVSDDPSGKFQQLGSSKLAKCVR 959
              |:||...|...:|:.|.|  .....||.|. :||:|.|..|     ..:.|.       :||:
plant   134 --LSCSVRGCQGAYHSLCAK--ESLGFSKSSKFKCPQHECFVC-----KQRTQW-------RCVK 182

  Fly   960 CPATYHQLSKCIPAGTQMLNTTN----IICPRH--------NIAKADAHVNVLWCYICVKGGELV 1012
            ||...|  .|..|...::|:..:    .:|.||        ..|.|.:.:..::|          
plant   183 CPMAAH--DKHSPWSKEILHLKDQPGRAVCWRHPTDWRLDTKHAVAQSEIEEVFC---------- 235

  Fly  1013 CCETCPIAVHAHCRNIPIKTNESYICEECESGRLPLYGEIVWAKFNNFRWWPAIILPPTEVPSNI 1077
                                            :|||               |.:           
plant   236 --------------------------------QLPL---------------PYV----------- 242

  Fly  1078 LKKAHGENDFVVRFFGTHDHGWISRRRVYLYIEGDTGDGHKTKSQLFRNYTTGVEEASRFLPIIK 1142
                  |.:|.:      |..|                                           
plant   243 ------EEEFKI------DLAW------------------------------------------- 252

  Fly  1143 ARRQEQDMERQSGNKLHPPPYVKIKTNKAVPPLRFSQNLEDLSTCNCLPVDEHPCGPEAGCLNRM 1207
                     :.|..|..||.||.|:.|..:...:.....:.:...||.|..:..|.....|::  
plant   253 ---------KDSVVKEDPPSYVHIRRNIYLVKKKRDNANDGVGCTNCGPNCDRSCVCRVQCIS-- 306

  Fly  1208 LFNECNPEYCKAGSLCENRMFEQRKSPRLEVVYMNERGFGLVNREPIAVGDFVIEYVGEVINHAE 1272
                |: :.|.....|.||.|  ||..::::|.....|:|:...|.|...||::||:||||:.|:
plant   307 ----CS-KGCSCPESCGNRPF--RKEKKIKIVKTEHCGWGVEAAESINKEDFIVEYIGEVISDAQ 364

  Fly  1273 FQRRMEQKQRDRDENYYFLGVEKDFIIDAGPKGNLARFMNHSCEPNCETQKWTVNCIHRVGIFAI 1337
            .::|:...:....:::|...::|||.|||..|||.:||:||||.|||..:||.|....|||:||.
plant   365 CEQRLWDMKHKGMKDFYMCEIQKDFTIDATFKGNASRFLNHSCNPNCVLEKWQVEGETRVGVFAA 429

  Fly  1338 KDIPVNSELTFNYLWDDLMNNSKKACFCGAKRCSGEIGGKLKD 1380
            :.|.....||::|.:  :....:..|.||::.|.|.:|.|.|:
plant   430 RQIEAGEPLTYDYRF--VQFGPEVKCNCGSENCQGYLGTKRKE 470

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
NSDNP_733239.1 PWWP_NSD_rpt1 395..514 CDD:438972
PHD2_NSD 867..932 CDD:277040 19/65 (29%)
PHD3_NSD 933..988 CDD:277041 12/58 (21%)
PHD4_NSD 1001..1041 CDD:277042 1/39 (3%)
PWWP_NSD_rpt2 1048..1143 CDD:438963 6/94 (6%)
AWS 1183..1233 CDD:197795 12/49 (24%)
SET_NSD 1233..1375 CDD:380950 51/141 (36%)
SDG4NP_567859.1 PHD2_NSD 121..167 CDD:277040 19/65 (29%)
AWS 286..326 CDD:197795 13/48 (27%)
SET_ASHR3-like 327..465 CDD:380952 51/139 (37%)
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.