DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment NSD and setd1a

DIOPT Version :10

Sequence 1:NP_733239.1 Gene:NSD / 43351 FlyBaseID:FBgn0039559 Length:1427 Species:Drosophila melanogaster
Sequence 2:XP_021329647.2 Gene:setd1a / 556535 ZFINID:ZDB-GENE-080521-4 Length:2253 Species:Danio rerio


Alignment Length:1472 Identity:274/1472 - (18%)
Similarity:450/1472 - (30%) Gaps:526/1472 - (35%)


- Green bases have known domain annotations that are detailed below.


  Fly    69 GQNPVQNGAQP--AAEESELESQRQTPVQKQQQQRVSM-------VNRKRDLINLQSALSPKYIG 124
            |.||.:...|.  ||...|:::..|..:.::..:.|:.       ..::|.....|:|:.    |
Zfish  1141 GHNPCEAVVQQVLAALIEEMKNIMQRDLNRKMVENVAFGTFDEWWDRKERKAKPFQTAMR----G 1201

  Fly   125 YANANSPTPLSDSDDTIRTTRRRVNQAAALNNSSAGETLAHDNASPRTP---------GGGGGGG 180
            .|.........:...|..||..|                      ||.|         .||..|.
Zfish  1202 VAVVREEEKKEEKTTTTMTTSNR----------------------PREPLMSLVDWAKSGGMEGF 1244

  Fly   181 GDDSANQLLS-KTYMSPIEKL-----LIKNGASSP----NSTGFEAGSEDLGI-----RPIVRKH 230
            ....|.:|.| |......::|     |.:...|:|    :...::..|.|...     |.:..:.
Zfish  1245 SLRGALRLPSFKVKRKEPQELVEGDELKRARPSTPPDEEDEDSYQGKSADAAAGRTEERRVAERG 1309

  Fly   231 VKRKMKRVPKAKVTLELDEKNQQEVDEKSVKTEPIDEEVDRTDEAP------------------- 276
            ..|:..|. ||:...|||.:.::..|..|.:.|  ||:.|:.||:.                   
Zfish  1310 AARRRSRA-KARKPYELDSEGEETSDGSSSEKE--DEDSDKVDESEDEALSADSDDESVSSSSSE 1371

  Fly   277 --------------------TQEAQTTAISIKSETEAEHKAAVDVHIKQEDTIRLDIVNNPVEST 321
                                .|.|::.::....|:..:..||||....:.|...||       ..
Zfish  1372 SSSSSSASSSSSDEEDEEEGEQAAESESLDTMDESTMDSVAAVDTEKDERDKASLD-------QP 1429

  Fly   322 SIVITEEPKDLEKSTEELAFALPLASSTEVDLKSPPDLSSTALATSIKSPS--------SVSIDS 378
            |:...|:.::.|||      |.|:..|:.....|.|.|....|....|:.|        ..|:.|
Zfish  1430 SVTTPEDKQEEEKS------ATPVPPSSPYPRPSSPILLLPPLKKRRKTVSFSMEESEVKPSVQS 1488

  Fly   379 AKGLSIVTDPGWPTYQVGDLFWGKVFSYCFWPCMVCPDPLGQIVGNMPSHPQRSSLDNANVPIQV 443
            :...|  ..|. |..|..|:                |..:..:....|:.|..||     .|:.:
Zfish  1489 STASS--PSPS-PVSQASDV----------------PPSVSPVSTPTPAAPSASS-----KPVPM 1529

  Fly   444 HVRFFADNGRRNWI----KPENLLTFAGLKAFDDMR-EELRIKHGPKSAKYRQMVPKRTKVVIWR 503
            .:.|.:.....:.:    .|..:||..  .....:| :|.:...||..:.  |..|.:......:
Zfish  1530 LLPFASRPSESSALTSPASPSAILTVP--PPVRSLRPDEPKKSPGPPPSP--QTPPPKNSTKRGK 1590

  Fly   504 QAIEEAQA-----MTQIPYSDRLEKFYQTYENVVTLNRQKRKRTKYMMQDTSDVGSSLYDSTDNL 563
            .:.....|     :..:|. |........||:.|.|...::.|.:...:..|...|:     .:|
Zfish  1591 DSPRTPPAPVCLTVQNLPL-DHASMVKMAYEDPVPLPTTQKGRARGRPRTLSQSASA-----SHL 1649

  Fly   564 HNKQGTQLLAVKRERSESPFSPAFSPVKSKNEKRAKRR---------KLSNGTEADTGSNSMAVT 619
            |.       |::.|..:          :.:.|:|.|.|         :|::..:.|....:....
Zfish  1650 HP-------ALEEEDED----------EEELEQRLKLREQLGVSSLLQLASAPKPDLSVLADVAL 1697

  Fly   620 PSQTETTVDSSAYENPEFRQLLSAVMEYVMMNRSDEKVEKVLLSVVSNIWSLKQIQLRELERDLA 684
            ....|..:||...|.                  |||..|.               :|.|.|.|..
Zfish  1698 KMDPEADIDSEETET------------------SDEAEEH---------------KLEEEEGDFF 1729

  Fly   685 SGEIEEPLGSSVVGRGSGVGTIKRLSNRLMTMMVRRSMTPVVTPSTTPAP---SEPDRRLSEPPK 746
            :....:||                |....:.:|...:.      |.||||   :.|.::..:...
Zfish  1730 APHPRQPL----------------LDPEGLFVMQEHNY------SKTPAPQHITPPQKKTKQDST 1772

  Fly   747 TKKP-------VNRPIEEVIEDIL--QLDSKYLFRGLSREPICKYCYQAGSDLVRCSRTCSSWLH 802
            ...|       |....||||.:.|  :.::..|:..|              ||:...|..:.   
Zfish  1773 VLLPADLNQHGVQEAPEEVIGEALAARAEAPELYGDL--------------DLLCDGREAAE--- 1820

  Fly   803 ADCLERKVTGAPMPKIGS-RKALVIPPTSKSPSPDEDHVTADAKEVVAVGTSLVCHECNV----- 861
               .:.|...:|..:.|| .|.:.:....|..:........:.:|   :.||....|..|     
Zfish  1821 ---TQTKTLSSPYKRTGSISKEVELEERGKGKNKKRSRKDKENEE---LQTSKKQKEKQVKKQKK 1879

  Fly   862 ---GEPEGCVICHQVESPAVPSTPRKEDSSSHTPIEDKLLTCSQPMCGKRFHTSCCKYWPQASSS 923
               .|.|..|...|:||..:.|      |:|.:...|..|..|.....:....|...:..:|..:
Zfish  1880 RKLEEFEEDVDVEQLESGELSS------STSDSGDSDFGLERSLEFEKEEVRKSERLFLQEAGLT 1938

  Fly   924 KHSARCPRHVCHTCVSDDPSGK----FQQL-------------GSSKLAKCVRCPATYHQLSKCI 971
            ..:.|.|:|...:.....||.|    |:|:             ...:|.|     :||.:|.: .
Zfish  1939 PSTQRRPKHTPESLPIPRPSYKNRSEFEQMTILYDIWNSGLDQEDMRLLK-----STYEKLLQ-D 1997

  Fly   972 PAGTQMLNTTNIICPRHNIAKADAHVNVLWCYICVKGGELVCCETCPIAVHAHCRNIPIKTNESY 1036
            ..||..||.|:                                                      
Zfish  1998 DHGTDWLNDTH------------------------------------------------------ 2008

  Fly  1037 ICEECESGRLPLYGEIVWAKFNNFRWWPAIILPPTEVPSNILKKAHGENDFVVRFFGTHDHGWIS 1101
                                     |.|..|   |.:|:...||                     
Zfish  2009 -------------------------WVPHTI---TNIPNPRRKK--------------------- 2024

  Fly  1102 RRRVYLYIEGDTGDGHKTKSQLFRNYTTGVEEASRFLPIIKARRQEQDMERQSGNKLHPPPYVKI 1166
                      .|.||.      .|.:.||...:..:..|   .|:|:|:            |:::
Zfish  2025 ----------KTADGQ------LREHVTGCARSEGYYAI---SRKEKDV------------YLEL 2058

  Fly  1167 KTNKAVPPLRFSQNLEDLSTCNCLPVDEHPCGPEAGCLNRMLFNECNPEYCKAGSLCENRMFEQR 1231
            ..                      ||.....|                :|..|||   ||:..:|
Zfish  2059 DQ----------------------PVTLREIG----------------DYDTAGS---NRVLSER 2082

  Fly  1232 KSP--RL------------EVVYMNERGF---------------GLVNREPIAVGDFVIEYVGEV 1267
            :|.  ||            :::.:|:..|               ||...||||..:.||||||:.
Zfish  2083 RSEQRRLLSAIGTPAVMDSDLLKLNQLKFRKKKLRFGRSRIHEWGLFAMEPIAADEMVIEYVGQS 2147

  Fly  1268 INHAEFQRRMEQKQRDRDENYYFLGVEKDFIIDAGPKGNLARFMNHSCEPNCETQKWTVNCIHRV 1332
            |.......|.::..::...:.|...|:.|.||||...||||||:||.|.|||..:..|:....::
Zfish  2148 IRQMVADNREKRYAQEGIGSSYLFRVDHDTIIDATKCGNLARFINHCCTPNCYAKVITIESQKKI 2212

  Fly  1333 GIFAIKDIPVNSELTFNYLWDDLMNNSKKACFCGAKRCSGEI 1374
            .|::.:.|.||.|:|::|.:.  :..:|..|.||.:.|.|.:
Zfish  2213 VIYSKQPIGVNEEITYDYKFP--IEENKIPCLCGTESCRGTL 2252

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
NSDNP_733239.1 PWWP_NSD_rpt1 395..514 CDD:438972 18/128 (14%)
PHD2_NSD 867..932 CDD:277040 14/64 (22%)
PHD3_NSD 933..988 CDD:277041 15/71 (21%)
PHD4_NSD 1001..1041 CDD:277042 0/39 (0%)
PWWP_NSD_rpt2 1048..1143 CDD:438963 14/94 (15%)
AWS 1183..1233 CDD:197795 10/49 (20%)
SET_NSD 1233..1375 CDD:380950 52/171 (30%)
setd1aXP_021329647.2 None

Return to query results.
Submit another query.