DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment NSD and EFS

DIOPT Version :10

Sequence 1:NP_733239.1 Gene:NSD / 43351 FlyBaseID:FBgn0039559 Length:1427 Species:Drosophila melanogaster
Sequence 2:NP_177854.6 Gene:EFS / 844066 AraportID:AT1G77300 Length:1805 Species:Arabidopsis thaliana


Alignment Length:1512 Identity:315/1512 - (20%)
Similarity:528/1512 - (34%) Gaps:485/1512 - (32%)


- Green bases have known domain annotations that are detailed below.


  Fly    13 GDAAHGNVLCNSASDSLTA-TDEVAAGND--ESVATEGDDVEIPRDTNNSTPVRLLDKPGQNPVQ 74
            |||:..|:..||.:.:|.. |:|     |  |.:::.|.:::         .|..|:.|      
plant     9 GDASGCNIDANSLASNLAMNTNE-----DFYEKLSSRGQNLD---------SVSSLEIP------ 53

  Fly    75 NGAQPAAEESE-LESQRQTPVQKQQQQRVSMVNRKRDLINLQSALSPKYIGYANANSPTPLSDSD 138
               |.|:..:. :|.||:...:.:|                        :||.|:||.....::|
plant    54 ---QTASSVNHTIEGQRKCFTEIEQ------------------------MGYGNSNSQEDAGNTD 91

  Fly   139 DTIRTTRRRVNQAAALNNSSAGETLAHDNASPRTPGGGGGGGGDDSANQLLSKTYMSPIEKLLIK 203
            |.:....                     ||.. |...|...|..:.:.:|:..|      .||: 
plant    92 DDLYVCY---------------------NADD-TQEQGVVSGELEQSQELICDT------DLLV- 127

  Fly   204 NGASSPNSTGFEAG--SEDLGIRPIVRKHVKRKMKRVPKAK-------VTLELDEKNQQEVDEKS 259
                  |....:.|  |:|..:..:.......:.|..|:||       .||.:....   :|.:|
plant   128 ------NCNKLDDGKESQDTNVSLVSIFSGSMQEKEAPQAKEDEGYGGTTLPIGGSG---IDTES 183

  Fly   260 --VKTEPID-EEVDRTDEAPTQEAQTTAISIKSE-----------TEAEHKAAVDVHIKQ----- 305
              |...|.. |.::.|......|.::..||.:.:           ::.:..::.|:.:.|     
plant   184 TFVNDAPEQFESLETTKHIKPDEVESDGISYRFDDGGKEGRNGPSSDLDTGSSDDISLSQSFSFP 248

  Fly   306 ------------------EDTIRLDIVNNPVESTSIVITEE---------PKDLEK--STEELAF 341
                              ||.|.::.....|.|.|:.|||.         ..||.|  .||.:..
plant   249 DSLLDSSVFGCSATESYLEDAIDIEGNGTIVVSPSLAITEMLNNDDGGLCSHDLNKITVTETINP 313

  Fly   342 ALPLASSTEVD----------LKS-PPDLSSTALATSIKSPSSVSID-----------------S 378
            .|.|.....:|          ||: ..|.||.:...::...:.::.|                 .
plant   314 DLKLVREDRLDTDLSVMNEKMLKNHVGDSSSESAVAALSMNNGMAADLRAENFSQSSPIDEKTLD 378

  Fly   379 AKGLSIVTDPGWPTYQVGDLFWGKVFSYCFWPCMVCPDPLGQIVGNMPSHPQRSSLDNANVPIQV 443
            .:..|.:||        ..|.|....::......||             :|     :||..|:  
plant   379 MEANSPITD--------SSLIWNFPLNFGSGGIEVC-------------NP-----ENAVEPL-- 415

  Fly   444 HVRFFADNGRRNWIKPENLLTFAGLKAFDDMREELRIKHGPKSAKYRQMVPKRTKVVIWRQAIEE 508
              |...||||   |..| :.:.:|    .|..|.     |..|::.:....|:.|||   |....
plant   416 --RIVDDNGR---IGGE-VASASG----SDFCEA-----GMSSSRRKARDGKQCKVV---QTKTS 462

  Fly   509 AQAMTQIPYSDRLEKFYQTYENVVTLNRQKR----------------KRTKYMMQDTSDVGSSLY 557
            |:.:.:   |.|.::..:..|::...::|||                |.|:..:|.    .:..|
plant   463 ARHLRK---SSRKKQSERDIESIFKCSKQKRSSLLKTSRSSEWGLPSKTTEIFLQS----NNIPY 520

  Fly   558 D---------STDNL----HNKQGTQLLAVKRERSESPFSPAFSPVKSKNEKRAKRRKLSNGTEA 609
            |         |..||    ||:...........|:....|.:...:|.|..|...:..| |.|.:
plant   521 DGPPHHEPQRSQGNLNNGEHNRSSHNGNVEGSNRNIQASSGSCLRLKVKFGKSGGQNPL-NITVS 584

  Fly   610 DTGSNSM---AVTPSQTETTVDSSAYENPEFRQLLSAVMEYVMMNRSDEKVEKVLLSVVSNIWSL 671
            ....||:   .:..:.|...:..||:...:..|.:....:.|..:...||        ||.:.|.
plant   585 KVSGNSLPGNGIVKAGTCLELPGSAHFGEDKMQTVETKEDLVEKSNPVEK--------VSYLQSS 641

  Fly   672 KQIQLRELERDLASGEIEEPLGSSVVGRGSGVGTIKRLSNRLMTMMVRRSMTPVVTPSTTPAPSE 736
            ..::.::..:|  :|.:...:|..|:.....:.:|:         ||........|.| ..|.:.
plant   642 DSMRDKKYNQD--AGGLCRKVGGDVLDDDPHLSSIR---------MVEECERATGTQS-LDAETS 694

  Fly   737 PDRRLSEPPKTKKPVNRPIEEVIEDILQLDSKY-LFRGLSREPICKYCYQAGSDLVRCSRTCSSW 800
            ||..:             |..|.:.|:.::.|. |..|....|         .|:|:.:|.    
plant   695 PDSEV-------------INSVPDSIVNIEHKEGLHHGFFSTP---------EDVVKKNRV---- 733

  Fly   801 LHADCLERKVTGAPMPKIGSRKALVIPPTSKSPSPDEDHVTADAKEVVAVGTSLVCHECNVGEPE 865
                          :.|....:|      |||||.:..|:..:||:.....:.      :.|..:
plant   734 --------------LEKEDELRA------SKSPSENGSHLIPNAKKAKHPKSK------SNGTKK 772

  Fly   866 GCVICHQVESPAVPSTPRKEDSSSHTPIEDKLLTCSQPMCGKRFHTSCCKYWPQASSSKHSARCP 930
            |       :|....|......:.||..:|.:          |..:||..:   ..|......|..
plant   773 G-------KSKFSESAKDGRKNESHEGVEQR----------KSLNTSMGR---DDSDYPEVGRIE 817

  Fly   931 RHVCHTCVSDDPSGKFQQLGSSKLAKCVRCPATYHQLSKCIPAGTQMLNTTNIICPRHNIAKADA 995
            .|.....:.|...||              ..|||..:|..:..|..:::.|          ..|:
plant   818 SHKTTGALLDADIGK--------------TSATYGTISSDVTHGEMVVDVT----------IEDS 858

  Fly   996 HVNVLWCYICVKGGELVCCETCPIAVHAHCRNIPIKTNESYI-CEECESGRLPLYGEIVWAKFNN 1059
            :                                  .|..::: |::|                  
plant   859 Y----------------------------------STESAWVRCDDC------------------ 871

  Fly  1060 FRWWPAIILPPTEVPSNILKKAHGENDFVVRFFGTHDHG--WISRRRVYLYIEGDTGDGHKTKSQ 1122
            |:|        ..:|::::              |:.|..  ||...      ..|......:|||
plant   872 FKW--------RRIPASVV--------------GSIDESSRWICMN------NSDKRFADCSKSQ 908

  Fly  1123 LFRNYTTGVEEASRFLPI-------------IKARRQEQDMERQSGNKLHPPPYVKIKTNKAVPP 1174
            ...|     ||.:..|.|             .:.:.:||..:|.:|.:  ...:..||||:.:..
plant   909 EMSN-----EEINEELGIGQDEADAYDCDAAKRGKEKEQKSKRLTGKQ--KACFKAIKTNQFLHR 966

  Fly  1175 LRFSQNLEDLSTCNCLPVDEH--PCGPEAGCLNRMLFNECNPEYCKAGSLCENRMFEQRKSPRLE 1237
            .|.||.::::..|:|.|..:.  .||.|  ||||||..||....|.||.||.|:.|::||..:.|
plant   967 NRKSQTIDEIMVCHCKPSPDGRLGCGEE--CLNRMLNIECLQGTCPAGDLCSNQQFQKRKYVKFE 1029

  Fly  1238 VVYMNERGFGLVNREPIAVGDFVIEYVGEVINHAEFQRRMEQKQRDRDENYYFLGVEKDFIIDAG 1302
            .....::|:||...|.:..|.|:|||||||::...::.|.::......:::||:.:..:.:||||
plant  1030 RFQSGKKGYGLRLLEDVREGQFLIEYVGEVLDMQSYETRQKEYAFKGQKHFYFMTLNGNEVIDAG 1094

  Fly  1303 PKGNLARFMNHSCEPNCETQKWTVNCIHRVGIFAIKDIPVNSELTFNYLWDDLMNNSKKACFCGA 1367
            .||||.||:||||||||.|:||.||....||||:::|:....||||:|.:..:...:.|.|:||:
plant  1095 AKGNLGRFINHSCEPNCRTEKWMVNGEICVGIFSMQDLKKGQELTFDYNYVRVFGAAAKKCYCGS 1159

  Fly  1368 KRCSGEIGG-KLKDDAV 1383
            ..|.|.||| .|..|.:
plant  1160 SHCRGYIGGDPLNGDVI 1176

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
NSDNP_733239.1 PWWP_NSD_rpt1 395..514 CDD:438972 26/118 (22%)
PHD2_NSD 867..932 CDD:277040 10/64 (16%)
PHD3_NSD 933..988 CDD:277041 9/54 (17%)
PHD4_NSD 1001..1041 CDD:277042 2/40 (5%)
PWWP_NSD_rpt2 1048..1143 CDD:438963 16/109 (15%)
AWS 1183..1233 CDD:197795 22/51 (43%)
SET_NSD 1233..1375 CDD:380950 56/141 (40%)
EFSNP_177854.6 zf-CW 865..910 CDD:462181 13/90 (14%)
AWS 975..1025 CDD:197795 22/51 (43%)
SET_SETD2 1025..1167 CDD:380949 56/141 (40%)

Return to query results.
Submit another query.