DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Set2 and NSD

DIOPT Version :9

Sequence 1:NP_572888.2 Gene:Set2 / 32301 FlyBaseID:FBgn0030486 Length:2362 Species:Drosophila melanogaster
Sequence 2:NP_733239.1 Gene:NSD / 43351 FlyBaseID:FBgn0039559 Length:1427 Species:Drosophila melanogaster


Alignment Length:1845 Identity:356/1845 - (19%)
Similarity:597/1845 - (32%) Gaps:647/1845 - (35%)


- Green bases have known domain annotations that are detailed below.


  Fly     9 NPSPVASRGRGRGRPPKVALSALGNTP------PHINPSLKHADAEASPTAPEDQDSGQSECRRS 67
            |...||:    .|...::......:||      |..||....|...|..:..|.|.....:.::.
  Fly    39 NDESVAT----EGDDVEIPRDTNNSTPVRLLDKPGQNPVQNGAQPAAEESELESQRQTPVQKQQQ 99

  Fly    68 SRKKIIKFDVRDLLNKNRKAHKIQIEARI--------DSNPSTGHSQSGTTAASTSMSTATASAA 124
            .|..::          |||...|.:::.:        ::|..|..|.|..|..:|......|:|.
  Fly   100 QRVSMV----------NRKRDLINLQSALSPKYIGYANANSPTPLSDSDDTIRTTRRRVNQAAAL 154

  Fly   125 SASSAATVSRLFSMFEMSHQSLPPPPP--------PPTALEIFAKP--RPTQSLIVAQVTSEPSA 179
            :.|||...        ::|.:..|..|        ..:|.::.:|.  .|.:.|::....|.|::
  Fly   155 NNSSAGET--------LAHDNASPRTPGGGGGGGGDDSANQLLSKTYMSPIEKLLIKNGASSPNS 211

  Fly   180 VG---GAHPVQTMAGLPPVTPRKRGRPRKSQLADAAIIPTVIVPSCSDSDTNSTSTTTSNMSSDS 241
            .|   |:..:    |:.|:..:...|..|                                    
  Fly   212 TGFEAGSEDL----GIRPIVRKHVKRKMK------------------------------------ 236

  Fly   242 GELPGFPIQKPKSKLRVSLKRLKLGGRLESSDSGNSPSSSSPEVEPPALQDE-NAMDERPKQEQN 305
                    :.||:|:.:.|            |..|........|:...:.:| :..||.|.||  
  Fly   237 --------RVPKAKVTLEL------------DEKNQQEVDEKSVKTEPIDEEVDRTDEAPTQE-- 279

  Fly   306 LSRMVDAEENSDSDSQIIFIEIETESPKGEEEQEEGRPVEVEPQDLIDIDMELAKQEPT----PD 366
                          :|...|.|::|:   |.|.:....|.::.:|.|.:|:.....|.|    .:
  Fly   280 --------------AQTTAISIKSET---EAEHKAAVDVHIKQEDTIRLDIVNNPVESTSIVITE 327

  Fly   367 PEEDLDEIMVEVLSGPPSLWSADDEAEEEEDATVQRATPPGKEPAADSCSSAPRRSRRSAP---- 427
            ..:||::...|:....|...|.:.:.:...|.:........|.|::.|..||...|..:.|    
  Fly   328 EPKDLEKSTEELAFALPLASSTEVDLKSPPDLSSTALATSIKSPSSVSIDSAKGLSIVTDPGWPT 392

  Fly   428 -LSGSSRQGKTLEETF--AEIAAESSKQILEAEESQDQ----EEQHILIDLIEDTLSESEVTSSV 485
             ..|....||.....|  ..:..:...||:....|..|    :..::.|.:.....:::...:.:
  Fly   393 YQVGDLFWGKVFSYCFWPCMVCPDPLGQIVGNMPSHPQRSSLDNANVPIQVHVRFFADNGRRNWI 457

  Fly   486 SPTIEHMVVEEVVVEENQLVDEADEILDS-KQEFVIKKVFSESDNIAASLNKDIFEPKVETKATC 549
            .|             ||.|.....:..|. ::|..||                 ..||   .|..
  Fly   458 KP-------------ENLLTFAGLKAFDDMREELRIK-----------------HGPK---SAKY 489

  Fly   550 GEVVP-RPEMVTEDVYITEGIAAT-------LEK-----SAVVT------KPTTEMIAETKLSDE 595
            .::|| |.::|.....|.|..|.|       |||     ..|||      |.|..|:.:|  || 
  Fly   490 RQMVPKRTKVVIWRQAIEEAQAMTQIPYSDRLEKFYQTYENVVTLNRQKRKRTKYMMQDT--SD- 551

  Fly   596 VVIEPPLKDESDPKQTEVELPESKPAVNIPKSERILSAEVETTSSPLVPPECCTLESVSGPVLLE 660
              :...|.|.:|....:             :..::|:.:.|.:.||..|                
  Fly   552 --VGSSLYDSTDNLHNK-------------QGTQLLAVKRERSESPFSP---------------- 585

  Fly   661 TSLSTEEKSNENVETTPLKTEAAKEDSPPAAPEEEASNSSEEPNFLLEDYESNQEQVAEDEMMKC 725
                         ..:|:|::..|.     |...:.||.:|      .|..||.           
  Fly   586 -------------AFSPVKSKNEKR-----AKRRKLSNGTE------ADTGSNS----------- 615

  Fly   726 NNQKGQKQTPLPEMKEPEKPVAETVSKKEKAMENPARSSP-------AIVDKKV--RAGEMEKKV 781
                                :|.|.|:.|..:::.|..:|       |:::..:  |:.|..:||
  Fly   616 --------------------MAVTPSQTETTVDSSAYENPEFRQLLSAVMEYVMMNRSDEKVEKV 660

  Fly   782 VKSTKGTVPEKKMDSKKSCAAVTPAKQKESGKSAKEAILKKETEKEKSSAKLDSSSPNTLDKKGK 846
            :.|....:...|.                        |..:|.|::.:|.:::....:::..:|.
  Fly   661 LLSVVSNIWSLKQ------------------------IQLRELERDLASGEIEEPLGSSVVGRGS 701

  Fly   847 DTA---QWSPQLQTLPKSSTKPPQESAPSVISKTTSNQPAPKEEQHAAKKGLSDNSPPSVLKAKE 908
            ...   :.|.:|.|:....:..|      |:  |.|..|||.|..    :.||:  ||...|...
  Fly   702 GVGTIKRLSNRLMTMMVRRSMTP------VV--TPSTTPAPSEPD----RRLSE--PPKTKKPVN 752

  Fly   909 KAVSGFVECDAMFKAMDLANAQLRLDEKNK-KKLKKVP--------------------TKVEAPP 952
            :.:...:|        |:    |:||.|.. :.|.:.|                    :.:.|..
  Fly   753 RPIEEVIE--------DI----LQLDSKYLFRGLSREPICKYCYQAGSDLVRCSRTCSSWLHADC 805

  Fly   953 KVEPPTAVPVP--GQKKSLSGKTSLRRNTVYEDSPNLERNSSPSSDSAQANT-SAGKLKPSKVKK 1014
            .....|..|:|  |.:|:|.             .|...::.||..|...|:. ....:..|.|..
  Fly   806 LERKVTGAPMPKIGSRKALV-------------IPPTSKSPSPDEDHVTADAKEVVAVGTSLVCH 857

  Fly  1015 KINPRRSTICEAAKDLRSSSSSSTPTREVAASSPVSTSSDSSSKRNGSKRTTSDLDGGSKLDQRR 1079
            :.|......|.....:.|.:..|||.:|           ||||                      
  Fly   858 ECNVGEPEGCVICHQVESPAVPSTPRKE-----------DSSS---------------------- 889

  Fly  1080 YTICEDRQPETAIPVPLTKRRF-----SMHPKASAN------PLH----------DTLLQTAGKK 1123
            :|..||:.  .....|:..:||     ...|:||::      |.|          ....|..|..
  Fly   890 HTPIEDKL--LTCSQPMCGKRFHTSCCKYWPQASSSKHSARCPRHVCHTCVSDDPSGKFQQLGSS 952

  Fly  1124 RGRK----------------EGKESLSRQNSLDSSSSASQGAPKKKAL------KSAEILSAAL- 1165
            :..|                .|.:.|:..|.:....:.::.......|      |..|::.... 
  Fly   953 KLAKCVRCPATYHQLSKCIPAGTQMLNTTNIICPRHNIAKADAHVNVLWCYICVKGGELVCCETC 1017

  Fly  1166 ------------LETESS---ESTSSG----------SKMSR---WDVQTSPELEAANPFGDIAK 1202
                        ::|..|   |...||          :|.:.   |.....|..|..:       
  Fly  1018 PIAVHAHCRNIPIKTNESYICEECESGRLPLYGEIVWAKFNNFRWWPAIILPPTEVPS------- 1075

  Fly  1203 FIEDGVNLLKRDKVDEDQRKEGQDE--VKREADPEEDEFAQRVANMETPATTPTPSPTQS----- 1260
                  |:||        :..|:::  |:.....:....::|...:.....|.....|:|     
  Fly  1076 ------NILK--------KAHGENDFVVRFFGTHDHGWISRRRVYLYIEGDTGDGHKTKSQLFRN 1126

  Fly  1261 ---NPEDSASTTTVLKELETGGGVRRSHRIKQKPQGPRASQGRGV---ASVALAPISMDEQLAEL 1319
               ..|:::....::|       .||..:..::..|.:......|   .:.|:.|:...:.|.:|
  Fly  1127 YTTGVEEASRFLPIIK-------ARRQEQDMERQSGNKLHPPPYVKIKTNKAVPPLRFSQNLEDL 1184

  Fly  1320 ANIEAINEQFLRSEGLNTFQLLKENFYRCARQVSQENAEMQCDCFLTGDEEAQGHLSCG--AGCI 1382
            :.                                       |:| |..||.     .||  |||:
  Fly  1185 ST---------------------------------------CNC-LPVDEH-----PCGPEAGCL 1204

  Fly  1383 NRMLMIECGP-LCSNGARCTNKRFQQHQCWPCRVFRTEKKGCGITAELLIPPGEFIMEYVGEVID 1446
            ||||..||.| .|..|:.|.|:.|:|.:.....|....::|.|:.....|..|:|::|||||||:
  Fly  1205 NRMLFNECNPEYCKAGSLCENRMFEQRKSPRLEVVYMNERGFGLVNREPIAVGDFVIEYVGEVIN 1269

  Fly  1447 SEEFERRQHLYSKDRNRHYYFMALRGEAVIDATSKGNISRYINHSCDPNAETQKWTVNGELRIGF 1511
            ..||:||.....:||:.:|||:.:..:.:|||..|||::|::||||:||.||||||||...|:|.
  Fly  1270 HAEFQRRMEQKQRDRDENYYFLGVEKDFIIDAGPKGNLARFMNHSCEPNCETQKWTVNCIHRVGI 1334

  Fly  1512 FSVKPIQPGEEITFDYQYLRYGRDAQR-CYCEAANCRGWIGGEPDSDEGEQLDEESDSDAEMDEE 1575
            |::|.|....|:||:|.:.....:::: |:|.|..|.|.|||                       
  Fly  1335 FAIKDIPVNSELTFNYLWDDLMNNSKKACFCGAKRCSGEIGG----------------------- 1376

  Fly  1576 ELEAEPEEGQPRKSAKAKAKSKLKAKLPLATGRKRKEQTKPKDREYKAGRWLKPSATGSSSSAE- 1639
                                                   |.||...||...||......:|:.. 
  Fly  1377 ---------------------------------------KLKDDAVKAHAKLKQMRRAKASAVRI 1402

  Fly  1640 --KPPKKPKVNKFQAMLEDPDVVEE 1662
              ||.|.|||....|..|..|..:|
  Fly  1403 HVKPKKTPKVKHISADDEPMDAKDE 1427

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
Set2NP_572888.2 AWS 1358..1410 CDD:197795 23/54 (43%)
SET 1414..1533 CDD:214614 52/118 (44%)
PostSET 1535..1551 CDD:214703 5/16 (31%)
WW 2014..2043 CDD:278809
SRI 2270..2348 CDD:285448
NSDNP_733239.1 MSH6_like 391..508 CDD:99898 25/149 (17%)
PHD2_NSD 867..932 CDD:277040 20/99 (20%)
PHD3_NSD 933..988 CDD:277041 6/54 (11%)
PHD4_NSD 1001..1041 CDD:277042 5/39 (13%)
WHSC1_related 1047..1141 CDD:99899 14/114 (12%)
AWS 1183..1233 CDD:197795 24/94 (26%)
SET 1234..1354 CDD:214614 52/119 (44%)
Blue background indicates that the domain is not in the aligned region.


Information from Original Tools:


Tool Simple Score Weighted Score Original Tool Information
BLAST Result Score Score Type Cluster ID
Compara 00.000 Not matched by this tool.
Domainoid 1 1.000 109 1.000 Domainoid score I3988
eggNOG 00.000 Not matched by this tool.
Homologene 00.000 Not matched by this tool.
Inparanoid 00.000 Not matched by this tool.
Isobase 00.000 Not matched by this tool.
OMA 00.000 Not matched by this tool.
OrthoDB 1 1.010 - - D507784at2759
OrthoFinder 00.000 Not matched by this tool.
OrthoInspector 00.000 Not matched by this tool.
orthoMCL 00.000 Not matched by this tool.
Panther 1 1.100 - - P PTHR22884
Phylome 1 0.910 - -
RoundUp 1 1.030 - avgDist Average_Evolutionary_Distance R4489
SonicParanoid 00.000 Not matched by this tool.
55.050

Return to query results.
Submit another query.