DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment NSD and Setd1a

DIOPT Version :10

Sequence 1:NP_733239.1 Gene:NSD / 43351 FlyBaseID:FBgn0039559 Length:1427 Species:Drosophila melanogaster
Sequence 2:NP_001414023.1 Gene:Setd1a / 309001 RGDID:1311624 Length:1717 Species:Rattus norvegicus


Alignment Length:1583 Identity:283/1583 - (17%)
Similarity:454/1583 - (28%) Gaps:617/1583 - (38%)


- Green bases have known domain annotations that are detailed below.


  Fly     6 DAHSEIEGDAAHGNVL-----CNSASDSLTATDEVAAG---------NDESVATEGDDVEIPRDT 56
            ||..|:...|.||...     .|....:.|.:.|..|.         |..|..:.|:|:||..|.
  Rat   537 DAGGEVPSGAGHGPCTPPPAPANFEDVAPTGSGEPGAARESPKANGQNQASPCSSGEDMEISDDD 601

  Fly    57 NNSTPVRLLDKPGQNPVQNGAQPAAEESELESQRQTPVQKQQQQRVSMVNRKRD----------- 110
            ...:|......|.|.|......|......|.|   .|:.....|...::....|           
  Rat   602 RGGSPPPAPTPPQQPPPPPPPPPPPPPPYLAS---LPLGYPPHQPAYLLPPHPDGPPPPEYPPPP 663

  Fly   111 -----------LINLQSALSPKYIGYANANSPTPLSDSDDTIRTTR-RRVNQAAALNNSSAGETL 163
                       .:.|...|..::.|       .|:|....|...|| .::.|...|..:|||   
  Rat   664 PPPPHIYDFVNSLELMDRLGAQWGG-------MPMSFQMQTQMLTRLHQLRQGKGLTAASAG--- 718

  Fly   164 AHDNASPRTPGGGGGGG---------------------GDDSANQLLSKTYMSPIEKLLIKNGAS 207
                    .|||..|..                     |.:.......:.|..|:..      |:
  Rat   719 --------PPGGAFGEAFLPFPPPQEAAYGLPYALYTQGQEGRGAYSREAYHLPLPM------AA 769

  Fly   208 SPNSTGFEAGSEDLGIRPIVRKHVK-RKMKRVPKA------KVTLELDEKN--QQEVDEKSVKTE 263
            .|..:...:|.|   .|...|:..: .:.|.:|.|      ..||..:.|:  |::::.|.|:..
  Rat   770 EPLPSSSVSGEE---ARLPHREEAEIAESKALPSAGTVGRVLATLVQEMKSIMQRDLNRKMVENV 831

  Fly   264 PIDEEVDRTDEAPTQEAQTTAISIKSETEAEHKAAVDVHIKQEDTIRLDIVNNPVESTSIVITEE 328
            ... ..|:..|:..::|:....:.|.:.:.|.|.    .:|.::...|.:|              
  Rat   832 AFG-AFDQWWESKEEKAKPFQNAAKQQAKEEDKE----KMKLKEPGMLSLV-------------- 877

  Fly   329 PKDLEKS-----TEELAFALPLAS-----STEVDLKSPPDLSSTALATSIKSPSSVSIDSAKGLS 383
              |..||     .|..||...|..     |.:|..|.|.::|.   |:..|.|.           
  Rat   878 --DWAKSGGITGIEAFAFGSGLRGALRLPSFKVKRKEPSEISE---ASEEKRPR----------- 926

  Fly   384 IVTDPGWPTYQVGDLFWGKVFSYCFWPCMVCPDPLGQIVGNMPSHPQRSSLDNANVPIQVHVRFF 448
                |..|..:..|                              .|:|..  .|..|        
  Rat   927 ----PSTPAEEDED------------------------------DPEREK--EAGEP-------- 947

  Fly   449 ADNGRRNWIKPENLLTFAGLKAFDDMREELRIKHGPKSAKYRQMVPKRTKVVIWRQAIEEAQAMT 513
               ||.....|:.          |:.|.:.:.||             |....:..:..|.:|..:
  Rat   948 ---GRPGTKPPKR----------DEERGKTQGKH-------------RKSFALDSEGEEASQESS 986

  Fly   514 QIPYSDRLEKFYQTYENVVTLNRQKRKRTKYMMQDTSDVGS---SLYDSTDNLHNKQGTQLLAVK 575
            .....|..|:..:..|....::..|::......:|.....|   |||..:|.             
  Rat   987 SEKDEDDDEEDEEDEEREEAVDATKKEAEASDGEDEDGDSSSQCSLYADSDG------------- 1038

  Fly   576 RERSESPFSPAFSPVKSKNEKRAKRRKLSNGTEADTGSNSMAVTPSQTETTVDSSAYENPEFRQL 640
                                        .:|:.:|:.|.|.:.:||.:.::..||:.|:....:.
  Rat  1039 ----------------------------EDGSTSDSESGSSSSSPSSSSSSSSSSSSESSSEEEE 1075

  Fly   641 LSAVMEYVM---------MNRSDEKVEKVLLSVVSNIWSLKQIQLRELERDLASGEIEEPLGSSV 696
            .|||:....         :...|||.|...: |.|.:..|.:   :|......:|..|||..:  
  Rat  1076 QSAVIPSASPPPREVPEPLPAPDEKPETDRV-VDSPVMPLPE---KETTSTQPAGPAEEPPPN-- 1134

  Fly   697 VGRGSGVGTIKRLSNRLMTMMVRRSMTPVVTPSTTPAPSEPDRRLSE----------PPKTKK-- 749
                                      .|...|.....|.:|..||.|          |||.::  
  Rat  1135 --------------------------IPQPPPEPPAGPPDPPPRLDERPSSPIPLLPPPKKRRKT 1173

  Fly   750 --------------------------PVNRPIEEVIEDI---LQLDSKYLFR------------- 772
                                      ||:|.:..|:|..   |.||...|.:             
  Rat  1174 VSFSATEEAPVPEPSTASPLQAKSPGPVSRKVPRVVERTIRNLPLDHASLVKSWPEEVARGGRNR 1238

  Fly   773 --GLSREPICKYCYQAGSD----------LVRCSRTCSSWLHADCLERKVTG----APMPKIG-- 819
              |..|....:...::|::          |....|..::....|..|...|.    .|.|.:.  
  Rat  1239 AGGRVRSTEEEEATESGTEVDLAVLADLALTPARRGLAAIPTGDDSEATETSDEAERPSPLLSHI 1303

  Fly   820 ---SRKALVI--PPTSKSPSPDEDH------VTADAKEVVAVGTSLVCHECNVGEPEGCVICHQV 873
               ...||.|  |||:.:|.|.|..      .::.|.||:..             ||  |:..:.
  Rat  1304 LLEHNYALAIKPPPTTPAPRPLEPAPALAALFSSPADEVLEA-------------PE--VVVAEA 1353

  Fly   874 ESPAVP----STPRKEDSSSHTPIEDKLLTCSQPMCGKRFHTSCCKYWPQASSSKHSARCPRHVC 934
            |.|..|    ..|.:|........|:                       ::.||:.|:       
  Rat  1354 EEPKQPLQQQQHPEQEGEDEEEDEEE-----------------------ESESSESSS------- 1388

  Fly   935 HTCVSDDPSGKFQQLGSSKLAKCVRCPATYHQLSKCIPAGTQMLNTTNIICPRHNIAKADAHVNV 999
             :..|.|..|..::.                                                  
  Rat  1389 -SSSSSDEEGAIRRR-------------------------------------------------- 1402

  Fly  1000 LWCYICVKGGELVCCETCPIAVHAHCRNIPIKTNESYICEECESGRLPLYGEIVWAKFNNFRWWP 1064
                                ::.:|.|                                  |..|
  Rat  1403 --------------------SLRSHTR----------------------------------RRRP 1413

  Fly  1065 AIILPPTEVPSNILKKAHGENDFVVRFFGTHDHGWISRRRVYLYIEGDTGDGHKTKSQLFR---- 1125
            .:..||...||   .:...|.:.:...:...:.|.......||.:         |..:|.:    
  Rat  1414 PLPPPPPPPPS---FEPRSEFEQMTILYDIWNSGLDLEDMSYLRL---------TYERLLQQTSG 1466

  Fly  1126 ----NYTTGVEEASRFLPIIKARRQEQD--MERQSGNKLHPPPYVKIKTNKAVPPLRFSQNLEDL 1184
                |.|..|......|...|.:|:.||  .|.|:|:......|          |:...:..:.|
  Rat  1467 ADWLNDTHWVHHTITNLSTPKRKRRPQDGPREHQTGSARSEGYY----------PISKKEKDKYL 1521

  Fly  1185 STCNCLPVD-EHPCGPEAGCLNRMLFNECNPEYCKAGSLCENRMFEQRKSPRL------------ 1236
            ..|   ||. ..|.|.:....||:|               ..|..|||   ||            
  Rat  1522 DVC---PVSARQPEGVDTQGTNRVL---------------SERRSEQR---RLLSAIGTSAIMDS 1565

  Fly  1237 EVVYMNERGF---------------GLVNREPIAVGDFVIEYVGEVINHAEFQRRMEQKQRDRDE 1286
            :::.:|:..|               ||...||||..:.||||||:.|.......|.::..::...
  Rat  1566 DLLKLNQLKFRKKKLRFGRSRIHEWGLFAMEPIAADEMVIEYVGQNIRQMVADMREKRYVQEGIG 1630

  Fly  1287 NYYFLGVEKDFIIDAGPKGNLARFMNHSCEPNCETQKWTVNCIHRVGIFAIKDIPVNSELTFNYL 1351
            :.|...|:.|.||||...||||||:||.|.|||..:..|:....::.|::.:.|.|:.|:|::|.
  Rat  1631 SSYLFRVDHDTIIDATKCGNLARFINHCCTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYK 1695

  Fly  1352 WDDLMNNSKKACFCGAKRCSGEI 1374
            :.  :.::|..|.||.:.|.|.:
  Rat  1696 FP--LEDNKIPCLCGTESCRGSL 1716

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
NSDNP_733239.1 PWWP_NSD_rpt1 395..514 CDD:438972 15/118 (13%)
PHD2_NSD 867..932 CDD:277040 10/68 (15%)
PHD3_NSD 933..988 CDD:277041 3/54 (6%)
PHD4_NSD 1001..1041 CDD:277042 2/39 (5%)
PWWP_NSD_rpt2 1048..1143 CDD:438963 16/102 (16%)
AWS 1183..1233 CDD:197795 13/50 (26%)
SET_NSD 1233..1375 CDD:380950 50/169 (30%)
Setd1aNP_001414023.1 None

Return to query results.
Submit another query.