DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Set1 and nsd3

DIOPT Version :10

Sequence 1:NP_001015221.1 Gene:Set1 / 3354971 FlyBaseID:FBgn0040022 Length:1641 Species:Drosophila melanogaster
Sequence 2:NP_001362267.1 Gene:nsd3 / 100216035 XenbaseID:XB-GENE-5992933 Length:1451 Species:Xenopus tropicalis


Alignment Length:1470 Identity:285/1470 - (19%)
Similarity:455/1470 - (30%) Gaps:498/1470 - (33%)


- Green bases have known domain annotations that are detailed below.


  Fly   449 TAFQPDFHPVQPPPPPPEEI-----------------------DNWDEEEHDKNSIVPTHY---- 486
            :..||||   |.|.|..|::                       .|.....:...:.....|    
 Frog    49 SGLQPDF---QYPTPTTEDLPPLTNGYQPSLGYEPQGKYYSQFPNGSANGYGTRNFTALEYYHLE 110

  Fly   487 --------GCMAKLQPPVPSNVNFATKLQ----SVTQPNSDPGTVDLDTRIALIFKGKTFGNAPP 539
                    | |:|..||.|::.  .|.||    |.|.|....|:.::..||.     ||..|...
 Frog   111 TAALRPVEG-MSKASPPQPTSP--PTLLQAPPLSTTTPQKKTGSPEIKLRIT-----KTIQNGRE 167

  Fly   540 FLQMDSSDSETDQGKPEVFSDVNSDSNNSENKKRSCEKNNKVLHQPNEASDISSDEELIGKKDK- 603
            ..:.........:     |..........|.:|...||..|..|..:........||:..::.: 
 Frog   168 MFESSLCGDLLHE-----FHASEMTKKKHERRKERREKRKKSRHHHHHHHHHHHREEVKAEEQQP 227

  Fly   604 ---SKLSLICEKEVNDDNMSLSSLSSQ---------EDPIQTKEGA-EYKSIMSS---YMYSHSN 652
               .||....|:|....::.:|.||.:         .|.:.:|.|. .:...|.|   .::.||.
 Frog   228 PKMKKLEPPPEEERVQKHLPVSPLSPEVRTGVWFHVGDLVWSKVGTFPWWPCMVSCDPQLHVHSK 292

  Fly   653 QN---PFYYHASGYGHYLSGIPSESASRLFSNGAYVHSEYLKAVASFNFDSFSKPYDYNKGALSD 714
            .|   ...||.    .:.|..|..         |:||.:.:|        .:.....|.:....|
 Frog   293 INTRGAREYHV----QFFSSEPER---------AWVHEKRMK--------EYRGEQQYEQLVAED 336

  Fly   715 ------QNDGI--------RQKVKQVIG----------------------YIVEELKQILKRDVN 743
                  |::.:        |::|:..||                      |:.||.....|:.||
 Frog   337 SAKACNQSEKLKMRKPRPQRERVQWEIGIAHAEKALRMTREERIEQYTFIYVDEEKPPEPKKPVN 401

  Fly   744 ----------------KRMIEI-----------TAFKHFETWWD---EHTSKARSKPLFEKADST 778
                            .|:.|:           :...|...  |   ||..:::.||..|:....
 Frog   402 IGKKSRRSTSSSEPESNRLSEVLPSPPRQPSAPSPISHNNK--DSPAEHRRQSQRKPPHEEEHPP 464

  Fly   779 VNTPLNCIKDTSYNEKNPDINLLINAHREVADFQSYSSIGLRAAMPKLPSFRRIRKHPSPIPTKR 843
            ...|......|:...|:...::|:  |:...|.|                    :.:.||:....
 Frog   465 QPPPAKTPWKTAAARKSLPASVLL--HKGTLDLQ--------------------KCNMSPVVKIE 507

  Fly   844 NFLERDLSDQEEMVQRSDSDKEDSN-VEISDTARSKIKGPVPIQESDSKS-------HTSGLNSK 900
            |.|....:..|...|...:.||..| .||:..|..:...|.|.|:|:..|       .|||..  
 Frog   508 NVLALQNATSEGFEQFIYTTKEVGNKTEITVKAPERAFAPSPNQKSEKSSAQHTPAPETSGAT-- 570

  Fly   901 RKGSASSFFSSSSSSTSSEAEYEAIDCVEKARTSE---EDSPRGYGQRNLNQ-RTTTIRNRNLVG 961
              |.:.......|:.|.||:| :::|.:.|.:..:   ||.|....|..|.: .:.:...|.::.
 Frog   571 --GPSEKKHQRRSNRTRSESE-KSLDSMPKKKVKKEQVEDIPLTAVQTGLQKGASISAEPRKIIN 632

  Fly   962 TMDVINVRNLCSGSNEFKKENVTKRTKKNIYSDTDEDNDRTLFPALKEKNIS------------- 1013
            ......:.:.|....:..:.:.......:.|.|..:.:.|    .|.:..||             
 Frog   633 LNGASEISDACKPLKKRSRASTDVELANSSYRDASDSDSR----GLNDTQISFEKQTDSPSAAAD 693

  Fly  1014 TILSDLEEIS---------KDS--------------------------CIGLD------------ 1031
            ...||::.:.         :||                          |:|:|            
 Frog   694 ADASDVQSVDSSVSHRSSRRDSVCQICESYGESLVSCDGECSRLFHLECLGMDTLPDGKFICMEC 758

  Fly  1032 ENGIEPTILRKIPNTPKLNEECRRSLTPVPPPGYNEEEIKKKVDCKQKPSFEYDRIYSDSE---- 1092
            :.|:......|||:|     ..:|...|.....|:|       .|.:    :|.....||:    
 Frog   759 KTGLHTCFSCKIPST-----SVKRCSAPCCGKFYHE-------TCAR----QYSATIFDSKGFRC 807

  Fly  1093 ----------EEKEYQERRKRNTEYMAQMER---------EFLEEQEKRIEKSLDKNLQSPNNIV 1138
                      ::..|:..:.|    |....|         ..:......:..||         |:
 Frog   808 PHHCCSSCTADKDPYRACKGR----MLSCVRCPVAYHNGDSCIAAGSLHLSSSL---------II 859

  Fly  1139 KNNNSPRNKNDETRKTAISQTRSCFESASKV---DTTLVNIISVENDINEFGPHEEGDVLTNGCN 1200
            .:|:|.|:.:    .::......||..|..:   |.:||:.:|.::..          :||:   
 Frog   860 CSNHSKRSGH----ASSSVNVGFCFVCARGLIVQDYSLVSSMSFKSHF----------LLTD--- 907

  Fly  1201 KMYTNSKGKTKRTQ----------SPVY----SEGGSSQASQASQVALEH--CYSL--PPHSVSL 1247
                     :||.:          ||.|    .:||.......|..|..|  |.:|  |..|.|.
 Frog   908 ---------SKRAELMKIPMIPSSSPAYIKKSEKGGGKLLCCDSCPAAFHPSCLNLEMPQGSWSC 963

  Fly  1248 GDYPSGKVNETKNILKREAENIAIVSQMTRTGPG---RPRKDPICIQKKKRDLAPRMSNVKSKMT 1309
            .|..|||....|.|:..:..|.       |..|.   .||..|:.||..|.|:            
 Frog   964 NDCRSGKKLHYKQIVWVKLGNY-------RWWPAEICNPRSVPLNIQGLKHDI------------ 1009

  Fly  1310 PNGDEWPDLAHKNVHFVPCDMYKTRDQNEEMVILYTFLTKGIDAEDINFIKMSYLDHLHKEPYAM 1374
              ||                                                          :.:
 Frog  1010 --GD----------------------------------------------------------FPV 1014

  Fly  1375 FLNNTH---WVDHCTTDRAFWPPPSKKRRKDDELIRHKTGCARTEGFYKLDVREKAKHKYHYAKA 1436
            |...:|   ||..   .|.|......|...|.:...:||       |.|  ..|:|..::...||
 Frog  1015 FFFGSHDYYWVHQ---GRVFPYVEGDKSFADGQTSINKT-------FKK--ALEEAARRFQELKA 1067

  Fly  1437 NTEDSFN---EDRSDEPTALTNHHHNKLISKMQ----GISREARSNQR---------------RL 1479
            ..|....   |..|.:|....:...||.:.|:|    .:|...|.|.|               |:
 Frog  1068 QRESKEALEIEQNSRKPPPYKHIKSNKPVGKVQIQLADLSEIPRCNCRPTDDTPCGLDTECLNRM 1132

  Fly  1480 L-----TAFGSMGESELLKFNQLKFRKKQLKFAKSAIHD---WGLFAMEPIAADEMVIEYVGQMI 1536
            |     ......||..|   || .|.|:....|:..|.:   |||.....|...|.|.||||::|
 Frog  1133 LLYECHPQICPAGEHCL---NQ-NFTKRLYPEAEIFITERRGWGLRTKRDIRKGEFVNEYVGELI 1193

  Fly  1537 RPVVADLRETKYEAIGIGSSYLFRIDMETIIDATKCGNLARFINHSCNPNCYAKVITIESEKKIV 1601
            .......|..:....|..:.||..:..:.||||...||.:||:||||||||..:..|:..:.::.
 Frog  1194 DEEECRQRMKRAHENGTTNFYLLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVG 1258

  Fly  1602 IYSKQPIGINEEITYDYKFP-LEDEKIPCLCGAQGCRGTL 1640
            :::...|....|:|::|... |.:.:..|.|||:.|.|.|
 Frog  1259 LFALCDIPKGAELTFNYNLDCLGNGRRECHCGAENCSGFL 1298

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
Set1NP_001015221.1 RRM_Set1 99..191 CDD:409745
U2AF_lg 255..>386 CDD:273727
N-SET 1352..1496 CDD:463344 33/173 (19%)
SET_SETD1 1490..1637 CDD:380946 48/150 (32%)
nsd3NP_001362267.1 PWWP_NSD3_rpt1 261..389 CDD:438991 24/148 (16%)
PHD1_NSD3 717..759 CDD:277119 3/41 (7%)
PHD_SF 764..810 CDD:473978 12/61 (20%)
PHD_SF 811..863 CDD:473978 7/64 (11%)
PHD_SF 934..966 CDD:473978 9/31 (29%)
PWWP_NSD3_rpt2 973..1067 CDD:438994 28/184 (15%)
AWS 1118..1156 CDD:465559 8/41 (20%)
SET_NSD3 1158..1299 CDD:380989 46/141 (33%)
PHD5_NSD3 1337..1379 CDD:277131
C5HCH 1378..1427 CDD:465605
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.