DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Set1 and ehmt1b

DIOPT Version :10

Sequence 1:NP_001015221.1 Gene:Set1 / 3354971 FlyBaseID:FBgn0040022 Length:1641 Species:Drosophila melanogaster
Sequence 2:XP_021324462.1 Gene:ehmt1b / 402830 ZFINID:ZDB-GENE-080515-3 Length:1295 Species:Danio rerio


Alignment Length:1225 Identity:229/1225 - (18%)
Similarity:389/1225 - (31%) Gaps:391/1225 - (31%)


- Green bases have known domain annotations that are detailed below.


  Fly   728 GYIVEELKQILKRDVNKRMIEITAFKHFETWWDEHTSKARSKPLFEKADSTVNTPLNCIKDTSYN 792
            |||:.::|:           :..:..|...|     |......|   ...|.::|.:.::.....
Zfish   112 GYILSKVKE-----------DTVSAPHRTNW-----SPTGGTTL---GHGTKSSPPSVLQSLDQG 157

  Fly   793 EKNPDINLLINAHREVADFQSYSSIGLRAAMPKLPSFRRIRKHPSPIPTKRNFLERDLSDQEEMV 857
            ..|...:...:.....|:.::.:.|..:...|.:...|:....|:...|.: .|.|:  ::|.:|
Zfish   158 SSNSGASQGPSLTERRAETETKNGIISKTPPPTVHRARKTMSRPASNQTLK-LLNRE--NKEPVV 219

  Fly   858 QRSDSDKEDSNVEISDTARSKIKGPVPIQESDSKSHTSGLNSKRKGSASSFFSSSSSSTSSEAEY 922
            .:.||..:...|:   ...|.|:..:|..::|:.|..:...:|.:..    ||..:|:.......
Zfish   220 VKDDSAGKPEAVQ---PQPSLIQNQLPQSQTDATSVPNTTPAKPQIE----FSLYASTVEPRPVS 277

  Fly   923 EAIDCVEKARTSEEDSPRGYGQRNL--NQRTTTIRNRNLVGTMDVIN-----------VRNLCSG 974
            .::..|.:.:      .|..|..:|  .::...::.|.::.....|:           ||     
Zfish   278 PSVAAVSRKK------KRKMGTYSLVPKKKNKVLKQRTVLEMFQQISQSPPNPKPKEIVR----- 331

  Fly   975 SNEFKKENVTKRTKKNIYSDTDEDNDRTLFPALK----EKNISTILSDLEEISKDS--CIGLDEN 1033
            .|..|.:|.:...:.....||:::.:.......|    |..|.:|...|||.|::|  |.| :|.
Zfish   332 VNGEKMDNESDEEESEEGEDTEDEEELVTEDGAKASHEEPRIPSISQPLEEESEESQDCEG-EEE 395

  Fly  1034 GIEPTILRKIPNTPKLNEECRRSLTPVPPPGYNEEEIKKKVDCKQKPSFEYDRIYSDSEEE---- 1094
            |.|..:..:.....|..::.:.....:.|....:.:.|.|.|   |.|.....|.|..|.:    
Zfish   396 GDESDLSSESSLKKKWKKKAKGDHAWLRPSRKRKRKWKAKTD---KVSAPVTEIQSRPESQSAPS 457

  Fly  1095 ------KEYQE--------------RRKRNTEYMAQME----------------------REFLE 1117
                  |||:|              ...:||.....::                      ||.|.
Zfish   458 VPTAHRKEYKEVPLDSLNLAAQEALLTSQNTAASGSLQSTDADMVQELPLCSCRMETPKSREILT 522

  Fly  1118 EQEKR--IEKSLDKNLQSPNNIVKNNNSPRNKND-------ETRKTAISQTRSC----------- 1162
            ..:::  ..:|:|..|....:.|..:...|..|.       |..:|.:.:.:.|           
Zfish   523 LADRKCMATESIDGQLSRCQSAVLKHEMMRPSNSVQLLVLCEDHRTGMVKHQCCPGCGFFCRAGT 587

  Fly  1163 -FESASKVD------TTLVNIISVENDINEFGPH------------------------------- 1189
             .|...:|:      .|..::::.::    |.||                               
Zfish   588 FMECQPEVNISHRFHRTCASVLNGQS----FCPHCGEEVSKAKEVTIAKADTTSTVPITHSSCIP 648

  Fly  1190 --EEG--DVLTNGCNKMYTNSKGKTKRT-QSPVYSEGGSSQ--ASQASQVALEHCYSLP----PH 1243
              .||  |..|.|..::....:||...| ..|..|:..|..  .|:|..||:.....||    |.
Zfish   649 STSEGKADTTTGGSTRLSVLGEGKANSTLPKPSESQDTSLSLVGSKARSVAMAAAPGLPLPPGPP 713

  Fly  1244 SVSLGD--------------------YPSGKVNETKNIL-----------KREAEN------IA- 1270
            ..:|..                    |.|.|..|.:.:|           |.|::|      :| 
Zfish   714 KETLQSVLLALDAEKPKKLRFHPKQLYISAKQGELQKVLLMLVDGIDPNFKMESQNKRTPLHVAA 778

  Fly  1271 ------IVSQMTRTGPGRPRKDPICIQKKKRDLAPRMSN--------------VKSKMTPNGDEW 1315
                  :...:.:.|...    .:|.:.::..|.....|              :.|.....|...
Zfish   779 EAGHQDVCHMLVQAGANL----DMCDEDQRTPLMEACENNHLETVRYLLRAGAIVSHKDVEGSTC 839

  Fly  1316 PDLAHKNVHF-------------VPCD------------MYKTRDQ---------------NEEM 1340
            ..||.||.||             :.|.            .||..||               .||.
Zfish   840 LHLAAKNGHFSIVQHLLSAGLVDINCQDDGGWTAMIWATEYKHVDQVKLLLSKGADINIRDKEEN 904

  Fly  1341 VILY------------TFLTKGIDAEDINFIKMSYLDHLHKEP----YAMFLNNTHWV------- 1382
            :.|:            .|||...|...:|....|.|....:|.    ..:||:....|       
Zfish   905 ICLHWAAFSGCVEIAEIFLTAKCDLNTMNIHGDSPLHIASREGRLDCVNLFLSRGADVNLKNKEG 969

  Fly  1383 ----DHCTTDRAFWP--PPSKKRRKDDELIRHKTGCAR-------TEGFYKLDV-------REKA 1427
                :.|:.....|.  ..:||:|:    ...|.|...       ..|:.|:.|       .|..
Zfish   970 ETPMECCSHSSKVWNALQANKKQRE----ANRKAGATEKLLNKDIARGYEKVPVPCVNAVDSEPC 1030

  Fly  1428 KHKYHYAKANTEDS-FNEDRSDEPTALTNHHHNKLISKMQGISREARSNQ----------RRLLT 1481
            ...|.|...:...| .|.|::     :|  |....:.|....|......|          .|||.
Zfish  1031 PDNYKYVPDSCVTSPLNIDKN-----IT--HLQYCVCKDDCSSASCMCGQLSLRCWYDKESRLLP 1088

  Fly  1482 AFGSMGESELLKF------------------NQLKFRKKQLKFAKSAIHDWGLFAMEPIAADEMV 1528
            .|.:  |...|.|                  |.|:.|   |:..|:.:..||:..::.|.....|
Zfish  1089 EFSN--EEPPLIFECNHACSCWRTCKNRVVQNGLRTR---LQLFKTQMMGWGVKTLQDIPQGTFV 1148

  Fly  1529 IEYVGQMIRPVVADLRETKYEAIGIGSSYLFRIDMET----IIDATKCGNLARFINHSCNPNCY- 1588
            .||||::|....||:||        ..||||.:|.:.    .:||...||::|||||.|.||.. 
Zfish  1149 CEYVGEIISDAEADVRE--------NDSYLFSLDSKVGDMYCVDARFYGNISRFINHHCEPNLLP 1205

  Fly  1589 AKVITIESEKK---IVIYSKQPIGINEEITYDYKFPLEDEK---IPCLCGAQGCR 1637
            .:|.|...:.:   |..::.:.|...:|:.:||.....|.|   ..|.||:..|:
Zfish  1206 CRVFTSHQDLRFPHIAFFACKNISAGDELGFDYGDHFWDVKGKLFNCKCGSSKCK 1260

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
Set1NP_001015221.1 RRM_Set1 99..191 CDD:409745
U2AF_lg 255..>386 CDD:273727
N-SET 1352..1496 CDD:463344 36/203 (18%)
SET_SETD1 1490..1637 CDD:380946 49/175 (28%)
ehmt1bXP_021324462.1 EHMT_ZBD 503..632 CDD:411018 18/132 (14%)
ANKYR 720..973 CDD:440430 40/256 (16%)
ANK repeat 739..765 CDD:293786 5/25 (20%)
ANK repeat 767..800 CDD:293786 3/36 (8%)
ANK repeat 802..833 CDD:293786 3/30 (10%)
ANK repeat 835..867 CDD:293786 8/31 (26%)
ANK repeat 869..900 CDD:293786 4/30 (13%)
ANK repeat 902..933 CDD:293786 7/30 (23%)
ANK repeat 935..966 CDD:293786 6/30 (20%)
SET 1033..1262 CDD:394802 65/248 (26%)
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.