DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Scm and Sfmbt2

DIOPT Version :10

Sequence 1:NP_731385.1 Gene:Scm / 41168 FlyBaseID:FBgn0003334 Length:877 Species:Drosophila melanogaster
Sequence 2:NP_001100834.2 Gene:Sfmbt2 / 307106 RGDID:1305027 Length:890 Species:Rattus norvegicus


Alignment Length:954 Identity:207/954 - (21%)
Similarity:325/954 - (34%) Gaps:319/954 - (33%)


- Green bases have known domain annotations that are detailed below.


  Fly    64 TWCGEGKLPLQYVLPTQTGKKEFCSETCIAEFRKAYSKGACTQCDNVIR--DGAPNK-----EFC 121
            |.||: .|.|:|....:..:.:|..:..||:.   :..|.|||.:.|:|  |....|     ||.
  Rat    97 TTCGQ-LLLLRYCGYGEDRRADFWCDVIIADL---HPVGWCTQNNKVLRPPDAIKEKYADWEEFL 157

  Fly   122 SIMCMNKHQKKNCSTRHSG---GSASGKG------------LAESERKLLASGAPAPTGPFQY-- 169
            .      |:.....|..:.   |...|||            |.:|:            .||||  
  Rat   158 I------HELTGSRTAPASLLEGPLRGKGPIDLITVDSLIELQDSQ------------NPFQYWI 204

  Fly   170 ---------------------ESFH----------------------------------VFDWDA 179
                                 ||:.                                  .|:|..
  Rat   205 VSVIENVGGRLRLRYVGLEHTESYDRWLFYLDYRLRPIGWCQEKKYRMDPPSELYYLKLPFEWKC 269

  Fly   180 YLEETGSEAA----PAKCFKQAQNPPNNDFKIGMKLEAL---DPRNVTSTCIATVVGVLGSR-LR 236
            .||:....||    |.:.||...:..::.|.:||:||.|   ||.::..   |||..|..|: .:
  Rat   270 ALEKALLAAAECPLPMEVFKDHADLGSHFFTVGMRLETLHINDPFHIYP---ATVTKVFNSQFFQ 331

  Fly   237 LRLDGSDSQNDFWRLV---DSTEIHAIGHCEKNGGMLQPPLGFRMNASSWPGYLCKILNNAMVAP 298
            :.:|...::.:...::   ||..|..:..|.|||..|.||.|:.....:|..|          ..
  Rat   332 VAIDDLRAETNGLTMLCHADSLGILPVQWCLKNGVNLAPPKGYSGQDFNWVDY----------HK 386

  Fly   299 EEIFQPEPPEPEENLFKVG----QKLEAVDKKNPQLICCATVDAIKDDQIHVTFDGWRGAF-DYW 358
            :...:..||...:|.|..|    .|||||:.|||..:|.|||.:::...:.:..:|..... :..
  Rat   387 QREAEGAPPYCFKNTFARGFAKNMKLEAVNPKNPGEVCVATVISVRGSLLWLRLEGVETPMPEII 451

  Fly   359 CNYRSRDIFPAGWCARSCHPMQPPGHKSRMDSSSSKQRCPRPRYTVVAESEAMVPA--------- 414
            .:..|.||||.|||..:.:|:..| :||   ||.||::    ..||..|..::.|.         
  Rat   452 VDIDSMDIFPVGWCEANSYPLTTP-YKS---SSKSKKK----PITVPPEKPSLPPVPVENIPQEL 508

  Fly   415 --SPAT---------AHFHP------NCKGGPFINN---SKLPCMVTGPTYQTLAKLCLQEVLAA 459
              .|.|         .::.|      .|..|||:|.   |:||..| ||   ....|.|:|:|..
  Rat   509 CLPPQTDTAAGAANEKYYCPQLFVNHRCFSGPFLNKGRISELPQSV-GP---GKCVLVLKEILTM 569

  Fly   460 ST-----------DTQQLSKLLFALEGDVHIVTAAGKNF--TVKIPSPMRMKDDESLAQFIETLC 511
            .|           :.|.:....:..:.:|......||.:  .|||     ::..:.:..|.:.:|
  Rat   570 ITNAAYKPGRVLRELQLVEDPEWDFQEEVLKAKYRGKIYRAVVKI-----VRTADQVTSFCQQVC 629

  Fly   512 TTCRACANLISLVHETEEC-KKCANSRKRQLTQSATPPSSPVLADKRNRQSNSATTSPSEKIIKQ 575
            .....|.||.|.:..:|.| :.|:...|.:.|         ....||.|            |||.
  Rat   630 AKLECCPNLFSPLLISENCPENCSIRTKTKYT---------YYYGKRKR------------IIKP 673

  Fly   576 ELAVKSPVESKSKTSTNNGKEPASQQNSNHSLNNNNNSASKSSNKVVIKSEPNGANAQTSSTTQA 640
            .|.                             .:|..||.|.:.:                    
  Rat   674 PLG-----------------------------ESNTESAPKPTRR-------------------- 689

  Fly   641 LRKVRFQHHANTNTNSSATNGNQDTSQTTHVSTSHCSSSSTSSSTSLAGGSANTSTIGKYLAPLV 705
             ||.|...:......|||......::|.|.                                   
  Rat   690 -RKRRKSIYVQKKRKSSAVAMPATSAQETE----------------------------------- 718

  Fly   706 AEVHPEQANVKPSNSYYKSPTTLSSSASLPTSVSTPFTGCQSASSTALAAGGVTAAKAATAPAGA 770
             |..||..:.....:.|    .:.:..:....|..|||..|.|......:|.|     ...|...
  Rat   719 -EDDPEALDTASEETGY----VVQNYKTQHVPVEMPFTRSQRAMILRKHSGAV-----KHPPVER 773

  Fly   771 AATAGASPSYTAITSPVSTPTSALANSH-----------LRSQPIDWTIEEVIQYIESNDNSLAV 824
            |....:..|..:..:....|...:..|.           |.|.|::|::.:|:::|:..|  .|.
  Rat   774 ARRVRSVHSTASSNNRNKVPLVRIEKSEDPRQSEEEKLILESNPLEWSVTDVVRFIKLTD--CAP 836

  Fly   825 HGDLFRKHEIDGKALLLLNSEMMMKYMGLKLGPALKICNLVNKV 868
            ...:|::.:|||:|||||....:.:.|.||||||:|:|:.:.:|
  Rat   837 LARIFQEQDIDGQALLLLTLPTVQECMELKLGPAIKLCHQIERV 880

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
ScmNP_731385.1 zf-FCS 58..95 CDD:428958 9/30 (30%)
MBT_dScm_rpt1 196..294 CDD:439097 27/104 (26%)
MBT_dScm_rpt2 312..383 CDD:439100 25/75 (33%)
SLED 418..522 CDD:463469 30/134 (22%)
SAM_Scm 800..870 CDD:188977 26/69 (38%)
Sfmbt2NP_001100834.2 MBT_SFMBT2_rpt1 70..163 CDD:439102 20/75 (27%)
MBT 182..269 CDD:459242 10/98 (10%)
MBT_SFMBT2_rpt3 297..389 CDD:439106 27/104 (26%)
MBT 405..493 CDD:459242 32/95 (34%)
SLED 527..641 CDD:463469 29/122 (24%)
SAM_Scm-like-4MBT1,2 807..890 CDD:188980 27/76 (36%)

Return to query results.
Submit another query.