DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Spps and Sp1

DIOPT Version :10

Sequence 1:NP_001262902.1 Gene:Spps / 42882 FlyBaseID:FBgn0039169 Length:985 Species:Drosophila melanogaster
Sequence 2:NP_036787.2 Gene:Sp1 / 24790 RGDID:3738 Length:786 Species:Rattus norvegicus


Alignment Length:997 Identity:276/997 - (27%)
Similarity:363/997 - (36%) Gaps:352/997 - (35%)


- Green bases have known domain annotations that are detailed below.


  Fly    23 GGSSKATIPPNGASAAASQTQVHGAPGTPTMQQIINIHQMPPQFAGGAVAGNGQNAGMPQNMFQI 87
            |||.      ||..||.|||:......:.:              :||   |.||.:         
  Rat    25 GGSG------NGGGAAFSQTRSSSTGSSSS--------------SGG---GGGQES--------- 57

  Fly    88 VQPMPMQTVNIDGQEAIFIPNLNAQ----------------LATAQAVNFNGQQAFITPNGQILR 136
             ||.|:..:...... |..||.|:.                .||..:...||.|...:.:|....
  Rat    58 -QPSPLALLAATCSR-IESPNENSNNSQGPSQSGGTGELDLTATQLSQGANGWQIISSSSGATPT 120

  Fly   137 APQMAANPAASNCIQLQQLNGL-GQEQTQLITIPGTNIQIPVTNLIQQQQ------------QAQ 188
            :.:.:.|          ..||. |.|.::..|:.|....:..|..:|.||            |.|
  Rat   121 SKEQSGN----------STNGSNGSESSKNRTVSGGQYVVAATPNLQNQQVLTGLPGVMPNIQYQ 175

  Fly   189 QVHQ-GTVQ-QQAQGTNASGANGSGVTNSGTAGQLPGSITIPGTNLQIPTSVAAANGLLGNISNI 251
            .:.| .||. ||.|    ..|.|:.|...| :||:.   .|||.|.||                |
  Rat   176 VIPQFQTVDGQQLQ----FAATGAQVQQDG-SGQIQ---IIPGANQQI----------------I 216

  Fly   252 SNLLGGGQSIKLENGQLQMRPQLVQFPAPAMPQ-QQQTVAVQ-IPVQTANGQTIYQTVHVPV--- 311
            :|...||..|                  .|||. .||.|.:| :.....:|||.|.| :|||   
  Rat   217 TNRGSGGNII------------------AAMPNLLQQAVPLQGLANNVLSGQTQYVT-NVPVALN 262

  Fly   312 --------------------QAAATSSGGLQ----------NLMQAQSLQMPSASQMQIIPQFSQ 346
                                ||...||.|.|          ..:.:.||....||........:.
  Rat   263 GNITLLPVNSVSAATLTPSSQAGTISSSGSQESGSQPVTSGTAISSASLVSSQASSSSFFTNANS 327

  Fly   347 IAQIVT-------------PNGQIQQVQLAMPYPQLPPNANIIHIQNPHQQQQQQVQQQQQQQQA 398
            .:...|             .:|...|.|.:.....| ..::.::||          |.|......
  Rat   328 YSTTTTTSNMGIMNFTSSGSSGTSSQGQTSQRVGGL-QGSDSLNIQ----------QNQTSGGSL 381

  Fly   399 QQQQQAQQQQAQQQQQQQVQAQHQ-----QLLQAISDASAGGQLPPNQPITITNAQGQQLTVI-- 456
            |..||.:.:|:||.||||:..|.|     |.|||:..|...||....|.|:....|..||..:  
  Rat   382 QGSQQKEGEQSQQTQQQQILIQPQLVQGGQALQALQAAPLSGQTFTTQAISQETLQNLQLQAVQN 446

  Fly   457 --PAQLRPNAPTAPTPAPAGVPTPMQMPNLQALPIQNIPGLGQVQIIHANQLPPNLPANFQQVLT 519
              |..:|     .||..|.|      ..:.|.|.:||:    |||                    
  Rat   447 SGPIIIR-----TPTVGPNG------QVSWQTLQLQNL----QVQ-------------------- 476

  Fly   520 QLPMSHPQVQTQGQVQVMPKQEPQSPTQMITSIKQEPPD--TFGPISATGNPPAPASTPNTAS-- 580
                 :||.||   :.:.|.|.        .|:.|....  |..||::..:.||...|.|.|.  
  Rat   477 -----NPQAQT---ITLAPMQG--------VSLGQTSSSNTTLTPIASAASIPAGTVTVNAAQLS 525

  Fly   581 --PQQQQIKFLHTESNSLSSLSIPASIQITALPQQATNTPNTPATTQPIPVSLPARSKVNAVTTS 643
              |..|.|        :||:|. .:.||:..||.......|||.                   ..
  Rat   526 SMPGLQTI--------NLSALG-TSGIQVHQLPGLPLAIANTPG-------------------DH 562

  Fly   644 STQITIAPTGGQVVSVTTQARGATASIRSTNTSTTTITTPSQSHLNMNISVASVGGAATGGGGGT 708
            ..|:.:...||..:...|                                        .||..|.
  Rat   563 GAQLGLHGPGGDGIHDET----------------------------------------AGGEEGE 587

  Fly   709 ATGEPKP----RLKRVACTCPNCTDGEKHSD----KKRQHICHITGCHKVYGKTSHLRAHLRWHT 765
            .:.:|:|    |.:|.|||||.|.|.|....    ||:||||||.||.|||||||||||||||||
  Rat   588 NSPDPQPQAGRRTRREACTCPYCKDSEGRGSGDPGKKKQHICHIQGCGKVYGKTSHLRAHLRWHT 652

  Fly   766 GERPFVCSWAFCGKRFTRSDELQRHRRTHTGEKRFQCQECNKKFMRSDHLSKHIKTHFKSRSGVE 830
            |||||:|:|::||||||||||||||:|||||||:|.|.||.|:||||||||||||||        
  Rat   653 GERPFMCNWSYCGKRFTRSDELQRHKRTHTGEKKFACPECPKRFMRSDHLSKHIKTH-------- 709

  Fly   831 LIELSIKQETKGGNAPKSISTVNGIVTIEIPGGGSAAAGSGASSVAAT---VAGSTVTPGGAT-- 890
                   |..|||   ..::...|.:.:: .|.||..:|:...|...|   ||...:.|.|..  
  Rat   710 -------QNKKGG---PGVALSVGTLPLD-SGAGSEGSGTATPSALITTNMVAMEAICPEGIARL 763

  Fly   891 ---------IVQLPTVEASGGG 903
                     :.:|.::..||.|
  Rat   764 ANSGINVMQVTELQSINISGNG 785

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
SppsNP_001262902.1 SP1-4_N <281..>371 CDD:425404 30/137 (22%)
Atrophin-1 284..>630 CDD:460830 97/408 (24%)
SP1-4_N <713..741 CDD:425404 15/35 (43%)
C2H2 Zn finger 742..764 CDD:275368 19/21 (90%)
zf-H2C2_2 756..783 CDD:463886 22/26 (85%)
C2H2 Zn finger 772..794 CDD:275368 17/21 (81%)
zf-H2C2_2 786..809 CDD:463886 17/22 (77%)
zf-C2H2 800..822 CDD:395048 17/21 (81%)
C2H2 Zn finger 802..822 CDD:275368 16/19 (84%)
Sp1NP_036787.2 Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1..94 22/102 (22%)
Repressor domain. /evidence=ECO:0000250 2..83 22/91 (24%)
SP1_N 55..628 CDD:411775 172/766 (22%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 110..143 6/42 (14%)
Transactivation domain A (Gln-rich). /evidence=ECO:0000250 147..252 38/146 (26%)
Transactivation domain B (Gln-rich). /evidence=ECO:0000250 262..496 63/295 (21%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 281..303 6/21 (29%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 332..397 15/75 (20%)
9aaTAD. /evidence=ECO:0000250|UniProtKB:P08047 463..471 2/7 (29%)
Transactivation domain C (highly charged). /evidence=ECO:0000250 497..611 37/181 (20%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 562..600 9/77 (12%)
VZV IE62-binding. /evidence=ECO:0000250 620..786 93/185 (50%)
zf-C2H2 627..651 CDD:395048 21/23 (91%)
C2H2 Zn finger 629..651 CDD:275368 19/21 (90%)
zf-H2C2_2 643..670 CDD:463886 22/26 (85%)
C2H2 Zn finger 659..681 CDD:275368 17/21 (81%)
zf-H2C2_2 673..696 CDD:463886 17/22 (77%)
zf-C2H2 687..709 CDD:395048 17/21 (81%)
C2H2 Zn finger 689..709 CDD:275368 16/19 (84%)
Domain D. /evidence=ECO:0000250 709..786 20/96 (21%)

Return to query results.
Submit another query.