DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Clk and Sim1

DIOPT Version :10

Sequence 1:NP_001014576.1 Gene:Clk / 38872 FlyBaseID:FBgn0023076 Length:1027 Species:Drosophila melanogaster
Sequence 2:XP_006512689.1 Gene:Sim1 / 20464 MGIID:98306 Length:801 Species:Mus musculus


Alignment Length:926 Identity:180/926 - (19%)
Similarity:319/926 - (34%) Gaps:279/926 - (30%)


- Green bases have known domain annotations that are detailed below.


  Fly    18 KSRNLSEKKRRDQFNSLVNDLSALI---STSSRKMDKSTVLKSTIAFLKNHNEATDRSKVFEIQQ 79
            ||:| :.:.||::.||...:|:.|:   |..:.::||:::::.|.::||......:....|.:| 
Mouse     4 KSKN-AARTRREKENSEFYELAKLLPLPSAITSQLDKASIIRLTTSYLKMRVVFPEVRCKFRMQ- 66

  Fly    80 DWKPAFL-----------------SNDEYTHL-----------------------MLESLDGFMM 104
              .|.||                 :|.:|..|                       :|::||||:.
Mouse    67 --TPGFLRYSLMGPGISRRKETDSNNSDYLGLGEAWGHTSRTS
PLDNVGRELGSHLLQTLDGFIF 129

  Fly   105 VFSSMGSIFYASESITSQLGYLPQDLYNMTIYDLAYEMDHEALL--------------------- 148
            |.:..|.|.|.||:.:..||....:|...:||:..:..||:.:.                     
Mouse   130 VVAPDGKIMYISETASVHLGLSQVELTGNSIYEYIHPADHDEMTAVLTAHQPYHSHFVQEYEIER 194

  Fly   149 NIFMNPTPVIEPRQTDISSSNQITFYTHLRRGGMEKVDANAYELVKFVGYFRNDTNTSTGSSSEV 213
            :.|:....|:..|...::...    |..:...|..|:...:.::..|.|.::|....:.|.|   
Mouse   195 SFFLRMKCVLAKRNAGLTCGG----YKVIHCSGYLKIRQYSLDMSPFDGCYQNVGLVAVGHS--- 252

  Fly   214 SNGSNGQPAVLPRIFQQNPNAEVDKKLVFVGTGRVQNPQLIREMSIIDPTSNEFTSKHSMEWKFL 278
                      ||      |:|..:.||                      .||.|..:.|::.|.:
Mouse   253 ----------LP------PSAVTEIKL----------------------HSNMFMFRASLDMKLI 279

  Fly   279 FLDHRAPPIIGYMPFEVLGTSGYDYYHFDDLDSIVACHEELRQTGEGKSCYYRFLTKGQQWIWLQ 343
            |||.|...:.||.|.:::..:.|.:.|..|...:...|..|...|:..:.|||||.|...|:|:|
Mouse   280 FLDSRVAELTGYEPQDLIEKTLYHHVHGCDTFHLRCAHHLLLVKGQVTTKYYRFLAKQGGWVWVQ 344

  Fly   344 TDYYVSYHQFNSKPDYVVCTHKVVSYAEVLKDSRKEGQKSGNSNSITNNGSSKVIASTGTSSKSA 408
            :...:.::..:|:|      |.:||...||.|:..:|.:.         ...::.||..|.|.::
Mouse   345 SYATIVHNSRSSRP------HCIVSVNYVLTDTEYKGLQL---------SLDQISASKPTFSYTS 394

  Fly   409 SATTTLRDFELSSQNLDSTLLGNSLAS-------LGTETAATSPAVDSSPMWSASAVQPSGSCQI 466
            |:|.|:.|....:::..|:....|..|       ..||.:.:    |....|..|.:..:.|.|:
Mouse   395 SSTPTISDNRKGAKSRLSSSKSKSRTSPYPQYSGFHTERSES----DHDSQWGGSPLTDTASPQL 455

  Fly   467 NPLKTSRPASSYGNISSTGISPKAKRKCY--------------FYNN------------------ 499
              |...||.|.:....:....|.....||              |:..                  
Mouse   456 --LDPERPGSQHELSCAYRQFPDRSSLCYGFALDHSRLVEDRHFHTQACEGGRCEAGRYFLGAPP 518

  Fly   500 RGNDS--------DSTSMSTDSVTSRQSMMTHVSSQSQRQRSHHREHHRENHHNQS---HHHMQQ 553
            .|.|.        ..|..|.:|..:.::.|.|::|   ..|.|.|.|..|:....|   ....:.
Mouse   519 TGRDPWWGSRAALPLTKASPESREAYENSMPHITS---IHRIHGRGHWDEDSVVSSPDPGSASES 580

  Fly   554 QQQHQNQQQQHQQHQQLQQQLQHTVGTPKMVPLLPIASTQIM---AGNACQFPQ-PAYPLAS--- 611
            ..:::.:|.|:..|:            |..:..| |.:||.|   ..|..|..: |...|||   
Mouse   581 GDRYRTEQYQNSPHE------------PSKIETL-IRATQQMIKEEENRLQLRKAPPDQLASING 632

  Fly   612 ----PQLVAPTFLEPPQYLTAIPMQPVIAPFPVAPVLSPLPVQSQTDMLPDTVVMTPTQSQLQDQ 672
                ..|....:.:||               |...|.....:.|.:..               |.
Mouse   633 AGKKHSLCFANYQQPP---------------PTGEVCHSSALASTSPC---------------DH 667

  Fly   673 LQRKHDELQKLILQQQNEL----------------RIVSEQLLLSRYTYLQPMMSMGFAPGNMTA 721
            :|::..   |::...:|:.                ||....|:|:: .||...||.....|:..|
Mouse   668 IQQREG---KMLSPHENDYDNSPTALSRISSPSSDRITKSSLILAK-DYLHSDMSPHQTAGDHPA 728

  Fly   722 AAVGNLGASGQRGLNFTGSNAVQPQFNQYGFALNSEQMLNQQDQQMMMQQQQNLHTQHQHNLQQQ 786
            .:....|:..|             .|:::.:.|....:.:..|.:.:.......:..|     ..
Mouse   729 ISPNCFGSHRQ-------------YFDKHAYTLTGYALEHLYDSETIRNYSLGCNGSH-----FD 775

  Fly   787 HQSHSQLQQHTQQQHQ 802
            ..||.::|....|.|:
Mouse   776 VTSHLRMQPDPAQGHK 791

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
ClkNP_001014576.1 bHLH-PAS_dCLOCK 11..86 CDD:381578 18/70 (26%)
PAS 93..153 CDD:214512 20/103 (19%)
PAS_11 265..368 CDD:464214 29/102 (28%)
Sim1XP_006512689.1 bHLH-PAS_SIM1 1..107 CDD:381581 23/106 (22%)
PAS 124..>194 CDD:238075 17/69 (25%)
PAS_3 279..365 CDD:430001 27/91 (30%)
SIM_C 395..704 CDD:461963 63/363 (17%)

Return to query results.
Submit another query.