DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment sim and Sim1

DIOPT Version :10

Sequence 1:NP_524340.2 Gene:sim / 41612 FlyBaseID:FBgn0004666 Length:688 Species:Drosophila melanogaster
Sequence 2:XP_006512689.1 Gene:Sim1 / 20464 MGIID:98306 Length:801 Species:Mus musculus


Alignment Length:773 Identity:318/773 - (41%)
Similarity:387/773 - (50%) Gaps:211/773 - (27%)


- Green bases have known domain annotations that are detailed below.


  Fly    25 MKEKSKNAARTRREKENTEFCELAKLLPLPAAITSQLDKASVIRLTTSYLKMRQVFPD------- 82
            |||||||||||||||||:||.|||||||||:||||||||||:|||||||||||.|||:       
Mouse     1 MKEKSKNAARTRREKENSEFYELAKLLPLPSAITSQLDKASIIRLTTSYLKMRVVFPEVRCKFRM 65

  Fly    83 -----------------------------GLGEAWG----SSPAMQRGATIKELGSHLLQTLDGF 114
                                         |||||||    :||....|   :|||||||||||||
Mouse    66 QTPGFLRYSLMGPGISRRKETDSNNSDYLGLGEAWGHTSRTSPLDNVG---RELGSHLLQTLDGF 127

  Fly   115 IFVVAPDGKIMYISETASVHLGLSQVELTGNSIFEYIHNYDQDEMNAILSLHPHINQHPLAQTHT 179
            |||||||||||||||||||||||||||||||||:||||..|.|||.|:|:.|...:.|.:.:   
Mouse   128 IFVVAPDGKIMYISETASVHLGLSQVELTGNSIYEYIHPADHDEMTAVLTAHQPYHSHFVQE--- 189

  Fly   180 PIGSPNGVQHPSAYDHDRGSHTIEIEKTFFLRMKCVLAKRNAGLTTSGFKVIHCSGYLKARIYP- 243
                                  .|||::|||||||||||||||||..|:||||||||||.|.|. 
Mouse   190 ----------------------YEIERSFFLRMKCVLAKRNAGLTCGGYKVIHCSGYLKIRQYSL 232

  Fly   244 DRGDGQGSLIQNLGLVAVGHSLPSSAITEIKLHQNMFMFRAKLDMKLIFFDARVSQLTGYEPQDL 308
            |.....| ..||:||||||||||.||:||||||.|||||||.|||||||.|:||::|||||||||
Mouse   233 DMSPFDG-CYQNVGLVAVGHSLPPSAVTEIKLHSNMFMFRASLDMKLIFLDSRVAELTGYEPQDL 296

  Fly   309 IEKTLYQYIHAADIMAMRCSHQILLYKGQVTTKYYRFLTKGGGWVWVQSYATLVHNSRSSREVFI 373
            ||||||.::|..|...:||:|.:||.||||||||||||.|.|||||||||||:||||||||...|
Mouse   297 IEKTLYHHVHGCDTFHLRCAHHLLLVKGQVTTKYYRFLAKQGGWVWVQSYATIVHNSRSSRPHCI 361

  Fly   374 VSVNYVLSEREVKDLVLNEIQTGVVK---------REPISPAAQAAQAAQAAQAAQA-------- 421
            |||||||::.|.|.|.|:..|....|         ...||...:.|::..::..:::        
Mouse   362 VSVNYVLTDTEYKGLQLSLDQISASKPTFSYTSSSTPTISDNRKGAKSRLSSSKSKSRTSPYPQY 426

  Fly   422 ----AQAAQAAHVAQAVQAQVVVVPQQSVVVQPQCAGATGQPVGPGTPVSLA------------- 469
                .:.:::.|.:|...:.:      :....||..    .|..||:...|:             
Mouse   427 SGFHTERSESDHDSQWGGSPL------TDTASPQLL----DPERPGSQHELSCAYRQFPDRSSLC 481

  Fly   470 --------------------------------LSASPK-LDPYF--EPELPLQPAVTPVPPTNNS 499
                                            |.|.|. .||::  ...|||..|        :.
Mouse   482 YGFALDHSRLVEDRHFHTQACEGGRCEAGRYFLGAPPTGRDPWWGSRAALPLTKA--------SP 538

  Fly   500 SSSSNNNNGVWHHHHVQQQQQSGSMDHDSLSYTQLYPPLNDLVVSSSSSVGGGTASSAGGGSSAS 564
            .|.....|.:.|...:.:....|..|.||             ||||...   |:||.:|..    
Mouse   539 ESREAYENSMPHITSIHRIHGRGHWDEDS-------------VVSSPDP---GSASESGDR---- 583

  Fly   565 ASSSGVYSTEMQYPDTTTGNLYYNNNNHYYYDYDATVDVATSMIRPFSANSNSCSSSSESERQLS 629
                  |.||.           |.|:.|.....:..:.....||:   ...|..........||:
Mouse   584 ------YRTEQ-----------YQNSPHEPSKIETLIRATQQMIK---EEENRLQLRKAPPDQLA 628

  Fly   630 TGNAS-------IVNETSPSQTTYSDLSHNFELSYFSDNSSQQHQHQQQQQHLMEQQH 680
            :.|.:       ..|...|..|  .::.|:..|:     |:....|.||::..|...|
Mouse   629 SINGAGKKHSLCFANYQQPPPT--GEVCHSSALA-----STSPCDHIQQREGKMLSPH 679

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
simNP_524340.2 bHLH-PAS_dSIM_like 25..86 CDD:381583 54/96 (56%)
PAS 102..215 CDD:395786 67/112 (60%)
PAS_3 291..377 CDD:430001 62/85 (73%)
Sim1XP_006512689.1 bHLH-PAS_SIM1 1..107 CDD:381581 59/105 (56%)
PAS 124..>194 CDD:238075 51/94 (54%)
PAS_3 279..365 CDD:430001 62/85 (73%)
SIM_C 395..704 CDD:461963 58/350 (17%)

Return to query results.
Submit another query.