DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Sos and Sos2

DIOPT Version :9

Sequence 1:NP_476597.2 Gene:Sos / 34790 FlyBaseID:FBgn0001965 Length:1596 Species:Drosophila melanogaster
Sequence 2:XP_006515698.1 Gene:Sos2 / 20663 MGIID:98355 Length:1333 Species:Mus musculus


Alignment Length:1395 Identity:608/1395 - (43%)
Similarity:836/1395 - (59%) Gaps:150/1395 - (10%)


- Green bases have known domain annotations that are detailed below.


  Fly    58 YDFTKCENAARWRGLFTPSLKKVLEQVHPRVTAKEDALLYVEKLCLRLLAMLCAKPLPHSVQDVE 122
            |:|...||:.:||||..|:|:||.|||||.::|.|::|.|:|:|..:||..||... |.:|||||
Mouse     8 YEFFSEENSPKWRGLLVPALRKVQEQVHPTLSANEESLYYIEELIFQLLNKLCMAQ-PRTVQDVE 71

  Fly   123 EKVNKSFPAPIDQWALNEAKEVINSKKRKS--VLPTEKVHTLLQKDVLQYKIDSSVSAFLVAVLE 185
            |:|.|:||.|||:||:.:|:..|..:||::  :||.:|:|..| |:||.||:|..||.::|||||
Mouse    72 ERVQKTFPHPIDKWAIADAQSAIEKRKRRNPLLLPVDKIHPSL-KEVLGYKVDYHVSLYIVAVLE 135

  Fly   186 YISADILKMAGDYVIKIAHCEITKEDIEVVMNADRVLMDMLNQSEAHILPSPLSL----PAQRAS 246
            ||||||||:||:||..|.|.||:::||:|.|.||:|||||.:|.:...|   :||    |.....
Mouse   136 YISADILKLAGNYVFNIRHYEISQQDIKVSMCADKVLMDMFDQDDDIGL---VSLCEDEPCSSGE 197

  Fly   247 ATYEETVKELIHDEKQYQRDLHMIIRVFREE--LVKIVSDPRELEPIFSNIMDIYEVTVTLLGSL 309
            ..|.:.|:..|.:|:||.|:|:|||:||||.  |.:.:..|.|:|.|||||.||:|:||.|||.:
Mouse   198 LNYYDLVRTEIAEERQYLRELNMIIKVFREAFLLDRKLFKPSEIEKIFSNISDIHELTVKLLGLI 262

  Fly   310 EDVIEMSQEQSA-PCVGSCFEELAEAEEFDVYKKYAYDVTSQASRDALNNLLSKPG-ASSLTTAG 372
            ||.:||:.|.|. |..|||||:|||.:.||.|:..:.|:.:....|..:.|:::|. |....:..
Mouse   263 EDTVEMTDESSPHPLAGSCFEDLAEEQAFDPYETLSQDILAPEFNDHFSKLMARPAVALHFQSIA 327

  Fly   373 HGFRDAVKYYLPKLLLVPICHAFVYFDYIKHLKDLSSSQDDIESFEQVQGLLHPLHCDLEKVMAS 437
            .||::||:|.||:|:|||:.|.:.||:.:|.||..|..|:|.|...|....|..|...::::...
Mouse   328 DGFKEAVRYVLPRLMLVPVYHCWHYFELLKQLKACSEEQEDKECLNQAITALMNLQGSMDRIYKQ 392

  Fly   438 LSKERQV--PVSGRVRRQ-----LAIERTRELQMKVEHWEDKDVGQNCNEFIREDSLSKLGSGKR 495
            .|..|:.  ||.....||     |||::..|:|..::.||.||:||.|||||.|..|:::|:.  
Mouse   393 HSPRRRPGDPVCLFYNRQLRSKHLAIKKMNEIQKNIDGWEGKDIGQCCNEFIMEGPLTRIGAK-- 455

  Fly   496 IWSERKVFLFDGLMVLCKANTKKQTPSAGATAYDYRLKEKYFMRRVDINDRPDSDDLKNSFELAP 560
              .||.:|||||||:.||.| ..||...|.::.:||||||:.||::.|.|:.|:.:.:::|||..
Mouse   456 --HERHIFLFDGLMISCKPN-HGQTRLPGYSSAEYRLKEKFVMRKIQICDKEDACEYRHAFELVS 517

  Fly   561 RMQPPIVLTAKNAQHKHDWMADLLMVITKSMLDRHLDSILQDIERKHPLRMPSPEIYKFAVPDSG 625
            :.:..::..||:|:.|::|||.|:.:..:|.|||.|||:|...|.:.|||:|||::|:|.|.||.
Mouse   518 KDENSVIFAAKSAEEKNNWMAALISLHYRSTLDRMLDSVLLKEENEQPLRLPSPDMYRFVVTDSE 582

  Fly   626 DNIVLEE--RESAGVPMIKGATLCKLIERLTYHIYADPTFVRTFLTTYRYFCSPQQLLQLLVERF 688
            :|||.|:  :..:|:|:|||.|:.|||||||||:||||.||||||||||.||.||:||.||:|||
Mouse   583 ENIVFEDNLQSRSGIPIIKGGTVVKLIERLTYHMYADPNFVRTFLTTYRSFCKPQELLNLLIERF 647

  Fly   689 NIPDPSLVYQDTGTAGAGGMGGVGGDKEHKNSHREDWKRYRKEYVQPVQFRVLNVLRHWVDHHFY 753
            .||:|.....|......|           :.....|.||:|||||||||.|||||.||||:||:|
Mouse   648 EIPEPEPTEADKLALEKG-----------EQPISADLKRFRKEYVQPVQLRVLNVFRHWVEHHYY 701

  Fly   754 DFEKDPMLLEKLLNFLEHVNGKSMRKWVDSVLKIVQRKNEQEKSNKKIVYAYGHDPPPIEHHLSV 818
            |||:|..|||:|.:|:..|.||:|:|||:|:.||::||.:.:.:.......:...|||:|.|:|.
Mouse   702 DFERDLELLERLESFISSVRGKAMKKWVESIAKIIKRKKQAQANGISHNITFESSPPPVEWHISR 766

  Fly   819 PN--DEITLLTLHPLELARQLTLLEFEMYKNVKPSELVGSPWTKKDKEVKSPNLLKIMKHTTNVT 881
            ..  :...|:||||:|:||||||||.::|:.|:|||||||.|||:|||:.||||||:::||||:|
Mouse   767 TGQFETFDLMTLHPIEIARQLTLLESDLYRKVQPSELVGSVWTKEDKEINSPNLLKMIRHTTNLT 831

  Fly   882 RWIEKSITEAENYEERLAIMQRAIEVMMVMLELNNFNGILSIVAAMGTASVYRLRWTFQGLPERY 946
            .|.||.|.||||:|||:|::.|.:|::.|..:||||||:|.||:|:.:.|||||..||:.|.||.
Mouse   832 LWFEKCIVEAENFEERVAVLSRIVEILQVFQDLNNFNGVLEIVSAVNSVSVYRLDHTFEALQERK 896

  Fly   947 RKFLEECRELSDDHLKKYQERLRSINPPCVPFFGRYLTNILHLEEGNPDLL--ANTELINFSKRR 1009
            |:.|::..|||.||.|||..:|:|||||||||||.||||||..||||.|.|  ...:||||||||
Mouse   897 RRILDDAVELSQDHFKKYLVKLKSINPPCVPFFGIYLTNILKTEEGNSDFLKRKGKDLINFSKRR 961

  Fly  1010 KVAEIIGEIQQYQNQPYCLNEESTIRQFFEQLDPFNGLSDKQMSDYLYNESLRIEPRGCKTVPKF 1074
            |||||.|||||||||||||..|..:|:|||.|:|...||:|:.:|||:|:||.||||.||..|:|
Mouse   962 KVAEITGEIQQYQNQPYCLRTEPEMRRFFENLNPMGILSEKEFTDYLFNKSLEIEPRNCKQPPRF 1026

  Fly  1075 PRKWPHIPLKSPGIKPRRQNQTNSSSKLSN------------STSSVAAAAAAS--STATSIATA 1125
            ||| ....||||||:|......::|..|..            |.|.:|.....|  |..||..|.
Mouse  1027 PRK-STFSLKSPGIRPNAGRHGSTSGTLRGHPTPLEREPYKISFSRIAETELESTVSAPTSPNTP 1090

  Fly  1126 SAPSLHASS----IMDAPTAAAANAGSGTLAGEQSPQHNPHAFSVFAPVIIPERNTSSWSGTPQH 1186
            |.|.:.|||    .:|..  ..::.||.|               :||||::|...|...|....|
Mouse  1091 STPPVSASSDHSVFLDVD--LNSSCGSNT---------------IFAPVLLPHSKTFFSSCGSLH 1138

  Fly  1187 TRTDQNNGEVSVPAPHLPKKPGAHVWANNNSTLASASAMDVVFSPALPEHLPPQSLPDSNP---- 1247
            ..::    |..:|.|..|:|...|...|:...:.|...     .||:|...||.  |...|    
Mouse  1139 KLSE----EPLIPPPLPPRKKFDHDALNSKGAVKSDDD-----PPAIPPRQPPP--PKVKPRVPV 1192

  Fly  1248 ----FASDTEAPPSPLPKLVVSPRHETGN----RSPFH-----GRMQNSPT-HSTASTVTLTGMS 1298
                |.....:||.|.|:   .|..:|..    |.|.|     ..:|..|. |.......|..:|
Mouse  1193 LMGTFDGPVPSPPPPPPR---DPLPDTPPPVPLRPPEHFINCPFNLQPPPLGHPHRDPDWLRDVS 1254

  Fly  1299 TSGGEEFCAGGFYFNSAHQGQPGAVPISPHVNVPMATNM------EYRAVP-PPLPPRRKERTES 1356
            |      |.     ||     |...|.:|...:|.:.::      ....:| ||:|||       
Mouse  1255 T------CP-----NS-----PSTPPTTPSPRIPRSCHLLSSSHSSLAHLPAPPVPPR------- 1296

  Fly  1357 CADMAQKRQAPDAPTLPPR--DGELSPPPI 1384
                  :..:|..|.|||:  ..|||.||:
Mouse  1297 ------QNSSPLLPKLPPKTYKRELSHPPL 1320

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
SosNP_476597.2 Histone 90..216 CDD:278551 67/127 (53%)
H2A 155..>196 CDD:305064 25/40 (63%)
RhoGEF 249..432 CDD:238091 79/186 (42%)
PH_SOS 476..586 CDD:269963 46/109 (42%)
PH 482..587 CDD:278594 41/104 (39%)
RasGEFN 637..791 CDD:214571 88/153 (58%)
RasGEF 825..1062 CDD:238087 150/238 (63%)
Sos2XP_006515698.1 Histone 70..169 CDD:333859 54/99 (55%)
RhoGEF 200..387 CDD:238091 79/186 (42%)
PH_SOS 438..544 CDD:269963 46/110 (42%)
RasGEFN 596..740 CDD:214571 89/154 (58%)
RasGEF 775..1019 CDD:214539 154/243 (63%)


Information from Original Tools:


Tool Simple Score Weighted Score Original Tool Information
BLAST Result Score Score Type Cluster ID
Compara 00.000 Not matched by this tool.
Domainoid 1 1.000 239 1.000 Domainoid score I2269
eggNOG 1 0.900 - - E1_KOG3417
Hieranoid 1 1.000 - -
Homologene 00.000 Not matched by this tool.
Inparanoid 1 1.050 1027 1.000 Inparanoid score I277
Isobase 00.000 Not matched by this tool.
OMA 1 1.010 - - QHG45786
OrthoDB 00.000 Not matched by this tool.
OrthoFinder 1 1.000 - - FOG0002651
OrthoInspector 1 1.000 - - otm44267
orthoMCL 1 0.900 - - OOG6_101209
Panther 1 1.100 - - LDO PTHR23113
Phylome 1 0.910 - -
RoundUp 1 1.030 - avgDist Average_Evolutionary_Distance R2063
SonicParanoid 1 1.000 - - X2024
SwiftOrtho 00.000 Not matched by this tool.
TreeFam 1 0.960 - -
1312.860

Return to query results.
Submit another query.