DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment CG5592 and Slc22a1

DIOPT Version :10

Sequence 1:NP_647996.1 Gene:CG5592 / 38662 FlyBaseID:FBgn0035645 Length:540 Species:Drosophila melanogaster
Sequence 2:NP_033228.2 Gene:Slc22a1 / 20517 MGIID:108111 Length:556 Species:Mus musculus


Alignment Length:560 Identity:163/560 - (29%)
Similarity:268/560 - (47%) Gaps:52/560 - (9%)


- Green bases have known domain annotations that are detailed below.


  Fly    10 ENFLLFIGDFGPFQKR--LLFWMMPAAFLFAFTYFGQIFMILLPKNHWCR---VRELE---GLPE 66
            ::.|..:|:||.|||:  ||..::.|:  .|..|.|.:|:...|.:| ||   |.||.   |...
Mouse     5 DDVLEHVGEFGWFQKQAFLLLCLISAS--LAPIYVGIVFLGFTPDHH-CRSPGVAELSQRCGWSP 66

  Fly    67 SEQKNRGIP---KKQDGSF-ENCFMYNVPYDSNATN--------DANQTRSPI-PCNNGWIYDRK 118
            :|:.|..:|   ...:.|| ..|..|.|.::.:..:        .||::..|: ||.:||:||  
Mouse    67 AEELNYTVPGLGSAGEASFLSQCMKYEVDWNQSTLDCVDPLSSLAANRSHLPLSPCEHGWVYD-- 129

  Fly   119 EVPYESIATEYNWVCDKRDFGTYSVVVYFVGCI-----VGCLCFGFITDHSGRLPALFLANSCSM 178
             .|..||.||:|.||     |....|..|..|:     :|.|..|:|.|..||...|.:....:.
Mouse   130 -TPGSSIVTEFNLVC-----GDAWKVDLFQSCVNLGFFLGSLVVGYIADRFGRKLCLLVTTLVTS 188

  Fly   179 IGGCVSVVCKDFPCFAASRFVAGLSMNYCFVPIYILTLENVGIKYRTLVGNLALTFFFTLGACLL 243
            :.|.::.|..|:......|.:.|:.....:|..|.|..|.||..||.... :.....||:|...|
Mouse   189 LSGVLTAVAPDYTSMLLFRLLQGMVSKGSWVSGYTLITEFVGSGYRRTTA-ILYQVAFTVGLVGL 252

  Fly   244 PWLAYVISNWRHYAMVVALPIVFMILTSLLAPESPSWLMSVGKVDRCIEVMKEAAKANGKIISEE 308
            ..:||.|.:||...:.|:||....:|.....||||.||:|..:..:.:.:|::.|:.|.|:...:
Mouse   253 AGVAYAIPDWRWLQLAVSLPTFLFLLYYWFVPESPRWLLSQKRTTQAVRIMEQIAQKNRKVPPAD 317

  Fly   309 VWSEMRECYELKFANEQLGKQYTSLDLFKTFPRLVVLT-ILIVTWMTVALAYDAHVRVVEILDTD 372
            :   ...|.| :.|:|:....:.  |||:| |.|...| ||:..|.:.|:.|...:..|.....:
Mouse   318 L---KMMCLE-EDASERRSPSFA--DLFRT-PSLRKHTLILMYLWFSCAVLYQGLIMHVGATGAN 375

  Fly   373 IFITFSLSSLVEIPAGIVPMLLLDRIGRKPMMSAVMLLCAASSLFVGIL--KGHWNASTAAIAAR 435
            :::.|..|||||.||..:.::.:|||||...::|..|:..|:.|.:..:  :.||...|.|...|
Mouse   376 LYLDFFYSSLVEFPAAFIILVTIDRIGRIYPIAASNLVAGAACLLMIFIPHELHWLNVTLACLGR 440

  Fly   436 FFATMAYNVGQQWASEILPTVLRGQGLAIINIMGQMGALLSP-LVLSTHRYYRPLPMFIITLVSV 499
            ..||:...:.....:|:.||.:|..|:.:.:.:..:|.:.:| :|......::.||:.:..::.:
Mouse   441 MGATIVLQMVCLVNAELYPTFIRNLGMMVCSALCDLGGIFTPFMVFRLMEVWQALPLILFGVLGL 505

  Fly   500 IGALIILFLPETKGATMPQTLDEAEKRWTLRCRDRKINED 539
            ....:.|.||||||..:|:|::|||   .|..|..|..|:
Mouse   506 SAGAVTLLLPETKGVALPETIEEAE---NLGRRKSKAKEN 542

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
CG5592NP_647996.1 MFS_SLC22 113..509 CDD:340875 112/404 (28%)
Slc22a1NP_033228.2 2A0119 12..525 CDD:273328 154/531 (29%)
Proline-rich sequence. /evidence=ECO:0000250|UniProtKB:O15245 284..288 3/3 (100%)

Return to query results.
Submit another query.