DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment CG7333 and Slc22a2

DIOPT Version :10

Sequence 1:NP_650814.2 Gene:CG7333 / 42334 FlyBaseID:FBgn0038715 Length:541 Species:Drosophila melanogaster
Sequence 2:NP_038695.1 Gene:Slc22a2 / 20518 MGIID:1335072 Length:553 Species:Mus musculus


Alignment Length:581 Identity:152/581 - (26%)
Similarity:239/581 - (41%) Gaps:94/581 - (16%)


- Green bases have known domain annotations that are detailed below.


  Fly     6 VLDKCGNYGRFQ-----VMILLLYGYTNILGSLHYFSQTLITFTPEHWCFHADLNGLS------- 58
            :|:..|.:..||     ::.||...:|.|     |.....:.|||.|.|....:..||       
Mouse     7 ILEHIGEFHLFQKQTFFLLALLSGAFTPI-----YVGIVFLGFTPNHHCRSPGVAELSQRCGWSP 66

  Fly    59 -------VEGIRSVYENISASSC--------TPLLGVVNGTGVVSTNRK------CRN-WIFNRE 101
                   |.|:.|..|....|.|        ...|..|:....::.||.      |.: |:::..
Mouse    67 AEELNYTVPGLGSAGEVSFLSQCMRYEVDWNQSTLDCVDPLSSLAANRSHLPLSPCEHGWVYDTP 131

  Fly   102 SGYESITTELKWVCDKSHHPAVGQSFFFMGSVVGTIIFGYLSDQVGRLPSLLMATLCGATGDFIT 166
            .  .||.||...||..|....:.||...:|..:|.:..|||:|:.||...||:..|..|....:.
Mouse   132 G--SSIVTEFNLVCAHSWMLDLFQSLVNVGFFIGAVGIGYLADRFGRKFCLLVTILINAISGVLM 194

  Fly   167 SFVHTLPWFAFSRFMSGLSTDTMYYLMYILVFEYLS-PKSRTFGLNIILAVFYCFGLMTSPWAAI 230
            :......|....||:.||.:...:.:.|||:.|::. ...||.|  |...:.:..||:.....|.
Mouse   195 AISPNYAWMLVFRFLQGLVSKAGWLIGYILITEFVGLGYRRTVG--ICYQIAFTVGLLILAGVAY 257

  Fly   231 WIGNWRRYLWLASLPALGVLIYPFLICESAQWLLTKRKYDDAVICLKKVAKFNRRHVEESVFDEF 295
            .:.|||...:..:||....|:|.:.|.||.:||:::.|...|:..:|.:||.|.:.|..|:    
Mouse   258 ALPNWRWLQFAVTLPNFCFLLYFWCIPESPRWLISQNKNAKAMKIIKHIAKKNGKSVPVSL---- 318

  Fly   296 VKYYRERELQDYKLNSHEDT-------FLAMFLTPRLRRFTLTLLVKSVIITLSCDVINRNMEGL 353
                       ..|.:.|||       ||.:..||::|:.||.|:......::....:..:|...
Mouse   319 -----------QSLTADEDTGMKLNPSFLDLVRTPQIRKHTLILMYNWFTSSVLYQGLIMHMGLA 372

  Fly   354 GTSPFKLFSFTSIVYLPAGVAILLLQNKIGRKGMACTALFVGGLITTATGFMIAHLDPTENALLL 418
            |.:.:..|.::::|..||...|:|..::|||:.....:..|.|....|:.|:     |.:...|.
Mouse   373 GDNIYLDFFYSALVEFPAAFIIILTIDRIGRRYPWAVSNMVAGAACLASVFI-----PDDLQWLK 432

  Fly   419 AIMVGLGRFGATVSYDAEIQYAAEIIPTSVRGQAVSNIHVIGLASSSLA--------FYVIYLAQ 475
            ..:..|||.|.|::|:......||:.||.:|..||       |..||:.        |.|..|..
Mouse   433 ITVACLGRMGITIAYEMVCLVNAELYPTYIRNLAV-------LVCSSMCDIGGIVTPFLVYRLTD 490

  Fly   476 YYKPLPSIFISCLMFFGAGLCLTLPETLNKKLPETLADGEKFALNESFLYFPCFSRKEKNV 536
            .:...|.:..:.:.....||.|.||||..|.||||:.|.||.....        .:|||.:
Mouse   491 IWLEFPLVVFAVVGLVAGGLVLLLPETKGKALPETIEDAEKMQRPR--------KKKEKRI 543

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
CG7333NP_650814.2 2A0119 11..510 CDD:273328 143/548 (26%)
Slc22a2NP_038695.1 2A0119 12..525 CDD:273328 143/548 (26%)
Proline-rich sequence. /evidence=ECO:0000250|UniProtKB:O15244 284..288 2/3 (67%)

Return to query results.
Submit another query.