DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment CG5849 and SP1029

DIOPT Version :10

Sequence 1:NP_650979.1 Gene:CG5849 / 42556 FlyBaseID:FBgn0038897 Length:968 Species:Drosophila melanogaster
Sequence 2:NP_652618.1 Gene:SP1029 / 53473 FlyBaseID:FBgn0263236 Length:932 Species:Drosophila melanogaster


Alignment Length:945 Identity:257/945 - (27%)
Similarity:433/945 - (45%) Gaps:129/945 - (13%)


- Green bases have known domain annotations that are detailed below.


  Fly     9 VVLVTLGLLVSGSMGERERS----LRLPNATYPLFYQLHISSDIHKGQ-LLFSGNATIDVAIRQS 68
            |..:.|||..:.|..:...:    .|||.:..|..|.|.|.:.:...: |.|||:..|.:...::
  Fly     8 VAALGLGLATADSSADSIETTYNYYRLPTSLRPQKYHLRILTLLENPEDLRFSGSVKILIEALEN 72

  Fly    69 TNEIVLHAKNLT--DIQITVHRLMAEGSEIVDDLTHTLHPTAALLIIHPIENYQAFEEGQQYRLE 131
            |..:.||:||||  :.|||:.::..||.:.....:..::|:....|::..:...|   |..|.|.
  Fly    73 TKNVTLHSKNLTIDESQITLRQIGGEGKKENCVSSTAVNPSHDFYILNTCQELLA---GNTYELY 134

  Fly   132 ILYTAIMASRPAGLYYMDYRDEENNHTVYVAATQCEPTYGRLIFPCYDEPGFKSNFSIKITHGSS 196
            :.:.|.:..:..|.|...|:|...|.|.:::.||.||...||.|||:|||.||:.|.:.:.:...
  Fly   135 MPFAADLNRQLEGYYRSSYKDPVANLTKWISVTQFEPASARLAFPCFDEPDFKAPFVVTLGYHKK 199

  Fly   197 HSAISNMPVKEVLAH---GDLKTTSFHTTPPISTYLVAFVISDFGSISET------YRGITQSIY 252
            ::||||||.||...|   .|.....|..:.|:||||||:.::||.....|      :|     .:
  Fly   200 YTAISNMPEKETKPHETLADYIWCEFQESVPMSTYLVAYSVNDFSHKPSTLPNSALFR-----TW 259

  Fly   253 TSPTSKEKGQVALKNAVRTVAALEDYFGVSYPLPKLDHVALKKNYGAAMENWGLITYKDVNLLKN 317
            ..|.:.::...|.:...:.:...|.:||:.:||||:|.:|:......|||||||:||:::.||.:
  Fly   260 ARPNAIDQCDYAAQFGPKVLQYYEQFFGIKFPLPKIDQIAVPDFSAGAMENWGLVTYREIALLYS 324

  Fly   318 I--SSDGQKRKLDLITQNHEIAHQWFGNLVSPEWWTYTWMNEGFATYFSYVITDLIYPNDKMMDM 380
            .  ||...|:::..:.. ||:|||||||||:.:|||..|:|||||||.:.:..:.|.|..:.|:.
  Fly   325 AAHSSLADKQRVASVVA-HELAHQWFGNLVTMKWWTDLWLNEGFATYVASLGVENINPEWRSMEQ 388

  Fly   381 FMTHEADSAYSYNSFFDVHPMSHYVEGEKDIMGVFDIISYKRGACVIKMFHHAFRQKLFVRGISH 445
            .......:.:..::....||:|..::...:|...||.|||::|:.|::|.|....::.|..|:..
  Fly   389 ESLSNLLTIFRRDALESSHPISRPIQMVSEISESFDQISYQKGSTVLRMMHLFLGEESFRSGLQA 453

  Fly   446 FLEKYRYSVANELNLFDALHSELQDDEYFSHQPWASRIREIMLSWTHSEWLPILVVTRNYENNTI 510
            :|:|:.|..|.:.||:::|   .|....:...|.:..|:.||.|||.....|::.|||:|...|.
  Fly   454 YLQKFSYKNAEQDNLWESL---TQAAHKYRSLPKSYDIKSIMDSWTLQTGYPVINVTRDYAARTA 515

  Fly   511 TFTQRSVHMKDEL--------WWIPINFATTQSPNFEDTQVDMFMP--------PQPQYTVSLED 559
            ...|....:..::        ||:|:::.|....:|.:|....:|.        |:     :::|
  Fly   516 KLNQERYLLNTQVARAYRGGCWWVPLSYTTQAVQDFNNTAPKAWMECGKNGESLPK-----TIQD 575

  Fly   560 LNIHVSGRD-WIMVNKQHTGFYLVRYDTDNLMAIARQLQTNHSV--IHPINRLGLFRDLGPLIEH 621
            |    .|.| |::.|.|.:..|.|.||..|...:...| ||...  ||.|||..|..|...|...
  Fly   576 L----PGPDQWVIFNTQLSTLYKVNYDAQNWKLLIETL-TNGDFERIHVINRAQLIDDALYLAWT 635

  Fly   622 NEIEQVEVVFELLKYLELEEDVLTWNQLQDTIECLTRNLHGTSSQSLFNEFVRRLVGPTFRRVYV 686
            .| :..|:...|::||:.|.:.|.|....:.::.:.|.:..|.....|..::::|:.|.:..:  
  Fly   636 GE-QDYEIAMRLIEYLQREREYLPWKSAFENLKRVGRIVRQTPDFEFFKRYMKKLILPIYEHL-- 697

  Fly   687 EQGVNLAEDGMFRGI------------LEIACSAD----LPECLEYTRRLAKEHIIDKIYFKDGS 735
             .|:|    ..|..|            :..||...    :|:.|.|.|....|...|:   |:..
  Fly   698 -NGIN----DTFSAIPQQDQVLLKTMVVNWACQYQVGDCVPQALAYYRNWRAEANPDE---KNPV 754

  Fly   736 DYHAIIDSVLCMGVRYLSDQDFQRIIDMMQEIDRASVYYDDIIYALRCTQSHRHLLYYL------ 794
            ..: :..:|.|..:::.||.|::.:....:: ...:.....|:.||.|::....|..||      
  Fly   755 PIN-VRSTVYCTSIKHGSDSDWEFLWTRYKK-SNVAAEKRTILTALGCSREVWLLQRYLELTFDP 817

  Fly   795 -EGLMGENSTHMVLS-EFEDLMYLL----------YIYK---------SNLASRPVIWQYIERNY 838
             |.:..::|.....: .|.::.:||          :|||         |.|.| |:..|.|.   
  Fly   818 KEAIRKQDSMWAFQAVAFNEVGFLLAKKYFMDNVDFIYKFYHPLTKDMSRLLS-PLSEQVIT--- 878

  Fly   839 KVLCRAPNFLEHFKQLAGFVPRHQRSHFERLRQTI 873
                     |..|.:...|| .:.|...:.|.|.|
  Fly   879 ---------LSDFNEFKDFV-NNSRQSLKGLEQAI 903

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
CG5849NP_650979.1 M1_APN-Q_like 38..492 CDD:341064 149/467 (32%)
ERAP1_C 569..841 CDD:463368 71/316 (22%)
SP1029NP_652618.1 M1_APN-Q_like 42..497 CDD:341064 149/466 (32%)
ERAP1_C 582..910 CDD:463368 79/350 (23%)

Return to query results.
Submit another query.