DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment CG9698 and P4ha3

DIOPT Version :10

Sequence 1:NP_651806.2 Gene:CG9698 / 43629 FlyBaseID:FBgn0039784 Length:547 Species:Drosophila melanogaster
Sequence 2:XP_006508007.1 Gene:P4ha3 / 320452 MGIID:2444049 Length:551 Species:Mus musculus


Alignment Length:568 Identity:160/568 - (28%)
Similarity:247/568 - (43%) Gaps:118/568 - (20%)


- Green bases have known domain annotations that are detailed below.


  Fly    26 FSSIHEMTKVFGYEQKMVLHMQKFLSDNQDKLDFLKARLREFENERNEAREWGPSYFE------- 83
            ||::..:.:....|::::..::::|...:       ||||:.....::..    |..|       
Mouse    29 FSALTSVARALAPERRLLGTLRRYLRGEE-------ARLRDLTRFYDKVL----SLHEDLKIPVV 82

  Fly    84 SPINKYLLNKRLTVDWQRV-------ENLMATSTGEKPLTRLRKFRNRETMPDKKELEGAIDGLL 141
            :|:..:.|.|||..||:.|       ||:.|...|.:.:        .:.:|..::||||...|:
Mouse    83 NPLLAFTLIKRLQSDWRNVVHSLEATENIRALKDGYEKV--------EQDLPAFEDLEGAARALM 139

  Fly   142 RLQYVYRLKAKDLARGILDGVDYGT-----------QLNSEHCVDIARLALRDQHPRLAHS---- 191
            |||.||.|..|.||||:...|...:           .|.::.|..:.:.:...:..::|:.    
Mouse   140 RLQDVYMLNVKGLARGVFQRVTGSSITDLYSPRQLFSLTADDCFQVGKPSCGSRSLQVAYDTGDY 204

  Fly   192 -----WLIEANDRLTGGEKEEQLKPQILALLVQAKKE--LEDFRGLNDTYQEL------IGIQSA 243
                 ||.||                 ::|..:|..|  .||...|.|....|      :|..|.
Mouse   205 YHAIPWLEEA-----------------VSLFRRAHGEWKTEDEASLEDALDYLAFACFQVGNVSC 252

  Fly   244 SEEHAKNYETF--------LNALSEKALLNES-----------KPILEHAPIPEEGEPVGEFQAY 289
            :...::.:..:        .|.|..:.||.|:           :|.:.|....:..|.:.:....
Mouse   253 ALSLSREFLVYSPDNKRMARNVLKYERLLAENGHQMAAETAIQRPNVPHLQTRDTYEGLCQTLGS 317

  Fly   290 SLTCSGHWRLTPKEQRHLRCGYVTETHPFLWIAPLKAEELFQDPLLVLYHDVIYQSEIDVIRKLT 354
            ..|   |:::.     .|.|.|.|.:.|:|.:.|.:.|.:...||:.||||.:...|...||:|.
Mouse   318 QPT---HYQIP-----SLYCSYETNSSPYLLLQPARKEVVHLRPLIALYHDFVSDEEAQKIRELA 374

  Fly   355 ENRLMRATITSHNESVVSNVRTSQFTFIPVTAHKVLSTIDQRVADMTNLNMK--YAEDHQFANYG 417
            |..|.|:.:.|..:.:....|.|:..::..|...:|.|:|.|:|.:|.|:::  |||..|..|||
Mouse   375 EPWLQRSVVASGEKQLQVEYRISKSAWLKDTVDPMLVTLDHRIAALTGLDIQPPYAEYLQVVNYG 439

  Fly   418 IGGHYGQHMDWFYQTTFDAGLVSSP----EMGNRIATVLFYLSDVAQGGGTAFPQLRTLLKPKKY 478
            |||||..|.|       .|...|||    :.|||:||.:.|||.|..||.|||......:...|.
Mouse   440 IGGHYEPHFD-------HATSPSSPLYRMKSGNRVATFMIYLSSVEAGGATAFIYGNFSVPVVKN 497

  Fly   479 AAAFWHNLHASGVGDVRTQHGACPIIAGSKWVQNRWIRENDQSDRRPC 526
            ||.||.|||.||.||..|.|..||::.|.|||.|:||.|..|..||||
Mouse   498 AALFWWNLHRSGEGDGDTLHAGCPVLVGDKWVANKWIHEYGQEFRRPC 545

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
CG9698NP_651806.2 P4Ha_N 28..161 CDD:462433 37/146 (25%)
P4Hc 345..516 CDD:214780 72/176 (41%)
P4ha3XP_006508007.1 P4Ha_N 31..158 CDD:462433 37/145 (26%)
P4Hc 363..535 CDD:214780 72/178 (40%)

Return to query results.
Submit another query.