DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment CG32201 and P4HA2

DIOPT Version :10

Sequence 1:NP_730346.2 Gene:CG32201 / 317911 FlyBaseID:FBgn0052201 Length:520 Species:Drosophila melanogaster
Sequence 2:NP_004190.1 Gene:P4HA2 / 8974 HGNCID:8547 Length:535 Species:Homo sapiens


Alignment Length:575 Identity:158/575 - (27%)
Similarity:255/575 - (44%) Gaps:99/575 - (17%)


- Green bases have known domain annotations that are detailed below.


  Fly     1 MRKYHSLLPVLLGFAVLN-------SFVGCVEDI--EEKERYSSSTVGLLKLLKVEEKFTDNLLN 56
            |:.:.|.| ::..|.||:       :.:|.:.|:  .|||...|    |.:.:.|||.    .|:
Human     1 MKLWVSAL-LMAWFGVLSCVQAEFFTSIGHMTDLIYAEKELVQS----LKEYILVEEA----KLS 56

  Fly    57 QVDQLGEKFEALRMYLSSVGYELHRSLNEKVQYVSNPINAFSLLRRTHEDLPKWHEYFKEAIGEG 121
            ::.....|.|||          ..:|..:...|:::|:||:.|::|.:.|.|...:...:....|
Human    57 KIKSWANKMEAL----------TSKSAADAEGYLAHPVNAYKLVKRLNTDWPALEDLVLQDSAAG 111

  Fly   122 NQSILVDLVKMVPNDVDMLSAMHGIQRIEKIYDLKIDDLAQGVLQGVQYNVQLTYRDLIAMGNSM 186
            ..:.|....:..|.|.|.:.|...:.|::..|.|....:::|.|.|.:|...|:..|...||.|.
Human   112 FIANLSVQRQFFPTDEDEIGAAKALMRLQDTYRLDPGTISRGELPGTKYQAMLSVDDCFGMGRSA 176

  Fly   187 YQQSDYQTAAKWYRIACKRELENPEQLFI---QI----------LGDPSEHLHR--QYIKSLFKY 236
            |.:.||.....|.....| :|:..|:...   |:          |||    |||  :..:.|...
Human   177 YNEGDYYHTVLWMEQVLK-QLDAGEEATTTKSQVLDYLSYAVFQLGD----LHRALELTRRLLSL 236

  Fly   237 GSTTSEPSKSIEEAFIMVQASQEELDNIMSDLNE-----PQN------DVEVEKDLYQVKRSPSN 290
            ..:......::.   ...|..:||.:..:::..|     |:.      |...|:|:|        
Human   237 DPSHERAGGNLR---YFEQLLEEEREKTLTNQTEAELATPEGIYERPVDYLPERDVY-------- 290

  Fly   291 CELGCRG-----LYRQKTNLVCRY-KSTANTFLRLAPLKLEEISLDPFMAMYHEVLYDSEIRELK 349
             |..|||     ..|::..|.||| .......|.:||.|.|:....|.:..|::|:.|.||..:|
Human   291 -ESLCRGEGVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIK 354

  Fly   350 GQSMNMVNGYASQRNGTEIRD------TVVRYDWWSNTSLVRE-------RINQRIIDMTGFNFL 401
            ..:       ..:.....:||      ||..|. .|.:|.:.|       |:|:|:..:||....
Human   355 EIA-------KPKLARATVRDPKTGVLTVASYR-VSKSSWLEEDDDPVVARVNRRMQHITGLTVK 411

  Fly   402 KDEKLQIANYGLGTYFQPHFDYS-SDGFETPNITTLGDRLASILFYASEVPQGGATVFPEINVTV 465
            ..|.||:||||:|..::||||:| :|..:|......|:|:|:.|.|.|:|..|||||||::...:
Human   412 TAELLQVANYGVGGQYEPHFDFSRNDERDTFKHLGTGNRVATFLNYMSDVEAGGATVFPDLGAAI 476

  Fly   466 FPQKGSMLYWFNLHDDGKPDIRSLHSVCPVLNGDRWTLTKWVPMFPQMFSFPCKS 520
            :|:||:.::|:||...|:.|.|:.|:.||||.|.:|...||.....|.|..||.|
Human   477 WPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRPCGS 531

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
CG32201NP_730346.2 P4Ha_N 36..167 CDD:462433 29/130 (22%)
P4Hc 341..507 CDD:214780 63/179 (35%)
P4HA2NP_004190.1 P4Ha_N 26..157 CDD:462433 35/148 (24%)
BamD 159..>272 CDD:443281 25/120 (21%)
TPR 207..240 8/36 (22%)
P4Hc 346..519 CDD:214780 64/180 (36%)

Return to query results.
Submit another query.