DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment CG32199 and P4ha3

DIOPT Version :10

Sequence 1:NP_730347.1 Gene:CG32199 / 40028 FlyBaseID:FBgn0052199 Length:509 Species:Drosophila melanogaster
Sequence 2:XP_006508007.1 Gene:P4ha3 / 320452 MGIID:2444049 Length:551 Species:Mus musculus


Alignment Length:558 Identity:141/558 - (25%)
Similarity:250/558 - (44%) Gaps:97/558 - (17%)


- Green bases have known domain annotations that are detailed below.


  Fly     5 TSVMSMMKLLEMEESLKTNLDIYVQEMQSKL-DLIKLYQDSLKRVTLTTLEEKEEYVSNPLNAFP 68
            :::.|:.:.|..|..|...|..|::..:::| ||.:.|..     .|:..|:.:..|.|||.||.
Mouse    30 SALTSVARALAPERRLLGTLRRYLRGEEARLRDLTRFYDK-----VLSLHEDLKIPVVNPLLAFT 89

  Fly    69 MLRRLNQDWPKWLRYIKLAIASKKIKE-MEVQLKSAPIDDDLKVALKGMTRIEKFHNLHAEDLTK 132
            :::||..||...:..::.....:.:|: .|...:..|..:||:.|.:.:.|::..:.|:.:.|.:
Mouse    90 LIKRLQSDWRNVVHSLEATENIRALKDGYEKVEQDLPAFEDLEGAARALMRLQDVYMLNVKGLAR 154

  Fly   133 GV---LMGKKLNS--------HLTAPDCVAL----------------GDYYYNQTQFSGSTHWYR 170
            ||   :.|..:..        .|||.||..:                ||||:       :..|..
Mouse   155 GVFQRVTGSSITDLYSPRQLFSLTADDCFQVGKPSCGSRSLQVAYDTGDYYH-------AIPWLE 212

  Fly   171 MALRVHTHPRGMIYAKVLGLKRKRIYKKYAKALLKE-------------------SLNSDKSKPT 216
            .|:.:.....|.             :|...:|.|::                   ||:.:....:
Mouse   213 EAVSLFRRAHGE-------------WKTEDEASLEDALDYLAFACFQVGNVSCALSLSREFLVYS 264

  Fly   217 PTEKSEWNRLAKEVTREDNYDNVKKLIDEYLSGDEKIFQEVAARLKRKPTKLERGCR--GEWPKK 279
            |..|    |:|:.|.:   |:.:.......::.:..|.:.....|:.:.| .|..|:  |..|..
Mouse   265 PDNK----RMARNVLK---YERLLAENGHQMAAETAIQRPNVPHLQTRDT-YEGLCQTLGSQPTH 321

  Fly   280 -SSPELICRYNRDTSAFLKLAPLKLEFLSVQPMILLYHDVLYENEFKSMRDIAMYNGSMIDGWTY 343
             ..|.|.|.|..::|.:|.|.|.:.|.:.::|:|.||||.:.:.|.:.:|::|       :.|..
Mouse   322 YQIPSLYCSYETNSSPYLLLQPARKEVVHLRPLIALYHDFVSDEEAQKIRELA-------EPWLQ 379

  Fly   344 VDFDKKGNPKQQ--DRVVKMIAFQGTTAPFTLSINRRMADMSGLEMRDNMVLYL--TNYGLGGHF 404
            ......|..:.|  .|:.|....:.|..|..::::.|:|.::||:::.....||  .|||:|||:
Mouse   380 RSVVASGEKQLQVEYRISKSAWLKDTVDPMLVTLDHRIAALTGLDIQPPYAEYLQVVNYGIGGHY 444

  Fly   405 GKHVDYVELAKRPPDFFADFGGDRIATALIYASDIPLGGTTVFTKLKIAVQPKKGSALIWFNLNH 469
            ..|.|:......|  .:....|:|:||.:||.|.:..||.|.|.....:|...|.:||.|:||:.
Mouse   445 EPHFDHATSPSSP--LYRMKSGNRVATFMIYLSSVEAGGATAFIYGNFSVPVVKNAALFWWNLHR 507

  Fly   470 AGEPDPLTEHSVCPVVLGSRWIISKWIHERQQVFKKPC 507
            :||.|..|.|:.|||::|.:|:.:|||||..|.|::||
Mouse   508 SGEGDGDTLHAGCPVLVGDKWVANKWIHEYGQEFRRPC 545

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
CG32199NP_730347.1 P4Ha_N 6..137 CDD:462433 33/135 (24%)
P4Hc <369..497 CDD:214780 46/129 (36%)
P4ha3XP_006508007.1 P4Ha_N 31..158 CDD:462433 33/131 (25%)
P4Hc 363..535 CDD:214780 55/180 (31%)

Return to query results.
Submit another query.