DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment CG18749 and P4ha3

DIOPT Version :10

Sequence 1:NP_001027154.1 Gene:CG18749 / 3771984 FlyBaseID:FBgn0042182 Length:491 Species:Drosophila melanogaster
Sequence 2:XP_006508007.1 Gene:P4ha3 / 320452 MGIID:2444049 Length:551 Species:Mus musculus


Alignment Length:555 Identity:155/555 - (27%)
Similarity:240/555 - (43%) Gaps:143/555 - (25%)


- Green bases have known domain annotations that are detailed below.


  Fly    31 SYAASTMELMKLLEVEDELVDNLKGYVKTLKMKFNLMERSLIDMSR---ENMEMKSDYESYLGNP 92
            :::|.| .:.:.|..|..|:..|:.|::.       .|..|.|::|   :.:.:..|.:..:.||
Mouse    28 TFSALT-SVARALAPERRLLGTLRRYLRG-------EEARLRDLTRFYDKVLSLHEDLKIPVVNP 84

  Fly    93 LNSFRLIHRLHTSWRKWYQYAIKVENNALGHIENARLMR-------KMLPTSSDLQQACRGIHDL 150
            |.:|.||.||.:.||...        ::|...||.|.::       :.||...||:.|.|.:..|
Mouse    85 LLAFTLIKRLQSDWRNVV--------HSLEATENIRALKDGYEKVEQDLPAFEDLEGAARALMRL 141

  Fly   151 MYFYDLKPEELAAGNLAGYSQPGTG--------------LTAYDCLALG---------------- 185
            ...|.|..:.||    .|..|..||              |||.||..:|                
Mouse   142 QDVYMLNVKGLA----RGVFQRVTGSSITDLYSPRQLFSLTADDCFQVGKPSCGSRSLQVAYDTG 202

  Fly   186 ---------------------------EFGVQNQKDDLAEAWYN-------LSLTRF------DN 210
                                       |..:::..|.||.|.:.       |||:|.      ||
Mouse   203 DYYHAIPWLEEAVSLFRRAHGEWKTEDEASLEDALDYLAFACFQVGNVSCALSLSREFLVYSPDN 267

  Fly   211 -----IIEKYQVHKAWALLLAKNKQLTEAFQHFENKPEGIVASNEVIH------FEGVLATTQNC 264
                 .:.||:      .|||:|.....|        |..:....|.|      :||:..|..:.
Mouse   268 KRMARNVLKYE------RLLAENGHQMAA--------ETAIQRPNVPHLQTRDTYEGLCQTLGSQ 318

  Fly   265 TAVVQKPSKKLHCRYNTSTTPFTRIAPLKMEELGLDPYMVVFHDVIYDTEIDGMLNSSDFGLSES 329
            ....|.||  |:|.|.|:::|:..:.|.:.|.:.|.|.:.::||.:.|.|...:...::..|..|
Mouse   319 PTHYQIPS--LYCSYETNSSPYLLLQPARKEVVHLRPLIALYHDFVSDEEAQKIRELAEPWLQRS 381

  Fly   330 V--SG---LKSEVRTSKDSHIVDA-----KTLNERVTDMTGLSME--MSDPFSLINYGLGGHFIL 382
            |  ||   |:.|.|.||.:.:.|.     .||:.|:..:|||.::  .::...::|||:|||:..
Mouse   382 VVASGEKQLQVEYRISKSAWLKDTVDPMLVTLDHRIAALTGLDIQPPYAEYLQVVNYGIGGHYEP 446

  Fly   383 HHDFHEYTNTT---RLKQGDRIATVLFYLREVDSGGATVFPMLNITVMPKKGSAVFWYNLHNSGA 444
            |.| |..:.::   |:|.|:|:||.:.||..|::||||.|...|.:|...|.:|:||:|||.||.
Mouse   447 HFD-HATSPSSPLYRMKSGNRVATFMIYLSSVEAGGATAFIYGNFSVPVVKNAALFWWNLHRSGE 510

  Fly   445 VNSKTLHTACPVISGSKYVLTKWINELPQMFVTPC 479
            .:..|||..|||:.|.|:|..|||:|..|.|..||
Mouse   511 GDGDTLHAGCPVLVGDKWVANKWIHEYGQEFRRPC 545

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
CG18749NP_001027154.1 P4Ha_N 35..168 CDD:462433 36/142 (25%)
P4Hc 314..468 CDD:214780 62/168 (37%)
P4ha3XP_006508007.1 P4Ha_N 31..158 CDD:462433 38/146 (26%)
P4Hc 363..535 CDD:214780 64/172 (37%)

Return to query results.
Submit another query.