DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment CG31371 and P4ha3

DIOPT Version :10

Sequence 1:NP_733375.2 Gene:CG31371 / 43626 FlyBaseID:FBgn0051371 Length:507 Species:Drosophila melanogaster
Sequence 2:XP_006508007.1 Gene:P4ha3 / 320452 MGIID:2444049 Length:551 Species:Mus musculus


Alignment Length:557 Identity:148/557 - (26%)
Similarity:227/557 - (40%) Gaps:127/557 - (22%)


- Green bases have known domain annotations that are detailed below.


  Fly     8 LWLLQLFLLVESVAGANFARGEEQLQAL------LDTETQLIDGLRDYIERLERQLEEIRRETSA 66
            |.||.|..|....|.|  ...|:...||      |..|.:|:..||.|:...|.:|.::.|....
Mouse     7 LALLALLALGGDPAAA--TGREDTFSALTSVARALAPERRLLGTLRRYLRGEEARLRDLTRFYDK 69

  Fly    67 IEEIHSQVDSVEEYMGNPLNVFGILKRFESVWPGLEQKANATLEMVFGERLSDR----QLTLPSE 127
            :..:|   :.::..:.|||..|.::||.:|.|..:.....||..:   ..|.|.    :..||:.
Mouse    70 VLSLH---EDLKIPVVNPLLAFTLIKRLQSDWRNVVHSLEATENI---RALKDGYEKVEQDLPAF 128

  Fly   128 EDYEESLNHLLHLQSVYELDSNSLSLGV---VNGFKLGS--------SMSWGDCLEVARKS---- 177
            ||.|.:...|:.||.||.|:...|:.||   |.|..:..        |::..||.:|.:.|    
Mouse   129 EDLEGAARALMRLQDVYMLNVKGLARGVFQRVTGSSITDLYSPRQLFSLTADDCFQVGKPSCGSR 193

  Fly   178 ---------DFPVARFWLESALEKLPSA-SENSTESQRERESGRVHILEATLNIEYRAGELSRAL 232
                     |:..|..|||.|:.....| .|..||.:...|....::..|.    ::.|.:|.||
Mouse   194 SLQVAYDTGDYYHAIPWLEEAVSLFRRAHGEWKTEDEASLEDALDYLAFAC----FQVGNVSCAL 254

  Fly   233 ATAEELLLLLPMNQGIQKAKRKIEKAMAKKELPKGRGQKSKAKKQISK-STEQLLIEEICRGAKQ 296
            :.:.|.|:..|.|:.:.:...|.|:.:|:      .|.:..|:..|.: :...|...:...|..|
Mouse   255 SLSREFLVYSPDNKRMARNVLKYERLLAE------NGHQMAAETAIQRPNVPHLQTRDTYEGLCQ 313

  Fly   297 QVTTGSRFNHCQL--------DGSSPWLLLQPSRLEPVSSDPYIVLHHDVLTPKESNELLQL--- 350
              |.||:..|.|:        ..|||:|||||:|.|.|...|.|.|:||.::.:|:.::.:|   
Mouse   314 --TLGSQPTHYQIPSLYCSYETNSSPYLLLQPARKEVVHLRPLIALYHDFVSDEEAQKIRELAEP 376

  Fly   351 ------IDEEEDTKGVSYQSLKLSKLAQKK----------LGRISRLLGLEILELDPWT------ 393
                  :...|....|.|   ::||.|..|          ..||:.|.||:|  ..|:.      
Mouse   377 WLQRSVVASGEKQLQVEY---RISKSAWLKDTVDPMLVTLDHRIAALTGLDI--QPPYAEYLQVV 436

  Fly   394 -----GRRHGH-EHITKLEHSSEL------KHVARLMLNLQAPGMGGAVVFPQLELAVNVPRGSL 446
                 |....| :|.|  ..||.|      ..||..|:.|.:...|||..|.....:|.|.:.:.
Mouse   437 NYGIGGHYEPHFDHAT--SPSSPLYRMKSGNRVATFMIYLSSVEAGGATAFIYGNFSVPVVKNAA 499

  Fly   447 LHWRTRFAGGSSSEWD-YRSGQ-------AICPVLLG 475
            |.|           |: :|||:       |.||||:|
Mouse   500 LFW-----------WNLHRSGEGDGDTLHAGCPVLVG 525

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
CG31371NP_733375.2 P4Ha_N 30..158 CDD:462433 39/140 (28%)
P4ha3XP_006508007.1 P4Ha_N 31..158 CDD:462433 38/132 (29%)
P4Hc 363..535 CDD:214780 45/181 (25%)

Return to query results.
Submit another query.