DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment CG31013 and P4ha2

DIOPT Version :10

Sequence 1:NP_733394.3 Gene:CG31013 / 326111 FlyBaseID:FBgn0051013 Length:534 Species:Drosophila melanogaster
Sequence 2:XP_006532515.1 Gene:P4ha2 / 18452 MGIID:894286 Length:557 Species:Mus musculus


Alignment Length:558 Identity:159/558 - (28%)
Similarity:249/558 - (44%) Gaps:99/558 - (17%)


- Green bases have known domain annotations that are detailed below.


  Fly    31 SLVTMVPLLELEKKLIDNLENYTNALEQKLEIIRSQLLVIRAENEKGRRNAISYLSNPLNGFSII 95
            |:..|..|:..||.|:.:|:.|....|.||..|:|....:.|...:...:...||::|:|.:.::
Mouse    28 SIGHMTDLIYAEKDLVQSLKEYILVEEAKLAKIKSWASKMEALTSRSAADPEGYLAHPVNAYKLV 92

  Fly    96 RRLHQDWINWRKYMEQPVGIWQLKAFYSWKDELPTERDLWDACEGIARIQSTYDLKVGDFINGNI 160
            :||:.||......:.|......:......:...||:.|...|...:.|:|.||.|.......|.:
Mouse    93 KRLNTDWPALGDLVLQDASAGFVANLSVQRQFFPTDEDESGAARALMRLQDTYKLDPDTISRGEL 157

  Fly   161 NGKQYNDSMSTADILSVGAYLFMKNRPSDAIQWLQEVPQRLQEELLIQPRHLPIKEVDA------ 219
            .|.:|...:|..|...:|...:.:......:.|:::|                :|::||      
Mouse   158 PGTKYQAMLSVDDCFGLGRSAYNEGDYYHTVLWMEQV----------------LKQLDAGEEATV 206

  Fly   220 -----LRLLAEAQIKDQNYSEALPLLHNCLKLQPHDARV---LRLWKKTTD-----FIENQPD-- 269
                 |..|:.|..:..:...|:.|....|.|.|...|.   ||.:::..:     .:.||.|  
Mouse   207 TKSLVLDYLSYAVFQLGDLHRAVELTRRLLSLDPSHERAGGNLRYFERLLEEERGKSLSNQTDAG 271

  Fly   270 --------QSPTENKKRNVPIANAFKLSCNGPLESST-----RLHCFYNF-TTTPFLRLAPLKTE 320
                    :.||:    .:|..:.::..|.|.....|     :|.|.|:. ...|.|.:||.|.|
Mouse   272 LATQENLYERPTD----YLPERDVYESLCRGEGVKLTPRRQKKLFCRYHHGNRVPQLLIAPFKEE 332

  Fly   321 QIGLDPYVVLYHEVLSAREISMLIGKAAQNMKNTKIHKERAVPKKNRG---------------RT 370
            .....|::|.|::|:|..||..:              ||.|.||..|.               |.
Mouse   333 DEWDSPHIVRYYDVMSDEEIERI--------------KEIAKPKLARATVRDPKTGVLTVASYRV 383

  Fly   371 AKGFWLKKESNELTKRITRRIMDMTGFDLADSEGFQVINYGIGGHYFLHMDY----FDFASS--- 428
            :|..||:::.:.:..|:.||:..:||..:..:|..||.|||:||.|..|.|:    ||....   
Mouse   384 SKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEG 448

  Fly   429 -------NHTDTRSRYS-IDLGDRIATVLFYLTDVEQGGATVFGDVGYYVSPQAGTAIFWYNLDT 485
                   |::|.:..:. :..|:|:||.|.|::|||.||||||.|:|..:.|:.|||:|||||..
Mouse   449 NRLATFLNYSDEQDAFKRLGTGNRVATFLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLR 513

  Fly   486 DGNGDPRTRHAACPVIVGSKWVMTEWIREKRQIFIRPC 523
            .|.||.|||||||||:||.|||..:|..|:.|.|:|||
Mouse   514 SGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRPC 551

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
CG31013NP_733394.3 P4Ha_N 31..162 CDD:462433 33/130 (25%)
NrfG <181..256 CDD:443378 15/88 (17%)
TPR repeat 211..246 CDD:276809 9/45 (20%)
P4Hc 336..513 CDD:214780 77/206 (37%)
P4ha2XP_006532515.1 P4Ha_N 28..159 CDD:462433 33/130 (25%)
P4Hc 348..541 CDD:214780 77/206 (37%)

Return to query results.
Submit another query.