DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment CG31524 and P4ha2

DIOPT Version :10

Sequence 1:NP_733379.2 Gene:CG31524 / 318781 FlyBaseID:FBgn0051524 Length:536 Species:Drosophila melanogaster
Sequence 2:XP_006532515.1 Gene:P4ha2 / 18452 MGIID:894286 Length:557 Species:Mus musculus


Alignment Length:578 Identity:172/578 - (29%)
Similarity:265/578 - (45%) Gaps:78/578 - (13%)


- Green bases have known domain annotations that are detailed below.


  Fly     4 LKLLFVVIFFLSLSMGQIEATQPRFARSVVNMDDLLNMEDDLVSKLEGYAEKLSYKANTIRWGIQ 68
            :||..:|:..|....|.:...|..|..|:.:|.||:..|.|||..|:.|......|...|:....
Mouse     1 MKLQVLVLVLLMSWFGVLSWVQAEFFTSIGHMTDLIYAEKDLVQSLKEYILVEEAKLAKIKSWAS 65

  Fly    69 QMREQLDKSKKEQSFDL---FNRYSFIRHMQADWLMWKQYLDKPVIRD----ELNYKQMDNLRM- 125
            :|.....:|..:....|   .|.|..::.:..||         |.:.|    :.:...:.||.: 
Mouse    66 KMEALTSRSAADPEGYLAHPVNAYKLVKRLNTDW---------PALGDLVLQDASAGFVANLSVQ 121

  Fly   126 ----PQELDLFDASEAIRRMQATYAMLSNDIAEGFLDGVQYTSKLSPIDCLAMGRHLMNQSRWTI 186
                |.:.|...|:.|:.|:|.||.:..:.|:.|.|.|.:|.:.||..||..:||...|:..:..
Mouse   122 RQFFPTDEDESGAARALMRLQDTYKLDPDTISRGELPGTKYQAMLSVDDCFGLGRSAYNEGDYYH 186

  Fly   187 AEQWILAGIKAQDRKGPQTEMILLRGPTKAELFRTLGKVRFERRNEEGALKAYQAALKHSP-HD- 249
            ...|:...:|..| .|.:..:      ||:.:...|....|:..:...|::..:..|...| |: 
Mouse   187 TVLWMEQVLKQLD-AGEEATV------TKSLVLDYLSYAVFQLGDLHRAVELTRRLLSLDPSHER 244

  Fly   250 ----LEIFQEYQNLKR-RVLT------LSPSEPIREEPNDDIEEMEL-PPCCSGRCEG----PRK 298
                |..|:.....:| :.|:      |:..|.:.|.|.|.:.|.:: ...|.|  ||    ||:
Mouse   245 AGGNLRYFERLLEEERGKSLSNQTDAGLATQENLYERPTDYLPERDVYESLCRG--EGVKLTPRR 307

  Fly   299 LNRLYCVY---NCVTAPFLRLAPIKTEILSVDPFVILLHDMVSHKEGALIRSSSKNQILPSETVN 360
            ..:|:|.|   |.|  |.|.:||.|.|.....|.::..:|::|.:|...|:..:|.: |...||.
Mouse   308 QKKLFCRYHHGNRV--PQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIAKPK-LARATVR 369

  Fly   361 --AANEFEIAKFRTSKSVWFDSDANEATLKLTQRLGEATGLDMKHSEPFQVINYGIGGVFESHFD 423
              ......:|.:|.|||.|.:.|.:....::.:|:...|||.:|.:|..||.|||:||.:|.|||
Mouse   370 DPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFD 434

  Fly   424 TS--------------LA-------DEDRFVN-GYIDRLATTLFYLNDVPQGGATHFPGLNITVF 466
            .|              ||       ::|.|.. |..:|:||.|.|::||..||||.||.|...::
Mouse   435 FSRRPFDSGLKTEGNRLATFLNYSDEQDAFKRLGTGNRVATFLNYMSDVEAGGATVFPDLGAAIW 499

  Fly   467 PKFGTVLMWYNLHTEGMLHVRTMHTGCPVIVGSKWVVSKWIDDKGQEFRRPCLRSRLD 524
            ||.||.:.||||...|....||.|..|||:||.|||.:||..::||||.|||..:.:|
Mouse   500 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRPCGTTEVD 557

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
CG31524NP_733379.2 P4Ha_N 31..159 CDD:462433 34/139 (24%)
BepA 172..>256 CDD:443813 17/89 (19%)
P4Hc 340..507 CDD:214780 71/190 (37%)
P4ha2XP_006532515.1 P4Ha_N 28..159 CDD:462433 34/139 (24%)
P4Hc 348..541 CDD:214780 73/193 (38%)

Return to query results.
Submit another query.