DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment CG10232 and Habp2

DIOPT Version :10

Sequence 1:NP_651175.4 Gene:CG10232 / 42800 FlyBaseID:FBgn0039108 Length:509 Species:Drosophila melanogaster
Sequence 2:NP_001316864.1 Gene:Habp2 / 226243 MGIID:1196378 Length:554 Species:Mus musculus


Alignment Length:610 Identity:146/610 - (23%)
Similarity:217/610 - (35%) Gaps:229/610 - (37%)


- Green bases have known domain annotations that are detailed below.


  Fly    27 YNSWKKCDVNEDCTHLKACPPLKYFLQSADISWAREAILRDKQCGYNSYCCRKEGYERRSNEQCV 91
            |.|:::...:||.:..:..|      ::.|  |..|   .|..|..|.  |...|       .|:
Mouse    39 YYSYEQSSPDEDPSVTQTTP------ENPD--WYYE---DDDPCQSNP--CEHGG-------DCI 83

  Fly    92 MLND-----CP-QFNGRKWTSDILKSIINKI---------CYIDHFQPLVR------------RR 129
            :..|     || .|:|.:     .::..||.         |.|....|..|            .:
Mouse    84 IRGDTFSCSCPAPFSGSR-----CQTAQNKCKDNPCVHGDCLITQKHPYYRCACKYPYTGPDCSK 143

  Fly   130 VYVNCQRQEIFPCQYDEIC-----RSRDSCPFLKLKRKKAQNILMSRQCGINTYCCPKQ---EFP 186
            |...|:..   |||...:|     |||.:|                        .||.|   :|.
Mouse   144 VLPACRPN---PCQNGGVCSRHRRRSRFTC------------------------ACPDQYKGKFC 181

  Fly   187 DCPADE--------------KCIRLDKCL---------RIHNTTMEDGA--NLMDNRQCAIDTRR 226
            :...|:              |.:..:.||         ..:|..|||..  .:.::..|    |.
Mouse   182 EIGPDDCYVGDGYSYRGKVSKTVNQNPCLYWNSHLLLQETYNMFMEDAETHGIAEHNFC----RN 242

  Fly   227 IDSDKRHY-------------IC----CPEPGNVLPT--------------SCGQAP----PLYR 256
            .|.|.:.:             .|    ||.|....|.              |||:..    .:.|
Mouse   243 PDGDHKPWCFVKVNSEKVKWEYCDVTVCPVPDTPNPVESLLEPVMELPGFESCGKTEVAEHAVKR 307

  Fly   257 MAYGTAARPNEYPWMAMLIYENRRLSTMTNN--CSGSLINKRYVLTAAHCVVKDKMVNTDLVLR- 318
            :..|..:...::||...|.......::|...  |.|:||:..:|||||||        ||:..: 
Mouse   308 IYGGFKSTAGKHPWQVSLQTSLPLTTSMPQGHFCGGALIHPCWVLTAAHC--------TDINTKH 364

  Fly   319 -RVRLGEHDITTNPDCDFTGNCAAPFVEIGIEYFNVHEQYFNTSRF-------------ESDIAL 369
             :|.||:.|:.....                     |||.|...:.             .:||||
Mouse   365 LKVVLGDQDLKKTES---------------------HEQTFRVEKILKYSQYNERDEIPHNDIAL 408

  Fly   370 VRLQTPV--------RYTHEILPICVPKDPIPLHNHPLQIAGWGYTKNREYSQVLLH-------N 419
            ::|: ||        ||   :..:|:|.||.| ......|:|||.|:..|.|:.||.       |
Mouse   409 LKLK-PVGGHCALESRY---VKTVCLPSDPFP-SGTECHISGWGVTETGEGSRQLLDAKVKLIAN 468

  Fly   420 TVYENRYYCQDKISFFRNESQICASGIR--GEDSCEGDSGGPLMLTLNNDYQDIVYLAGIVSYGS 482
            .:..:|......|    ::|.|||..::  |.|:|:|||||||....:..|    |:.||||:|.
Mouse   469 PLCNSRQLYDHTI----DDSMICAGNLQKPGSDTCQGDSGGPLTCEKDGTY----YVYGIVSWGQ 525

  Fly   483 ENCGDRKPGVYTKTGAFFSWIKANL 507
            | || :||||||:...|.:|||..:
Mouse   526 E-CG-KKPGVYTQVTKFLNWIKTTM 548

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
CG10232NP_651175.4 Tryp_SPc 260..506 CDD:238113 86/279 (31%)
Habp2NP_001316864.1 EGF_CA 67..103 CDD:238011 11/49 (22%)
EGF_CA 150..182 CDD:238011 12/58 (21%)
KR 187..271 CDD:238056 12/87 (14%)
Tryp_SPc 308..547 CDD:238113 86/282 (30%)

Return to query results.
Submit another query.