DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment CG42342 and COL23A1

DIOPT Version :10

Sequence 1:NP_001189234.1 Gene:CG42342 / 7354466 FlyBaseID:FBgn0259244 Length:831 Species:Drosophila melanogaster
Sequence 2:XP_006714996.1 Gene:COL23A1 / 91522 HGNCID:22990 Length:566 Species:Homo sapiens


Alignment Length:795 Identity:264/795 - (33%)
Similarity:320/795 - (40%) Gaps:282/795 - (35%)


- Green bases have known domain annotations that are detailed below.


  Fly    18 DNSQSEPSGGNGESPAATTAAAASVEAPQQSLLLGHNAADASAAAVASRLAPPPCQHPINNSNNN 82
            |..:...:||.|...:||||.:.:|.|    |.|                               
Human    12 DAGKGNAAGGGGGGRSATTAGSRAVSA----LCL------------------------------- 41

  Fly    83 SNISNNSSNSSSSKERPRPTVRFISLLHVASYVLCLCAFSFALYGNVRQTRLEQRMQRLQQLDAR 147
                                     ||.|.|...||      |.| |:...|:.|:..|:     
Human    42 -------------------------LLSVGSAAACL------LLG-VQAAALQGRVAALE----- 69

  Fly   148 IVELELRLEQQQLLHWPAEQTQVLASHPSDRDSSNSNNGSQHLELHVRRELHRLRRDVSHLQLTR 212
                    |:::||.        .|..|...|:.    ...|||..:|.:|..|.:         
Human    70 --------EERELLR--------RAGPPGALDAW----AEPHLERLLREKLDGLAK--------- 105

  Fly   213 RQQRRQAAEAAAAAASGEGGSGGGQCQCQPGPPGPPGPPGKRGKRGKKGDSGEKGDPGLNGISGE 277
               .|.|.||.:            :|.|.|||||..|.||:||..|..|.||..|.||..|:.|:
Human   106 ---IRTAREAPS------------ECVCPPGPPGRRGKPGRRGDPGPPGQSGRDGYPGPLGLDGK 155

  Fly   278 KGAAGKPGDKGQKGDVGHPGMDVFQTVKGLKRSVTTLHGGTLGYAEIVAVKDLQEAGVNVSASTV 342
            .|..|..|:||..||.|..|                                  :.|.:      
Human   156 PGLPGPKGEKGAPGDFGPRG----------------------------------DQGQD------ 180

  Fly   343 IKLKGEPGEPGPPGPPGEAGQPGAPGERGPPGEIGAQGPQGEAGQP---GVAGPPGVAGAPGTKG 404
                |..|.||||||||..|.||..|:.||.|..|..||:||.||.   |..||||..|.||..|
Human   181 ----GAAGPPGPPGPPGARGPPGDTGKDGPRGAQGPAGPKGEPGQDGEMGPKGPPGPKGEPGVPG 241

  Fly   405 DKGDRGDRGLTTTIKGDEFPTGIIEGPPGPAGPPGPPGEPGA---RGEPGPIGPAGPPGEKGPRG 466
            .|||                    :|.|...|||||.||||:   |||.|..|..||.||.|.||
Human   242 KKGD--------------------DGTPSQPGPPGPKGEPGSMGPRGENGVDGAPGPKGEPGHRG 286

  Fly   467 KRGKRIFGPGGTKIDEDYDDPPVTLLRGPPGPPGIAGKDG------RDGR--DGSKGEPGEPGEP 523
            ..|                      ..||.|.||:.|:.|      .|||  |..||.||..|.|
Human   287 TDG----------------------AAGPRGAPGLKGEQGDTVVIDYDGRILDALKGPPGPQGPP 329

  Fly   524 GSLGPRGLDGLPGEPGIEGPPGLPGYQGPPGEKGDRGDIGPPGLMGPPGLPGPPGYPGVKGDKGD 588
               ||.|:.|..||.|:.|.||:.|.:||.|:|||.|:.||.||.|..|..|..|.||..|.||:
Human   330 ---GPPGIPGAKGELGLPGAPGIDGEKGPKGQKGDPGEPGPAGLKGEAGEMGLSGLPGADGLKGE 391

  Fly   589 RGDSYRKMRRRQDDGMSDAPHMPTIEYLYGPPGPPGPMGPPGHTGSQGERGLDGRKGDPGEKGHK 653
            :|:|       ..|.:.::.....:|  .||||||||.||.|..|.||.:||||.|   ||||..
Human   392 KGES-------ASDSLQESLAQLIVE--PGPPGPPGPPGPMGLQGIQGPKGLDGAK---GEKGAS 444

  Fly   654 GDQGPMGLPGPMGMRGESGPSGPSGKAGIPGPPGLDGMKGAQGETGHKGERGDPGLPGTDGIPGQ 718
            |::||.|||||:                  |||||.|:.|.      |||:|.||.||.||.||.
Human   445 GERGPSGLPGPV------------------GPPGLIGLPGT------KGEKGRPGEPGLDGFPGP 485

  Fly   719 EGPRGEQGSRGDAGPPGKRGRKGDRGDKGEQGVPGLDAPCPL----------------------- 760
            .|.:|::..||:.|..|..||||.:|.|||.|.||||.|||:                       
Human   486 RGEKGDRSERGEKGERGVPGRKGVKGQKGEPGPPGLDQPCPVENPTCGGRRGAPGWRGPARGNGP 550

  Fly   761 ---GADGLPLPGCGW 772
               |.||||:||| |
Human   551 CPAGPDGLPVPGC-W 564

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
CG42342NP_001189234.1 DUF2046 <133..210 CDD:401633 15/76 (20%)
gly_rich_SclB <249..>413 CDD:468478 60/166 (36%)
gly_rich_SclB <368..>560 CDD:468478 81/205 (40%)
gly_rich_SclB <527..>757 CDD:468478 104/229 (45%)
COL23A1XP_006714996.1 gly_rich_SclB <134..>384 CDD:468478 122/338 (36%)
gly_rich_SclB <265..>524 CDD:468478 134/319 (42%)

Return to query results.
Submit another query.