DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment CG16812 and C1h19orf47

DIOPT Version :10

Sequence 1:NP_609619.1 Gene:CG16812 / 34722 FlyBaseID:FBgn0032488 Length:455 Species:Drosophila melanogaster
Sequence 2:NP_001103134.1 Gene:C1h19orf47 / 292739 RGDID:1307554 Length:413 Species:Rattus norvegicus


Alignment Length:465 Identity:117/465 - (25%)
Similarity:174/465 - (37%) Gaps:160/465 - (34%)


- Green bases have known domain annotations that are detailed below.


  Fly     5 SAARWVKFFNAAGIPSPAAAGYAHVFVENRIQDDMLLDLNKEYLREMGITLMGDIIAILRHSKTV 69
            :.:.|::||..||||...|..||.:||:||||..||||||||.:.|:|:|::|||||||:|:|.|
  Rat    33 ATSEWIQFFKEAGIPPGPAVNYAVMFVDNRIQKSMLLDLNKEIMNELGVTVVGDIIAILKHAKVV 97

  Fly    70 -----CEQNAREQVLTTEVVQSVVR--------------------PVSPKPKAPVATPKAKVYPP 109
                 |:........|...:|..:|                    |.:|..::..:|.|..|...
  Rat    98 HRQDMCKAATASVPCTPSPLQGELRRGASSAASRMIANSLNHDSPPHTPARRSDNSTSKISVTVS 162

  Fly   110 AKPA-----------------------RRVLPEHEGKYKVTLPSGTTERSKQILAKREKLYSDRV 151
            .|.|                       |||..|.||||.:.:|.|||.|:::||.:::     ..
  Rat   163 NKVAVKNAKAAALAHREEESLVVPTKRRRVTAEMEGKYIIHMPKGTTPRTRKILEQQQ-----AA 222

  Fly   152 SSSKKSDIFTRLHAEDEAQEGIVSSSGESSVRVHVAGASKTAATSSNNSVFARLGGKQGALPE-- 214
            ....::.:|.||.||.:|.                     |...:....||:||    ||.||  
  Rat   223 KGLHRTSVFDRLGAETKAD---------------------TTTGTKPTGVFSRL----GATPEMD 262

  Fly   215 -----------EFTSTREIKSILKNTHRTVGGGGGGGGVTKNSP----IVKAKKVAAVTQQKVML 264
                       ..:|..:...:||..         |.|.||.||    .||||..::.|      
  Rat   263 EELAWDSDNDSSSSSVLQYAGVLKKL---------GRGPTKASPQPALTVKAKATSSAT------ 312

  Fly   265 VHKVPLKKGDDDDDSMDDFESDEEDYMSSGEDFDMGAGEKIVKFASTAEVREI-VPESAYKNRQG 328
                                                      ..|||.::|.: :|......::.
  Rat   313 ------------------------------------------TLASTPKLRRLALPSRPGPEKKP 335

  Fly   329 KSFAN-NIKSRLG---MVSKLHATKKTYNLKASPARKPQARLSPVKGKTIRMRSDELFSRQD--- 386
            :|... :|..|||   :||:...::.|.....|......|....:.|......|:.|.::.|   
  Rat   336 ESLPKVSILKRLGKAAVVSEAQDSQVTSTKSKSSTEVKFAIKRTLVGPRGSSSSESLGAQMDHAG 400

  Fly   387 TVPVHRRLGK 396
            ||.|.:|||:
  Rat   401 TVSVFKRLGR 410

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
CG16812NP_609619.1 SAM_CS047 6..70 CDD:188930 37/68 (54%)
DUF5577 <109..>215 CDD:407613 32/141 (23%)
C1h19orf47NP_001103134.1 SAM_4 27..110 CDD:407856 38/76 (50%)
DUF5577 119..409 CDD:407613 75/376 (20%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 135..156 3/20 (15%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 289..316 11/74 (15%)

Return to query results.
Submit another query.