DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment CG4069 and Hcfc1

DIOPT Version :10

Sequence 1:NP_648590.1 Gene:CG4069 / 39437 FlyBaseID:FBgn0036301 Length:509 Species:Drosophila melanogaster
Sequence 2:XP_006229655.1 Gene:Hcfc1 / 363519 RGDID:1563804 Length:2091 Species:Rattus norvegicus


Alignment Length:491 Identity:106/491 - (21%)
Similarity:173/491 - (35%) Gaps:145/491 - (29%)


- Green bases have known domain annotations that are detailed below.


  Fly    32 MLEKLGEANIADIIQLLEAKEGKIEAISESVCPPPTPRSNFTLVCHPEKEELIMFGGELYTGTKT 96
            |...:..||:..:  ||:.:..::...|.   |.|.||.....|.  .||.:::|||    |.:.
  Rat     1 MASAVSPANLPAV--LLQPRWKRVVGWSG---PVPRPRHGHRAVA--IKELIVVFGG----GNEG 54

  Fly    97 TVYNDLFFYNTKTVEWRQLKSPSGPTPRSGHQMVAVASNGGELWMFGGEHASPSQLQFHHY-KDL 160
            .| ::|..|||.|.:| .:.:..|..| .|........:|..|.:|||      .:::..| .||
  Rat    55 IV-DELHVYNTATNQW-FIPAVRGDIP-PGCAAYGFVCDGTRLLVFGG------MVEYGKYSNDL 110

  Fly   161 WKFALKSRKWERIAA---PNG--PSPRSGHRMTVSKKRLFIFGGFHDN-----NQSYHYFNDVHI 215
            ::......:|:|:.|   .||  |.||.||..::...:.::|||..::     |....|.||::|
  Rat   111 YELQASRWEWKRLKAKTPKNGPPPCPRLGHSFSLVGNKCYLFGGLANDSEDPKNNIPRYLNDLYI 175

  Fly   216 FSLESYQWLKA---EIGGAIVPSPRSGCCIAASPE-----GKIYVWGGYSRAAMKKEADRGVTHT 272
            ..|.....:.|   .|...::|.||.........|     .|:.::||.|          |....
  Rat   176 LELRPGSGVVAWDIPITYGVLPPPRESHTAVVYTEKDNKKSKLVIYGGMS----------GCRLG 230

  Fly   273 DMFVLSQDKNAGDADNKYKWAPVKPGGYKPKPRSSVGCTVAANGKAYTFGGVMDVDEDDEDV--- 334
            |::.|..:        ...|......|..|.|||....|...| |.|.|||.:.:..||..|   
  Rat   231 DLWTLDIE--------TLTWNKPSLSGVAPLPRSLHSATTIGN-KMYVFGGWVPLVMDDVKVATH 286

  Fly   335 --HGQFGDDLLAFDLTSQTW--------------------------------------------- 352
              ..:..:.|...:|.:..|                                             
  Rat   287 EKEWKCTNTLACLNLDTMAWETILMDTLEDNIPRARAGHCAVAINTRLYIWSGRDGYRKAWNNQV 351

  Fly   353 -----------------RLQEIQTKSSSAEKK----DSEESKDVEMSAVDKPVTTTTDGIFTVTV 396
                             |:|.::..::|.|..    .:.:|..:::...|.|.|..|        
  Rat   352 CCKDLWYLETEKPPPPARVQLVRANTNSLEVSWGAVATADSYLLQLQKYDIPATAAT-------- 408

  Fly   397 GGPSTSTTPFVSKIPSLFAKPKPTNVP---SPRMNP 429
               :||.||  :.:||:.|.|..:..|   :|.:.|
  Rat   409 ---ATSPTP--NPVPSVPANPPKSPAPAAAAPAVQP 439

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
CG4069NP_648590.1 NanM 64..353 CDD:442289 80/374 (21%)
KELCH repeat 69..120 CDD:276965 15/50 (30%)
KELCH repeat 124..179 CDD:276965 14/60 (23%)
KELCH repeat 305..353 CDD:276965 15/114 (13%)
Kelch_5 423..463 CDD:433528 3/10 (30%)
KELCH repeat 426..470 CDD:276965 1/4 (25%)
Hcfc1XP_006229655.1 NanM 25..344 CDD:442289 81/355 (23%)
KELCH repeat 33..80 CDD:276965 17/55 (31%)
KELCH repeat 84..134 CDD:276965 13/55 (24%)
KELCH repeat 137..196 CDD:276965 14/58 (24%)
KELCH repeat 200..253 CDD:276965 11/70 (16%)
KELCH repeat 255..319 CDD:276965 15/64 (23%)
KELCH repeat 322..367 CDD:276965 0/44 (0%)
COG4625 444..>950 CDD:443664
PRK10263 <1473..>1647 CDD:236669
FN3 <1912..1941 CDD:238020
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.