DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment CG8170 and Hgfac

DIOPT Version :10

Sequence 1:NP_610441.2 Gene:CG8170 / 35908 FlyBaseID:FBgn0033365 Length:855 Species:Drosophila melanogaster
Sequence 2:NP_445772.1 Gene:Hgfac / 58947 RGDID:70909 Length:653 Species:Rattus norvegicus


Alignment Length:636 Identity:158/636 - (24%)
Similarity:219/636 - (34%) Gaps:199/636 - (31%)


- Green bases have known domain annotations that are detailed below.


  Fly   384 RPQRGFYFGDTEFRTGPPAP---VRQFGP-----QKNFQEYV-----GPSEYQGTRKSRYYPYKS 435
            :||.|      ...|.||.|   ..:..|     ..||...:     |.:|.:|.:..||. ..|
  Rat    31 QPQAG------RNHTEPPGPNVTATRMTPMIPATSGNFSTSIGSDPEGEAETEGPQSERYL-LSS 88

  Fly   436 SRSP---RVV--------FP-----------------------TNDNV-----------GTTGPS 455
            |.||   :|:        ||                       |..|.           .|....
  Rat    89 SSSPLGGQVLTESGQPCRFPFRYGGRMLHSCSSEGSAYRKWCATTHNYDRDRAWGYCAEATLPVE 153

  Fly   456 GPA----GSSGPSGNGVYFSDN-----------IAFRDQNFGINELAAVQDVRNDY-SLQD-LDS 503
            |||    .:|||..||...|..           :||..::.|..:  ...:.|.:| .:.| ...
  Rat   154 GPAVLDPCASGPCLNGGTCSSTQDHVSYHCACPLAFTGKDCGTEK--CFDETRYEYFEVGDHWAR 216

  Fly   504 ASEATSSPQSASTFKEKVDITTDTECQH----RGGTCEFFLG----------------------- 541
            .||......|....:.:.:.|..|.|..    .||||...:|                       
  Rat   217 VSEGHVEQCSCIEGQARCEDTHHTACLSSPCLNGGTCHLIVGTGTSICACPLGYAGRFCNIVPTE 281

  Fly   542 -CWLSG-----GLIQGTCDGLL---------------------------------------RGCC 561
             |:|..     |:...:..||.                                       |..|
  Rat   282 RCFLGNGTEYRGVASTSASGLSCLAWNSDLLYQELHVDSVGAAALLGLGPHAYCRNPDKDERPWC 346

  Fly   562 HRTAKSA------NLGSSDFVGNAVD-----LTDLPQKNYGPVNNEPSCGISLAKQTAQR-RIVG 614
            :....||      .|.:.:.:.....     |..||:..   ....|:||....|:|..| ||:|
  Rat   347 YVVKDSALSWEYCRLAACESLARVHSRIPEVLATLPEST---STARPTCGKRHKKRTFLRPRIIG 408

  Fly   615 GDDAGFGSFPWQAYIRIGSSRCGGSLISRRHVVTAGHCVARATPR-QVHVTLGDYVINS------ 672
            |..:..||.||.|.|.||:..|.|||:....||:|.||.:.:.|| .:.|.||.:..|.      
  Rat   409 GSSSLPGSHPWLAAIYIGNGFCAGSLVHTCWVVSAAHCFSSSPPRDSITVVLGQHFFNRTTDVTQ 473

  Fly   673 --AVEPLPAYTFGVRRIDVHPYFKFTPQADRFDISVLTL----ERTVHFMPHIAPICLPEKNEDF 731
              |:|....||.         |..|.|  :..|:.::.|    ||.......:.||||||....|
  Rat   474 TFAIEKYVPYTL---------YSVFNP--NDHDLVLIRLKKKGERCAVRSQFVQPICLPEAGSSF 527

  Fly   732 -LGKFGWAAGWGALNPGSRLRPKTLQAVDVPVIENRICERWHRQNGINVVIYQEMLCAGYRNGGK 795
             .|.....||||.::........:|....||::.:..|.   ........|...||||||.:...
  Rat   528 PTGHKCQIAGWGHMDENVSGYSNSLLEALVPLVADHKCS---SPEVYGADISPNMLCAGYFDCKS 589

  Fly   796 DSCQGDSGGPLMHDKNGRWYLIGVVSAGYSCASRGQPGIYHSVSKTVDWVS 846
            |:|||||||||:.:|||..||.|::|.|..|....:||:|..||..|||::
  Rat   590 DACQGDSGGPLVCEKNGVAYLYGIISWGDGCGRLNKPGVYTRVSNYVDWIN 640

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
CG8170NP_610441.2 Tryp_SPc 612..846 CDD:238113 87/247 (35%)
HgfacNP_445772.1 FN2 99..145 CDD:128373 4/45 (9%)
EGF_CA 159..194 CDD:238011 8/34 (24%)
FN1 197..237 CDD:238018 6/41 (15%)
EGF 242..274 CDD:394967 6/31 (19%)
Kringle 283..364 CDD:395005 10/80 (13%)
Tryp_SPc 406..641 CDD:238113 87/249 (35%)

Return to query results.
Submit another query.