DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment CG8170 and Hgfac

DIOPT Version :10

Sequence 1:NP_610441.2 Gene:CG8170 / 35908 FlyBaseID:FBgn0033365 Length:855 Species:Drosophila melanogaster
Sequence 2:XP_006504094.1 Gene:Hgfac / 54426 MGIID:1859281 Length:658 Species:Mus musculus


Alignment Length:643 Identity:164/643 - (25%)
Similarity:226/643 - (35%) Gaps:213/643 - (33%)


- Green bases have known domain annotations that are detailed below.


  Fly   375 GP-ITALPQQRPQRGFYFGDTEFRTGPPAPVRQFGPQKNFQEYVGPSEYQGTRKSRYYPYKSSRS 438
            || :||.|                ..|..||.. |......|....:|.:|.:..||.|..||..
Mouse    45 GPNVTATP----------------VTPTIPVIS-GNVSTSTESAPAAETEGPQSERYPPPSSSSP 92

  Fly   439 P--RVV--------FP-----------------------TNDNVG-----------TTGPSGPA- 458
            |  :|:        ||                       |..|..           |....||| 
Mouse    93 PGGQVLTESGQPCRFPFRYGGRMLHSCTSEGSAYRKWCATTHNYDRDRAWGYCAEVTLPVEGPAI 157

  Fly   459 ---GSSGPSGNG-----------VYFSDNIAFRDQNFGINELAAVQDVRNDY-SLQD-LDSASEA 507
               .:|||..||           .:.|..:||..::.|..:  ...:.|.:| .:.| ....||.
Mouse   158 LDPCASGPCLNGGTCSSTHDHGSYHCSCPLAFTGKDCGTEK--CFDETRYEYFEVGDHWARVSEG 220

  Fly   508 TSSPQSASTFKEKVDITTDTECQH----RGGTCEFFLG------------------------CWL 544
            ..........:.:.:.|..|.|..    .||||...:|                        |:|
Mouse   221 HVEQCGCMEGQARCEDTHHTACLSSPCLNGGTCHLIVGTGTSVCTCPLGYAGRFCNIVPTEHCFL 285

  Fly   545 SGGLIQGTCDGLLRGCCHRTAKSANLGSS---------------DFVGNAVDLTDL--------P 586
            ..|.       ..||    .|.:|..|.|               |.|..|| |..|        |
Mouse   286 GNGT-------EYRG----VASTAASGLSCLAWNSDLLYQELHVDSVAAAV-LLGLGPHAYCRNP 338

  Fly   587 QKNYGP-------------------------VNNE----------------PSCGISLAKQTAQR 610
            .|:..|                         |:::                |:||....|:|..|
Mouse   339 DKDERPWCYVVKDNALSWEYCRLTACESLARVHSQTPEILAALPESAPAVRPTCGKRHKKRTFLR 403

  Fly   611 -RIVGGDDAGFGSFPWQAYIRIGSSRCGGSLISRRHVVTAGHCVARATPR-QVHVTLGDYVINSA 673
             ||:||..:..||.||.|.|.||:|.|.|||:....||:|.||.|.:.|| .:.|.||.:..|..
Mouse   404 PRIIGGSSSLPGSHPWLAAIYIGNSFCAGSLVHTCWVVSAAHCFANSPPRDSITVVLGQHFFNRT 468

  Fly   674 VEPLPAYTFGVRR-IDVHPYFKFTPQADRFDISVLTL----ERTVHFMPHIAPICLPEKNEDF-L 732
            .:  ...|||:.: :....|..|.|  :..|:.::.|    ||.......:.||||||....| .
Mouse   469 TD--VTQTFGIEKYVPYTLYSVFNP--NNHDLVLIRLKKKGERCAVRSQFVQPICLPEAGSSFPT 529

  Fly   733 GKFGWAAGWGALNPGSRLRPKTLQAVDVPVIENRICERW------HRQNGINVV---IYQEMLCA 788
            |.....||||.::       :...:.||....|.:.|..      |:.:...|.   |...||||
Mouse   530 GHKCQIAGWGHMD-------EMQSSTDVSSYSNSLLEALVPLVADHKCSSPEVYGADISPNMLCA 587

  Fly   789 GYRNGGKDSCQGDSGGPLMHDKNGRWYLIGVVSAGYSCASRGQPGIYHSVSKTVDWVS 846
            ||.:...|:|||||||||:.:|||..||.|::|.|..|....:||:|..|:..|||::
Mouse   588 GYFDCKSDACQGDSGGPLVCEKNGVAYLYGIISWGDGCGRLNKPGVYTRVANYVDWIN 645

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
CG8170NP_610441.2 Tryp_SPc 612..846 CDD:238113 89/249 (36%)
HgfacXP_006504094.1 FN2 99..145 CDD:128373 4/45 (9%)
EGF_CA 159..194 CDD:238011 8/34 (24%)
FN1 197..237 CDD:238018 5/41 (12%)
EGF_CA 242..275 CDD:238011 6/32 (19%)
Kringle 283..364 CDD:395005 18/92 (20%)
Tryp_SPc 406..646 CDD:238113 89/251 (35%)

Return to query results.
Submit another query.