DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Hml and muc5.2

DIOPT Version :9

Sequence 1:NP_524060.2 Gene:Hml / 39529 FlyBaseID:FBgn0029167 Length:3843 Species:Drosophila melanogaster
Sequence 2:XP_021326453.1 Gene:muc5.2 / 572172 ZFINID:ZDB-GENE-121108-1 Length:4086 Species:Danio rerio


Alignment Length:1369 Identity:396/1369 - (28%)
Similarity:604/1369 - (44%) Gaps:211/1369 - (15%)


- Green bases have known domain annotations that are detailed below.


  Fly   451 WWSSEETVVTMSSYSLYQSNNLDIVIDKT-PRP----ALCTTWGGINMKTFDGLVFKAPLSCSHT 510
            |..:.....||::          :::.|. |:|    .:|:|||..:.|||||..|:.|.:|::.
Zfish    46 WLPTTTVSPTMTT----------VMVTKVEPKPDHQSRICSTWGNFHFKTFDGHFFQLPDTCNYV 100

  Fly   511 L--ITDKVSGTFDIILKACPYGSGYGCAHTLKILWQSVLYTFENLNGTMQLTTPIKKL------- 566
            |  :.|.....|:|.::.                        :.:||::...:.|.||       
Zfish   101 LAVMCDAAISDFNIQMQR------------------------DTVNGSISFGSVIVKLDGTVIKI 141

  Fly   567 ------------PMPVQVMGMKVMPVAQHVQIDLESVGLKLDWDHRQYVSVQAGPQMWGKVGGLC 619
                        .:|:...|:|:  ......|.:...|:.:.|:....::::...:..|...|||
Zfish   142 TDSGIMMDDQVVSVPINQKGIKI--EGSPTSIKISRYGMTIFWEEDNSIAIELAEKYKGLTCGLC 204

  Fly   620 GTLDGDPNTDLTSRTGKKLATVKAFADAWRVEDRSELCQVENSAEMEFGMDSCEQSKLQKAVSVC 684
            |..:||.:.|:.          ::....|::...:|.|:    ..:....|.|:|:.:     ||
Zfish   205 GNYNGDKDDDMP----------ESGPATWKISTPTESCE----DVILPPKDQCDQNTM-----VC 250

  Fly   685 ERLLANEKLGDCIKPFNYDALIRTCMADYCNCANREHPESCNCDAIAMLAKECAFKGIKLEHG-- 747
            ::.|::.....|....:.....:.|::|.|.|..   ...|.|:.:..::::|...|     |  
Zfish   251 QQYLSSPGFSGCYDVMDMKIFEKACVSDMCQCYG---SHDCLCNTLTEISRQCTHAG-----GQP 307

  Fly   748 --WRNLEICPISCGFGRVYQACGPNVEPTCDSDLALPASKGACNEGCFCPEGTVQ---YKEACIT 807
              ||..::||..|.....|..||...:.||....|....|..|.:|||||||||:   .:..|:.
Zfish   308 GTWRTEQLCPKMCPINLQYMECGGPCKSTCSDPTAHLMCKDHCVDGCFCPEGTVEDDIGQGGCVP 372

  Fly   808 RELCPCSLRGKEFKPESTVKKNCNTCTCKNGQWRCTEDKCGARCGAVGDPHYQTFDGKRYDFMGK 872
            ...|||...|..:|...:.::.|..|.|..|.|.||...|...|..||..|..|||||.:.|.|.
Zfish   373 VNECPCVHDGTVYKSGESYQQACKKCFCAAGHWTCTYLDCPGTCSVVGGSHVTTFDGKSFTFSGN 437

  Fly   873 CSYHLLKTQNTS---VEAENVACSGAVSESMNFAAPDDPSCTKAVTIRFILRDGTPSVIKLDQGL 934
            |.|.|.|..|.|   |......|..|.::          :|..:||   ::..||......| |:
Zfish   438 CDYILTKHSNDSDFAVVGNLAKCEPARTD----------TCLNSVT---LVISGTTIGFTSD-GV 488

  Fly   935 TTIVNDKPIAKLPKMLGLGEVLIRRASSTFLTVEFAD-GIRVWWDGVSRVYIDAPPSLRGQTQGL 998
            .|:..:.|. .||.:  :|.|.|.:.|.:|:..:... .:.:....|.::||.|....:|:..||
Zfish   489 VTLNGNSPF-NLPAV--IGPVSIFQPSLSFIIADLNSLRLEIQLAPVMQLYIVASTEEKGKMTGL 550

  Fly   999 CGTFNSNTQDDFLTPEGDVETAVEPFADKW-RTKDTCQFKAETHQGPHPCTLNPEKKAQAEKFCD 1062
            ||.:|....|||.|..|..|.....||:.| :|..:|.....|..  :||:|:.:.:..|:.:|.
Zfish   551 CGNYNDVQSDDFKTDLGITEGTAISFANFWKKTPYSCPDLENTFD--NPCSLSVDTEKLAKDWCS 613

  Fly  1063 WILQD--IFQDCHFLVEPEQFYEDCLYDTCACKDEMSKCFCPILSAYGTECMRQG-VKTGWRMSV 1124
            .:...  .|..||..:.|:.:||.|:||||.|.| :.||.|..:|.|...|..:| :..|| |..
Zfish   614 RLTNQSGAFSACHSEICPKIYYERCVYDTCKCAD-IRKCICAAVSTYAHACAARGIILQGW-MDS 676

  Fly  1125 KECAVKCPLGQVFDECGDGCALSCDDLP-SKGSCKREC--VEGCRCPHGEYVNEDGECVPKKMCH 1186
            ..|..:|...........||.|:|..|. .:.:|:...  |:||.|..|.|:||:|.|||...|.
Zfish   677 DPCESECSENMKHSYGMTGCGLTCRSLSGQENTCQGSFTPVDGCVCSEGTYLNEEGICVPADQCP 741

  Fly  1187 CNFDGMSFRPGYKEVRPGE-KFLD--LCTCTDGVWDCQDAEPGDKDKYPPSSELRSKCAKQPYAE 1248
            |       ..|.:..:|.| ..:|  .|||..|...|.:.|               .|. .|...
Zfish   742 C-------YSGDQVTKPSEVSHVDGLTCTCKLGKLHCSNPE---------------TCV-APMVL 783

  Fly  1249 FTKCAPKEP--------KTCKNMDKYVADSSDCLPGCVCMEGYVYDTSRLACVLPANCSCHHAGK 1305
            | ||:..||        :||:..|.....|:.|:.||:|.:..:.| .:..||....|.|.|.|.
Zfish   784 F-KCSNYEPGKKGTECQRTCQKQDINNCVSTGCVSGCMCPDNLLAD-GQGGCVEREKCPCVHNGV 846

  Fly  1306 SYDDGEKIKEDCNLCECRAGNWKCSKNGCESTCSVWGDSHFTTFDGHDFDFQGACDYVLAKGVFD 1370
            :|..||:::||||.|.||.|.|.|::..|..||:::|:.||.||||..:.|.|.|:.:|... :.
Zfish   847 TYSSGEQVQEDCNTCTCRNGMWDCTEKECYGTCTIYGEGHFMTFDGKKYSFHGDCEQILVHD-YC 910

  Fly  1371 NGD----GFSITIQNVLCGTMGVTCSKSLEIALTGHAEESLLLSADSAYSTDPNKTPIKKLRDSV 1431
            |.|    ...:..:|:.|||.|..||||:.:..   ....|:||.:.....             |
Zfish   911 NTDQSPYSLRLVTENIPCGTSGTICSKSINLFF---GRYKLILSEEKEIQV-------------V 959

  Fly  1432 NSKGHN-AFHIYKAGVFVVVEVIPLKLQVKWDEGTRVYVKLGNEWRQKVSGLCGNYNGNSLDDMQ 1495
            .|.|.: .:.|:.||::.|:||..| |.:.||..|.|.::|..:::.||.|||||::||:.:|..
Zfish   960 ESNGTDYQYQIHTAGIYNVIEVKGL-LNLIWDSKTSVMLQLHPKFKGKVCGLCGNFDGNANNDFM 1023

  Fly  1496 TPSMGLETSPMLFGHAWKLQPHCSAPVAPIDACKKHPERETWAQLKCGALKSDLFKECHAEVPLE 1560
            .......|.|::||::||..|.|......::.|:|:|.|..||..:|..:.|.:|.:||:.|...
Zfish  1024 KHDGEEVTDPVVFGNSWKSNPGCPDVSNVMNPCEKNPHRSAWAIKQCSIITSPVFSDCHSRVDSG 1088

  Fly  1561 RFWKRCIFDTCACDQGGDCECLCTAVAAYADACAQKGINIRWRSQHFCPMQCD------PHCS-D 1618
            .::..|:.||||||.||||:|.||||||||..|.:.|..:.|||...||:.||      ..|. .
Zfish  1089 PYYDACVKDTCACDSGGDCDCFCTAVAAYAAECRKTGACVAWRSPSICPLFCDYYNNPPSECEWH 1153

  Fly  1619 YKACTPACAVETCDNFLDQGIAERMCNRENCLEGCHIKPCEDGFIYLNDTYRDCVPKAECKPVCM 1683
            ||.|...| ::||.|  ..|...   |:...|||| ...|.....:|.:..|.||.:|||..:| 
Zfish  1154 YKTCGSTC-MKTCRN--PSGTCS---NQIPLLEGC-FPQCPSERPFLREDNRKCVTEAECLSLC- 1210

  Fly  1684 VRDGKTFYEGDITF--TD---SCATCRCSKRKEICSGV-KCDVPATTGLPAPLVEGTTLPTPLAT 1742
            ..|||.:..||:.:  ||   :|.|..|....||...: ||   .||..|.........|||.:.
Zfish  1211 TYDGKIYTTGDVMYDTTDGNGTCFTAVCGSNGEIIRSINKC---MTTTTPFTFTTPPQTPTPSSP 1272

  Fly  1743 QNQT 1746
            :..|
Zfish  1273 ETYT 1276

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
HmlNP_524060.2 VWD 485..636 CDD:295339 35/171 (20%)
C8 680..754 CDD:285899 15/77 (19%)
TIL 758..811 CDD:280072 19/55 (35%)
VWD 840..1015 CDD:214566 54/178 (30%)
C8 1054..1121 CDD:214843 23/69 (33%)
TIL 1131..1185 CDD:280072 19/56 (34%)
TIL 1245..1298 CDD:280072 17/60 (28%)
VWD 1327..1498 CDD:214566 56/175 (32%)
C8 1535..1609 CDD:214843 33/73 (45%)
Mucin2_WxxW 1751..1837 CDD:290069
TIL 1938..2005 CDD:280072
FA58C 2089..2223 CDD:238014
FA58C 2104..2225 CDD:214572
FA58C <2299..2404 CDD:214572
FA58C <2299..2403 CDD:238014
VWD 2703..2858 CDD:295339
C8 2893..2970 CDD:285899
TIL 2974..3030 CDD:280072
VWD 3035..3198 CDD:295339
C8 3257..3313 CDD:285899
VWC 3397..3451 CDD:302663
GHB_like <3755..3813 CDD:304424
muc5.2XP_021326453.1 VWD 68..217 CDD:214566 36/184 (20%)
C8 249..316 CDD:312319 15/74 (20%)
TIL 320..376 CDD:307783 19/55 (35%)
VWD 416..564 CDD:306577 49/164 (30%)
C8 605..679 CDD:214843 25/75 (33%)
TIL 683..740 CDD:307783 19/56 (34%)
TIL 795..839 CDD:307783 11/44 (25%)
VWC_out 841..>892 CDD:214565 24/50 (48%)
VWD 868..1027 CDD:214566 56/176 (32%)
C8 1065..1137 CDD:214843 33/71 (46%)
TIL 1153..1206 CDD:307783 19/59 (32%)
Atrophin-1 <1269..1639 CDD:331285 2/8 (25%)
Blue background indicates that the domain is not in the aligned region.


Information from Original Tools:


Tool Simple Score Weighted Score Original Tool Information
BLAST Result Score Score Type Cluster ID
Compara 00.000 Not matched by this tool.
Domainoid 1 1.000 92 1.000 Domainoid score I7571
eggNOG 00.000 Not matched by this tool.
Hieranoid 00.000 Not matched by this tool.
Homologene 00.000 Not matched by this tool.
Inparanoid 00.000 Not matched by this tool.
OMA 00.000 Not matched by this tool.
OrthoDB 1 1.010 - - D12226at2759
OrthoFinder 00.000 Not matched by this tool.
OrthoInspector 00.000 Not matched by this tool.
orthoMCL 1 0.900 - - OOG6_100854
Panther 1 1.100 - - O PTHR11339
Phylome 1 0.910 - -
RoundUp 00.000 Not matched by this tool.
SonicParanoid 1 1.000 - - X2291
SwiftOrtho 00.000 Not matched by this tool.
TreeFam 00.000 Not matched by this tool.
ZFIN 00.000 Not matched by this tool.
65.920

Return to query results.
Submit another query.