DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Hml and MUC5B

DIOPT Version :9

Sequence 1:NP_524060.2 Gene:Hml / 39529 FlyBaseID:FBgn0029167 Length:3843 Species:Drosophila melanogaster
Sequence 2:NP_002449.2 Gene:MUC5B / 727897 HGNCID:7516 Length:5762 Species:Homo sapiens


Alignment Length:1906 Identity:503/1906 - (26%)
Similarity:763/1906 - (40%) Gaps:404/1906 - (21%)


- Green bases have known domain annotations that are detailed below.


  Fly   484 LCTTWGGINMKTFDGLVFKAPLSCSHTLITDKVSGT---FDIILKACPYGSGYGCAHTLKILWQS 545
            :|:|||..:.|||||.||:.|..|:: :.::.....   |::.|:....||.   ....:::.::
Human    76 VCSTWGDFHYKTFDGDVFRFPGLCNY-VFSEHCRAAYEDFNVQLRRGLVGSR---PVVTRVVIKA 136

  Fly   546 VLYTFENLNGTMQLTTPIKKLPMPVQVMGMKVMPVAQHVQIDLESVGLKLDWDHRQYVSVQAGPQ 610
            .....|..||::.:....::|  |....|:.|.....::::.:..| |...|:......::..|:
Human   137 QGLVLEASNGSVLINGQREEL--PYSRTGLLVEQSGDYIKVSIRLV-LTFLWNGEDSALLELDPK 198

  Fly   611 MWGKVGGLCGTLDGDPNTDLTSRTGKKLATVKAFADAWRVEDRSELCQVENSAEMEFGMDSCEQS 675
            ...:..||||..:|.|..:.......:|..:: |.:..:::..:|.|    ...:.....:|...
Human   199 YANQTCGLCGDFNGLPAFNEFYAHNARLTPLQ-FGNLQKLDGPTEQC----PDPLPLPAGNCTDE 258

  Fly   676 KLQKAVSVCERLLANEKLGDCIKPFNYDALIRTCMADYCNCANREHPESCNCDAIAMLAKECAFK 740
            :     .:|.|.|......:|....:..|.:..|..|.|.|      .:|.|......:::||..
Human   259 E-----GICHRTLLGPAFAECHALVDSTAYLAACAQDLCRC------PTCPCATFVEYSRQCAHA 312

  Fly   741 GIKLEHGWRNLEICPISCGFGRVYQACGPNVEPTCDSDLALPASKGACNEGCFCPEGTVQ---YK 802
            |.: ...||..|:||.:|.....:|.||.....||.:.......:..|.:|||||.|||.   ..
Human   313 GGQ-PRNWRCPELCPRTCPLNMQHQECGSPCTDTCSNPQRAQLCEDHCVDGCFCPPGTVLDDITH 376

  Fly   803 EACITRELCPCSLRGKEFKPESTVKKNCNTCTCKNGQWRCTEDKCGARCGAVGDPHYQTFDGKRY 867
            ..|:....|||:..|:.:.|.::....|::|||..|.|:|.:..|...|...|..|..|:|.|.|
Human   377 SGCLPLGQCPCTHGGRTYSPGTSFNTTCSSCTCSGGLWQCQDLPCPGTCSVQGGAHISTYDEKLY 441

  Fly   868 DFMGKCSYHLLK---TQNTSVEAENVACSGAVSESMNFAAPDDPSCTKAVTIRFILRDGTPSVIK 929
            |..|.|||.|.|   ..:.:|.||...|          ...|:.:|.||||:..   ||..:.|:
Human   442 DLHGDCSYVLSKKCADSSFTVLAELRKC----------GLTDNENCLKAVTLSL---DGGDTAIR 493

  Fly   930 L--DQG--LTTIVNDKPIAKLPKMLGLGEVLIRRASSTFLTVEFADGIR--VWWDGVSRVYIDAP 988
            :  |.|  |.:|....|       |....:.:...||.|:.|:...|::  |....:.:|::...
Human   494 VQADGGVFLNSIYTQLP-------LSAANITLFTPSSFFIVVQTGLGLQLLVQLVPLMQVFVRLD 551

  Fly   989 PSLRGQTQGLCGTFNSNTQDDFLTPEGDVETAVEPFADKWRTKDTCQFKAETHQGPHPCTLNPEK 1053
            |:.:||..||||.||.|..|||....|.||.....||:.|:.:..|.....:.:  .||:|:.|.
Human   552 PAHQGQMCGLCGNFNQNQADDFTALSGVVEATGAAFANTWKAQAACANARNSFE--DPCSLSVEN 614

  Fly  1054 KAQAEKFCDWIL--QDIFQDCHFLVEPEQFYEDCLYDTCACKDEMSKCFCPILSAYGTECMRQGV 1116
            :..|..:|..:.  ...|..||.::.|:.|:.:|::|||.| :....|.|..||:|...|..:||
Human   615 ENYARHWCSRLTDPNSAFSRCHSIINPKPFHSNCMFDTCNC-ERSEDCLCAALSSYVHACAAKGV 678

  Fly  1117 K-TGWRMSVKECA---VKCPLGQVFDECGDGCALSCDDLPSKG-SCKREC--VEGCRCPHGEYVN 1174
            : :.||..|  |.   ..||..|.:....|.|..:|..|.... :|....  |:||.||.|.::|
Human   679 QLSDWRDGV--CTKYMQNCPKSQRYAYVVDACQPTCRGLSEADVTCSVSFVPVDGCTCPAGTFLN 741

  Fly  1175 EDGECVPKKMCHCNFDGMSFRPGYKEVRPGEKFLDLCTCTDGVWDCQDAEPGDKDKYPPSSELRS 1239
            :.|.|||.:.|.|...|....||  ||...|.  .:|:||.|...|..|          |.:..:
Human   742 DAGACVPAQECPCYAHGTVLAPG--EVVHDEG--AVCSCTGGKLSCLGA----------SLQKST 792

  Fly  1240 KC-AKQPYAEFTKCAPKEP-----KTCKNMDKYVADSSDCLPGCVCMEGYVYDTSRLACVLPANC 1298
            .| |...|.:.:..:...|     ::|..:| ....|:.|:.||||..|.|.|.|. .|:...:|
Human   793 GCAAPMVYLDCSNSSAGTPGAECLRSCHTLD-VGCFSTHCVSGCVCPPGLVSDGSG-GCIAEEDC 855

  Fly  1299 SCHHAGKSYDDGEKIKEDCNLCECRAGNWKCSKNGCESTCSVWGDSHFTTFDGHDFDFQGACDYV 1363
            .|.|...:|..||.|:.|||.|.||...|:||...|..||..:||.||.||||..:.|:|:|:|:
Human   856 PCVHNEATYKPGETIRVDCNTCTCRNRRWECSHRLCLGTCVAYGDGHFITFDGDRYSFEGSCEYI 920

  Fly  1364 LAKGVFDNGDG-----FSITIQNVLCGTMGVTCSKSLEIALTGHAEESLLLSADS--AYSTDPNK 1421
            ||:...  ||.     |.|..:|:.|||.|.||||::::.:..:   .|:|...:  |.:..|..
Human   921 LAQDYC--GDNTTHGTFRIVTENIPCGTTGTTCSKAIKLFVESY---ELILQEGTFKAVARGPGG 980

  Fly  1422 TPIKKLRDSVNSKGHNAFHIYKAGVFVVVEVIPLKLQVKWDEGTRVYVKLGNEWRQKVSGLCGNY 1486
            .|..|:|              ..|:|:|:|.  ..:.|.||..|.|:::|..:::.:|.|||||:
Human   981 DPPYKIR--------------YMGIFLVIET--HGMAVSWDRKTSVFIRLHQDYKGRVCGLCGNF 1029

  Fly  1487 NGNSLDDMQTPSMGLETSPMLFGHAWKLQPHCSAPVAPIDACKKHPERETWAQLKCGALKSDLFK 1551
            :.|:::|..|.|..:....:.||::|||.|.|...:||.|.|..:|.|::|||.:|..|....|.
Human  1030 DDNAINDFATRSRSVVGDALEFGNSWKLSPSCPDALAPKDPCTANPFRKSWAQKQCSILHGPTFA 1094

  Fly  1552 ECHAEVPLERFWKRCIFDTCACDQGGDCECLCTAVAAYADACAQKGINIRWRSQHFCPMQCD--- 1613
            .|.::|...::::.|:.|.||||.||||||.||||||||.||...|:.:.||:...||:.||   
Human  1095 ACRSQVDSTKYYEACVNDACACDSGGDCECFCTAVAAYAQACHDAGLCVSWRTPDTCPLFCDFYN 1159

  Fly  1614 PH--CS-DYKACTPACAVETCDNFLDQGIAERMCNRENC------LEGCHIKPCEDGFIYLNDTY 1669
            ||  |. .|:.|...| ::||.|  ..|         :|      ||||:.| |.....:.|:..
Human  1160 PHGGCEWHYQPCGAPC-LKTCRN--PSG---------HCLVDLPGLEGCYPK-CPPSQPFFNEDQ 1211

  Fly  1670 RDCVPKAECKPVCMVRDGKTFYEG-DITFTDSCATCRCSKRKEICSGVKC--------------- 1718
            ..||  |:|.  |..:||..:..| .:...::|.:|.|:.     ||::|               
Human  1212 MKCV--AQCG--CYDKDGNYYDVGARVPTAENCQSCNCTP-----SGIQCAHSLEACTCTYEDRT 1267

  Fly  1719 ----DV--PATTGLPAPLV----------------EGTTLPTPL--------------ATQNQTK 1747
                ||  ..|.||.|.|:                .||...||.              |....|.
Human  1268 YSYQDVIYNTTDGLGACLIAICGSNGTIIRKAVACPGTPATTPFTFTTAWVPHSTTSPALPVSTV 1332

  Fly  1748 CVKGWTRWCDKDRDTSDKSVRLNDEEKVP-----RYDRMENV----YGTCLKQYMTKVECRVKDT 1803
            ||:...||          |...|.....|     .::..||:    |..|  ..:..:|||....
Human  1333 CVREVCRW----------SSWYNGHRPEPGLGGGDFETFENLRQRGYQVC--PVLADIECRAAQL 1385

  Fly  1804 HEAP-EQMDENVVCSLEEGLRCIGK------CHDYELRAFCQCD-------------------EE 1842
            .:.| |::.:.|.|....||.|...      |||||||..| |:                   ..
Human  1386 PDMPLEELGQQVDCDRMRGLMCANSQQSPPLCHDYELRVLC-CEYVPCGPSPAPGTSPQPSLSAS 1449

  Fly  1843 LEPELPKP-----TEKPQLGL---------------ACDAAVVEYKEFPGDCHKFLHCQPKGVEG 1887
            .||.:|.|     |||..|.:               :....|......||.    ..|||:....
Human  1450 TEPAVPTPTQTTATEKTTLWVTPSIRSTAALTSQTGSSSGPVTVTPSAPGT----TTCQPRCQWT 1510

  Fly  1888 GWI---Y---------VE-----KTCGEYMMFNPTMLICDHIATVTEIKPN-----CGLKPEPEP 1930
            .|.   |         ||     :..|.::...|..:.|.     .|..||     .|.|...:.
Human  1511 EWFDEDYPKSEQLGGDVESYDKIRAAGGHLCQQPKDIECQ-----AESFPNWTLAQVGQKVHCDV 1570

  Fly  1931 EFEPIKQCPPGKIKSECANQCEN--------TCHYYGSILKKRGLCQVGEHCK-----PGCVDEL 1982
            .|..:               |.|        .|:.|    :.|.||...:||:     |....||
Human  1571 HFGLV---------------CRNWEQEGVFKMCYNY----RIRVLCCSDDHCRGRATTPPPTTEL 1616

  Fly  1983 RPDCPKLGK--FWRDEDTCVHADECPCMDKAEHYVQPHKPVLGEF----EVCQCIDNAFTCVPNK 2041
            ........:  |...:.|     ..|.:.:|........|.|.|.    .....:..|.|..|..
Human  1617 ETATTTTTQALFSTPQPT-----SSPGLTRAPPASTTAVPTLSEGLTSPRYTSTLGTATTGGPTT 1676

  Fly  2042 P----EP-VPKDEDDDLDLVSVV-----------PIYPVTLTPPLQ-------------CSPERL 2077
            |    || ||......|...|.:           |..|.||.|...             .|.|.|
Human  1677 PAGSTEPTVPGVATSTLPTRSALPGTTGSLGTWRPSQPPTLAPTTMATSRARPTGTASTASKEPL 1741

  Fly  2078 IPKIENPAHSLPDSIFNASSQLA-PEHGPKMARLTKE---------QPRGSWSPSINDQMQYLEL 2132
            ...:   |.:|...:..:.::.: |.....|:.||..         ||:..|:       ::.::
Human  1742 TTSL---APTLTSELSTSQAETSTPRTETTMSPLTNTTTSQGTTRCQPKCEWT-------EWFDV 1796

  Fly  2133 NFAKPEPFYGVVMAGSPEFDN 2153
            :|    |..||.......|:|
Human  1797 DF----PTSGVAGGDMETFEN 1813

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
HmlNP_524060.2 VWD 485..636 CDD:295339 34/153 (22%)
C8 680..754 CDD:285899 17/73 (23%)
TIL 758..811 CDD:280072 16/55 (29%)
VWD 840..1015 CDD:214566 56/183 (31%)
C8 1054..1121 CDD:214843 20/69 (29%)
TIL 1131..1185 CDD:280072 19/56 (34%)
TIL 1245..1298 CDD:280072 15/57 (26%)
VWD 1327..1498 CDD:214566 59/177 (33%)
C8 1535..1609 CDD:214843 33/73 (45%)
Mucin2_WxxW 1751..1837 CDD:290069 26/101 (26%)
TIL 1938..2005 CDD:280072 14/81 (17%)
FA58C 2089..2223 CDD:238014 13/75 (17%)
FA58C 2104..2225 CDD:214572 12/59 (20%)
FA58C <2299..2404 CDD:214572
FA58C <2299..2403 CDD:238014
VWD 2703..2858 CDD:295339
C8 2893..2970 CDD:285899
TIL 2974..3030 CDD:280072
VWD 3035..3198 CDD:295339
C8 3257..3313 CDD:285899
VWC 3397..3451 CDD:302663
GHB_like <3755..3813 CDD:304424
MUC5BNP_002449.2 Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 27..50
VWD 70..223 CDD:214566 34/153 (22%)
C8 270..325 CDD:285899 14/61 (23%)
TIL 329..385 CDD:280072 16/55 (29%)
VWC_out 387..>432 CDD:214565 13/44 (30%)
VWD 414..578 CDD:214566 56/183 (31%)
C8 615..685 CDD:214843 21/70 (30%)
TIL 695..752 CDD:280072 19/56 (34%)
TIL 794..855 CDD:280072 17/62 (27%)
VWD 884..1041 CDD:214566 59/177 (33%)
C8 1078..1152 CDD:214843 33/73 (45%)
7 X Cys-rich subdomain repeats 1333..4228 109/541 (20%)
Mucin2_WxxW 1340..1426 CDD:290069 25/97 (26%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1437..1462 4/24 (17%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1480..1502 3/25 (12%)
Mucin2_WxxW 1508..1597 CDD:290069 18/112 (16%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1607..1783 34/183 (19%)
Mucin2_WxxW 1789..1878 CDD:290069 7/36 (19%)
11 X approximate tandem repeats, Ser/Thr-rich 1890..2199
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1890..2019
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 2031..2100
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 2114..2211
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 2242..2302
Mucin2_WxxW 2320..2408 CDD:290069
11 X approximate tandem repeats, Ser/Thr-rich 2419..2756
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 2443..2462
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 2473..2522
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 2556..2861
PARM 2675..>2860 CDD:293666
Mucin2_WxxW 2877..2965 CDD:290069
17 X approximate tandem repeats, Ser/Thr-rich 2976..3456
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 3001..3049
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 3256..3357
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 3371..3469
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 3481..3561
Mucin2_WxxW 3577..3665 CDD:290069
11 X approximate tandem repeats, Ser/Thr-rich 3676..4013
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 3699..3779
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 3813..3917
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 3956..4118
Mucin2_WxxW 4134..4222 CDD:290069
23 X approximate tandem repeats, Ser/Thr-rich 4233..4879
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 4259..4389
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 4428..4447
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 4458..4527
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 4541..4750
VWD 5064..5237 CDD:214566
C8 5290..5351 CDD:285899
VWC 5523..5586 CDD:214564
CT 5660..5742 CDD:214482
Blue background indicates that the domain is not in the aligned region.


Information from Original Tools:


Tool Simple Score Weighted Score Original Tool Information
BLAST Result Score Score Type Cluster ID
Compara 00.000 Not matched by this tool.
Domainoid 1 1.000 97 1.000 Domainoid score I7243
eggNOG 1 0.900 - - E1_KOG1216
Hieranoid 00.000 Not matched by this tool.
Homologene 00.000 Not matched by this tool.
Inparanoid 1 1.050 807 1.000 Inparanoid score I494
Isobase 00.000 Not matched by this tool.
OMA 00.000 Not matched by this tool.
OrthoDB 1 1.010 - - D12226at2759
OrthoFinder 00.000 Not matched by this tool.
OrthoInspector 1 1.000 - - otm42152
orthoMCL 1 0.900 - - OOG6_100854
Panther 1 1.100 - - O PTHR11339
Phylome 1 0.910 - -
RoundUp 00.000 Not matched by this tool.
SonicParanoid 1 1.000 - - X2291
SwiftOrtho 00.000 Not matched by this tool.
TreeFam 00.000 Not matched by this tool.
User_Submission 00.000 Not matched by this tool.
98.870

Return to query results.
Submit another query.