DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Hml and MUC5B

DIOPT Version :10

Sequence 1:NP_524060.2 Gene:Hml / 39529 FlyBaseID:FBgn0029167 Length:3843 Species:Drosophila melanogaster
Sequence 2:NP_002449.2 Gene:MUC5B / 727897 HGNCID:7516 Length:5762 Species:Homo sapiens


Alignment Length:1906 Identity:503/1906 - (26%)
Similarity:763/1906 - (40%) Gaps:404/1906 - (21%)


- Green bases have known domain annotations that are detailed below.


  Fly   484 LCTTWGGINMKTFDGLVFKAPLSCSHTLITDKVSGT---FDIILKACPYGSGYGCAHTLKILWQS 545
            :|:|||..:.|||||.||:.|..|:: :.::.....   |::.|:....||.   ....:::.::
Human    76 VCSTWGDFHYKTFDGDVFRFPGLCNY-VFSEHCRAAYEDFNVQLRRGLVGSR---PVVTRVVIKA 136

  Fly   546 VLYTFENLNGTMQLTTPIKKLPMPVQVMGMKVMPVAQHVQIDLESVGLKLDWDHRQYVSVQAGPQ 610
            .....|..||::.:....::|  |....|:.|.....::::.:..| |...|:......::..|:
Human   137 QGLVLEASNGSVLINGQREEL--PYSRTGLLVEQSGDYIKVSIRLV-LTFLWNGEDSALLELDPK 198

  Fly   611 MWGKVGGLCGTLDGDPNTDLTSRTGKKLATVKAFADAWRVEDRSELCQVENSAEMEFGMDSCEQS 675
            ...:..||||..:|.|..:.......:|..:: |.:..:::..:|.|    ...:.....:|...
Human   199 YANQTCGLCGDFNGLPAFNEFYAHNARLTPLQ-FGNLQKLDGPTEQC----PDPLPLPAGNCTDE 258

  Fly   676 KLQKAVSVCERLLANEKLGDCIKPFNYDALIRTCMADYCNCANREHPESCNCDAIAMLAKECAFK 740
            :     .:|.|.|......:|....:..|.:..|..|.|.|      .:|.|......:::||..
Human   259 E-----GICHRTLLGPAFAECHALVDSTAYLAACAQDLCRC------PTCPCATFVEYSRQCAHA 312

  Fly   741 GIKLEHGWRNLEICPISCGFGRVYQACGPNVEPTCDSDLALPASKGACNEGCFCPEGTVQ---YK 802
            |.: ...||..|:||.:|.....:|.||.....||.:.......:..|.:|||||.|||.   ..
Human   313 GGQ-PRNWRCPELCPRTCPLNMQHQECGSPCTDTCSNPQRAQLCEDHCVDGCFCPPGTVLDDITH 376

  Fly   803 EACITRELCPCSLRGKEFKPESTVKKNCNTCTCKNGQWRCTEDKCGARCGAVGDPHYQTFDGKRY 867
            ..|:....|||:..|:.:.|.::....|::|||..|.|:|.:..|...|...|..|..|:|.|.|
Human   377 SGCLPLGQCPCTHGGRTYSPGTSFNTTCSSCTCSGGLWQCQDLPCPGTCSVQGGAHISTYDEKLY 441

  Fly   868 DFMGKCSYHLLK---TQNTSVEAENVACSGAVSESMNFAAPDDPSCTKAVTIRFILRDGTPSVIK 929
            |..|.|||.|.|   ..:.:|.||...|          ...|:.:|.||||:..   ||..:.|:
Human   442 DLHGDCSYVLSKKCADSSFTVLAELRKC----------GLTDNENCLKAVTLSL---DGGDTAIR 493

  Fly   930 L--DQG--LTTIVNDKPIAKLPKMLGLGEVLIRRASSTFLTVEFADGIR--VWWDGVSRVYIDAP 988
            :  |.|  |.:|....|       |....:.:...||.|:.|:...|::  |....:.:|::...
Human   494 VQADGGVFLNSIYTQLP-------LSAANITLFTPSSFFIVVQTGLGLQLLVQLVPLMQVFVRLD 551

  Fly   989 PSLRGQTQGLCGTFNSNTQDDFLTPEGDVETAVEPFADKWRTKDTCQFKAETHQGPHPCTLNPEK 1053
            |:.:||..||||.||.|..|||....|.||.....||:.|:.:..|.....:.:  .||:|:.|.
Human   552 PAHQGQMCGLCGNFNQNQADDFTALSGVVEATGAAFANTWKAQAACANARNSFE--DPCSLSVEN 614

  Fly  1054 KAQAEKFCDWIL--QDIFQDCHFLVEPEQFYEDCLYDTCACKDEMSKCFCPILSAYGTECMRQGV 1116
            :..|..:|..:.  ...|..||.::.|:.|:.:|::|||.| :....|.|..||:|...|..:||
Human   615 ENYARHWCSRLTDPNSAFSRCHSIINPKPFHSNCMFDTCNC-ERSEDCLCAALSSYVHACAAKGV 678

  Fly  1117 K-TGWRMSVKECA---VKCPLGQVFDECGDGCALSCDDLPSKG-SCKREC--VEGCRCPHGEYVN 1174
            : :.||..|  |.   ..||..|.:....|.|..:|..|.... :|....  |:||.||.|.::|
Human   679 QLSDWRDGV--CTKYMQNCPKSQRYAYVVDACQPTCRGLSEADVTCSVSFVPVDGCTCPAGTFLN 741

  Fly  1175 EDGECVPKKMCHCNFDGMSFRPGYKEVRPGEKFLDLCTCTDGVWDCQDAEPGDKDKYPPSSELRS 1239
            :.|.|||.:.|.|...|....||  ||...|.  .:|:||.|...|..|          |.:..:
Human   742 DAGACVPAQECPCYAHGTVLAPG--EVVHDEG--AVCSCTGGKLSCLGA----------SLQKST 792

  Fly  1240 KC-AKQPYAEFTKCAPKEP-----KTCKNMDKYVADSSDCLPGCVCMEGYVYDTSRLACVLPANC 1298
            .| |...|.:.:..:...|     ::|..:| ....|:.|:.||||..|.|.|.|. .|:...:|
Human   793 GCAAPMVYLDCSNSSAGTPGAECLRSCHTLD-VGCFSTHCVSGCVCPPGLVSDGSG-GCIAEEDC 855

  Fly  1299 SCHHAGKSYDDGEKIKEDCNLCECRAGNWKCSKNGCESTCSVWGDSHFTTFDGHDFDFQGACDYV 1363
            .|.|...:|..||.|:.|||.|.||...|:||...|..||..:||.||.||||..:.|:|:|:|:
Human   856 PCVHNEATYKPGETIRVDCNTCTCRNRRWECSHRLCLGTCVAYGDGHFITFDGDRYSFEGSCEYI 920

  Fly  1364 LAKGVFDNGDG-----FSITIQNVLCGTMGVTCSKSLEIALTGHAEESLLLSADS--AYSTDPNK 1421
            ||:...  ||.     |.|..:|:.|||.|.||||::::.:..:   .|:|...:  |.:..|..
Human   921 LAQDYC--GDNTTHGTFRIVTENIPCGTTGTTCSKAIKLFVESY---ELILQEGTFKAVARGPGG 980

  Fly  1422 TPIKKLRDSVNSKGHNAFHIYKAGVFVVVEVIPLKLQVKWDEGTRVYVKLGNEWRQKVSGLCGNY 1486
            .|..|:|              ..|:|:|:|.  ..:.|.||..|.|:::|..:::.:|.|||||:
Human   981 DPPYKIR--------------YMGIFLVIET--HGMAVSWDRKTSVFIRLHQDYKGRVCGLCGNF 1029

  Fly  1487 NGNSLDDMQTPSMGLETSPMLFGHAWKLQPHCSAPVAPIDACKKHPERETWAQLKCGALKSDLFK 1551
            :.|:::|..|.|..:....:.||::|||.|.|...:||.|.|..:|.|::|||.:|..|....|.
Human  1030 DDNAINDFATRSRSVVGDALEFGNSWKLSPSCPDALAPKDPCTANPFRKSWAQKQCSILHGPTFA 1094

  Fly  1552 ECHAEVPLERFWKRCIFDTCACDQGGDCECLCTAVAAYADACAQKGINIRWRSQHFCPMQCD--- 1613
            .|.::|...::::.|:.|.||||.||||||.||||||||.||...|:.:.||:...||:.||   
Human  1095 ACRSQVDSTKYYEACVNDACACDSGGDCECFCTAVAAYAQACHDAGLCVSWRTPDTCPLFCDFYN 1159

  Fly  1614 PH--CS-DYKACTPACAVETCDNFLDQGIAERMCNRENC------LEGCHIKPCEDGFIYLNDTY 1669
            ||  |. .|:.|...| ::||.|  ..|         :|      ||||:.| |.....:.|:..
Human  1160 PHGGCEWHYQPCGAPC-LKTCRN--PSG---------HCLVDLPGLEGCYPK-CPPSQPFFNEDQ 1211

  Fly  1670 RDCVPKAECKPVCMVRDGKTFYEG-DITFTDSCATCRCSKRKEICSGVKC--------------- 1718
            ..||  |:|.  |..:||..:..| .:...::|.:|.|:.     ||::|               
Human  1212 MKCV--AQCG--CYDKDGNYYDVGARVPTAENCQSCNCTP-----SGIQCAHSLEACTCTYEDRT 1267

  Fly  1719 ----DV--PATTGLPAPLV----------------EGTTLPTPL--------------ATQNQTK 1747
                ||  ..|.||.|.|:                .||...||.              |....|.
Human  1268 YSYQDVIYNTTDGLGACLIAICGSNGTIIRKAVACPGTPATTPFTFTTAWVPHSTTSPALPVSTV 1332

  Fly  1748 CVKGWTRWCDKDRDTSDKSVRLNDEEKVP-----RYDRMENV----YGTCLKQYMTKVECRVKDT 1803
            ||:...||          |...|.....|     .::..||:    |..|  ..:..:|||....
Human  1333 CVREVCRW----------SSWYNGHRPEPGLGGGDFETFENLRQRGYQVC--PVLADIECRAAQL 1385

  Fly  1804 HEAP-EQMDENVVCSLEEGLRCIGK------CHDYELRAFCQCD-------------------EE 1842
            .:.| |::.:.|.|....||.|...      |||||||..| |:                   ..
Human  1386 PDMPLEELGQQVDCDRMRGLMCANSQQSPPLCHDYELRVLC-CEYVPCGPSPAPGTSPQPSLSAS 1449

  Fly  1843 LEPELPKP-----TEKPQLGL---------------ACDAAVVEYKEFPGDCHKFLHCQPKGVEG 1887
            .||.:|.|     |||..|.:               :....|......||.    ..|||:....
Human  1450 TEPAVPTPTQTTATEKTTLWVTPSIRSTAALTSQTGSSSGPVTVTPSAPGT----TTCQPRCQWT 1510

  Fly  1888 GWI---Y---------VE-----KTCGEYMMFNPTMLICDHIATVTEIKPN-----CGLKPEPEP 1930
            .|.   |         ||     :..|.::...|..:.|.     .|..||     .|.|...:.
Human  1511 EWFDEDYPKSEQLGGDVESYDKIRAAGGHLCQQPKDIECQ-----AESFPNWTLAQVGQKVHCDV 1570

  Fly  1931 EFEPIKQCPPGKIKSECANQCEN--------TCHYYGSILKKRGLCQVGEHCK-----PGCVDEL 1982
            .|..:               |.|        .|:.|    :.|.||...:||:     |....||
Human  1571 HFGLV---------------CRNWEQEGVFKMCYNY----RIRVLCCSDDHCRGRATTPPPTTEL 1616

  Fly  1983 RPDCPKLGK--FWRDEDTCVHADECPCMDKAEHYVQPHKPVLGEF----EVCQCIDNAFTCVPNK 2041
            ........:  |...:.|     ..|.:.:|........|.|.|.    .....:..|.|..|..
Human  1617 ETATTTTTQALFSTPQPT-----SSPGLTRAPPASTTAVPTLSEGLTSPRYTSTLGTATTGGPTT 1676

  Fly  2042 P----EP-VPKDEDDDLDLVSVV-----------PIYPVTLTPPLQ-------------CSPERL 2077
            |    || ||......|...|.:           |..|.||.|...             .|.|.|
Human  1677 PAGSTEPTVPGVATSTLPTRSALPGTTGSLGTWRPSQPPTLAPTTMATSRARPTGTASTASKEPL 1741

  Fly  2078 IPKIENPAHSLPDSIFNASSQLA-PEHGPKMARLTKE---------QPRGSWSPSINDQMQYLEL 2132
            ...:   |.:|...:..:.::.: |.....|:.||..         ||:..|:       ::.::
Human  1742 TTSL---APTLTSELSTSQAETSTPRTETTMSPLTNTTTSQGTTRCQPKCEWT-------EWFDV 1796

  Fly  2133 NFAKPEPFYGVVMAGSPEFDN 2153
            :|    |..||.......|:|
Human  1797 DF----PTSGVAGGDMETFEN 1813

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
HmlNP_524060.2 VWD 485..636 CDD:459671 34/153 (22%)
C8 684..754 CDD:462584 17/69 (25%)
TIL 758..811 CDD:410995 16/55 (29%)
VWD 840..1015 CDD:214566 56/183 (31%)
C8 1054..1121 CDD:214843 20/69 (29%)
TIL 1131..1185 CDD:410995 19/56 (34%)
TIL 1245..1298 CDD:460351 15/57 (26%)
VWD 1327..1498 CDD:214566 59/177 (33%)
C8 1535..1609 CDD:214843 33/73 (45%)
TIL 1938..2005 CDD:473303 14/81 (17%)
FA58C 2089..2223 CDD:238014 13/75 (17%)
FA58C <2299..2403 CDD:238014
VWD 2703..2858 CDD:459671
TIL 2974..3030 CDD:460351
VWD 3035..3198 CDD:459671
C8 3248..3313 CDD:462584
VWC 3397..3451 CDD:450195
GHB_like <3755..3813 CDD:473907
MUC5BNP_002449.2 Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 3481..3561
Mucin2_WxxW 3577..3665 CDD:463846
11 X approximate tandem repeats, Ser/Thr-rich 3676..4013
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 3699..3779
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 3813..3917
Chi1 3878..>4080 CDD:442692
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 3956..4118
Mucin2_WxxW 4134..4222 CDD:463846
23 X approximate tandem repeats, Ser/Thr-rich 4233..4879
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 4259..4389
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 4428..4447
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 4458..4527
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 4541..4750
VWD 5064..5237 CDD:214566
C8 5290..5351 CDD:462584
TIL 5359..5414 CDD:410995
VWC 5523..5586 CDD:214564
CT 5660..5742 CDD:214482
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 27..50
VWD 70..223 CDD:214566 34/153 (22%)
C8 261..325 CDD:462584 17/70 (24%)
TIL 329..385 CDD:460351 16/55 (29%)
VWC_out 387..>432 CDD:214565 13/44 (30%)
VWD 425..579 CDD:459671 53/173 (31%)
C8 615..685 CDD:214843 21/70 (30%)
TIL 695..752 CDD:410995 19/56 (34%)
TIL 794..855 CDD:460351 17/62 (27%)
VWD 884..1041 CDD:214566 59/177 (33%)
C8 1078..1152 CDD:214843 33/73 (45%)
7 X Cys-rich subdomain repeats 1333..4228 109/541 (20%)
Mucin2_WxxW 1340..1426 CDD:463846 25/97 (26%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1437..1462 4/24 (17%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1480..1502 3/25 (12%)
Mucin2_WxxW 1509..1597 CDD:463846 18/111 (16%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1607..1783 34/183 (19%)
DUF5585 1630..>1767 CDD:465521 27/144 (19%)
Mucin2_WxxW 1790..1878 CDD:463846 7/35 (20%)
11 X approximate tandem repeats, Ser/Thr-rich 1890..2199
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1890..2019
Herpes_BLLF1 <1928..2313 CDD:282904
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 2031..2100
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 2114..2211
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 2242..2302
Mucin2_WxxW 2320..2408 CDD:463846
11 X approximate tandem repeats, Ser/Thr-rich 2419..2756
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 2443..2462
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 2473..2522
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 2556..2861
Chi1 2644..>2868 CDD:442692
Mucin2_WxxW 2877..2965 CDD:463846
17 X approximate tandem repeats, Ser/Thr-rich 2976..3456
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 3001..3049
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 3256..3357
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 3371..3469
Chi1 3375..>3573 CDD:442692
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.