| Sequence 1: | NP_524060.2 | Gene: | Hml / 39529 | FlyBaseID: | FBgn0029167 | Length: | 3843 | Species: | Drosophila melanogaster |
|---|---|---|---|---|---|---|---|---|---|
| Sequence 2: | NP_083077.2 | Gene: | Muc5b / 74180 | MGIID: | 1921430 | Length: | 4800 | Species: | Mus musculus |
| Alignment Length: | 5126 | Identity: | 1005/5126 - (19%) |
|---|---|---|---|
| Similarity: | 1520/5126 - (29%) | Gaps: | 2116/5126 - (41%) |
- Green bases have known domain annotations that are detailed below.
|
Fly 453 SSEETVVTMSSYSLY------QSNNLDIVIDKTPRPAL-----------CTTWGGINMKTFDGLV 500
Fly 501 FKAPLSCSHTLITDKVSGT---FDIILKACPYGSGYGCAHTLKILWQSVLYTFENLNGTMQLTTP 562
Fly 563 IKKLPMPVQVMGMKVMPVAQHVQIDLESVGLKLDWDHRQYVSVQAGPQMWGKVGGLCGTLDGDPN 627
Fly 628 TDLTSRTGKKLATVKAFADAWRVEDRSELCQ------VENSAEMEFGMDSCEQSKLQKAVSVCER 686
Fly 687 LLANEKLGDCIKPFNYDALIRTCMADYCNCANREHPESCNCDAIAMLAKECAFKGIKLEHGWRNL 751
Fly 752 EICPISCGFGRVYQACGPNVEPTCDSDLALPASKGACNEGCFCPEGTV---QYKEACITRELCPC 813
Fly 814 SLRGKEFKPESTVKKNCNTCTCKNGQWRCTEDKCGARCGAVGDPHYQTFDGKRYDFMGKCSYHLL 878
Fly 879 KTQNTSVEAENVACSGA----VSESMNFAAPDDPSCTKAVTIRFILRDGTPSVIKLDQGLTTIVN 939
Fly 940 DKPIAKLPKMLGLGEVLIRRASSTFLTVEFADG--IRVWWDGVSRVYIDAPPSLRGQTQGLCGTF 1002
Fly 1003 NSNTQDDFLTPEGDVETAVEPFADKWRTKDTCQFKAETHQGPHPCTLNPEKKAQAEKFCDWILQD 1067
Fly 1068 --IFQDCHFLVEPEQFYEDCLYDTCACKDEMSKCFCPILSAYGTECMRQGV-KTGWRMSVKEC-- 1127
Fly 1128 -AVKCPLGQVFDECGDGCALSCDDLPSKG---SCKRECVEGCRCPHGEYVNEDGECVPKKMCHCN 1188
Fly 1189 FDGMSFRPGYKEVRPGEKFLD---LCTCTDGVWDCQDAEPGDKDKYPPSSELRSKC-AKQPYAEF 1249
Fly 1250 ---------TKCAPKEPKTCKNMDKYVADSSDCLPGCVCMEGYVYDTSRLACVLPANCSCHHAGK 1305
Fly 1306 SYDDGEKIKEDCNLCECRAGNWKCSKNGCESTCSVWGDSHFTTFDGHDFDFQGACDYVLA----K 1366
Fly 1367 GVFDNGDGFSITIQNVLCGTMGVTCSKSLEIALTGHAEESLLLSAD-SAYSTDPNKTPIKKLRDS 1430
Fly 1431 VNSKGHNAFHIYKAGVFVVVEVIPLKLQVKWDEGTRVYVKLGNEWRQKVSGLCGNYNGNSLDDMQ 1495
Fly 1496 TPSMGLETSPMLFGHAWKLQPHC-SAPVAPIDACKKHPERETWAQLKCGALKSDLFKECHAEVPL 1559
Fly 1560 ERFWKRCIFDTCACDQGGDCECLCTAVAAYADACAQKGINIRWRSQHFCPMQCD---PH--CS-D 1618
Fly 1619 YKACTPACAVETCDNFLDQGIAERMCNRENC------LEGCHIKPCEDGFIYLNDTYRDC----- 1672
Fly 1673 --------------VPKAE------CKP---VCMVR--------DGKTFYEGDITF--TDSCATC 1704
Fly 1705 ---------RCSKRKEICSGVKCDVPATTGLPAPLVEGTTLPTPLATQN-----QTKCVK---GW 1752
Fly 1753 TRWCDKDRDTSDKSVRLNDEEKVPRYDRMENV----YGTCLKQYMTKVECR-VKDTHEAPEQMDE 1812
Fly 1813 NVVCSLEEGLRCIGK------CHDYELRAFCQCD------------------------------- 1840
Fly 1841 -------------EELEPELPK-------------------------------PT---------- 1851
Fly 1852 ----------------------------------------------------------------- 1851
Fly 1852 ------------EKPQ----------------LG--LACD--AAVVEYKEFPGD----CHKFL-- 1878
Fly 1879 -------HCQPKGVEGGWIYVEKTC---------------GEYMMFNP----------------- 1904
Fly 1905 ---------------TMLICDH-----------IATVTEIKPNCGLKPE---------------- 1927
Fly 1928 -------------PEPEFEP---------------------IKQCPPGKIKSE------------ 1946
Fly 1947 ---------CANQCENT----CHYYGSI-----------LKKRG--LCQVGEHCKPGCVDELRP- 1984
Fly 1985 -DCPKLGKFWRDEDTCVHADECPCMDKAEHYVQPHKPVLGEFEVC-------------QC----- 2030
Fly 2031 ------IDNAFTCVP----NKPEP--------VPKDEDDDLDL------------VSVVPIYPVT 2065
Fly 2066 LT-------PPLQC--SPERLIPKIENPAHSLPDSIFNASSQL---------------------- 2099
Fly 2100 ---APEHG--------------------------------PKMARL------------------- 2110
Fly 2111 -----------TKEQPRGSWSPSINDQMQYLELNFAKPEPFYGVVMAGSPEFDNYVTLFKILHSH 2164
Fly 2165 DGI------------------------------AYHYLV------------------------DE 2175
Fly 2176 TEKPQMFNG-PLDSRAPVQTLF----KIPIEASSLRIYPLKW----------------------- 2212
Fly 2213 ------HGS--------------------------------------IAMRVELLICGDKEEP-- 2231
Fly 2232 ----KPVPTVSTILPITE----------------------------------RPARLV------- 2251
Fly 2252 ------------------------------------------------DLECIDL----MGVDE- 2263
Fly 2264 GKMYQDQVQSSSLWQQPNLGKKLQ-----LLELL--------KLSTPLAWRPLANSQNEFIEFDF 2315
Fly 2316 LEPRNISGFVTKGGPDGW----------------------------------------------- 2333
Fly 2334 ----VTGYKVMFSKKKPTWNTVLSTD----GQARIFEANHDAETERRHHFKNPILTQYIK----- 2385
Fly 2386 -----IVPAYWEKNINMRIEPLG-----------------------CF----------------- 2405
Fly 2406 --------------------------LP-----------------YPEIQRQVPV---------- 2417
Fly 2418 ------------------EE--------------------SKP----------TKCNICDGVSTS 2434
Fly 2435 S-----------STTGCQCQDQLFWDGNTCVQHNLCP----CIENYVSYPIGSKFENSACEDCVC 2484
Fly 2485 VLGGHKNCKPKKCPPCLGGKLRPVITSDC------FCKCEPCPKHQRLCP--------------- 2528
Fly 2529 ------------------SSGDCIPEILW-------------------------CNGVQDCADDE 2550
Fly 2551 DASCS---------DSF--TVEPDV-------SREKNET---------EVITCPVPVC----PPQ 2584
Fly 2585 MKIRITE----------------------------KKSRKMSKMFTF-SKQVSIVDDGTT-ITKT 2619
Fly 2620 KF-----------------------------------------ISSKEQILAMPNRELDFQLEE- 2642
Fly 2643 ---------------------------------QCDEFTCVPIPSKQV----------------- 2657
Fly 2658 ----DKNETVTCTEPKCPEKYDVELDMSASKV--GDCLRYSCVLRPNKDDVCE------------ 2704
Fly 2705 -------------------------ISGKSF---------------------TTFDGTVFKYG-- 2721
Fly 2722 ------------PCSHILARDIHSSSW--SISVHQQCSDETRKVCHKV-------------ITIQ 2759
Fly 2760 DTE------------AG--NELILLPHLKLKFNGYEFTVQQLINS--PICKASFVVSQPGKTLLA 2808
Fly 2809 VSTKYGFWVQLDD---------------------------------------IGIVKVG-ISSKF 2833
Fly 2834 IRTVDGL-CGYYNGNQKDDKR-------------------------------------------- 2853
Fly 2854 -------SPDGQIIPNTE-----KFGDS----------WYDKRIPK------------------- 2877
Fly 2878 ---DQCGDLKC--------PREMQAKALQLCNIIHHPTFARCHKAVNYK---QFLNNY-----CL 2923
Fly 2924 EAACNC-----------------------------------------------------MMANNG 2935
Fly 2936 DPAACKCNILES------FVKKCLSVNPLVQLT------------TWRAVAQCEINCPSP----- 2977
Fly 2978 ---------LVH-------------------TDCYKRRCEPSCDNVHGDDCPVLPDACFPGC--Y 3012
Fly 3013 CPEGTVRKG---PNC----------------------------VPIS-----------ECKDCVC 3035
Fly 3036 NSLGASKYMTYDRKSFSFNGNCTYLLSRDVVLPGVHTFQVYVSMDDCKKLGQPTPVEGGSCAKSL 3100
Fly 3101 HI-LNGDHVIHVQRVPQKPKSLQVLVDGFEVKKIPYKDSWISLRQVVGKELV---------LSLP 3155
Fly 3156 ESHVELTASFEDLIFSLGVPSIKYGSKMEGLCGDCNGNAGNDL-QPNPAKKKAGVDVIQSWQADE 3219
Fly 3220 PKLGLV-----EECLSEDVPKEHCIPLPP-----EKDPC-----LQFYNAELFGKCPLAVDPIAY 3269
Fly 3270 VSACQQDICKPGNTQQGVCVALAAYAKECNQHGICTNWRRPQ--LCPYECPSDMVYEPCGCAKNC 3332
Fly 3333 DTIKALSEFDAVSLKNEAVVHTVKTDEMCLSSER---FEGCFCPPGKVMDGGQ---CVPEIACTK 3391
Fly 3392 CDDGLHLPDEKWKKDKCTECQCD-SKGKTTCVEKKCQVEEN--ICAE-GYRPETIVSVDE-CCPR 3451
Fly 3452 YRCV-PETKDPSKLCLAPLVPICGPGQFKKEKKDVNGCSQYICECIPKDQCEIIELRELLPGEII 3515
Fly 3516 VNVEEGCCPTQKIECKPETCPKAPVNCQERFYEVKTIKEPGMCCSKHSCVPPKDLCIVQYELDEA 3580
Fly 3581 TKFTKTVGDKWTHAKEVCKQETCSYGPDGNAQVVSTLEQCLTDCAPGFSYQNLDKTKCCGKCVQT 3645
Fly 3646 SCIF-EQKLYEVNALWKSA--DNCTTYSCLKKDGQFLVTTSREVCPDVGSCLSHLLYQDGCCKRC 3707
Fly 3708 KSEPLVEDKSSC-LPVSLAESRTKEILKFPVQGHGTCVNADPIQGFTDCEGACSSGSKYNTLTDM 3771
Fly 3772 HEKFCTCCSIKSYHPISVKMICDDG----HTFTQKHEVPSNCGCSP-CSEFSDSAI 3822 |
| Gene | Sequence | Domain | Region | External ID | Identity |
|---|---|---|---|---|---|
| Hml | NP_524060.2 | VWD | 485..636 | CDD:459671 | 38/153 (25%) |
| C8 | 684..754 | CDD:462584 | 16/69 (23%) | ||
| TIL | 758..811 | CDD:410995 | 17/55 (31%) | ||
| VWD | 840..1015 | CDD:214566 | 53/180 (29%) | ||
| C8 | 1054..1121 | CDD:214843 | 24/69 (35%) | ||
| TIL | 1131..1185 | CDD:410995 | 16/56 (29%) | ||
| TIL | 1245..1298 | CDD:460351 | 15/61 (25%) | ||
| VWD | 1327..1498 | CDD:214566 | 59/175 (34%) | ||
| C8 | 1535..1609 | CDD:214843 | 35/73 (48%) | ||
| TIL | 1938..2005 | CDD:473303 | 18/106 (17%) | ||
| FA58C | 2089..2223 | CDD:238014 | 34/346 (10%) | ||
| FA58C | <2299..2403 | CDD:238014 | 25/168 (15%) | ||
| VWD | 2703..2858 | CDD:459671 | 48/349 (14%) | ||
| TIL | 2974..3030 | CDD:460351 | 17/132 (13%) | ||
| VWD | 3035..3198 | CDD:459671 | 42/172 (24%) | ||
| C8 | 3248..3313 | CDD:462584 | 21/71 (30%) | ||
| VWC | 3397..3451 | CDD:450195 | 17/58 (29%) | ||
| GHB_like | <3755..3813 | CDD:473907 | 25/61 (41%) | ||
| Muc5b | NP_083077.2 | VWD | 80..227 | CDD:214566 | 38/153 (25%) |
| C8 | 265..329 | CDD:462584 | 16/70 (23%) | ||
| TIL | 333..389 | CDD:410995 | 17/55 (31%) | ||
| VWC_out | 391..>436 | CDD:214565 | 13/44 (30%) | ||
| VWD | 429..583 | CDD:459671 | 50/170 (29%) | ||
| C8 | 619..700 | CDD:214843 | 28/83 (34%) | ||
| TIL | 699..756 | CDD:410995 | 16/56 (29%) | ||
| TIL | 798..859 | CDD:410995 | 17/66 (26%) | ||
| VWD | 888..1047 | CDD:214566 | 59/175 (34%) | ||
| C8 | 1084..1158 | CDD:214843 | 35/73 (48%) | ||
| Mucin2_WxxW | 1346..1432 | CDD:463846 | 27/95 (28%) | ||
| Mucin2_WxxW | 1575..1663 | CDD:463846 | 13/87 (15%) | ||
| PLN02217 | <1687..1796 | CDD:215130 | 6/108 (6%) | ||
| Metaviral_G | <1742..1864 | CDD:462833 | 11/121 (9%) | ||
| Mucin2_WxxW | 1871..1959 | CDD:463846 | 16/102 (16%) | ||
| Herpes_BLLF1 | <1968..2181 | CDD:282904 | 27/212 (13%) | ||
| Mucin2_WxxW | 2185..2273 | CDD:463846 | 10/87 (11%) | ||
| Herpes_BLLF1 | <2282..2495 | CDD:282904 | 25/215 (12%) | ||
| Mucin2_WxxW | 2499..2587 | CDD:463846 | 11/87 (13%) | ||
| Mucin2_WxxW | 2688..2776 | CDD:463846 | 19/101 (19%) | ||
| DUF5585 | 2827..>3082 | CDD:465521 | 33/261 (13%) | ||
| Herpes_BLLF1 | 2909..3626 | CDD:282904 | 101/726 (14%) | ||
| Mucin2_WxxW | 3067..3155 | CDD:463846 | 12/87 (14%) | ||
| Mucin2_WxxW | 3381..3469 | CDD:463846 | 11/88 (13%) | ||
| Mucin2_WxxW | 3625..3713 | CDD:463846 | 16/91 (18%) | ||
| Mucin2_WxxW | 3779..3867 | CDD:463846 | 17/92 (18%) | ||
| VWD | 4111..4283 | CDD:214566 | 47/189 (25%) | ||
| C8 | 4336..4398 | CDD:462584 | 20/62 (32%) | ||
| VWC | 4570..4631 | CDD:214564 | 24/106 (23%) | ||
| CT | 4708..4790 | CDD:214482 | 34/99 (34%) |