| Sequence 1: | NP_001260032.1 | Gene: | dpy / 318824 | FlyBaseID: | FBgn0053196 | Length: | 22949 | Species: | Drosophila melanogaster |
|---|---|---|---|---|---|---|---|---|---|
| Sequence 2: | XP_031757776.1 | Gene: | muc4 / 100485162 | XenbaseID: | XB-GENE-852599 | Length: | 5393 | Species: | Xenopus tropicalis |
| Alignment Length: | 7333 | Identity: | 1549/7333 - (21%) |
|---|---|---|---|
| Similarity: | 2157/7333 - (29%) | Gaps: | 3037/7333 - (41%) |
- Green bases have known domain annotations that are detailed below.
|
Fly 2133 ASNQCPSGHMCI----GNQCNLPC------TKTASCAVGERCYQQVCRKVCYTSNNCLAGE---- 2183
Fly 2184 ----------ICNSDRTCQPG-CDSDADCPPTELCLTGKCKCATGFIGTPFG--CSDIDECTEQP 2235
Fly 2236 CHASARCENLPGTYRCVCPEGTVGDGYSQPGCSQPRQCHKPDDCANNLACIHGKCTDPCLHTVCG 2300
Fly 2301 INANCQSEGHEALCSCPAGFLGDPNDTGVGCFKVECIDHVDCAGDRACDAETNRCIKPCDLTSCG 2365
Fly 2366 KGNCQVRDHKATCACYEGYQLVNDVC-EDINECLSQPCHSTAFCNNLPGSYSCQCPEGLIGDPLQ 2429
Fly 2430 AGCRDP-NECLSDADCPASASCQNSRCRSPCERQNACGLNANCQAQAHQAICTCPLNSRGDPTIE 2493
Fly 2494 CVHIECADNDDCSGEKACLDSKCIDPCSLPNACGALARCSVQNHIGVCSCEAGSTGDAKLGCVQL 2558
Fly 2559 QYCQQDGQCAQGSICSHGI--CSPLCSTNRDCISEQLCLQGVCQGTCKSNSSCPQFQFCSNNICT 2621
Fly 2622 KELECRSDSECGEDETCLSDAYGRAKCESVCLGRAACGRNAECVARSHAPDCLCKEGFFGDAKSG 2686
Fly 2687 CRKIECTSDDDCSNDKSCDNHMCKIACLIGQPCGENALCTTEHHQQVCHCQPGFSGDPRVRCDVI 2751
Fly 2752 DFCRDA-------PCGPGARCRNA----------------RGSYKCTCPPGLVGDPY-----NEG 2788
Fly 2789 CRSSVECETNEDCPPHAACTKTNG-VAKCRDVCAQ---LQCGPNAECVPKGHVAQCACRSGYDGQ 2849
Fly 2850 P-----ADRVAGCKPLPSPCQVTGDCPTNTYCSDSVCKPACVLDTECGAFEVC-----QGGQ--- 2901
Fly 2902 ----------------------------CFNPCLQPQACGQNAECVMQNHLKQCHCPEGFTGDSA 2938
Fly 2939 KECVRVPVACDGECGPGYTCRDSMCLPVCHNDLECASN----EKCLKGSCMLTCRVDNDCFLGHV 2999
Fly 3000 CLHNKCVYGCHVDDDCSASESCRNDKCVNPCLENPC------------GPNAACSVSNHRASCSC 3052
Fly 3053 L----ESMVPNPTPQVGCVRSPPLECRENRDCGNGLACFESVCRPLCADDAGCLTNERCQQGVCK 3113
Fly 3114 PLCRHDNECGHGELCLGLNCVPGCRSDQGCPPELSCVGQQCVDP-----CADPTACGTNAHCQTI 3173
Fly 3174 DHRKQCLCPEG------LDGNANVACKVP-RIA-CGRNEDCQ-SNQLCYAGSCQGKCR-----ND 3224
Fly 3225 QNCLADERCMRGTCRTVCNTDEACAQGQICENRMCQTGCRTD----LSCATDE--------ACVN 3277
Fly 3278 KKCQNPCRTPGQCG----QCADCLVVNH-------GVQCQCPAAFMGDGLTGCQLPPERC----- 3326
Fly 3327 --------------------------------HPDCECDENGAYCAPKCSRTEDCACGQ------ 3353
Fly 3354 --QCARGKCRNKCGP------------KRQCTVG----------------------QLCERGACI 3382
Fly 3383 ------------------------------AGCKSNGDCAADQSCVNGKCSDP-CANEKACGRNA 3416
Fly 3417 LCTVSEHRMLCYCPDGYEGEPSKECVQFECRVDTDCDSNKRCDQG-KCRNPCLEYGACGTNAQCR 3480
Fly 3481 VVGRKAQCSCPPDFFGNPTSECRPLEGGCSSKPCGENSKCTEVPGGYECACMDGCIGDAHQGCLC 3545
Fly 3546 GGPLVNACRD---QPCGLNAACHVLENNQAECYCPEDFPNGDAYVQCYLTTPKQDCRTLGC-EVG 3606
Fly 3607 GCVRQGYEYVCQQDTEQCYSDTDCPSEKSCLQGH----CSD--PCTMRGVCGLNALCKTVLHRPR 3665
Fly 3666 CSCPSCHIGRPEIECKSDPKCVAEDTDPKTKEQIPCSTDSECPETLQCGQYGQCT--DPCNNPLF 3728
Fly 3729 ICESNKKCETRRHQPVCICKSGFIVNEYGELTCAPDKRECYRDDDCASNMACSDGKCRNPCIVPL 3793
Fly 3794 GRAAICAENKSCEVQNHKPVCICMRDCQPSISICLRDAGCPASQACRKLKCVDPCEFATCAPNSP 3858
Fly 3859 CIVEDHKPICK--------FCPAGFIADAK-----NGCQKAKPGGNCTSNTDCSQAHQCGSSGKC 3910
Fly 3911 IDPCLTSCAG-GVKC---------VVSAHRVTICTCPATLTNNTDSNCTSTDITVGTTTQRIETT 3965
Fly 3966 TDFINVKYTVMQLANQTEMRTRFTDIEAENETTGPYTTTTESYKTTKQLSSNPETETPTTLPSRP 4030
Fly 4031 TTRPFTDQTTEFTSEIPTITPMEGSTPTPSHLETTVASITSESTTREVYTIKPFDRSTPTPVSPD 4095
Fly 4096 TTVPSITFETTTNIPIGTTRGQVTEQTTSSPSEKRTTIRVEESTLPSRSTDRTTPSESPETPTIL 4160
Fly 4161 PSDSTTRTYSDQTTESTRDVPTTRPFEASTPSPASLETTVPSVTLETTTNDPIGSTGGQVTEQTT 4225
Fly 4226 SSPSEVRTTIGLEESTLPSRSTDRTTPSESPETPTTLPSDFITRPHSDQTTESTRDVPTTRPFEA 4290
Fly 4291 STPSSASLETTVPSVTLETTTNVPIGSTGGQVTEQTTSSPSEVRTTIRVEESTLPSRSADRTTPS 4355
Fly 4356 ESPETPTTLPSDFTTRPHSEQTTESTRDVPTTRPFEASTPSPASLETTVPSVTLETTTNVPIGST 4420
Fly 4421 GGQVTGQTTSSPSEVRTTIRVEESTLPSRSADRTTPSESPETPTTLPSDFITRPHSEKTTESTRD 4485
Fly 4486 VPTTRPFEASTPSSASLETTVPSVTLETTTNVPIGSTGGQVTEQTTSSPSEVRTTIRVEESTLPS 4550
Fly 4551 RSADRTTLSESPETPTTLPSDFTIRPHSEQTTESTRDVPTTRPFEASTPSPASLETTVPSV---- 4611
Fly 4612 TSETTTNVPIGSTGGQVTGQTTAPPSEFRTTIRVEESTLPSRSTDRTTPSESPETPTILPSDSTT 4676
Fly 4677 RTYSDQTTESTRDVPTTRPFEASTPSPASLETTVPSVTLETTTNVPIGSTGGQVTEQTTSSPSEV 4741
Fly 4742 RTTIRVEESTLPSRSADRTTPSESPETPTTLPSDFITRPHSEKTTESTRDVPTTRPFEASTPSSA 4806
Fly 4807 SLETTVPSVTLETTTNVPIGSTGGQVTEQTTSSPSEVRTTIRVEESTLPSRSADRTTPSESPETP 4871
Fly 4872 TTLPSDFITRPHSEKTTESTRDVPTTRPFEASTPSSASLETTVPSVTLETTTNVPIGSTGGQVTE 4936
Fly 4937 QTTSSPSEVRTTIRVEESTLPSRSTDRTTPSESPETPTTLPSDFTTRPHSEQTTESTRDVPTTRP 5001
Fly 5002 FEASTPSPASLETTVPSVTLETTTNVPIGSTGGQVTEQTTSSPSEVRTTIRVEESTLPSRSADRT 5066
Fly 5067 TPSESPETPTTLPSDFITRTYSDQTTESTRDVPTTRPFEASTPSPASLETTVPSVTSETTTNVPI 5131
Fly 5132 GSTGGQVTGQTTAPPSEFRTTIRVEESTLPSRSTDRTTPSESPETPTTLPSDFTTRPHSDQTTES 5196
Fly 5197 TRDVPTTRPFEASTPSPASLETTVPSVTLETTTNVPIGSTGGQVTEQTTSSPSEVRTTIRVEEST 5261
Fly 5262 LPSRSADRTTPSESPETPTLPSDFTTRPHSEQTTESTRDVPATRPFEASTPSPASLETTVPSVTS 5326
Fly 5327 EATTNVPIGSTGGQVTEQTTSSPSEVRTTIRVEESTLPSRSTDRTSPSESPETPTTLPSDFTTRP 5391
Fly 5392 HSDQTTECTRDVPTTRPFEASTPSSASLETTVPSVTLETTTNVPIGSTGGQVTEQTTSSPSEVRT 5456
Fly 5457 TIRVEESTLPSRSADRTTPSESPETPTLPSDFTTRPHSEQTTESTRDVPTTRPFEASTPSSASLE 5521
Fly 5522 TTVPSVTLETTTNVPIGSTGGQVTEQTTSSPSEFRTTIRVEESTLPSRSADRTTPSESPETPTLP 5586
Fly 5587 SDFTTRPHSEQTTESTRDVPTTRPFEASTPSPASLETTVPSVTSETTTNVPIGSTGGQVTGQTTA 5651
Fly 5652 PPSEVRTTIRVEESTLPSRSTDRTTPSESPETPTILPSDSTTRTYSDQTTESTRDVPTTRPFEAS 5716
Fly 5717 TPSPASLETTVPSVTLETTTNVPIGSTGGQVTGQTTATPSEVRTTIGVEESTLPSRSTDRTSPSE 5781
Fly 5782 SPETPTTLPSDFTTRPHSDQTTESTRDVPTTRPFEASTPSPASLETTVPSVTSETTTNVPIGSTG 5846
Fly 5847 GQVTEQTTSSPSEVRTTIGLEESTLPSRSTDRTSPSESPETPTTLPSDFITRPHSDQTTESTRDV 5911
Fly 5912 PTTRPFEASTPSPASLETTVPSVTSETTTNVPIGSTGGQVTGQTTAPPSEVRTTIGVEESTLPSR 5976
Fly 5977 STDRTSPSESPETPTTLPSDFITRPHSEQTTESTRDVPTTRPFEASTPSPASLKTTVPSVTSEAT 6041
Fly 6042 TNVPIGSTGQRIGTTPSESPETPTTLPSDFTTRPHSEKTTESTRDVPTTRPFETSTPSPASLETT 6106
Fly 6107 VPSVTLETTTNVPIGSTGGQVTEQTTSSPSEVRTTIRVEESTLPSRSADRTTPSESPETPTLPSD 6171
Fly 6172 FTTRPHSEQTTESTRDVPTTRPFEASTPSPASLETTVPSVTSETTTNVPIGSTGGQVTGQTTAPP 6236
Fly 6237 SEVRTTIGVEESTLPSRSTDRTSPSESPETPTTLPSDFITRPHSEQTTESTRDVPTTRPFEASTP 6301
Fly 6302 SPASLKTTVPSVTSEATTNVPIGSTGGQVTEQTTSSPSEVRTTIRVEESTLPSRSTDRTTPSESP 6366
Fly 6367 ETPTTLPSDFTTRPHSEKTTESTRDVPTTRPFETSTPSPASLETTVPSVTLETTTSVPMGSTGGQ 6431
Fly 6432 VTGQTTAPPSEVRTTIRVEESTLPSRSTDRTSPSESPETPTTLPSDFITRPHSEKTTESTRDVPT 6496
Fly 6497 TRPFEASTPSSASSGNNCSISYFRNHYKCSNRFNRSADRTTPSESPETPTLPSDFTTRPHSEQTT 6561
Fly 6562 ESTRDVPTTRPFEASTPSPASLETTVPSVTSETTTNVPIGSTGGQVTGQTTAPPSEVRTTIRVEE 6626
Fly 6627 STLPSRSTDRTTPSESPETPTILPSDFTTRPHSDQTTESTRDVPTTRPFEASTPRPVTLETAVPS 6691
Fly 6692 VTLETTTNV-PIGSTGGQVTGQTTATPSEVRTTIRVEESTLPSRSTDRTTPSESPETPTTLPSDF 6755
Fly 6756 TTRPHSDQTTESTRDVPTTRP--FEASTPSPASLETTVPSVTSETTTNVPIGSTGGQVTEQTTSS 6818
Fly 6819 PSEVRTTIGLEESTLPSRSTDRTSPSESPETPTTLPSDFITRPHSDQTTESTRDVPTTRPFEAST 6883
Fly 6884 PSPASLETTVPSVTSETTTNVPIGSTGGQVTEQTTSSPSEVRTTIGLEESTLPSRSTDRTSPSES 6948
Fly 6949 PETPTTLPSDFITRPHSDQTTESTRDVPTTRPFEASTPSSASLETTVPSVTLETTTNVPIGSTGG 7013
Fly 7014 QVTEQTTSSPSEVRTTIRVEESTLPSRSTDRTTPSESPETPTTLPSDFTTRPHSDQTTESSRDVP 7078
Fly 7079 TTQPFEASTPRPVTLQTAVLPVTSETTTNVPIGSTGGQVTEQTTSSPSEVRTTIRVEESTLPSRS 7143
Fly 7144 TDRTTPSESPETPTTLPSDFTTRPHSDQTTESSRDVPTTQPFESSTPRPVTLETAVPPVTSETTT 7208
Fly 7209 NVPIGSTGGQVTEQTTPSPSEVRTTIRIEESTFPSRSTDRTTPSESPETPTTLPSDFTTRPHSDQ 7273
Fly 7274 TTESTRDVPTTRPFESSTPRPVTLEIAVPPVTSETTTNVAIGSTGGQVTEQTTSSPSEVRTTIRV 7338
Fly 7339 EESTLPSRSTDRTTPSESPETPTTLPSDFTTRPHSDQTTESTRDVPTTRPFEASTPSPASLETTV 7403
Fly 7404 PSVTL--ETTTSVPMGSTGG--QVTGQTTAPPSEVRTTIRVEESTLPSRSTDRTPPSESPETPTT 7464
Fly 7465 LPSDFTTRPHSDQTTESSRDVPTTQPFESSTPRPVTLEIAVPPVTSETTTNVPIGSTGGQVTGQT 7529
Fly 7530 TATPSEVRTTIGVEESTLPSRSTDRTTPSESPETPTTLPSDFTTRPHSDQTTESTRDVPTTRPFE 7594
Fly 7595 ASTPSPASLETTVPSVTLETTTNVPIGSTGGQVTGQTTATPSEVRTTIGVEESTLPSRSTDRTTP 7659
Fly 7660 SESPETPTTLPSDFTTRPHSDQTTESTRDVPTTRPFEASTPRPVTLETAVPSVTSETTTNVPIGS 7724
Fly 7725 TVTSE-TTTNVPIGSTGGQVAGQTTAPPSEVRTTIRVEESTLPSRSADRTTPSESPETPTTLPSD 7788
Fly 7789 FTTRPHSEQTTESTRDVPTTRPFEASTPSPASLETTVPSVTSETTTNVPIGSTGGQLTEQSTSSP 7853
Fly 7854 SEVRTTIRVEESTLPSRSTDRTFPSESPEKPTTLPSDFTTRPHLEQTTESTRDVLTTRPFETSTP 7918
Fly 7919 SPVSLETTVPSVTSETSTNVPIGSTGGQVTEQTTAPPSVRTT--ETIVKSTHPAVSPDTTIPSEI 7981
Fly 7982 PATRVPLESTTRLYTDQTIPPGSTDRTTSSERPDESTRLTSEESTETTRPVPTVSPRDALETTVT 8046
Fly 8047 SLITETTKTTSG-----------GTPRGQVTERTTKSVSELTTG------RSSDVV----TERTM 8090
Fly 8091 PSNISSTTT-------VFN-NSEPVSDNLPTTIS-ITVTDSPTT-----VPVPTCKTDYDCLDEQ 8141
Fly 8142 TCIGGQCISPCEY-FTNLCTVQNLTICRTLNHTTKCYCDTDDDVNRPDCSMKAEIGC-------- 8197
Fly 8198 -----ASSDECPSQQACINALCVDPCTFNNPCSRNED---CRVF-------NHQPLCSAEHGRTP 8247
Fly 8248 GCEHCPPGANCDPTTGA--------CIKANVTITTITTKNSTSTKIPTKPRTTANPNTGVKTTPT 8304
Fly 8305 TTRVTTRNTTTTTTTTTTSSTSTE-----SSTITSATNQTS-------------KNQKPD----- 8346
Fly 8347 --TESTT---SHTDATRRYRDGENNITDTPTPRPTIQTTTLRGE--GVMG--DSQRRSTTTPKMK 8402
Fly 8403 TTRLDTSNEVPDTTSPWPIELPTTEGTTTEVYNTMFAPVVNTTDTSLI-NPCTVDTNCAPNEHCK 8466
Fly 8467 LGHCRKKEPPGSPKTPEPCQSNNDCIESEACYMGLCQDPCEFAKICAATAKCTAKSHRPVCTC-P 8530
Fly 8531 QGHEGNPMVKCVTTQT-----------------------------SIECTDDSDCGVTEACINQL 8566
Fly 8567 CQH----------PCDVH--DPC---ATNAVCINSNHAADCSCADGFQGNGFVGCQPAR---SHV 8613
Fly 8614 CQYNEDCPPTKLC---------DRLNRRCINPCQ-EDSCGENAECIPVNHGTECRCLPGFLGNAY 8668
Fly 8669 VQCLPSQGCRSDSECDSSQACINGKCSSPCQCGAYALCDVVNHRGVCKCPPGYNGNPKVGCSPPQ 8733
Fly 8734 DPCDPNPCGLNALCELDNGN--PICYCPKGLTGNPFKNCIPEGD--------ECTP--------N 8780
Fly 8781 PCGPNSGCRRVGGNPVCF----CLPEYEGQPPSIPCELPSNPCDPSPC--GPNTQCSV------- 8832
Fly 8833 --------LSNGFSKCTCLPNYVESPN-TIRGCVEPINPCDPNPCGTG-----------AICDSS 8877
Fly 8878 RHPVCYCPDNKIGNPFRLCDKPAVTIELCQPGPCGRNAECYVAGNREECYCRS 8930 |
| Gene | Sequence | Domain | Region | External ID | Identity |
|---|---|---|---|---|---|
| dpy | NP_001260032.1 | EGF_3 | 137..166 | CDD:463759 | |
| EGF_CA | 212..247 | CDD:238011 | |||
| EGF_CA | 255..>286 | CDD:214542 | |||
| EGF_CA | 298..331 | CDD:238011 | |||
| EGF_CA | 338..373 | CDD:238011 | |||
| EGF_CA | 413..456 | CDD:238011 | |||
| EGF_CA | 457..490 | CDD:238011 | |||
| EGF_CA | 497..>529 | CDD:214542 | |||
| EGF_CA | 580..>612 | CDD:214542 | |||
| EGF_3 | 676..702 | CDD:463759 | |||
| EGF_CA | 1022..1056 | CDD:214542 | |||
| EGF_CA | 2227..2260 | CDD:238011 | 10/32 (31%) | ||
| EGF_CA | 2393..>2422 | CDD:214542 | 13/28 (46%) | ||
| Herpes_BLLF1 | <4032..4489 | CDD:282904 | 92/456 (20%) | ||
| Herpes_BLLF1 | <4334..4797 | CDD:282904 | 109/466 (23%) | ||
| PHA03255 | 4845..>5021 | CDD:165513 | 38/175 (22%) | ||
| DUF5585 | 4959..5356 | CDD:465521 | 90/396 (23%) | ||
| PHA03247 | <5170..5799 | CDD:223021 | 133/628 (21%) | ||
| PRK12495 | <5749..5877 | CDD:183558 | 29/127 (23%) | ||
| PHA03247 | <5865..6485 | CDD:223021 | 129/619 (21%) | ||
| Chi1 | 6385..>6609 | CDD:442692 | 35/223 (16%) | ||
| Herpes_BLLF1 | <6607..7102 | CDD:282904 | 115/497 (23%) | ||
| Herpes_BLLF1 | <7027..7490 | CDD:282904 | 113/466 (24%) | ||
| DUF5585 | 7550..7954 | CDD:465521 | 98/404 (24%) | ||
| DUF5585 | 7881..>8130 | CDD:465521 | 60/285 (21%) | ||
| Atrophin-1 | 18652..>19081 | CDD:460830 | |||
| PHA03247 | <18873..19514 | CDD:223021 | |||
| ZP | 22576..22811 | CDD:214579 | |||
| muc4 | XP_031757776.1 | Herpes_BLLF1 | <53..520 | CDD:282904 | 16/79 (20%) |
| EGF_CA | 555..586 | CDD:214542 | 13/36 (36%) | ||
| EGF_CA | 598..631 | CDD:238011 | 14/71 (20%) | ||
| EGF_CA | 634..668 | CDD:238011 | 9/39 (23%) | ||
| EGF_CA | 671..707 | CDD:238011 | 15/39 (38%) | ||
| EGF_CA | 710..745 | CDD:238011 | 14/108 (13%) | ||
| EGF_CA | 748..782 | CDD:238011 | 9/58 (16%) | ||
| EGF_3 | 789..824 | CDD:463759 | 11/86 (13%) | ||
| NIDO | 907..1071 | CDD:470642 | 33/206 (16%) | ||
| AMOP | 1083..1224 | CDD:470667 | 30/178 (17%) | ||
| VWD | 1232..1406 | CDD:214566 | 35/188 (19%) | ||
| EGF_CA | 1686..1727 | CDD:214542 | 12/55 (22%) | ||
| EGF_CA | 1728..1763 | CDD:238011 | 11/38 (29%) | ||
| EGF_CA | 1771..1807 | CDD:238011 | 8/52 (15%) | ||
| EGF_CA | 1808..1845 | CDD:238011 | 8/78 (10%) | ||
| EGF_3 | 1850..1885 | CDD:463759 | 9/44 (20%) | ||
| EGF_CA | 1887..1921 | CDD:214542 | 12/64 (19%) | ||
| EGF_CA | 1928..1968 | CDD:214542 | 13/44 (30%) | ||
| EGF_3 | 1973..2009 | CDD:463759 | 11/41 (27%) | ||
| NIDO | 4258..4412 | CDD:470642 | 35/184 (19%) | ||
| AMOP | 4421..4538 | CDD:470667 | 28/150 (19%) | ||
| VWD | 4556..4721 | CDD:459671 | 37/207 (18%) | ||
| EGF_3 | 4984..5015 | CDD:463759 | 10/31 (32%) | ||
| EGF_CA | 5017..5056 | CDD:214542 | 15/56 (27%) | ||
| Blue background indicates that the domain is not in the aligned region. | |||||