DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Hml and Muc6

DIOPT Version :10

Sequence 1:NP_524060.2 Gene:Hml / 39529 FlyBaseID:FBgn0029167 Length:3843 Species:Drosophila melanogaster
Sequence 2:NP_001355882.2 Gene:Muc6 / 353328 MGIID:2663233 Length:4931 Species:Mus musculus


Alignment Length:1395 Identity:409/1395 - (29%)
Similarity:609/1395 - (43%) Gaps:213/1395 - (15%)


- Green bases have known domain annotations that are detailed below.


  Fly   432 RQKLLALLAKYRDLERRSEWWSSEETVVTMSSYSLYQSNNLDIVIDKTPRPALCTTWGGINMKTF 496
            ||.||.||.:...::..:  |:.:.|           .::.:..:..:|....|:|||..:..||
Mouse     5 RQLLLLLLFRGPLIDAGA--WTGDVT-----------DSDTEDNLQSSPEKGWCSTWGAGHFSTF 56

  Fly   497 DGLVFKAPLSCSHTLIT---DKVSGTFDIILKACPYGSGYGCAHTLKILWQSVLYTFENLNGTMQ 558
            ||..:.....|::....   |.|..||.|.|:....|:   .:..:..|..||:    .:|....
Mouse    57 DGHEYNFQGMCNYIFTATCGDDVPATFSIQLRRDMEGN---ISRIIMELGASVV----TVNKETI 114

  Fly   559 LTTPIKKLPMPVQVMGMKVMPVAQHVQIDLESVGLKL--DWDHRQYVSVQAGPQMWGKVGGLCGT 621
            ....|..:.:|....|:::.|..|.||:..:.:.|:|  .|....:::||...:..||:.||||.
Mouse   115 SVRDIGVVSLPYTSNGLQITPYGQSVQLVAKQLELELVITWGPDAHLTVQVETKYMGKLCGLCGN 179

  Fly   622 LDGDPNTDLTSRTGKKLATVKAFADAWRVEDRSELCQVENSAEMEFGMDSCEQSKLQKAVSVCER 686
            .||..:.:..|..||.|...| :|...:::|.:|:|..|       .:.|....|.:.| .:|.:
Mouse   180 FDGKIDNEFLSEDGKLLEAHK-YATLQKLDDPNEICAHE-------AIPSTIILKTRYA-QICNQ 235

  Fly   687 LLANEKLGDCIKPFNYDALIRTCMADYCNCANREHPESCNCDAIAMLAKECAFKGIKLEHGWRNL 751
            ||.....| |..|  .:.|:.:|.||...||....| :|:|..::..::.|:..|..:.: ||..
Mouse   236 LLTLVSPG-CDVP--KETLMLSCQADMAACARPGQP-NCSCATLSEYSRRCSMTGQPVRN-WRTP 295

  Fly   752 EICPIS-CGFGRVYQACGPNVEPTCDSDLALP--ASKGACNEGCFCPEGT----VQYKEACITRE 809
            .:||:| |...:|||.||.....||.:    |  :....|..|||||.||    :...::|:...
Mouse   296 ALCPMSQCPANQVYQECGEVCIKTCSN----PQHSCSSPCTFGCFCPHGTLLDDISGNQSCVPVN 356

  Fly   810 LCPCSLRGKEFKPESTVKKNCNTCTCKNGQWRCTEDKCGARCGAVGDPHYQTFDGKRYDFMGKCS 874
            .|||.|.|..:.|....|..|.||.|..|:|.||:..|...|...|.....|||.:.|.|.|.|:
Mouse   357 QCPCMLNGMVYGPGEITKTACQTCQCTMGRWTCTKQPCPGHCSLEGGSFVTTFDARPYRFHGTCT 421

  Fly   875 YHLLKTQNTSVEAENVACSGAVSESMNFAAPDDPSCTKAVTIRFILRDGTPSVIKLDQGLTTIVN 939
            |.||:    |.:..|.....||.:...::..:    |..|.|.::.:. ...||..|:   .|.|
Mouse   422 YTLLQ----SPQLPNEGTLMAVYDKSGYSHSE----TSLVAIMYLSKK-DKIVISEDE---VITN 474

  Fly   940 DKPIAKLPKMLGLGEVLIRRASSTFLTVEFADGIRVWW--DGVSRVYIDAPPSLRGQTQGLCGTF 1002
            :.....||  .....:.|.|.:||.|.:....|:.:.:  ..|.:|||...|..:|||:||||.|
Mouse   475 NGDTKLLP--YKTHNITIFRQTSTHLQMATTFGLELVFQMQPVFQVYITVGPQFKGQTRGLCGNF 537

  Fly  1003 NSNTQDDFLTPEGDVETAVEPFADKWRTKDTCQFKAETHQGPHPCTLNPEKKAQAEKFCDWILQ- 1066
            |.:|.|||.|..|..|.....|.|.||. ..|....|...  .||:::...|..||..|..:|: 
Mouse   538 NGDTTDDFTTSMGIDEGTASLFVDSWRA-GNCPAALEREM--DPCSMSQLNKVCAETHCSMLLKK 599

  Fly  1067 -DIFQDCHFLVEPEQFYEDCLYDTCACKDEMSKCFCPILSAYGTECMRQGVKT-GWRMSVKECAV 1129
             .:|:.||.:|.|:.||:.|:|..|. .:|.....|..|.||...|..:|:.. |||.||..|.|
Mouse   600 GSVFEKCHSVVNPQPFYKRCVYQACN-YEETFPHICSALGAYAHACSARGILLWGWRNSVDNCTV 663

  Fly  1130 KCPLGQVFDECGDGCALSCDDLPSKGSCKREC------VEGCRCPHGEYVNEDGECVPKKMCHCN 1188
            .|...:.|......|..:|..|..:   :.||      |:||.||.|.|:|...|||.|..|.|.
Mouse   664 PCTGNRTFSYDSQACDRTCLSLSDR---ETECHVSPVPVDGCNCPEGTYLNHKAECVHKAQCPCL 725

  Fly  1189 FDGMSFRPGYKEVRPGEKFL---DLCTCTDGVWDCQDAEPGDKDKYPPSSELRSKCAKQPYAEFT 1250
            .|      .||.|:..:..:   .:|.|.:|                     |..|.:|....|.
Mouse   726 LD------DYKFVQADQSTMINGVICHCING---------------------RLSCPRQAEMFFA 763

  Fly  1251 KCAPKEPKTCKNM-----DKY----------VADSSDCLP-----GCVCMEGYVYDTSRLACVLP 1295
            .|  .||||.::.     ||:          :|...||:|     ||||.:| :|:.|...||..
Mouse   764 SC--PEPKTFQSCSQSSEDKFGAACAPTCQMLATGIDCVPTKCESGCVCPKG-LYENSDGQCVPA 825

  Fly  1296 ANCSCHHAGKSYDDGEKIKEDCNLCECRAGNWKCS-KNGCESTCSVWGDSHFTTFDGHDFDFQGA 1359
            ..|.|.:||.||..|.::..||..|.|..|.|.|. ...|.|||.::|:.|..||||..|.|.|.
Mouse   826 EECPCDYAGVSYPGGFELHTDCKTCTCSQGRWTCQLSTQCPSTCVLYGEGHIITFDGQRFVFDGD 890

  Fly  1360 CDYVLAK---GVFDNGDGFSITIQNVLCGTMGVTCSKSLEIALTGHAEESLLLSADSAYSTDPNK 1421
            |:|:||.   |...:...|.:..:||:||..|||||::::|:|.|    ..:..|||.| |...:
Mouse   891 CEYMLATDDCGANSSQPTFKVLTENVICGKSGVTCSRAIKISLGG----LFITMADSNY-TVSGE 950

  Fly  1422 TPIKKLRDSVNSKGHNAFHIYKAGVFVVVEV-IP--LKLQVKWDEGTRVYVKLGNEWRQKVSGLC 1483
            .|:..|:            :..:.:.:|::: ||  |.|.:.|::...|.:|:....:..:.|||
Mouse   951 EPLVHLK------------VKPSPLNLVLDIDIPGRLNLTLVWNKHMSVSIKIRRATQDALCGLC 1003

  Fly  1484 GNYNGNSLDDMQTPSMGLETSPMLFGHAWKLQPHCSAPVAPIDACKKHPERETWAQLKCGALKSD 1548
            ||.|||..||.:|.|..:.::.:.|.::||..|.|......:|.|..:..|.:||:.||..:.|.
Mouse  1004 GNANGNMKDDFETRSKYVASNELEFVNSWKESPLCGDASYAVDPCSLNTFRRSWAERKCNIINSQ 1068

  Fly  1549 LFKECHAEVPLERFWKRCIFDTCACDQGGDCECLCTAVAAYADACAQKGINIRWRSQHFCPMQCD 1613
            .|..||::|....:::.|:.|.|.||.||||||||.||||||.||..||:.:.||:..|||:.||
Mouse  1069 TFAACHSKVYHLPYYEACVRDACGCDTGGDCECLCDAVAAYAKACLDKGVCVDWRTPDFCPIYCD 1133

  Fly  1614 ---PHC----SDYK-----ACT----PACAVETCDNFLDQGIAERMCNRENCLEGCHIKPCEDGF 1662
               .|.    ::|:     .||    |.....:..:|.|..           .|||:.....:.|
Mouse  1134 FYNTHTLVGENEYQYAQESNCTWHYQPCLCPGSLGSFPDTN-----------TEGCYNCSQNEYF 1187

  Fly  1663 IYLNDTYRDCVPKAECKPVCMVRDGKTFYEGDITFTDSCATCRCSKRKEICSGVKCDVPAT---- 1723
            .:...|...|.|.....|.              |.|.|..|...:...|..|....:.|..    
Mouse  1188 DHSEGTCVPCAPPTTTLPP--------------TTTGSQPTTETTISTEFHSSTSANTPVAPSYL 1238

  Fly  1724 TGLPAPLVEGTTLPTPLATQNQTKCVKGWT 1753
            .|||.|         |.:..:.|:.:..||
Mouse  1239 PGLPTP---------PPSAPSSTEELTVWT 1259

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
HmlNP_524060.2 VWD 485..636 CDD:459671 41/155 (26%)
C8 684..754 CDD:462584 19/69 (28%)
TIL 758..811 CDD:410995 18/58 (31%)
VWD 840..1015 CDD:214566 55/176 (31%)
C8 1054..1121 CDD:214843 23/69 (33%)
TIL 1131..1185 CDD:410995 19/59 (32%)
TIL 1245..1298 CDD:460351 21/72 (29%)
VWD 1327..1498 CDD:214566 59/177 (33%)
C8 1535..1609 CDD:214843 35/73 (48%)
TIL 1938..2005 CDD:473303
FA58C 2089..2223 CDD:238014
FA58C <2299..2403 CDD:238014
VWD 2703..2858 CDD:459671
TIL 2974..3030 CDD:460351
VWD 3035..3198 CDD:459671
C8 3248..3313 CDD:462584
VWC 3397..3451 CDD:450195
GHB_like <3755..3813 CDD:473907
Muc6NP_001355882.2 VWD 45..194 CDD:459671 41/155 (26%)
C8 <246..298 CDD:462584 14/55 (25%)
TIL 303..358 CDD:460351 18/58 (31%)
VWC_out 360..>404 CDD:214565 16/43 (37%)
VWD 387..550 CDD:214566 55/176 (31%)
C8 589..661 CDD:214843 26/72 (36%)
TIL 665..722 CDD:410995 19/59 (32%)
TIL 765..828 CDD:410995 20/65 (31%)
VWD 857..1018 CDD:214566 59/177 (33%)
C8 1057..1129 CDD:214843 35/71 (49%)
ROM1 1406..>1624 CDD:227709
Herpes_BLLF1 <1553..1968 CDD:282904
Herpes_BLLF1 <1858..2241 CDD:282904
Chi1 2031..>2219 CDD:442692
PHA03247 <2099..2563 CDD:223021
FhaB 2601..4221 CDD:442443
Chi1 2955..>3176 CDD:442692
Chi1 3259..>3449 CDD:442692
Chi1 3864..>4064 CDD:442692
PHA03247 <4320..4742 CDD:223021
CT 4847..4926 CDD:214482
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.