DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Hml and MUC6

DIOPT Version :10

Sequence 1:NP_524060.2 Gene:Hml / 39529 FlyBaseID:FBgn0029167 Length:3843 Species:Drosophila melanogaster
Sequence 2:XP_054224825.1 Gene:MUC6 / 4588 HGNCID:7517 Length:6114 Species:Homo sapiens


Alignment Length:1295 Identity:376/1295 - (29%)
Similarity:559/1295 - (43%) Gaps:201/1295 - (15%)


- Green bases have known domain annotations that are detailed below.


  Fly   467 YQSNNLDIVIDK---TPRPALCTTWGGINMKTFDGLVFKAPLSCSHTLIT--DKVSGTFDIILKA 526
            |.|..|..:.|.   .|....|:|||..:..|||..|:....:|::....  .....||.:.|:.
Human    43 YTSPGLQRLKDSPQTAPDKGQCSTWGAGHFSTFDHHVYDFSGTCNYIFAATCKDAFPTFSVQLRR 107

  Fly   527 CPYGSGYGCAHTLKILWQSVLYTFENLNGTMQLTTPIKKLPMPVQVMGMKVMPVAQHVQIDLESV 591
            .|.||   .:..:..|..||:...|.:.....    |..:.:|....|:::.|..|.|::..:.:
Human   108 GPDGS---ISRIIVELGASVVTVSEAIISVKD----IGVISLPYTSNGLQITPFGQSVRLVAKQL 165

  Fly   592 GLKLD--WDHRQYVSVQAGPQMWGKVGGLCGTLDGDPNTDLTSRTGKKLATVKAFADAWRVEDRS 654
            .|:|:  |....::.|....:..|::.||||..||....:..|..||.|...| ||...:::|..
Human   166 ELELEVVWGPDSHLMVLVERKYMGQMCGLCGNFDGKVTNEFVSEEGKFLEPHK-FAALQKLDDPG 229

  Fly   655 ELCQVENSAEMEFGMDSCEQSKLQKAVSVCERLLANEKLGDCIKP---FNYDALIRTCMADYCNC 716
            |:|..::       :.|....:.|.| .:|.:||.      .:.|   .:.:..:.:|.||.. .
Human   230 EICTFQD-------IPSTHVRQAQHA-RICTQLLT------LVAPECSVSKEPFVLSCQADVA-A 279

  Fly   717 ANREHPESCNCDAIAMLAKECAFKGIKLEHGWRNLEICPI-SCGFGRVYQACGPNVEPTCDSDLA 780
            |.:..|::.:|..::..:::|:..|..:.. ||:..:|.: .|...:|||.||.....||.:   
Human   280 APQPGPQNSSCATLSEYSRQCSMVGQPVRR-WRSPGLCSVGQCPANQVYQECGSACVKTCSN--- 340

  Fly   781 LP--ASKGACNEGCFCPEGTV----QYKEACITRELCPCSLRGKEFKPESTVKKNCNTCTCKNGQ 839
             |  :...:|..|||||||||    .....|:....|||.|.|..:.|.......|.||.|..|:
Human   341 -PQHSCSSSCTFGCFCPEGTVLNDLSNNHTCVPVTQCPCVLHGAMYAPGEVTIAACQTCRCTLGR 404

  Fly   840 WRCTEDKCGARCGAVGDPHYQTFDGKRYDFMGKCSYHLLKTQNTSVEAENVAC--SGAVSESMNF 902
            |.|||..|...|...|.....|||.:.|.|.|.|:|.||::.....:...:|.  ...||.|.  
Human   405 WVCTERPCPGHCSLEGGSFVTTFDARPYRFHGTCTYILLQSPQLPEDGALMAVYDKSGVSHSE-- 467

  Fly   903 AAPDDPSCTKAVTIRFILRDGTPSVIKLDQGLTTIVNDKPIAKLPKMLGLGEVLIRRASSTFLTV 967
                    |..|.:.::.|. ...||..|:   .:.|:.....||  .....:.:.|.:||.|.:
Human   468 --------TSLVAVVYLSRQ-DKIVISQDE---VVTNNGEAKWLP--YKTRNITVFRQTSTHLQM 518

  Fly   968 EFADGIR--VWWDGVSRVYIDAPPSLRGQTQGLCGTFNSNTQDDFLTPEGDVETAVEPFADKWRT 1030
            ..:.|:.  |....:.:.|:...|..||||:||||.||.:|.|||.|..|..|.....|.|.||.
Human   519 ATSFGLELVVQLRPIFQAYVTVGPQFRGQTRGLCGNFNGDTTDDFTTSMGIAEGTASLFVDSWRA 583

  Fly  1031 KDTCQFKAETHQGPHPCTLNPEKKAQAEKFCDWILQ--DIFQDCHFLVEPEQFYEDCLYDTCACK 1093
             ..|  .|...:...||:::...|..||..|..:|:  .:|:.||..|.|..||:.|:|..|. .
Human   584 -GNC--PAALERETDPCSMSQLNKVCAETHCSMLLRTGTVFERCHATVNPAPFYKRCVYQACN-Y 644

  Fly  1094 DEMSKCFCPILSAYGTECMRQGVKT-GWRMSVKECAVKCPLGQVFDECGDGCALSCDDLPSKGSC 1157
            :|.....|..|..|...|..:||.. |||.||..|.:.|.....|......|..:|..|..:.: 
Human   645 EETFPHICAALGDYVHACSLRGVLLWGWRSSVDNCTIPCTGNTTFSYNSQACERTCLSLSDRAT- 708

  Fly  1158 KREC------VEGCRCPHGEYVNEDGECVPKKMCHCNFDGMSFRPGYKEVRPGEKFL---DLCTC 1213
              ||      |:||.||.|.|:|:.||||.|..|.|..:      |||.:...:..:   ..|.|
Human   709 --ECHHSAVPVDGCNCPDGTYLNQKGECVRKAQCPCILE------GYKFILAEQSTVINGITCHC 765

  Fly  1214 TDGVWDCQDAEPGDKDKYPPSSELRSKCAKQPYAEFTKC-APKEPKTC--KNMDKYVAD------ 1269
            .:|                     |..|.::|......| |||..|:|  .:.:|:.|.      
Human   766 ING---------------------RLSCPQRPQMFLASCQAPKTFKSCSQSSENKFGAACAPTCQ 809

  Fly  1270 ---------SSDCLPGCVCMEGYVYDTSRLACVLPANCSCHHAGKSYDDGEKIKEDCNLCECRAG 1325
                     .:.|.|||||.|| :|:.:...||.|..|.|..:|.||..|.::..||..|.|..|
Human   810 MLATGVACVPTKCEPGCVCAEG-LYENADGQCVPPEECPCEFSGVSYPGGAELHTDCRTCSCSRG 873

  Fly  1326 NWKCSK-NGCESTCSVWGDSHFTTFDGHDFDFQGACDYVLAK---GVFDNGDGFSITIQNVLCGT 1386
            .|.|.: ..|.|||:::|:.|..||||..|.|.|.|:|:||.   ||.|:...|.|..:||:||.
Human   874 RWACQQGTHCPSTCTLYGEGHVITFDGQRFVFDGNCEYILATDVCGVNDSQPTFKILTENVICGN 938

  Fly  1387 MGVTCSKSLEIALTGHAEESLLLSADSAYSTDPNKTPIKKLRDSVNSKGHNAFHIYKAGVFVVVE 1451
            .|||||::::|.|.|.:    ::.||..|:.             ...:.|....:....:.:||:
Human   939 SGVTCSRAIKIFLGGLS----VVLADRNYTV-------------TGEEPHVQLGVTPGALSLVVD 986

  Fly  1452 V-IP--LKLQVKWDEGTRVYVKLGNEWRQKVSGLCGNYNGNSLDDMQTPSMGLETSPMLFGHAWK 1513
            : ||  ..|.:.|:....:.:::....:..:.|||||:|||..||.:|.|..:.:|.:...::||
Human   987 ISIPGRYNLTLIWNRHMTILIRIARASQDPLCGLCGNFNGNMKDDFETRSRYVASSELELVNSWK 1051

  Fly  1514 LQPHCSAPVAPIDACKKHPERETWAQLKCGALKSDLFKECHAEVPLERFWKRCIFDTCACDQGGD 1578
            ..|.|.......|.|..:..|.:||:.||..:.|..|..||::|....:::.|:.|.|.||.|||
Human  1052 ESPLCGDVSFVTDPCSLNAFRRSWAERKCSVINSQTFATCHSKVYHLPYYEACVRDACGCDSGGD 1116

  Fly  1579 CECLCTAVAAYADACAQKGINIRWRSQHFCPMQC---DPHCSDYKACTPACAVETCDNFLDQGIA 1640
            |||||.||||||.||..||:.:.||:..|||:.|   :.|..|                   |..
Human  1117 CECLCDAVAAYAQACLDKGVCVDWRTPAFCPIYCGFYNTHTQD-------------------GHG 1162

  Fly  1641 ERMCNRE-NCLEGCHIKPC--------------EDGFIYLNDTYRD-----CVPKAECKP 1680
            |....:| ||.  .|.:||              |..:....|.|.|     |||   |.|
Human  1163 EYQYTQEANCT--WHYQPCLCPSQPQSVPGSNIEGCYNCSQDEYFDHEEGVCVP---CMP 1217

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
HmlNP_524060.2 VWD 485..636 CDD:459671 37/154 (24%)
C8 684..754 CDD:462584 14/72 (19%)
TIL 758..811 CDD:410995 20/58 (34%)
VWD 840..1015 CDD:214566 53/178 (30%)
C8 1054..1121 CDD:214843 23/69 (33%)
TIL 1131..1185 CDD:410995 20/59 (34%)
TIL 1245..1298 CDD:460351 21/70 (30%)
VWD 1327..1498 CDD:214566 57/177 (32%)
C8 1535..1609 CDD:214843 35/73 (48%)
TIL 1938..2005 CDD:473303
FA58C 2089..2223 CDD:238014
FA58C <2299..2403 CDD:238014
VWD 2703..2858 CDD:459671
TIL 2974..3030 CDD:460351
VWD 3035..3198 CDD:459671
C8 3248..3313 CDD:462584
VWC 3397..3451 CDD:450195
GHB_like <3755..3813 CDD:473907
MUC6XP_054224825.1 VWD 59..211 CDD:214566 38/158 (24%)
C8 250..316 CDD:462584 14/73 (19%)
TIL 321..376 CDD:410995 20/58 (34%)
VWC_out 378..>422 CDD:214565 16/43 (37%)
VWD 405..568 CDD:214566 53/178 (30%)
C8 610..679 CDD:462584 24/69 (35%)
TIL 683..740 CDD:410995 20/59 (34%)
TIL 783..846 CDD:410995 20/63 (32%)
VWD 875..1036 CDD:214566 57/177 (32%)
C8 1075..1147 CDD:214843 35/71 (49%)
PHA03247 <1222..1800 CDD:223021
PHA03247 <1735..2170 CDD:223021
PHA03247 <2599..3017 CDD:223021
Chi1 2696..>2862 CDD:442692
Chi1 3265..>3494 CDD:442692
Chi1 3439..>3663 CDD:442692
Chi1 3608..>3832 CDD:442692
PHA03247 <3611..4195 CDD:223021
Chi1 4214..>4393 CDD:442692
Chi1 4613..>4749 CDD:442692
Chi1 5174..>5310 CDD:442692
Chi1 5450..>5652 CDD:442692
CT 6026..6109 CDD:214482
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.