DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Hml and Muc6

DIOPT Version :10

Sequence 1:NP_524060.2 Gene:Hml / 39529 FlyBaseID:FBgn0029167 Length:3843 Species:Drosophila melanogaster
Sequence 2:XP_038953937.1 Gene:Muc6 / 282586 RGDID:628826 Length:1449 Species:Rattus norvegicus


Alignment Length:1335 Identity:393/1335 - (29%)
Similarity:586/1335 - (43%) Gaps:216/1335 - (16%)


- Green bases have known domain annotations that are detailed below.


  Fly   476 IDKTPRPALCTTWGGINMKTFDGLVFKAPLSCSHTLIT---DKVSGTFDIILKACPYGSGYGCAH 537
            :..:|....|:.||..:..||||..:.....|::....   |.:..||.|.|:.    .|.|...
  Rat    40 LQSSPEKGWCSMWGAGHFSTFDGHQYNFKGMCNYVFTATCGDDLPTTFSIQLRR----DGDGNIS 100

  Fly   538 TLKI-LWQSVLYTFENLNGTMQLTTPIKKLPMPVQVMGMKVMPVAQHVQIDLESVGLKL--DWDH 599
            .:.| |..||:    .:|........|..:.:|....|:::.|..|.||:..:.:.|:|  .|..
  Rat   101 RIIIELGASVV----TVNKETISVRDIGVVSLPYTSNGLQITPYGQSVQLVAKQLELELVIMWGP 161

  Fly   600 RQYVSVQAGPQMWGKVGGLCGTLDGDPNTDLTSRTGKKLATVKAFADAWRVEDRSELCQVENSAE 664
            ..::.|....:..||:.||||..||..:.:..|..||.|...| :|...:::|.:|:|..|    
  Rat   162 DAHLMVLVEKKYMGKLCGLCGNFDGKIDNEFLSEDGKLLEPHK-YATLQKLDDPNEICAHE---- 221

  Fly   665 MEFGMDSCEQSKLQKAVSVCERLLANEKLGDCIKPFNYDALIRTCMADYCNCANREHPESCNCDA 729
               .:.|....|.:.| .:|.:||.....| |..|  .:.|:.:|..|..:||....| :|:|..
  Rat   222 ---AIPSPSILKTRYA-HICNQLLTLVSPG-CDVP--KETLMLSCQEDMADCAQPGQP-NCSCAT 278

  Fly   730 IAMLAKECAFKGIKLEHGWRNLEICPIS-CGFGRVYQACGPNVEPTCDSDLALP--ASKGACNEG 791
            ::..::.|:..|..:.: ||...:||:| |...:|||.||.....||.:    |  :....|..|
  Rat   279 LSEYSRRCSMAGQPVRN-WRTPSLCPMSQCPANQVYQECGEVCVKTCSN----PQHSCSSFCTFG 338

  Fly   792 CFCPEGT----VQYKEACITRELCPCSLRGKEFKPESTVKKNCNTCTCKNGQWRCTEDKCGARCG 852
            ||||.||    :....:|:....|||.|.|..:.|....:..|.||.|..|.|.||:..|...|.
  Rat   339 CFCPHGTLLDDISGNHSCVPVNQCPCMLNGMVYGPGEITRTACQTCQCTMGHWTCTKQPCPGHCS 403

  Fly   853 AVGDPHYQTFDGKRYDFMGKCSYHLLKTQNTSVEAENVACSGAVSESMNFAAPDDPSCTKAVTIR 917
            ..|.....|||.:.|.|.|.|:|.||:    |.:..|.....||.:...::..:    |...:|.
  Rat   404 LEGGSFVTTFDARPYRFHGTCTYTLLQ----SPQLPNEGTLMAVYDKSGYSHSE----TSLESII 460

  Fly   918 FILRDGTPSVIKLDQGLTTIVNDKPIAKLPKMLGLGEVLIRRASSTFLTVEFADGIRVWW--DGV 980
            ::.:. ...||..|:   .|.|:.....||  .....:.|.|.:||.|.:....|:::.:  ..|
  Rat   461 YLSKK-DKIVISEDE---VITNNGDTKLLP--YKTHNITIFRQTSTHLQMATTFGLQIMFQMQPV 519

  Fly   981 SRVYIDAPPSLRGQTQGLCGTFNSNTQDDFLTPEGDVETAVEPFADKWRTKDTCQFKAETHQGPH 1045
            .:.||...|..:|||:||||.||.:|.|||.|..|..|.....|.|.||. ..|  .|...:...
  Rat   520 FQAYITVGPQFKGQTRGLCGNFNGDTTDDFTTSMGIDEGTASLFVDSWRA-GNC--PAALERETD 581

  Fly  1046 PCTLNPEKKAQAEKFCDWILQ--DIFQDCHFLVEPEQFYEDCLYDTCACKDEMSKCFCPILSAYG 1108
            ||:::...|..||..|..:|:  .:|:.||.:|.|:.||:.|:|..|. .:|.....|..|.||.
  Rat   582 PCSMSQLNKVCAETHCSMLLKKGSVFEKCHNVVNPQPFYKRCVYQACN-YEETFPHICSALGAYA 645

  Fly  1109 TECMRQGVKT-GWRMSVKECAVKCPLGQVFDECGDGCALSCDDLPSKGSCKREC------VEGCR 1166
            ..|..:||.. |||.||..|.|.|...:.|......|..:|..|..:   :.||      |:||.
  Rat   646 HACSSRGVLLWGWRNSVDNCTVPCSGNRTFSYDSKACDRTCLSLSDR---ETECHASPVPVDGCN 707

  Fly  1167 CPHGEYVNEDGECVPKKMCHCNFDGMSFRPGYKEVRPGEKFL---DLCTCTDGVWDCQDAEPGDK 1228
            ||.|.|:|...|||.|..|.|..|      .:|.|:..:..:   .:|.|.:|            
  Rat   708 CPEGTYLNHKAECVHKTQCPCLLD------SHKFVQADQSTMINGVICYCING------------ 754

  Fly  1229 DKYPPSSELRSKCAKQPYAEFTKCAPKEPKTCKNM-----DKY----------VADSSDCLP--- 1275
                     |..|.:|....|..|  .||||.::.     ||:          :|..:.|:|   
  Rat   755 ---------RLSCPRQAEMFFASC--PEPKTFQSCSQSSEDKFGAACAPTCQMLATGTACVPTKC 808

  Fly  1276 --GCVCMEGYVYDTSRLACVLPANCSCHHAGKSYDDGEKIKEDCNLCECRAGNWKCS-KNGCEST 1337
              ||||.:| :|:.|...||....|.|.:||.||..|.::..||..|.|..|.|.|. ...|.||
  Rat   809 ESGCVCPKG-LYENSDGQCVPAEECPCDYAGTSYPGGSELHTDCKTCTCSQGRWTCQLSTQCPST 872

  Fly  1338 CSVWGDSHFTTFDGHDFDFQGACDYVLAK---GVFDNGDGFSITIQNVLCGTMGVTCSKSLEIAL 1399
            |.::|:.|..||||..|.|.|.|:|.||.   |...:...|.:..:||:||..|||||::::|:|
  Rat   873 CVLYGEGHIITFDGQRFVFDGNCEYTLATDDCGANSSQPTFKVLTENVICGKSGVTCSRAIKISL 937

  Fly  1400 TGHAEESLLLSADSAYSTDPNKTPIKKLRDSVNSKGHNAFHIYKAGVFVVVEV-IP--LKLQVKW 1461
            .|.:    :..||..| |...:.|:..|:            :..:.:.:|::: ||  |.|.:.|
  Rat   938 GGLS----ITMADGNY-TVSGEEPLVHLQ------------VKPSPLNLVLDIDIPGRLNLTLMW 985

  Fly  1462 DEGTRVYVKLGNEWRQKVSGLCGNYNGNSLDDMQTPSMGLETSPMLFGHAWKLQPHCSAPVAPID 1526
            ::...|.:|:..|.:..:.|||||.|||..||.:|.|..:.::.:.|.::||..|.|......:|
  Rat   986 NKHMSVSMKIRRETQDPLCGLCGNSNGNMKDDFETRSKYVASNELEFVNSWKESPLCGDASYAVD 1050

  Fly  1527 ACKKHPERETWAQLKCGALKSDLFKECHAEVPLERFWKRCIFDTCACDQGGDCECLCTAVAAYAD 1591
            .|..:..|.:||:.||..:.|..|..||::|....:::.|:.|.|.||.||||||||.||||||.
  Rat  1051 PCSLNTFRRSWAERKCNIINSQTFAACHSKVYHLPYYEACVHDACGCDTGGDCECLCDAVAAYAK 1115

  Fly  1592 ACAQKGINIRWRSQHFCPMQCDPHCSDYKACTPACAVETCDNFLDQG--IAERMCNRENCLEGCH 1654
            ||..||:.:.||:..|||:.||.:             .|.....|.|  ||:..|.       .|
  Rat  1116 ACLDKGVCVDWRTPDFCPIYCDFY-------------NTHTLVGDNGYQIAQHNCT-------WH 1160

  Fly  1655 IKPCEDGFIYLNDTYRDCVPKAECKPVCMVRDGKTFYEGDI-----TFTDSCATCRCSK----RK 1710
            .:||                      :|         .|.:     |.|:.|..|..::    ::
  Rat  1161 YQPC----------------------LC---------PGSLGSFPDTNTEGCYNCSQNEYFDHKE 1194

  Fly  1711 EICSGVKCDVPATTGLPAPLVEGT--TLPTPLATQ 1743
            ..|  |.|..|.||  |.|...|:  |..||::|:
  Rat  1195 GTC--VPCAPPTTT--PPPTTTGSQPTTATPISTE 1225

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
HmlNP_524060.2 VWD 485..636 CDD:459671 40/156 (26%)
C8 684..754 CDD:462584 18/69 (26%)
TIL 758..811 CDD:410995 18/58 (31%)
VWD 840..1015 CDD:214566 53/176 (30%)
C8 1054..1121 CDD:214843 24/69 (35%)
TIL 1131..1185 CDD:410995 19/59 (32%)
TIL 1245..1298 CDD:460351 20/72 (28%)
VWD 1327..1498 CDD:214566 59/177 (33%)
C8 1535..1609 CDD:214843 35/73 (48%)
TIL 1938..2005 CDD:473303
FA58C 2089..2223 CDD:238014
FA58C <2299..2403 CDD:238014
VWD 2703..2858 CDD:459671
TIL 2974..3030 CDD:460351
VWD 3035..3198 CDD:459671
C8 3248..3313 CDD:462584
VWC 3397..3451 CDD:450195
GHB_like <3755..3813 CDD:473907
Muc6XP_038953937.1 VWD 49..198 CDD:459671 40/156 (26%)
C8 <250..302 CDD:462584 13/55 (24%)
TIL 307..362 CDD:410995 18/58 (31%)
VWC 364..>408 CDD:450195 15/43 (35%)
VWD 391..554 CDD:214566 53/176 (30%)
C8 593..665 CDD:214843 27/72 (38%)
TIL 669..726 CDD:410995 19/59 (32%)
TIL 769..832 CDD:410995 19/65 (29%)
VWD 861..1022 CDD:214566 59/177 (33%)
C8 1061..1133 CDD:214843 35/71 (49%)
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.