DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Hml and Muc19

DIOPT Version :10

Sequence 1:NP_524060.2 Gene:Hml / 39529 FlyBaseID:FBgn0029167 Length:3843 Species:Drosophila melanogaster
Sequence 2:NP_001296418.2 Gene:Muc19 / 497227 RGDID:1624205 Length:7415 Species:Rattus norvegicus


Alignment Length:1342 Identity:366/1342 - (27%)
Similarity:573/1342 - (42%) Gaps:210/1342 - (15%)


- Green bases have known domain annotations that are detailed below.


  Fly   487 TWGGINMKTFDGLVFKAPLSCSHTLITD-KVSG-TFDIILKACPYGSGYGCAHTLKILWQSVLYT 549
            |||....|..:|.:|.....|:.|...| ..|| .|::.:|....|.    ...:|.|...|   
  Rat    59 TWGKGTYKALNGRIFSFESECTFTFCRDCAESGEDFNVEIKRHENGD----IEEIKALIDDV--- 116

  Fly   550 FENLNGTMQLTTPI----KKLPMPVQVMGMKVMPVAQHVQIDLESVGLKLDWD--------HRQY 602
                 |.:.:...|    :::.||.....:.:.....|..:......|.|.|.        :|||
  Rat   117 -----GILVVRDTISVNEERVQMPFSNKMIHIKKQGDHYVLKTRRKILSLSWGKNKLSLTLYRQY 176

  Fly   603 VSVQAGPQMWGKVGGLCGTLDGDPNTDLTSRTGKKLATVKAFADAWRVEDR-SELCQVENSAEME 666
            ::.           ||||..:..|..:   ...:.:|..|...|......| :|:|         
  Rat   177 ITC-----------GLCGDFNSVPGEE---DINEHIANSKISNDCPSTLSRNNEVC--------- 218

  Fly   667 FGMDSCEQSKLQKAVSVCERLLAN--EKLGDCIKPFNYDALIRTCMADYCNCANREHPESCNCDA 729
                       :..|..|:|::..  ||... :.|.:.: ....|:.:||.....   :...|:.
  Rat   219 -----------EDGVQYCDRIIGTYFEKCSK-VSPLSSE-YKNVCVDEYCRTGGN---KQTTCNT 267

  Fly   730 IAMLAKECAFKGIKLEHGWR--NLEIC-PISCGFGRVYQACGPNVEPTCDSDLALPASKGACNEG 791
            .:.||:.||:.|..:...||  :..:| ...|....:|:.|||:..||| |::| |.....|..|
  Rat   268 YSELARLCAYDGPGVFEHWRDDSAVVCAKQQCPGKHIYKECGPSNPPTC-SNVA-PFQDSECVSG 330

  Fly   792 CFCPEG----TVQYKEACITRELCPCSLRGKEFKPESTVKKNCNT-CTCKNGQWRCTEDKCGARC 851
            |.||||    .::.|..|:.:|.|||...||.:||....:..|.: |||:..:|.|||.:|...|
  Rat   331 CTCPEGYLLDDIREKGKCVLKEKCPCESNGKVYKPGEVREGPCGSQCTCQEAKWSCTEARCPGIC 395

  Fly   852 GAVGDPHYQTFDGKRYDFMGKCSYHLLKTQNTSVEAENVACSGAVSESMNFAAPDDPSCTKAVTI 916
            ...|. .:.|||..::...|.|.:..:.....|:..|...|...          ...||..:|.:
  Rat   396 KVEGS-SFTTFDDNKFSHPGDCHFLAVHNDEISISVEIHPCGDG----------QTGSCLTSVMV 449

  Fly   917 ---------RFIL-RDGTPSVIKLDQGLTTIVNDKPIAKLPKMLGL---GEVLIRRASSTFLTVE 968
                     ||:. |||            |:..|..|.|     |.   ..|.|..:||.::..|
  Rat   450 LLNSSTSGNRFVFNRDG------------TVTKDGVIIK-----GYYYSDGVQIFNSSSLYMQAE 497

  Fly   969 FADGIRVWWDGVSR--VYIDAPPSLRGQTQGLCGTFNSNTQDDFLTPEGDVETAVEPFADKWRTK 1031
            ....|::....|.:  :|:...|.....|.||||:||:..:|||::.:..:|:..:.||:.|...
  Rat   498 ILSHIKLQIQMVPKMQLYVSLAPDTSTDTVGLCGSFNNKAEDDFMSSQKILESTAQAFANSWEMM 562

  Fly  1032 DTCQFKAETHQGPHPCTLNPEKKAQAEKFCDWILQDI--FQDCHFLVEPEQFYEDCLYDTCACKD 1094
             .|     ....|..| ::.|.:..||..|:.:|...  |..||..|:|:.::|:|...||:|::
  Rat   563 -PC-----PKGSPSSC-VSIETEKFAESNCEILLSSSGPFAACHQTVDPKFYHEECKRYTCSCEN 620

  Fly  1095 EMSKCFCPILSAYGTECM-RQGVKTGWRMSVKECAVKCPLGQVFDECGDGCALSCDDLPSKG-SC 1157
            ... |.|.:|..|...|. ::....|||..:  |.|.||.|.||:.....|..||..|.::. ||
  Rat   621 SQD-CLCTVLGNYVKACAEKETYLVGWRDGL--CEVSCPTGLVFNYKVKTCNSSCRSLSARDRSC 682

  Fly  1158 KRE--CVEGCRCPHGEYVNEDGECVPKKMCHCNFDGMSFRPGYKEVRPGEKFL---DLCTCTDGV 1217
            ..|  .|:||.||.|.|.|.:|.||.|..|.|..:.       :.|:||:..|   :.|.|.|||
  Rat   683 DIEDILVDGCTCPDGMYQNNEGNCVQKSECDCYVED-------ETVQPGKSILIDDNKCVCQDGV 740

  Fly  1218 WDCQDAEPGDKDKYPPSSELRSKCAKQPYAEFTKCAPKEPK-------TC--KNMDKYVADSSDC 1273
            ..||          .|.......|::.  ||:..|  |:||       ||  :|:..:..| ..|
  Rat   741 LHCQ----------TPLDLTLQNCSRG--AEYIDC--KDPKAQRRTERTCATRNIPDFEGD-LPC 790

  Fly  1274 LPGCVCMEGYVYDTSRLACVLPANCSCHHAGKSYDDGEKIKEDCNLCECRAGNWKCSKNGCESTC 1338
            ..||.|..|.|.: |:..|:.|.:|.|....:.|:.|......||.|.|..|:|.|::|.|:|||
  Rat   791 KRGCYCPVGMVRN-SKGICIHPDDCPCSFGDREYEQGSVTSVGCNECTCIKGSWNCTQNECQSTC 854

  Fly  1339 SVWGDSHFTTFDGHDFDFQGACDYVLAKGVFDNGDG-FSITIQNVLCGTMGVTCSKSLEIALTGH 1402
            .|:|:.|:.||||..:.|.|.|.|...:......:| |.|..::|.|...|:|||:.:.:|.   
  Rat   855 HVYGEGHYRTFDGESYSFDGLCQYTFLEDYCGQENGTFRILTESVPCCENGLTCSRKIIVAF--- 916

  Fly  1403 AEESLLLSADSAYSTDPNKTPIKKLRDSVNSKGHNAFHIYKAGVFVVVEVIPLKLQVKWDEGTRV 1467
            .:::::|.       |...|.::........:..:::.|:..|::::|:::...:.: ||:.|:|
  Rat   917 QDQNIILQ-------DGKVTAVQTAESEDCERSASSYSIHTVGLYLIVKLLNGIILI-WDKYTKV 973

  Fly  1468 YVKLGNEWRQKVSGLCGNYNGNSLDDMQTPSMGLETSPMLFGHAWKLQPHCSAPVAPIDACKKHP 1532
            .|.|...|:.||.|||||.||:..||..|....:....:.|.::||....||..|.....|..:|
  Rat   974 SVILDPSWQNKVCGLCGNNNGDLKDDFTTRHSSVAAGALEFANSWKTSQECSDTVTQSFPCDSNP 1038

  Fly  1533 ERETWAQLKCGALKSDLFKECHAEVPLERFWKRCIFDTCACDQGGDCECLCTAVAAYADACAQKG 1597
            ..:.||:.||..::.|.|::||::|....::..||.:.|:||..|.....|||||.||:||...|
  Rat  1039 YCKAWAEKKCEIIRDDTFRDCHSKVDPTTYYNACIEEACSCDMEGKYLGFCTAVAMYAEACNAAG 1103

  Fly  1598 INIRWRSQHFCPMQCDPH-----CS-DYKACTPACAVETCDNFLDQGIAERMCNRENCLEGCHIK 1656
            :.:.||..:.||:.||.:     || .|:.|....| :||.   ||.|.::.   .:.||||:.|
  Rat  1104 VCVSWRKPNLCPVYCDYYNAPGECSWHYEPCGTVTA-KTCK---DQVIGQKF---SSLLEGCYAK 1161

  Fly  1657 PCEDGFIYLNDTYRDCVPKAECKPVCMVRDGKTFYEGDITFTDSCATCRCSKRKEICSGVKCDVP 1721
             |.:...||::....||..:||.  |...|  ....|.....|...||.|:     ...::|...
  Rat  1162 -CPENAPYLDENTMKCVQLSECS--CFYND--VIPAGGAVVDDCGRTCSCT-----AGELECSET 1216

  Fly  1722 ATTGLPAPLVEGTTLPTPLATQNQTKCVKGWTRWCDKDRDTS 1763
            .|....|..|......:.|:|...|...:..|   .:.|:|:
  Rat  1217 PTNSTSADKVSLAPTTSVLSTSAVTTGARSST---SESRETT 1255

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
HmlNP_524060.2 VWD 485..636 CDD:459671 33/162 (20%)
C8 684..754 CDD:462584 16/73 (22%)
TIL 758..811 CDD:410995 21/56 (38%)
VWD 840..1015 CDD:214566 47/189 (25%)
C8 1054..1121 CDD:214843 20/69 (29%)
TIL 1131..1185 CDD:410995 24/56 (43%)
TIL 1245..1298 CDD:460351 19/61 (31%)
VWD 1327..1498 CDD:214566 53/171 (31%)
C8 1535..1609 CDD:214843 27/73 (37%)
TIL 1938..2005 CDD:473303
FA58C 2089..2223 CDD:238014
FA58C <2299..2403 CDD:238014
VWD 2703..2858 CDD:459671
TIL 2974..3030 CDD:460351
VWD 3035..3198 CDD:459671
C8 3248..3313 CDD:462584
VWC 3397..3451 CDD:450195
GHB_like <3755..3813 CDD:473907
Muc19NP_001296418.2 VWD 52..186 CDD:214566 32/149 (21%)
C8 233..290 CDD:462584 14/61 (23%)
TIL 299..354 CDD:410995 21/56 (38%)
VWD 395..545 CDD:459671 42/177 (24%)
C8 585..651 CDD:462584 20/68 (29%)
TIL 655..712 CDD:410995 24/56 (43%)
TIL 754..814 CDD:410995 20/65 (31%)
VWD 843..1003 CDD:214566 52/170 (31%)
C8 1041..1115 CDD:214843 27/73 (37%)
Chi1 1220..>1440 CDD:442692 8/39 (21%)
FhaB 1482..3184 CDD:442443
Hia 3085..3716 CDD:444098
FhaB 3739..5397 CDD:442443
FhaB 4940..6616 CDD:442443
FhaB 6381..>7107 CDD:442443
VWC 7195..7255 CDD:214564
CT 7328..7410 CDD:214482
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.