DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Hml and Kcp

DIOPT Version :9

Sequence 1:NP_524060.2 Gene:Hml / 39529 FlyBaseID:FBgn0029167 Length:3843 Species:Drosophila melanogaster
Sequence 2:XP_038964921.1 Gene:Kcp / 296952 RGDID:1561119 Length:1671 Species:Rattus norvegicus


Alignment Length:1816 Identity:378/1816 - (20%)
Similarity:566/1816 - (31%) Gaps:646/1816 - (35%)


- Green bases have known domain annotations that are detailed below.


  Fly   195 EVPPCQAQCT-PPCQNNGICISAGVCQCPENYYGPLCQQKKSICASFPKAPKNSKVSCKNN---M 255
            ::..|:...| |.|...|.....|....|:.....:|:...:.|...|..|...  .|.:|   .
  Rat   172 QLESCKCCSTSPQCWGPGHPCPEGARWEPDACTACVCRDGTTHCGPQPNLPHCR--GCSHNGQSY 234

  Fly   256 CHAE-----------CMRGFQFPDGSGITNIECRNGQWVHTKTGLSKTP--DCAPTCAPACQNGG 307
            .|.|           |:.|.....|...:.:.|..          |.||  :|.|.|.|.|:..|
  Rat   235 GHGETFSPDACTTCRCLAGAVQCQGPSCSELNCLE----------SFTPPGECCPICRPGCEYEG 289

  Fly   308 QC----ISF----NVC-QCSKMFRGDHCQYNIDRCNVTNTNFNGNYKCAYEMDDARCTFSCPQVP 363
            |.    .||    |.| |||       |..::.||                              
  Rat   290 QLHQEGTSFLSTSNPCLQCS-------CLRSLVRC------------------------------ 317

  Fly   364 GLKIQGRIDIEYKCNYLQGQYLPAPLPKCIFPPGY---TVRSTSSMQGVTHQNGVYHRGMSGE-- 423
                     :..||.       |:|.|..:..||:   ..:::...:|.:|::........|:  
  Rat   318 ---------VPVKCQ-------PSPCPNPVLRPGHCCPVCQASGCTEGHSHRDHGQEWTTPGDPC 366

  Fly   424 -----MAYELQTERQKLLALLAKYRD---------------LERRSEWWSSEETVVTMSSYSLYQ 468
                 :...:|. ||:..|.|..|..               |..|..  ||.|.|.:....|...
  Rat   367 RICQCLEGHIQC-RQQECASLCPYPARPLPGTCCPVCDGCFLNGREH--SSGEPVGSQDPCSSCH 428

  Fly   469 SNNLDIVIDKTP-RPALCTTWG------------------GINMKTFD------------GLVFK 502
            ..|..:..:..| .||.|...|                  .:|.:..:            ...|.
  Rat   429 CANGSVQCEPLPCPPAPCRYPGRIPGQCCPVCDVPCPPVQAVNTRDMNIGARRPLPSKRMAAAFG 493

  Fly   503 APLSCSHTLITDKVSGTFDI-ILKACPYGSGYGCAHTLKILWQSVLYTFENLNGTMQLTTPIKKL 566
            ||.......:.::.:.:..: .|...|..:...|:....:|..|        :|: |:|:|:  |
  Rat   494 APARLVKYPVRNRTARSPPVSTLPPAPSSAQPVCSMVRSLLRGS--------SGS-QMTSPV--L 547

  Fly   567 PMPVQVMGMKVMPVAQHVQIDLESVGLKLDWDHRQY---------VSVQAGPQMWG--------- 613
            |:|.:.   :.:....|..:              ||         ||.|..|...|         
  Rat   548 PVPAKT---ECLCAGLHFAL--------------QYPANILPSRPVSAQPDPVFSGFDQPTLTSL 595

  Fly   614 -KVGGLCGTLDGDPNTDLTSRTGKKLATVKAFADAWRVEDRSELCQVEN---------------- 661
             .:|..|.:.:......|....|:....|.:.......||.:..|.|.|                
  Rat   596 PHLGACCPSCESCTYHGLVYSNGQNFTDVDSPCQTCYCEDGTVRCSVINCPSLTCAKPQNGPGQC 660

  Fly   662 ---------SAEM-------EFGMDSCEQSKLQKAVSVCERLLANEKLGDCIKPFNYDALIRTCM 710
                     .|:|       ....|.|::...|:..:.|:....:.  ..|..|.........|.
  Rat   661 CPKCPDCILEAQMFVDGERFPHPRDPCQECLCQEGQTHCQLRACHS--APCGHPLPSTCCRNDCK 723

  Fly   711 A------DYCNCANREHP----ESCNC--DAIAMLAKEC---------------------AFKGI 742
            .      :|.|.|:..||    ..|.|  ..:..||:.|                     |..|.
  Rat   724 GCAFGGKEYLNGADFPHPTDPCRMCRCLSGNVQCLARRCPPLACPQPVLTPGDCCPQCPDAPAGC 788

  Fly   743 ----------KLEHGWRNLEIC------------------PISCGFGRVYQACGPNVEPTCDSDL 779
                      ..||.::..:.|                  |..|...|....|     |:||..|
  Rat   789 PQSGNTVLVRHQEHFFQPGDPCSRCLCLDGSVSCQRLPCPPAPCAHPRQGACC-----PSCDGCL 848

  Fly   780 ----------ALPASKGACNEGCFCPEGTVQYK-EACI-------TRE-LCP----CS------L 815
                      ..|:....|:. |.|.||:|..: .||.       ||| .||    |.      |
  Rat   849 YHGKEFANGERFPSPSVTCHV-CLCWEGSVNCEPRACAPAQCPFPTREDCCPACDSCEYLGVSYL 912

  Fly   816 RGKEFKPESTVKKNCNTCTCKNGQWRCTEDKCGARCGAVGDPHYQTFDGKRYDFMGKCSYHLLKT 880
            ..:|| |:.  ::.||.|||..|...||...|        :|             ..||:.|:..
  Rat   913 NSQEF-PDP--REACNLCTCLGGFVTCTRRPC--------EP-------------PACSHPLILP 953

  Fly   881 QNTSVEAENVACSGAVSESMNFAAPD--DPSCTKAVTIRFILRDGTPSVIKLDQGLTTIVNDKPI 943
            ::.....:.....| ::.::....||  ||:|               |:...::| :...:.|| 
  Rat   954 KHCCPTCQGCLYHG-ITAALGETLPDPLDPTC---------------SLCTCEEG-SMRCHKKP- 1000

  Fly   944 AKLPKMLGLGEVLIRRASSTFLTVEFADGIRVWWDGVSRVYIDAPPSLRGQTQGLCGTFNSNTQD 1008
                                                       .||:|                 
  Rat  1001 -------------------------------------------CPPAL----------------- 1005

  Fly  1009 DFLTPEGDVETAVEPFADKWRTKDTCQFKAETHQGPHPCTLNPEKKAQAEKFCDWILQDIFQDCH 1073
                                     |     ||..|.||            ||     .:.:.|.
  Rat  1006 -------------------------C-----THPSPGPC------------FC-----PVCRSCL 1023

  Fly  1074 FLVEPEQFYED-------CLYDTCACKDEMSKCF---CPILSAYGTECMRQGVKTGWRMSVKECA 1128
            |..:..|..|:       |  :.|.|......|.   ||.|     .|:.|..:.|      .|.
  Rat  1024 FQGQEHQDGEEFEGPEGSC--ERCRCLAGQVSCMRLQCPPL-----PCLLQATEPG------TCC 1075

  Fly  1129 VKC----------PLGQVFDECGDGCALSCDDLPSKG--SCKR-ECVEGCRCPHGEYVNEDGECV 1180
            .:|          |.|..:......|: ||  :..||  :|.: :||..|..|.    ....:|.
  Rat  1076 PRCTGCRVRGEEHPEGSSWVPADSPCS-SC--MCHKGIVTCAQVQCVSACIWPQ----QGPSDCC 1133

  Fly  1181 PKKMCH-CNFDGMSFRPGYKEVRPGEKFLDLCTCTDGVWDCQDAEPGDKDKYPPSSELRSKCAKQ 1244
            |.  |. |..:|..:.|| :..:||:...::|.|.            .|.|.|||..    |.::
  Rat  1134 PS--CSGCEHEGRKYEPG-ESFQPGDDPCEVCICE------------LKGKGPPSLH----CRRR 1179

  Fly  1245 PYAEFTKCAPKE-----PKTC-----KNMDKYVAD--SSDCLP-----GCVCMEGYVYDTSR--- 1289
            .......|.|.:     |:.|     :.:.....|  .|:.:|     .|.|.:.....|.|   
  Rat  1180 QCPSLVGCPPSQLLPPGPQHCCPACAQALSNCTEDLLGSELVPPDPCYTCQCQDLTWLCTHRACP 1244

  Fly  1290 -LACVL------PANC---------SCHHAGKSYDDGEK-IKEDCNLCECRAGNWKCSKNGCE-- 1335
             |:|.|      |.:|         ||.|.|:....||. |.:.|..|.|.||...|....|.  
  Rat  1245 ALSCPLWEHHTTPGSCCPVCKDPTGSCSHQGRWVASGEHWIVDACTSCACVAGTVHCQSQRCRNL 1309

  Fly  1336 -----------------------STCSVWGDSHFTTFDGHDFDFQGACDYVLAKGVFDNGDGFSI 1377
                                   ::|..:||.|:.||||....|||:|.|||||..  :.:.||:
  Rat  1310 SCGRDEVPALSPGSCCPRCLPGLASCMAFGDPHYRTFDGRLLHFQGSCSYVLAKDC--HSEDFSV 1372

  Fly  1378 TIQNVLCGTMGVTCSKSLEIALTGHAEESLLLSADSAYSTDPNKTPIKKLRDS---VNSKGHNAF 1439
            .:.|...|..||..::.:.:.|   ...::.|........|.:...:..||:.   :..:||.  
  Rat  1373 HVTNDDRGRRGVAWTQEVAVLL---GTVAVRLLQGRTVMVDQHTVTLPFLREPLLYIELRGHT-- 1432

  Fly  1440 HIYKAGVFVVVEVIPLKLQVKWDEGTRVYVKLGNEWRQKVSGLCGNYNGNSLDDMQTPSMGLETS 1504
                    |::...| .|||.||..::|.|::.:.:|.:..|||||:||.:.||:|.|...|..:
  Rat  1433 --------VILHAQP-GLQVLWDGQSQVEVRVPSSYRGQTCGLCGNFNGFAQDDLQGPDGRLLPT 1488

  Fly  1505 PMLFGHAWKL-------QPHCSA--PVAPIDACKKHPERETWAQLKCGALKSDLFKECHAEVPLE 1560
            ...||::||:       :| |||  .|.|..|......||  |..:||.|||..|..|||.||.|
  Rat  1489 EAAFGNSWKVPKGLGPGRP-CSAGREVDPCRAAGYRARRE--ANARCGILKSSPFSRCHAVVPPE 1550

  Fly  1561 RFWKRCIFDTCACDQGGDCE-CLCTAVAAYADACAQKGINIRWRSQHFCPMQCDPHCS-DYKACT 1623
            .|:..|::|.|||..|...: |||.|:.|||..|.|.|:...||....|.:.|..... .:..|.
  Rat  1551 PFFAACVYDLCACGPGSSADTCLCDALEAYASHCRQAGVTPVWRGPTLCVVGCPVERGFVFDECG 1615

  Fly  1624 PACAVETCDN-FLDQGIAERMCNRENCLEGCHIKPCEDGFIYLNDTYRDCVPKAECKPVCM 1683
            |.|. .||.| .:..|.....|.|. |:.||.   |..|.:   :....|:....|..|.:
  Rat  1616 PPCP-RTCFNQHIPLGELAAHCVRP-CVPGCQ---CPAGLV---EHEGHCISPEACPSVLL 1668

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
HmlNP_524060.2 VWD 485..636 CDD:295339 29/200 (15%)
C8 680..754 CDD:285899 18/116 (16%)
TIL 758..811 CDD:280072 19/71 (27%)
VWD 840..1015 CDD:214566 20/176 (11%)
C8 1054..1121 CDD:214843 16/76 (21%)
TIL 1131..1185 CDD:280072 15/66 (23%)
TIL 1245..1298 CDD:280072 15/79 (19%)
VWD 1327..1498 CDD:214566 52/198 (26%)
C8 1535..1609 CDD:214843 33/74 (45%)
Mucin2_WxxW 1751..1837 CDD:290069
TIL 1938..2005 CDD:280072
FA58C 2089..2223 CDD:238014
FA58C 2104..2225 CDD:214572
FA58C <2299..2404 CDD:214572
FA58C <2299..2403 CDD:238014
VWD 2703..2858 CDD:295339
C8 2893..2970 CDD:285899
TIL 2974..3030 CDD:280072
VWD 3035..3198 CDD:295339
C8 3257..3313 CDD:285899
VWC 3397..3451 CDD:302663
GHB_like <3755..3813 CDD:304424
KcpXP_038964921.1 VWC 227..281 CDD:278520 13/63 (21%)
VWC 285..341 CDD:327433 20/108 (19%)
VWC 405..460 CDD:327433 13/56 (23%)
VWC 608..664 CDD:327433 8/55 (15%)
VWC 725..781 CDD:327433 10/55 (18%)
VWC 1022..1078 CDD:327433 15/68 (22%)
VWC 1139..1204 CDD:327433 18/81 (22%)
VWC <1223..1264 CDD:327433 9/40 (23%)
VWC 1271..1328 CDD:327433 13/56 (23%)
VWD 1335..1484 CDD:395046 51/164 (31%)
C8 1527..1599 CDD:214843 33/73 (45%)
TIL 1610..1663 CDD:410995 15/60 (25%)
Blue background indicates that the domain is not in the aligned region.


Information from Original Tools:


Tool Simple Score Weighted Score Original Tool Information
BLAST Result Score Score Type Cluster ID
Compara 00.000 Not matched by this tool.
Domainoid 00.000 Not matched by this tool.
eggNOG 1 0.900 - - E1_KOG1216
Hieranoid 00.000 Not matched by this tool.
Homologene 00.000 Not matched by this tool.
Inparanoid 00.000 Not matched by this tool.
OMA 00.000 Not matched by this tool.
OrthoDB 00.000 Not matched by this tool.
OrthoFinder 00.000 Not matched by this tool.
OrthoInspector 00.000 Not matched by this tool.
orthoMCL 00.000 Not matched by this tool.
Panther 00.000 Not matched by this tool.
Phylome 00.000 Not matched by this tool.
SonicParanoid 00.000 Not matched by this tool.
SwiftOrtho 1 1.000 - -
TreeFam 00.000 Not matched by this tool.
21.900

Return to query results.
Submit another query.