DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment mesh and Muc4

DIOPT Version :10

Sequence 1:NP_001163782.1 Gene:mesh / 43688 FlyBaseID:FBgn0051004 Length:1454 Species:Drosophila melanogaster
Sequence 2:XP_063126583.1 Gene:Muc4 / 303887 RGDID:621331 Length:3363 Species:Rattus norvegicus


Alignment Length:956 Identity:210/956 - (21%)
Similarity:324/956 - (33%) Gaps:284/956 - (29%)


- Green bases have known domain annotations that are detailed below.


  Fly   261 PSDIDQRTPGVYFRVERDLMGRTDRFGVEVRERTMWDIRQGVVGADTFIPKHVVIAT-------- 317
            ||...:|||.|         ..:|.|       |..|...|..|.......|.||.:        
  Rat  2108 PSTSSRRTPSV---------ATSDIF-------TTTDSTSGNAGHTLLTGSHSVITSRVASTTLG 2156

  Fly   318 ----------------------WKNVSFAGGIDNSLYTTNTFQMVLATDEVYTYIIFNYAVLNWL 360
                                  .:::..:...:.||.|..|.:...|:....|       |....
  Rat  2157 RLSTVAHSKSTQRSSTHSQSYLTESMGASSTSETSLLTEATTEKQFASSPGPT-------VTETF 2214

  Fly   361 SH-TEAGGDTTKGEGG-----------VPAYVGFNAGNGT---------QAYEYNPYSQNMVIRD 404
            |. |.:.|.|||.:..           :||.....|...|         |.....||..:..:||
  Rat  2215 SRGTSSSGLTTKTDNDRSTALSATSLTLPAPSTSTASRSTVPPAPLPPDQGISLFPYGSSSEVRD 2279

  Fly   405 LANRGWAN-------------GFP-GRHI-----FRVDEQILIGSCNKDIDAALLPLTFAPESGN 450
              .:.:|.             ||| |..:     |..:.||:....:.|:.:...|    |:.| 
  Rat  2280 --KQLFARTVDFTSPIFKIQIGFPLGSSLRDSFYFTDNGQIIFPESDYDVFSYPNP----PQRG- 2337

  Fly   451 MLGGQVVNITGPCFDPAIRVTCHFDTESVLGT---------YVDRNRVICVQPYLKAEGYIRFQI 506
            ..|.:.|.|..|.:..|       |..|..||         |.:::::|     .|.|..|....
  Rat  2338 FTGRERVAIVAPFWGDA-------DFSSSRGTIFYQDYITFYDEQHQLI-----RKVESLINEFT 2390

  Fly   507 SVGTQRFKWRGKY-FVETPAAATEKIFFTTDDVHKKNPAEIRITWNQYNLTSNANANVMISLWGY 570
            |..:.:.||..|. :|..||...:..|.|                |.|....:.:.:...:|:.|
  Rat  2391 SDWSFKAKWTLKVTWVNVPAYPAQGSFGT----------------NTYQAILSTDGSRSYALFLY 2439

  Fly   571 RETKIEPQLEYIDVIEASYS------NSGSYVITPSNYINRNNINRDMQFGFLQINLTQPDQY-- 627
            :...:.     .||.:..|:      :||......|..|.|..:.:           .:||::  
  Rat  2440 QSGGMR-----WDVTQGLYNRVLMGFSSGDGYFENSPLIFRPAVEK-----------YRPDRFLN 2488

  Fly   628 SGLAISPVLWSRPIPLAWYMAPQWERQHGKRWA--RALCDNWIRADRFLRNFAADLPLCPCTLDQ 690
            |.|.|..:              |..|.|.:.|.  |..|..|:.:.....::..:...|||:..|
  Rat  2489 SKLGIRGL--------------QVYRLHREVWPNYRLKCLQWLESQPQQPSWGWNKISCPCSWQQ 2539

  Fly   691 AVLDKGRFR---------PDRECDKDSNPSCLRHRGAIHCVVSGTPVAQGAEQQCCYDRYGFLML 746
            ...|   ||         ..:.|...|.      ||.: |              |.|..:|    
  Rat  2540 GRWD---FRFWLINTGLWGRQLCSFSSG------RGGV-C--------------CSYGTWG---- 2576

  Fly   747 TYDQMWGSRPRRVHNLGKMPWNEASKVPSLSMWFHDMRPFYSCCYWQEEQAVGCETYRFERRPSQ 811
            .:.:.|     |:|:    || :..:......|         ||.|.::.:. |..|:. |||..
  Rat  2577 EFREGW-----RMHS----PW-QFDEEQEAQNW---------CCRWNDKPSF-CVQYQL-RRPRV 2620

  Fly   812 DCVAYQAPGIAGVFGDPHFVTFDGTAYTFNGLGEFVLARSVDESNKFEVQGRFEQLPVNYYGEVK 876
            .|..|:.|..|..|||||..|.|...|||||||:|:|.::.|.::.|.::||..|.     ....
  Rat  2621 SCAGYRPPRPAWTFGDPHITTLDNAKYTFNGLGDFLLVQAQDRNSSFLLEGRTAQT-----DSAN 2680

  Fly   877 ATQLTAVAMRGNTTTTIEVRLRPLHARWRYRLDVLADGRRVYFDRESLKFQHFD--GVTVYTPTY 939
            ||...|.|.:.||::.    ..|:..:|....:   |..||..:.:::.|...|  .:.|:..|.
  Rat  2681 ATNFIAFAAQYNTSSL----KSPITVQWFLEPN---DTIRVVHNNQTVAFNTSDTEDLPVFNATG 2738

  Fly   940 LL---NQSQVVVQFDAGIGVEVVENEGYMTGRVFLPWKFINKTAGLFGNWSFNKLDDFMLPNGQV 1001
            :|   |.|||...||..:.:.|:.....:.....|..::.|.|.||.|.|:.|..|||.:|||..
  Rat  2739 VLLIQNGSQVSANFDGTVTISVIALSNILHASSSLSEEYRNHTKGLLGVWNDNPEDDFRMPNGST 2803

  Fly  1002 AQLNLNDLRSIHTNFGIKWMLTDREVPGVGAALFKREFGRMSGYYANATFQPNYVLDPADFLPA- 1065
            ...|.::....|  :|:.|.:....:.||.....                       |::|.|. 
  Rat  2804 IPSNSSEETLFH--YGMTWQINGTGLLGVRTDPL-----------------------PSEFTPIF 2843

  Fly  1066 -----NRSYDLERAEELCGECMQCQYDYAMTLNRDLAHFTKNYYDT 1106
                 |:|...|.....|.|..||::|...|.|||:...|.:...|
  Rat  2844 LSQLWNKSGAGEDLISGCNEDAQCKFDILATGNRDIGQSTNSILRT 2889

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
meshNP_001163782.1 NIDO 259..419 CDD:214712 42/222 (19%)
TIG 446..511 CDD:460355 17/73 (23%)
AMOP 660..803 CDD:461048 28/153 (18%)
VWD 811..999 CDD:214566 60/192 (31%)
CCP 1121..1171 CDD:214478
Muc4XP_063126583.1 None
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.