DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Mep1a and Cubn

DIOPT Version :10

Sequence 1:NP_037275.1 Gene:Mep1a / 25684 RGDID:3080 Length:748 Species:Rattus norvegicus
Sequence 2:NP_727348.2 Gene:Cubn / 326235 FlyBaseID:FBgn0052702 Length:3750 Species:Drosophila melanogaster


Alignment Length:947 Identity:182/947 - (19%)
Similarity:285/947 - (30%) Gaps:411/947 - (43%)


- Green bases have known domain annotations that are detailed below.


  Rat    13 SFSAHIA-AVSIQHLSTG------HDHDDVDVGEQQKD----ISEINSAAGLNL---FQG----- 58
            ||||:.. .|||:: .:|      :.:..:|..||.|:    |.|:.:..|.::   |||     
  Fly  2881 SFSANATFNVSIKY-GSGCGGKLVYPYRAIDFAEQYKNNVECIWEVEATMGYHIGLTFQGRFYIE 2944

  Rat    59 -------DILLPRTRNALRDPSSRW----------KPPI-----PY----------ILAD----N 87
                   |.||.:.||   :.:..|          .|.:     ||          ::||    .
  Fly  2945 DSPGCTKDYLLVQQRN---ETTGNWTDLQRICGRVAPEMINTTSPYLRLIFRSDGDVVADGFLAK 3006

  Rat    88 LDLNAKGAIL--------------NAFEMFR----------------LKSCVDFKPYEGESSYII 122
            .:.|..|.:.              |.:|.:.                |.|.|:|...:|..|..:
  Fly  3007 FERNCGGLLYADSTEQELASPGFPNGYEKYLQCNWTIVPRSPSMGGVLVSFVNFDLEQGPISVCL 3071

  Rat   123 F--------------QQFSGCWSMVGDQHVGQ---NISIGEGCDYK------------------- 151
            :              ||.:.|......::.|:   |:.:.....|.                   
  Fly  3072 YDNLTVTTKDKGKDPQQTTLCGVKHNHEYRGKEYVNLLLRTDGSYSGRGFTLLYTSRLCGGIISR 3136

  Rat   152 -AIIEHEILHALGFFHEQSRTDR------DDYVNIWWN------------EIMTDYEHNFN-TYD 196
             :::|..:.|          ||.      |.|    ||            .:..|:|.|.| .||
  Fly  3137 TSMVESPVQH----------TDNTLPPGSDCY----WNLTAPAGYKFNIKFLFIDFEANSNCAYD 3187

  Rat   197 DKTITDLNTPYDYESLMHYGPFSFNKNETIPTITTKIPEFNAIIGQRLD-------FSATDLTRL 254
            ...:.....|   :....:|.|....||.:|.|:  ||:...||....|       |.|  |.|:
  Fly  3188 GVEVFSGPIP---DERYRWGRFCGRINEDLPLIS--IPQERGIIHSFSDDRDPSRGFRA--LVRV 3245

  Rat   255 NRMYNCTRTHTLLDHCAFEKTNICGMIQGTRDDADWVHEDSSQPGQVDHTLVGRCKAAGYFMY-- 317
              |.||.           ||.::.|               ||:                 ::|  
  Fly  3246 --MPNCD-----------EKISLNG---------------SSR-----------------YVYSK 3265

  Rat   318 FNTSSGVTGEVALLESRILYPKRKQQCL-----QFFYKMT-GSPSDRLLIWVRRDDNTGNVRQLA 376
            ||.:.|...:   |:.:|::.....|.:     .|..:.| |..||    :|...|..|....: 
  Fly  3266 FNNAGGYQND---LDCQIVFRVNPDQQISVEFSNFHVQDTDGCRSD----YVELRDGGGTFADI- 3322

  Rat   377 KIQTFQGDSD--------HNWKIAHVTLNE--EKKFRYVF----------------QGTK----G 411
             |..|.|.:.        |...:..||.|:  :..|:...                .|||    .
  Fly  3323 -IGRFCGQNQPPTLRTTRHTLYMRFVTDNKVTDTGFQVTINAIPRLCGSSEITLSADGTKEVTIN 3386

  Rat   412 DPGNSDGGIYLDDITLTETPCPTGV---WTIRNISQVLENTVKGDRLVSPRF------------- 460
            .|..:.||.|           |.||   |.|           |||.|:..:|             
  Fly  3387 SPARTPGGNY-----------PNGVSCFWKI-----------KGDSLLRVQFVNFDLHGPNQNGS 3429

  Rat   461 --------YNSEG-----YGFGVTLYPNGRITSNSGYLGLAFHLYSGD----------NDVILEW 502
                    ||||.     .|.|..|..||:.:|.:|:.....|:|.|:          ::|.|::
  Fly  3430 CVDDYLKIYNSEDAPLLEQGLGTDLVFNGQTSSKNGFGFATEHVYCGNVKPDIYYGRSSEVYLKF 3494

  Rat   503 PVEN-EQ--------AIMTILDQEPDA-RNRMSLSLM------------FTTSKYQTSSAINGSV 545
            ..:. ||        |:.:..::..|. :.|:.||..            :|.|.|.|....    
  Fly  3495 RSKGLEQHGGFQLQVALNSNRERHYDGLQGRVHLSQSADCNIIIRAPPNYTLSLYYTELIF---- 3555

  Rat   546 IWDRPTKVGVYDKDCDCFRSIDWGWGQAISHQMLMRRNFLKDDTLIIFVDFKDLTHLRQTEVPIS 610
                    |.||.:.:.....|      .:::.|.|        :..|||.              
  Fly  3556 --------GTYDCEMENLEVFD------RTNRSLQR--------VCSFVDM-------------- 3584

  Rat   611 SRSVIPRGLLLQGQEPLALGDSRIAMMEESLPRRLDQRQPSRPKRSVENTGP---MEDHNWPQYF 672
            .:|:....           .:.|:.|...|....||....:.|   ||. ||   .:.:|....|
  Fly  3585 GKSLFSNA-----------NELRLQMKTGSYLTSLDLTYLASP---VEK-GPGCGGQFYNTEGIF 3634

  Rat   673 RDPCDPNPCQNEGTCVNVKGMASCRCVSGHAFFYTGE 709
            .:|..||..:|...|..:     .|..|.:..|.|.|
  Fly  3635 SNPFYPNNVRNNSECQWI-----VRVPSNNVVFLTFE 3666

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
Mep1aNP_037275.1 ZnMc 32..260 CDD:469599 68/368 (18%)
MAM 270..433 CDD:459878 35/200 (18%)
MATH 432..596 CDD:445786 45/224 (20%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 641..668 7/29 (24%)
EGF 676..710 CDD:394967 9/34 (26%)
CubnNP_727348.2 cubilin_NTD 21..149 CDD:412063
EGF_CA 156..190 CDD:238011
EGF_CA 192..233 CDD:238011
EGF_CA 282..322 CDD:214542
EGF_CA 324..367 CDD:214542
EGF 430..457 CDD:394967
EGF_CA 469..503 CDD:238011
CUB 509..622 CDD:238001
CUB 627..737 CDD:238001
CUB 744..849 CDD:412131
CUB 853..970 CDD:238001
CUB 978..1094 CDD:238001
CUB 1100..1211 CDD:238001
CUB 1216..1330 CDD:238001
CUB 1446..1549 CDD:238001
CUB 1554..1667 CDD:238001
CUB 1792..1899 CDD:238001
CUB 1910..1998 CDD:412131
CUB 2019..2133 CDD:238001
CUB 2140..2242 CDD:238001
CUB 2263..2379 CDD:238001
CUB 2385..2511 CDD:238001
CUB 2516..2630 CDD:238001
CUB <2833..2892 CDD:412131 5/10 (50%)
CUB 2898..3008 CDD:238001 21/112 (19%)
CUB 3011..3127 CDD:238001 15/115 (13%)
CUB 3130..3241 CDD:238001 27/129 (21%)
CUB 3254..3363 CDD:238001 26/149 (17%)
CUB 3379..3508 CDD:238001 35/150 (23%)
CUB 3531..3601 CDD:412131 16/120 (13%)
CUB 3623..3733 CDD:238001 12/49 (24%)
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.