DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Mayo and Adgre1

DIOPT Version :10

Sequence 1:NP_651842.3 Gene:Mayo / 43678 FlyBaseID:FBgn0039818 Length:802 Species:Drosophila melanogaster
Sequence 2:NP_001007558.1 Gene:Adgre1 / 316137 RGDID:1359214 Length:932 Species:Rattus norvegicus


Alignment Length:938 Identity:196/938 - (20%)
Similarity:316/938 - (33%) Gaps:276/938 - (29%)


- Green bases have known domain annotations that are detailed below.


  Fly     4 NWCAGLFVVLALVSPIYSACPKENQMCLRNGTFQYCDGEERNSS-DELMQTHLVYCTSMF----- 62
            :|..|        ||.:..|...:: ||   |...|   .:||: ...:.::...|.|.|     
  Rat   120 SWILG--------SPGHFLCTDVDE-CL---TIGIC---PKNSNCSNSVGSYSCTCQSGFVSNGS 169

  Fly    63 -CEPEEFQHDYITRNHDQTPHQNKWKRARIGERATLHDV-----CLLRNGMP----------VTR 111
             ||.|:   :.:|||             ...|.||.|:.     |....|:.          :..
  Rat   170 TCEDED---ECVTRN-------------ACPEHATCHNTLGSYYCTCNEGLEFSGGGPMFQGLEE 218

  Fly   112 ECRVKDNRANWESTEHWDPVVCLR------------------RFREHTISGDLNSLH--DDVLEG 156
            .|...| ..:..||......:|:.                  :...|...|:...:.  ||:...
  Rat   219 SCEDVD-ECSRNSTLCGPSFICINTLGSYSCSCPAGFSLSTFQIPGHPADGNCTDIDECDDICPS 282

  Fly   157 RRRTNDTQGRREMTGIMRNMFRQRGGNLLPADVHMTGQMFGALMQQDKDATVSVDLVSVCKEIMS 221
            .....:|.|....|             ..|......||    |...|::.|        |::|..
  Rat   283 NSSCTNTLGSYFCT-------------CHPGFASSNGQ----LNFTDQEVT--------CEDIDE 322

  Fly   222 CDSKVLRLSAQLNATN-------SLLSQFESYMDALPEQLVPKDRCGKVVAK------PTSDEAE 273
            |.....|.....:.||       |.|..|.  ||....|......|.::..|      |.|::.|
  Rat   323 CTQDPFRCGRNSSCTNVPGSYNCSCLPDFR--MDPGGSQAHGNFTCKRIPFKCKEDLIPKSEQIE 385

  Fly   274 TATTGVETYNFSDIGVQALITGNISVFFANPECDRITGLAIFSAPGDQRKTSAS--------GFW 330
            ....| :..|.........:....::.  :..|:.      .|||...:..:.|        ..|
  Rat   386 QCQAG-QGRNLDYTSFCTFVNATFTIL--DNTCEN------KSAPVSLQSAATSVSLMLEQASTW 441

  Fly   331 YRFIRFSEDLAKVKEESNLET-----------AAFL---------------------------PE 357
            :.|.|        :|.|.|.|           ||.|                           .|
  Rat   442 FEFSR--------EETSTLGTILLETVESTMLAALLTPSGNASQTIRTEYLEIESKVINEECNEE 498

  Fly   358 NLWRQVKSRG-----ATYLI------------FKVYAH-----DALFVETSLQRTRKPR--SKVI 398
            |:...:|:||     ..::|            |..:||     |..|.|.. |.:.|.|  |.|:
  Rat   499 NVSINLKARGDKMDVGCFIIKESESTGTPGVAFVSFAHMDSVLDERFFEDG-QASWKLRMNSHVV 562

  Fly   399 SISIPGLRSNYLSLPLPFLLRNENLRNPDSKAFSIGSGCGYWNYET----WSTEGVSTESSSDLL 459
            ..::.|.|....|.|:.:.|::...:....::.     |..||.:.    |:..|..|..:|   
  Rat   563 GGTVTGERKEDFSKPIVYTLQHIQPKQKSERSI-----CVSWNTDVEDGRWTPSGCETVEAS--- 619

  Fly   460 KDAIIECHT----NHLTQFAFLVGGSYRANDLGEEILITPINEKVLDIISIVGCSLSLLGILGIF 520
                 |.||    |.:|..|.::..       ||..:     |..|.|||.||..:||:.:.   
  Rat   620 -----ETHTVCSCNRMTNLAIIMAS-------GELTM-----EFSLYIISYVGTVISLVCLA--- 664

  Fly   521 LTAALFKSWRSQASTKVLLHLCLAMCLQMMLFVFLNTDDVSEALVVNGNTVRCVALGAAMQYSIL 585
            |..|.|..:|:..:....|||.|.:||.:...:||...|.::      |...|..:...:.|..|
  Rat   665 LAIATFLLFRAVQNHNTYLHLHLCVCLFLAKILFLTGIDKTD------NQTACAIIAGFLHYLFL 723

  Fly   586 VLFSWMLIIAFLQF--------QRYVTVIGIERPPRYILKAAIVAWLLPLVPTLLVALIDPDSY- 641
            ..|.|||:.|.:.|        ..|.:...|:     :|......:.||:|..::.|.:.|..| 
  Rat   724 ACFFWMLVEAVMLFLMVRNLKVVNYFSSRNIK-----MLHLCAFGYGLPVVVVIISATVHPWGYG 783

  Fly   642 VPSAAQLSTDTGICYPSGYGLIFGVVLPVTLITVCNLVIFVYVFYSISHSLSQSIHKNEKKMVVK 706
            :.:...|:|:|        |.|:..:.||.:|...|..:..:..:.:...|. |::....|:  |
  Rat   784 MHNRCWLNTET--------GFIWSFLGPVCMIITINSALLAWTLWVLRQKLC-SVNSEVSKL--K 837

  Fly   707 QIRL----SIMLFFLLGLTWIFGIFAFMQAGVAFSYLFCITATMQGFVMFIYFVLLDSTNRRLWV 767
            ..||    :|...|:||.:|:.|||.........:|||....::||..:|:...||:...|..:.
  Rat   838 DTRLLTFKAIAQIFILGCSWVLGIFQIGPLASIMAYLFTTINSLQGAFIFLIHCLLNRQVRDEYR 902

  Fly   768 GLICPTKMELDVQKRTTE--LQSMTTSS 793
            .|: ..|.:|....:|:.  |.||.::|
  Rat   903 KLL-TRKTDLSSHSQTSGILLSSMPSTS 929

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
MayoNP_651842.3 7tmB2_Adhesion 498..766 CDD:320168 73/280 (26%)
TM helix 1 500..525 CDD:320168 9/24 (38%)
TM helix 2 535..557 CDD:320168 8/21 (38%)
TM helix 3 574..600 CDD:320168 8/33 (24%)
TM helix 4 615..635 CDD:320168 4/19 (21%)
TM helix 5 659..688 CDD:320168 6/28 (21%)
TM helix 6 704..730 CDD:320168 11/29 (38%)
TM helix 7 734..759 CDD:320168 7/24 (29%)
Adgre1NP_001007558.1 EGF_CA 33..63 CDD:238011
EGF_CA 81..>109 CDD:214542
EGF_CA 133..172 CDD:214542 9/45 (20%)
EGF_CA 173..>203 CDD:214542 9/45 (20%)
EGF_CA 222..255 CDD:214542 4/33 (12%)
EGF_CA 272..>301 CDD:214542 6/41 (15%)
EGF_CA 319..349 CDD:429571 6/29 (21%)
Cell attachment site. /evidence=ECO:0000255 507..509 1/1 (100%)
GPS 592..641 CDD:197639 13/68 (19%)
GPS. /evidence=ECO:0000255|PROSITE-ProRule:PRU00098 596..643 15/61 (25%)
7tmB2_EMR 645..907 CDD:320555 74/287 (26%)
TM helix 1 647..672 CDD:320555 11/27 (41%)
TM helix 2 681..703 CDD:320555 8/21 (38%)
TM helix 3 712..739 CDD:320555 8/26 (31%)
TM helix 4 756..776 CDD:320555 4/19 (21%)
TM helix 5 793..816 CDD:320555 7/30 (23%)
TM helix 6 840..865 CDD:320555 10/24 (42%)
TM helix 7 869..894 CDD:320555 7/24 (29%)
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.