DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Egfr and Epha2

DIOPT Version :10

Sequence 1:NP_476759.1 Gene:Egfr / 37455 FlyBaseID:FBgn0003731 Length:1426 Species:Drosophila melanogaster
Sequence 2:NP_034269.2 Gene:Epha2 / 13836 MGIID:95278 Length:977 Species:Mus musculus


Alignment Length:937 Identity:215/937 - (22%)
Similarity:333/937 - (35%) Gaps:247/937 - (26%)


- Green bases have known domain annotations that are detailed below.


  Fly   418 RNCTVIDGNIRILDQTFSGF---QDV-YANYTMGPRYIPLD---PERLEVFSTVKEITGYLNIEG 475
            |:|....|......:||:.:   .|| |.......::..:|   |:.:.|.|..:.....||:|.
Mouse   102 RDCNSFPGGASSCKETFNLYYAESDVDYGTNFQKRQFTKIDTIAPDEITVSSDFEARNVKLNVEE 166

  Fly   476 --THPQFRNLSYFRNLETIHGRQLMESMFAALAIVKSSLYSLEMRNLKQISSGSVVIQHNRDLCY 538
              ..|..|...|.          ..:.:.|.:|::...:|..:...:.|                
Mouse   167 RMVGPLTRKGFYL----------AFQDIGACVALLSVRVYYKKCPEMLQ---------------- 205

  Fly   539 VSNIRWPAIQKEPEQKVWVNENLRADLCEKNGTICSD-------------QCNEDGCWGAGTDQC 590
             |..|:|       :.:.|..:....|....|| |.|             .|..||.|.....||
Mouse   206 -SLARFP-------ETIAVAVSDTQPLATVAGT-CVDHAVVPYGGEGPLMHCTVDGEWLVPIGQC 261

  Fly   591 LTCKNFNFNGTCIADCGYISNAYKFDNRTCKICHP----------ECRTC------NGAGADHCQ 639
            | |:                ..|:.....|:.|.|          .|..|      :..||..||
Mouse   262 L-CQ----------------EGYEKVEDACRACSPGFFKSEASESPCLECPEHTLPSTEGATSCQ 309

  Fly   640 ECVHVRDGQHCVSECPKNKYNDRGVCRECHATCDGCTGPKD------TIGIGACTTCNLAIINND 698
             |   .:|.....|.|.:.               .||.|..      .||:||....        
Mouse   310 -C---EEGYFRAPEDPLSM---------------SCTRPPSAPNYLTAIGMGAKVEL-------- 347

  Fly   699 ATVKRCLLKDDKCPDGYFWEYVHPQEQGSLKPLAGRAVCRKCHP------LCELCTNYGYHEQVC 757
                               .:..|::.|..:.:.....|.:|.|      .||....|       
Mouse   348 -------------------RWTAPKDTGGRQDIVYSVTCEQCWPESGECGPCEASVRY------- 386

  Fly   758 SKCTHYKRREQCETEC--PADHYTDEEQRECFQCHPECNGCTGPGADDCKSCRNFKL--FDANET 818
            |:..|...|.......  |..:||...:..        ||.:|     ..:.|:|:.  ...|:|
Mouse   387 SEPPHALTRTSVTVSDLEPHMNYTFAVEAR--------NGVSG-----LVTSRSFRTASVSINQT 438

  Fly   819 GPYV------NSTMFNCTSKCPLEMRHVNYQYTAI----GPYCAASPPRSSKITANLD------- 866
            .|..      ::|..:.|...|:..:...::|...    |...:.:..|:...:..||       
Mouse   439 EPPKVRLEDRSTTSLSVTWSIPVSQQSRVWKYEVTYRKKGDANSYNVRRTEGFSVTLDDLAPDTT 503

  Fly   867 ------------------------------VNMIFIITGAVLVPTICILC---VVTYICRQKQKA 898
                                          .||..|  |.|.|..:.:|.   |..:|.|:::..
Mouse   504 YLVQVQALTQEGQGAGSKVHEFQTLSTEGSANMAVI--GGVAVGVVLLLVLAGVGLFIHRRRRNL 566

  Fly   899 KKETVKMTMALSGCEDSEPLR----PSNI-GANLCKLRIVKDAE---LRKGGVLGMGAFGRVYKG 955
            :.......:..|..|..:||:    |... ..|...|:...:..   :.:..|:|.|.||.||||
Mouse   567 RARQSSEDVRFSKSEQLKPLKTYVDPHTYEDPNQAVLKFTTEIHPSCVARQKVIGAGEFGEVYKG 631

  Fly   956 VWVPEGENVKIPVAIKELLKSTGAESSEEFLREAYIMASVEHVNLLKLLAVCMS-SQMMLITQLM 1019
            .........:||||||.|......:...:||.||.||....|.|:::|..|... ..||:||:.|
Mouse   632 TLKASSGKKEIPVAIKTLKAGYTEKQRVDFLSEASIMGQFSHHNIIRLEGVVSKYKPMMIITEYM 696

  Fly  1020 PLGCLLDYVRNNRDKIGSKALLNWSTQIAKGMSYLEEKRLVHRDLAARNVLVQTPSLVKITDFGL 1084
            ..|.|..::|....:.....|:.....||.||.||.....|||||||||:||.:..:.|::||||
Mouse   697 ENGALDKFLREKDGEFSVLQLVGMLRGIASGMKYLANMNYVHRDLAARNILVNSNLVCKVSDFGL 761

  Fly  1085 AKLLSSDSN-EYKAAGGKMPIKWLALECIRNRVFTSKSDVWAFGVTIWELLTFGQRPHENIPAKD 1148
            :::|..|.. .|..:|||:||:|.|.|.|..|.|||.||||::|:.:||::|:|:||:..:...:
Mouse   762 SRVLEDDPEATYTTSGGKIPIRWTAPEAISYRKFTSASDVWSYGIVMWEVMTYGERPYWELSNHE 826

  Fly  1149 IPDLIEVGLKLEQPEICSLDIYCTLLSCWHLDAAMRPTFKQLTTVFAEFARDPGRYLAIPGDKFT 1213
            :...|..|.:|..|..|...||..::.||..:.:.||.|..:.::..:..|.|        |...
Mouse   827 VMKAINDGFRLPTPMDCPSAIYQLMMQCWQQERSRRPKFADIVSILDKLIRAP--------DSLK 883

  Fly  1214 RLPAYTSQDEKDLIRKLAPTTDGSEAI 1240
            .|..:   |.:..||  .|:|.|||.:
Mouse   884 TLADF---DPRVSIR--LPSTSGSEGV 905

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
EgfrNP_476759.1 Recep_L_domain 128..239 CDD:460032
Furin-like 253..401 CDD:395614
Recep_L_domain 419..547 CDD:460032 24/136 (18%)
GF_recep_IV 572..684 CDD:464344 28/146 (19%)
FU 662..716 CDD:214589 7/59 (12%)
GF_recep_IV 740..>834 CDD:464344 21/109 (19%)
PTKc_EGFR_like 930..1208 CDD:270648 99/282 (35%)
Epha2NP_034269.2 Mediates interaction with CLDN4. /evidence=ECO:0000250 1..205 21/112 (19%)
EphR_LBD_A2 27..200 CDD:198448 21/107 (20%)
fn3 331..425 CDD:394996 21/140 (15%)
fn3 439..520 CDD:394996 9/80 (11%)
EphA2_TM <570..611 CDD:464211 7/40 (18%)
Mediates interaction with ARHGEF16. /evidence=ECO:0000250 607..907 108/312 (35%)
Protein Kinases, catalytic domain 608..876 CDD:473864 96/267 (36%)
Negatively regulates interaction with ARHGEF16. /evidence=ECO:0000250 887..977 8/24 (33%)
SAM_EPH-A2 903..972 CDD:188942 1/3 (33%)
PDZ-binding. /evidence=ECO:0000255 975..977
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.