DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Egfr and Egfr

DIOPT Version :9

Sequence 1:NP_997538.1 Gene:Egfr / 13649 MGIID:95294 Length:1210 Species:Mus musculus
Sequence 2:NP_476759.1 Gene:Egfr / 37455 FlyBaseID:FBgn0003731 Length:1426 Species:Drosophila melanogaster


Alignment Length:1417 Identity:500/1417 - (35%)
Similarity:698/1417 - (49%) Gaps:341/1417 - (24%)


- Green bases have known domain annotations that are detailed below.


Mouse    24 ALEEK--------KVCQGTSNRLTQLGTFEDHFLSLQRMYNNCEVVLGNLEITYV-QRNYDLSFL 79
            :||:|        |:|.||.:||:.....|.|:.:|:..|.||..|.||||:|:: ..|.|||||
  Fly    87 SLEDKNKNEFVKGKICIGTKSRLSVPSNKEHHYRNLRDRYTNCTYVDGNLELTWLPNENLDLSFL 151

Mouse    80 KTIQEVAGYVLIALNTVERIPLENLQIIRGNALY-----ENTYALAILSNYGTNRTGLRELPMRN 139
            ..|:||.||:||:...|:::....||||||..|:     |..|||.:  .|....|  .|:|  :
  Fly   152 DNIREVTGYILISHVDVKKVVFPKLQIIRGRTLFSLSVEEEKYALFV--TYSKMYT--LEIP--D 210

Mouse   140 LQEILIGAVRFSNNPILCNMDTIQWRDIVQNVFMSNMSMDLQSHPSSCPKCDPSCPNGSCWGGGE 204
            |:::|.|.|.|.||..||:|.||||.:||.|...:..:.|..:....||||..||.:| |||.|.
  Fly   211 LRDVLNGQVGFHNNYNLCHMRTIQWSEIVSNGTDAYYNYDFTAPERECPKCHESCTHG-CWGEGP 274

Mouse   205 ENCQKLTKIICAQQCS-HRCRGRSPSDCCHNQCAAGCTGPRESDCLVCQKFQDEATCKDTCPPLM 268
            :||||.:|:.|:.||: .||.|..|.:|||..||.|||||.:.||:.|:.|.||..||:.|||:.
  Fly   275 KNCQKFSKLTCSPQCAGGRCYGPKPRECCHLFCAGGCTGPTQKDCIACKNFFDEGVCKEECPPMR 339

Mouse   269 LYNPTTYQMDVNPEGKYSFGATCVKKCPRNYVVTDHGSCVRACGPDYYEVEEDGIRKCKKCDGPC 333
            .||||||.::.||||||::||||||:|| .:::.|:|:|||:|..|  ::::.|  :|..|:|||
  Fly   340 KYNPTTYVLETNPEGKYAYGATCVKECP-GHLLRDNGACVRSCPQD--KMDKGG--ECVPCNGPC 399

Mouse   334 RKVCNGIGIGEFKDTLSINATNIKHFKYCTAISGDLHILPVAFKG-----DSFTRTP---PLDPR 390
            .|.|.|:.:        ::|.||..|:.||.|.|::.||...|.|     .::|..|   ||||.
  Fly   400 PKTCPGVTV--------LHAGNIDSFRNCTVIDGNIRILDQTFSGFQDVYANYTMGPRYIPLDPE 456

Mouse   391 ELEILKTVKEITGFLLIQAWPDNWTDLHAFENLEIIRGRTKQHGQF-SLAVVGLNITSLGLRSLK 454
            .||:..|||||||:|.|:.....:.:|..|.|||.|.||......| :||:|..::.||.:|:||
  Fly   457 RLEVFSTVKEITGYLNIEGTHPQFRNLSYFRNLETIHGRQLMESMFAALAIVKSSLYSLEMRNLK 521

Mouse   455 EISDGDVIISGNRNLCYANTINWKKLFGTPNQKTKIMNNRAEKDCKAVNHVCNPLCSSEGCWGPE 519
            :||.|.|:|..||:|||.:.|.|..:...|.||..:..|.....|:....:|:..|:.:||||..
  Fly   522 QISSGSVVIQHNRDLCYVSNIRWPAIQKEPEQKVWVNENLRADLCEKNGTICSDQCNEDGCWGAG 586

Mouse   520 PRDCVSCQNVSRGRECVEKCNILEGEPREFVENSECIQCHPECLPQAMNITCTGRGPDNCIQCAH 584
            ...|::|:|.:....|:..|..:....:  .:|..|..|||||.      ||.|.|.|:|.:|.|
  Fly   587 TDQCLTCKNFNFNGTCIADCGYISNAYK--FDNRTCKICHPECR------TCNGAGADHCQECVH 643

Mouse   585 YIDGPHCVKTCPA-----------------GIMGENNTL-------------------------- 606
            ..||.|||..||.                 |..|..:|:                          
  Fly   644 VRDGQHCVSECPKNKYNDRGVCRECHATCDGCTGPKDTIGIGACTTCNLAIINNDATVKRCLLKD 708

Mouse   607 -------VWKYADANN-----------VCHLCH----------------ANCTY----------- 626
                   .|:|.....           ||..||                :.||:           
  Fly   709 DKCPDGYFWEYVHPQEQGSLKPLAGRAVCRKCHPLCELCTNYGYHEQVCSKCTHYKRREQCETEC 773

Mouse   627 ---------------------GCAGPGLQGC------------EVWP------------------ 640
                                 ||.|||...|            |..|                  
  Fly   774 PADHYTDEEQRECFQCHPECNGCTGPGADDCKSCRNFKLFDANETGPYVNSTMFNCTSKCPLEMR 838

Mouse   641 ---------------SGPKIPSIATGIVGGLLFIVVVAL---------GIGLFMRRRHIVRKRTL 681
                           |.|:...|...:...::||:..|:         .:....|::...:|.|:
  Fly   839 HVNYQYTAIGPYCAASPPRSSKITANLDVNMIFIITGAVLVPTICILCVVTYICRQKQKAKKETV 903

Mouse   682 R--RLLQERELVEPLTPSGEAPNQAHLRILKETEFKKIKVLGSGAFGTVYKGLWIPEGEKVKIPV 744
            :  ..|...|..|||.||....|...|||:|:.|.:|..|||.||||.||||:|:||||.|||||
  Fly   904 KMTMALSGCEDSEPLRPSNIGANLCKLRIVKDAELRKGGVLGMGAFGRVYKGVWVPEGENVKIPV 968

Mouse   745 AIKELREATSPKANKEILDEAYVMASVDNPHVCRLLGICLTSTVQLITQLMPYGCLLDYVREHKD 809
            |||||.::|..::::|.|.|||:||||::.::.:||.:|::|.:.|||||||.||||||||.::|
  Fly   969 AIKELLKSTGAESSEEFLREAYIMASVEHVNLLKLLAVCMSSQMMLITQLMPLGCLLDYVRNNRD 1033

Mouse   810 NIGSQYLLNWCVQIAKGMNYLEDRRLVHRDLAARNVLVKTPQHVKITDFGLAKLLGAEEKEYHAE 874
            .|||:.||||..||||||:|||::||||||||||||||:||..||||||||||||.::..||.|.
  Fly  1034 KIGSKALLNWSTQIAKGMSYLEEKRLVHRDLAARNVLVQTPSLVKITDFGLAKLLSSDSNEYKAA 1098

Mouse   875 GGKVPIKWMALESILHRIYTHQSDVWSYGVTVWELMTFGSKPYDGIPASDISSILEKGERLPQPP 939
            |||:||||:|||.|.:|::|.:||||::|||:|||:|||.:|::.|||.||..::|.|.:|.||.
  Fly  1099 GGKMPIKWLALECIRNRVFTSKSDVWAFGVTIWELLTFGQRPHENIPAKDIPDLIEVGLKLEQPE 1163

Mouse   940 ICTIDVYMIMVKCWMIDADSRPKFRELILEFSKMARDPQRYLVIQGDERMHLPSPTDSNFYRALM 1004
            ||::|:|..::.||.:||..||.|::|...|::.||||.|||.|.||:...||:.|.       .
  Fly  1164 ICSLDIYCTLLSCWHLDAAMRPTFKQLTTVFAEFARDPGRYLAIPGDKFTRLPAYTS-------Q 1221

Mouse  1005 DEEDM-----------EDVVDADEYLIPQQGFFNSPSTS-RTPLLSSLSATSNNSTVACINRNGS 1057
            ||:|:           |.:.:.|:||.|:.    :|..| ||.....:..               
  Fly  1222 DEKDLIRKLAPTTDGSEAIAEPDDYLQPKA----APGPSHRTDCTDEIPK--------------- 1267

Mouse  1058 CRVKEDAFLQRYSSDPT---GAVTEDNIDDAF-----------LPVPEYVNQSVPKRPAGSVQNP 1108
                    |.||..||:   .:..:|..|.:.           |||.|                .
  Fly  1268 --------LNRYCKDPSNKNSSTGDDETDSSAREVGVGNLRLDLPVDE----------------D 1308

Mouse  1109 VYHNQPLHPAPGRDLHYQNPHSN---AVG----------------NPEYLNTAQ---------PT 1145
            .|......|.|..:.:..||:.|   |||                |||||..||         ||
  Fly  1309 DYLMPTCQPGPNNNNNINNPNQNNMAAVGVAAGYMDLIGVPVSVDNPEYLLNAQTLGVGESPIPT 1373

Mouse  1146 CLSSGFNSPALWIQKGSHQMSLDNPDYQQDFFPKETKPNGIFKGPTAENAEY 1197
               .....|.:.: .|:.::.:..|..:                ||:.:.||
  Fly  1374 ---QTIGIPVMGV-PGTMEVKVPMPGSE----------------PTSSDHEY 1405

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
EgfrNP_997538.1 Recep_L_domain 57..168 CDD:279382 51/116 (44%)
Approximate 75..300 110/230 (48%)
Furin-like 184..335 CDD:279142 77/151 (51%)
FU 231..274 CDD:238021 24/42 (57%)
Recep_L_domain 362..481 CDD:279382 54/127 (43%)
Approximate 390..600 79/227 (35%)
GF_recep_IV 505..636 CDD:291509 49/239 (21%)
FU 506..559 CDD:238021 13/52 (25%)
FU 552..598 CDD:214589 22/62 (35%)
TM_ErbB1 634..679 CDD:213054 10/98 (10%)
Important for dimerization, phosphorylation and activation. /evidence=ECO:0000250 690..706 6/15 (40%)
PTKc_EGFR 706..1018 CDD:270683 177/322 (55%)
TyrKc 715..970 CDD:197581 153/254 (60%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1113..1137 9/42 (21%)
EgfrNP_476759.1 Recep_L_domain 419..547 CDD:279382 56/136 (41%)
GF_recep_IV 572..684 CDD:291509 39/172 (23%)
FU 573..>606 CDD:238021 10/32 (31%)
FU 617..659 CDD:214589 19/41 (46%)
FU 662..716 CDD:214589 21/190 (11%)
FU 739..786 CDD:238021 19/48 (40%)
FU 784..834 CDD:214589 32/49 (65%)
PTKc_EGFR_like 930..1208 CDD:270648 105/331 (32%)
STYKc 938..1194 CDD:214568 91/281 (32%)
Recep_L_domain 128..239 CDD:279382 49/116 (42%)
Furin-like 253..401 CDD:279142 75/148 (51%)
FU 255..292 CDD:214589 18/37 (49%)
FU 302..345 CDD:238021 23/42 (55%)


Information from Original Tools:


Tool Simple Score Weighted Score Original Tool Information
BLAST Result Score Score Type Cluster ID
Compara 00.000 Not matched by this tool.
Domainoid 1 1.000 341 1.000 Domainoid score I1078
eggNOG 1 0.900 - - E1_KOG1025
Hieranoid 00.000 Not matched by this tool.
Homologene 00.000 Not matched by this tool.
Inparanoid 00.000 Not matched by this tool.
Isobase 00.000 Not matched by this tool.
OMA 1 1.010 - - QHG45861
OrthoDB 1 1.010 - - D51516at33208
OrthoFinder 1 1.000 - - FOG0000755
OrthoInspector 1 1.000 - - otm42670
orthoMCL 1 0.900 - - OOG6_101159
Panther 1 1.100 - - O PTHR24416
Phylome 1 0.910 - -
RoundUp 1 1.030 - avgDist Average_Evolutionary_Distance R5222
SonicParanoid 1 1.000 - - X448
SwiftOrtho 00.000 Not matched by this tool.
TreeFam 1 0.960 - -
1211.820

Return to query results.
Submit another query.