DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Egfr and Erbb2

DIOPT Version :10

Sequence 1:NP_476759.1 Gene:Egfr / 37455 FlyBaseID:FBgn0003731 Length:1426 Species:Drosophila melanogaster
Sequence 2:NP_058699.2 Gene:Erbb2 / 24337 RGDID:2561 Length:1259 Species:Rattus norvegicus


Alignment Length:1443 Identity:475/1443 - (32%)
Similarity:666/1443 - (46%) Gaps:360/1443 - (24%)


- Green bases have known domain annotations that are detailed below.


  Fly   100 KICIGTKSRLSVPSNKEHHYRNLRDRYTNCTYVDGNLELTWLPNENLDLSFLDNIREVTGYILIS 164
            ::|.||..:|.:|::.|.|...||..|..|..|.||||||::| .|..||||.:|:||.||:||:
  Rat    27 QVCTGTDMKLRLPASPETHLDMLRHLYQGCQVVQGNLELTYVP-ANASLSFLQDIQEVQGYMLIA 90

  Fly   165 HVDVKKVVFPKLQIIRGRTLFSLSVEEEKYALFV-----------------TYSKMYTLEIPDLR 212
            |..||:|...:|:|:||..||     |:||||.|                 |...:..|::..|.
  Rat    91 HNQVKRVPLQRLRIVRGTQLF-----EDKYALAVLDNRDPQDNVAASTPGRTPEGLRELQLRSLT 150

  Fly   213 DVLNGQVGFHNNYNLCHMRTIQWSEIV-SNGTDAYYNYDFTAPERECPKCHESC-THGCWGEGPK 275
            ::|.|.|....|..||:...:.|.::. .|...|..:.| |...|.||.|..:| .:.||||.|:
  Rat   151 EILKGGVLIRGNPQLCYQDMVLWKDVFRKNNQLAPVDID-TNRSRACPPCAPACKDNHCWGESPE 214

  Fly   276 NCQKFSKLTCSPQCAGGRCYGPKPRECCHLFCAGGCTGPTQKDCIACKNFFDEGVCKEECPPMRK 340
            :||..:...|:..||  ||.|..|.:|||..||.|||||...||:||.:|...|:|:..||.:..
  Rat   215 DCQILTGTICTSGCA--RCKGRLPTDCCHEQCAAGCTGPKHSDCLACLHFNHSGICELHCPALVT 277

  Fly   341 YNPTTYVLETNPEGKYAYGATCVKECP-GHLLRDNGACVRSCPQDKMDKGGE-----CVPCNGPC 399
            ||..|:....||||:|.:||:||..|| .:|..:.|:|...||.:..:...|     |..|:.||
  Rat   278 YNTDTFESMHNPEGRYTFGASCVTTCPYNYLSTEVGSCTLVCPPNNQEVTAEDGTQRCEKCSKPC 342

  Fly   400 PKTCPGVTVLH--------AGNIDSFRNCTVIDGNIRILDQTFSGFQDVYANYTMGPRYIPLDPE 456
            .:.|.|:.:.|        :.|:..|..|..|.|::..|.::|.|      :.:.|  ..||.||
  Rat   343 ARVCYGLGMEHLRGARAITSDNVQEFDGCKKIFGSLAFLPESFDG------DPSSG--IAPLRPE 399

  Fly   457 RLEVFSTVKEITGYLNIEGTHPQFRNLSYFRNLETIHGRQLMESMFAALAIVKSSLYSLEMRNLK 521
            :|:||.|::||||||.|.......|:||.|:||..|.||.|.:..: :|.:....::||.:|:|:
  Rat   400 QLQVFETLEEITGYLYISAWPDSLRDLSVFQNLRIIRGRILHDGAY-SLTLQGLGIHSLGLRSLR 463

  Fly   522 QISSGSVVIQHNRDLCYVSNIRWPAIQKEPEQKVWVNENLRADLCEKNGTICSDQCNEDGCWGAG 586
            ::.||..:|..|..||:|..:.|..:.:.|.|.:..:.|...:.|...|.:|:..|....|||.|
  Rat   464 ELGSGLALIHRNAHLCFVHTVPWDQLFRNPHQALLHSGNRPEEDCGLEGLVCNSLCAHGHCWGPG 528

  Fly   587 TDQCLTCKNFNFNGTCIADCGY---ISNAYKFDNRTCKICHPECR------TCNGAGADHCQECV 642
            ..||:.|.:|.....|:.:|..   :...|..|.| |..|||||:      ||.|:.||.|..|.
  Rat   529 PTQCVNCSHFLRGQECVEECRVWKGLPREYVSDKR-CLPCHPECQPQNSSETCFGSEADQCAACA 592

  Fly   643 HVRDGQHCVSECPKNKYNDRGVCRECHATCDGCTGPKDTIGIGACTTCNLAIINNDATVKRCLLK 707
            |.:|...||:.||                    :|.|                            
  Rat   593 HYKDSSSCVARCP--------------------SGVK---------------------------- 609

  Fly   708 DDKCPD-GY--FWEYVHPQEQGSLKPLAGRAVCRKCHPLCELCTNYGYHEQVCSKCTHYKRREQC 769
                || .|  .|:|  |.|:|         :|:.| |:               .|||       
  Rat   610 ----PDLSYMPIWKY--PDEEG---------ICQPC-PI---------------NCTH------- 636

  Fly   770 ETECPADHYTDEEQRECFQCHPECNGCTGPGADDCKSCRNFKLFDANETGPYVNSTMFNCTSKCP 834
                                                ||     .|.:|.|             ||
  Rat   637 ------------------------------------SC-----VDLDERG-------------CP 647

  Fly   835 LEMRHVNYQYTAIGPYCAASPPRSSKITANLDVNMIFIITGAVLVPTICILCVVT--YICRQKQK 897
            .|.|              |||             :.|||...|.|....||.||.  .|.|::||
  Rat   648 AEQR--------------ASP-------------VTFIIATVVGVLLFLILVVVVGILIKRRRQK 685

  Fly   898 AKKETVKMTMALSGCEDSEPLRPSNIGANLCKLRIVKDAELRKGGVLGMGAFGRVYKGVWVPEGE 962
            .:|.|  |...|...|..|||.||....|..::||:|:.||||..|||.||||.||||:|:|:||
  Rat   686 IRKYT--MRRLLQETELVEPLTPSGAMPNQAQMRILKETELRKVKVLGSGAFGTVYKGIWIPDGE 748

  Fly   963 NVKIPVAIKELLKSTGAESSEEFLREAYIMASVEHVNLLKLLAVCMSSQMMLITQLMPLGCLLDY 1027
            |||||||||.|.::|..::::|.|.|||:||.|....:.:||.:|::|.:.|:|||||.|||||:
  Rat   749 NVKIPVAIKVLRENTSPKANKEILDEAYVMAGVGSPYVSRLLGICLTSTVQLVTQLMPYGCLLDH 813

  Fly  1028 VRNNRDKIGSKALLNWSTQIAKGMSYLEEKRLVHRDLAARNVLVQTPSLVKITDFGLAKLLSSDS 1092
            ||.:|.::||:.||||..||||||||||:.||||||||||||||::|:.|||||||||:||..|.
  Rat   814 VREHRGRLGSQDLLNWCVQIAKGMSYLEDVRLVHRDLAARNVLVKSPNHVKITDFGLARLLDIDE 878

  Fly  1093 NEYKAAGGKMPIKWLALECIRNRVFTSKSDVWAFGVTIWELLTFGQRPHENIPAKDIPDLIEVGL 1157
            .||.|.|||:||||:|||.|..|.||.:||||::|||:|||:|||.:|::.|||::||||:|.|.
  Rat   879 TEYHADGGKVPIKWMALESILRRRFTHQSDVWSYGVTVWELMTFGAKPYDGIPAREIPDLLEKGE 943

  Fly  1158 KLEQPEICSLDIYCTLLSCWHLDAAMRPTFKQLTTVFAEFARDPGRYLAI---------PGDKFT 1213
            :|.||.||::|:|..::.||.:|:..||.|::|.:.|:..||||.|::.|         |.|...
  Rat   944 RLPQPPICTIDVYMIMVKCWMIDSECRPRFRELVSEFSRMARDPQRFVVIQNEDLGPSSPMDSTF 1008

  Fly  1214 RLPAYTSQDEKDLIRKLAPTTDGSEAIAEPDDYLQPKAAPGPS------HRTDCT---------- 1262
            ........|..||:       |..|.:.....:..|...||..      ||:..|          
  Rat  1009 YRSLLEDDDMGDLV-------DAEEYLVPQQGFFSPDPTPGTGSTAHRRHRSSSTRSGGGELTLG 1066

  Fly  1263 ----DEIPKLNRYCKDPSNKNSSTGDDETDSSAREVGVGNLR---------------------LD 1302
                :|.|.     :.|...:...|.|..|.   ::.:|..:                     |.
  Rat  1067 LEPSEEGPP-----RSPLAPSEGAGSDVFDG---DLAMGVTKGLQSLSPHDLSPLQRYSEDPTLP 1123

  Fly  1303 LPVDEDDYLMP-TCQPGPN--NNNNIN---------------------------NPNQNNMAAVG 1337
            ||.:.|.|:.| .|.|.|.  |.:.:.                           :|.:|.:.   
  Rat  1124 LPPETDGYVAPLACSPQPEYVNQSEVQPQPPLTPEGPLPPVRPAGATLERPKTLSPGKNGVV--- 1185

  Fly  1338 VAAGYMDLIGVPVSVDNPEYLLNAQTLGVGESPIPTQTIGIPVMGVPGTMEVKVPMPGSEPTSSD 1402
                 .|:.....:|:|||||            :|.:          ||.....|.|...|...:
  Rat  1186 -----KDVFAFGGAVENPEYL------------VPRE----------GTASPPHPSPAFSPAFDN 1223

  Fly  1403 HEYYNDTQRELQP 1415
            ..|::....|..|
  Rat  1224 LYYWDQNSSEQGP 1236

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
EgfrNP_476759.1 Recep_L_domain 128..239 CDD:460032 47/127 (37%)
Furin-like 253..401 CDD:395614 62/154 (40%)
Recep_L_domain 419..547 CDD:460032 45/127 (35%)
GF_recep_IV 572..684 CDD:464344 36/120 (30%)
FU 662..716 CDD:214589 5/56 (9%)
GF_recep_IV 740..>834 CDD:464344 10/93 (11%)
PTKc_EGFR_like 930..1208 CDD:270648 163/286 (57%)
Erbb2NP_058699.2 Recep_L_domain 55..177 CDD:460032 47/127 (37%)
Furin-like 193..342 CDD:395614 60/150 (40%)
Recep_L_domain 370..486 CDD:460032 44/124 (35%)
GF_recep_IV 514..647 CDD:464344 55/273 (20%)
TM_ErbB2 645..688 CDD:213055 21/82 (26%)
PTKc_HER2 716..994 CDD:270684 162/277 (58%)
PHA03247 <1037..1219 CDD:223021 37/219 (17%)

Return to query results.
Submit another query.