DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Egfr and Csf1r

DIOPT Version :10

Sequence 1:NP_476759.1 Gene:Egfr / 37455 FlyBaseID:FBgn0003731 Length:1426 Species:Drosophila melanogaster
Sequence 2:NP_001032948.2 Gene:Csf1r / 12978 MGIID:1339758 Length:977 Species:Mus musculus


Alignment Length:746 Identity:178/746 - (23%)
Similarity:299/746 - (40%) Gaps:195/746 - (26%)


- Green bases have known domain annotations that are detailed below.


  Fly   659 YNDRGVCRECHATCDGCTGPKDTIGIGACT-TCNLAIINN---DATVKRCLLKDDKCPDG----- 714
            :.|.|:.        .|....|   :|..| |.|..::.:   :.|.::.||::....|.     
Mouse   270 FQDAGIY--------SCVASND---VGTRTATMNFQVVESAYLNLTSEQSLLQEVSVGDSLILTV 323

  Fly   715 ----------YFWEYVHP--QEQGSLKPLAGRAVCRKCHPL-------------CELCTNYGYHE 754
                      |.|.|:.|  ::|..|:.:..||:.|....|             ..:..|.....
Mouse   324 HADAYPSIQHYNWTYLGPFFEDQRKLEFITQRAIYRYTFKLFLNRVKASEAGQYFLMAQNKAGWN 388

  Fly   755 QVCSKCTHYKRREQCETECPADHYTDEEQRECFQC----HP-------ECNGCTGPGADDCKSCR 808
            .:..:.|.....|...|..|.:      ..:...|    :|       ||.|.|    |.|...:
Mouse   389 NLTFELTLRYPPEVSVTWMPVN------GSDVLFCDVSGYPQPSVTWMECRGHT----DRCDEAQ 443

  Fly   809 NFKLFDANETGPYVNSTM-FN---CTSKCPLEMRHVNYQY-----TAIG---PYCAASPPRSSKI 861
            ..:::  |:|.|.|.|.. |:   ..|:.|:.....|..|     .::|   .|..|.....|| 
Mouse   444 ALQVW--NDTHPEVLSQKPFDKVIIQSQLPIGTLKHNMTYFCKTHNSVGNSSQYFRAVSLGQSK- 505

  Fly   862 TANLDVNMIF--IITGAVLVPTICILCVVTYICRQKQKAKKET-VKMTMALSGCEDSEPLRPSNI 923
              .|....:|  ::...:.|.::.:|.::..:.:.|||.|.:. .|:.....| .....:.|:.:
Mouse   506 --QLPDESLFTPVVVACMSVMSLLVLLLLLLLYKYKQKPKYQVRWKIIERYEG-NSYTFIDPTQL 567

  Fly   924 GANLCKLRIVKDAELRKGGVLGMGAFGRVYKGVWVPEG-ENVKIPVAIKELLKSTG-AESSEEFL 986
            ..|. |....:: .|:.|..||.||||:|.:......| |:..:.||:| :||||. |:..|..:
Mouse   568 PYNE-KWEFPRN-NLQFGKTLGAGAFGKVVEATAFGLGKEDAVLKVAVK-MLKSTAHADEKEALM 629

  Fly   987 REAYIMASV-EHVNLLKLLAVCM-SSQMMLITQLMPLGCLLDYVR-------------------- 1029
            .|..||:.: :|.|::.||..|. ...:::||:....|.||:::|                    
Mouse   630 SELKIMSHLGQHENIVNLLGACTHGGPVLVITEYCCYGDLLNFLRRKAEAMLGPSLSPGQDSEGD 694

  Fly  1030 -------------------------------------------NNRDKIGSKA-----LLNWSTQ 1046
                                                       .:.||..|:.     ||::|:|
Mouse   695 SSYKNIHLEKKYVRRDSGFSSQGVDTYVEMRPVSTSSSDSFFKQDLDKEASRPLELWDLLHFSSQ 759

  Fly  1047 IAKGMSYLEEKRLVHRDLAARNVLVQTPSLVKITDFGLAKLLSSDSNEYKAAGGKMPIKWLALEC 1111
            :|:||::|..|..:|||:||||||:.:..:.||.|||||:.:.:|||.......::|:||:|.|.
Mouse   760 VAQGMAFLASKNCIHRDVAARNVLLTSGHVAKIGDFGLARDIMNDSNYVVKGNARLPVKWMAPES 824

  Fly  1112 IRNRVFTSKSDVWAFGVTIWELLTFGQRPHENIPAKD-IPDLIEVGLKLEQPEICSLDIYCTLLS 1175
            |.:.|:|.:||||::|:.:||:.:.|..|:..|...: ...|::.|.::.||.....:||..:.|
Mouse   825 IFDCVYTVQSDVWSYGILLWEIFSLGLNPYPGILVNNKFYKLVKDGYQMAQPVFAPKNIYSIMQS 889

  Fly  1176 CWHLDAAMRPTFKQLTTVFAEFARDPGRYLAIPGDKFTRLPA----------------YTSQDEK 1224
            ||.|:...||||:|:..:..|.||     |......:..||:                .:|:.|:
Mouse   890 CWDLEPTRRPTFQQICFLLQEQAR-----LERRDQDYANLPSSGGSSGSDSGGGSSGGSSSEPEE 949

  Fly  1225 DLIRKLAPTTDGSEAIA--EPDDYLQPKAAP 1253
            :         ..||.:|  ||.|..||...|
Mouse   950 E---------SSSEHLACCEPGDIAQPLLQP 971

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
EgfrNP_476759.1 Recep_L_domain 128..239 CDD:460032
Furin-like 253..401 CDD:395614
Recep_L_domain 419..547 CDD:460032
GF_recep_IV 572..684 CDD:464344 4/24 (17%)
FU 662..716 CDD:214589 11/72 (15%)
GF_recep_IV 740..>834 CDD:464344 21/121 (17%)
PTKc_EGFR_like 930..1208 CDD:270648 102/350 (29%)
Csf1rNP_001032948.2 IG_like 28..101 CDD:214653
Ig strand B 38..42 CDD:409353
Ig strand E 67..71 CDD:409353
Ig strand F 81..86 CDD:409353
IG_like 113..195 CDD:214653
Ig 203..295 CDD:472250 8/35 (23%)
Ig strand B 220..224 CDD:409353
Ig strand C 234..238 CDD:409353
Ig strand E 259..263 CDD:409353
Ig strand F 275..280 CDD:409353 1/12 (8%)
Ig strand G 288..291 CDD:409353 1/2 (50%)
Ig 299..398 CDD:472250 16/98 (16%)
Ig strand B 319..323 CDD:409353 0/3 (0%)
Ig strand C 333..337 CDD:409353 1/3 (33%)
Ig strand E 362..366 CDD:409353 1/3 (33%)
Ig strand F 376..381 CDD:409353 0/4 (0%)
Ig strand G 389..392 CDD:409353 0/2 (0%)
Ig_3 399..487 CDD:464046 21/99 (21%)
Regulatory juxtamembrane domain. /evidence=ECO:0000250 540..572 7/33 (21%)
PTKc_CSF-1R 541..913 CDD:133237 106/375 (28%)
Activation loop. /evidence=ECO:0000250 794..816 8/21 (38%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 921..957 6/44 (14%)
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.