DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment eys and Egflam

DIOPT Version :10

Sequence 1:NP_001027571.3 Gene:eys / 3771890 FlyBaseID:FBgn0031414 Length:2176 Species:Drosophila melanogaster
Sequence 2:XP_006232075.1 Gene:Egflam / 365691 RGDID:1306592 Length:1013 Species:Rattus norvegicus


Alignment Length:1196 Identity:266/1196 - (22%)
Similarity:429/1196 - (35%) Gaps:354/1196 - (29%)


- Green bases have known domain annotations that are detailed below.


  Fly   740 TEYPAEVLITTHRTSAGRFTTVQPPAGVTTTSPTEDSSVELPTPHTPQIVVTILDSNEVIPSLIT 804
            |||...:...: :|..||.:.   |..|||.|  :||.:....||.|.::|       |..|.:.
  Rat   108 TEYRVSIAAYS-QTGKGRLSF---PRHVTTLS--QDSCLPPEAPHQPHVLV-------VSDSEVA 159

  Fly   805 TTGSPTTHHHHHHHPHHEAEGTTLQPLEEDEHHHHHHHDEFTTPQPVEITT--GHPLQTEDLI-- 865
            .:..|           .|.||:.  |::.       :..||..|...:..|  ...||.:.::  
  Rat   160 LSWRP-----------GENEGSA--PIQS-------YSVEFIRPDFDKSWTIIQERLQMDSMVIK 204

  Fly   866 GVQEPAVVTTESPFA--------------PAETTVVPVVVPATIAPLGTAAPPATPAPVPPATTT 916
            |: :|   .|...||              |:.|.       .|:.|....:....|..:   |.|
  Rat   205 GL-DP---DTNYQFAVRAMNAYGFSLRSQPSNTI-------RTLGPGEAGSGRYGPGYI---TDT 255

  Fly   917 PPPSPPSLATETPTLPPTLPPVTLPPVTQ--PPPTIPPTPPSTQSAQTLPPPTSAINVYTTPDGP 979
            ........:.:...|..:...|...|.|:  ...:...:..:::....|..||||....||...|
  Rat   256 GVSEDDDASEDELDLDVSFEEVKPLPATKVGNKKSKKTSVSNSEMDSRLAQPTSASLPETTVAVP 320

  Fly   980 PTASQTKPSVTESSEEVEGTNTVSTGGRGSGGVPEEKAGDVDCIKLGCYNGGTCVTTSE--GSRC 1042
            ||.:|.|           |.|:|:...|         ..|:.|.:..|.....||....  ||||
  Rat   321 PTPAQRK-----------GKNSVAVMSR---------LFDMSCDETLCSADSFCVNDYAWGGSRC 365

  Fly  1043 VCRFDRQGPLCELPIIIRNAAFSGDSYVSHRIYKDIGGHESLDAVLPMHIQLKVRTRATNGLIML 1107
            .|...:.|..|...|.|:...|.|.|||:....|:     |..|   ..|.|:.|..|.:||::.
  Rat   366 HCNLGKGGEACSEDIFIQYPQFFGHSYVTFEPLKN-----SYQA---FQITLEFRAEAEDGLLLY 422

  Fly  1108 AAAQGTKGGHYMALFLQKGLMQFQFSCGL-QTMLLSELETPVNTGHEITIRAELDFSRNYTHCNA 1171
            ........|.:|:|.|.:..:.|:|:||. ..:::||.:..:...|.:|:..:        ..|.
  Rat   423 CGESEHGRGDFMSLALIRRSLHFRFNCGTGMAIIISETKIKLGAWHSVTLYRD--------GLNG 479

  Fly  1172 SLLVNDTLAMSG-DQPTWLKLLPPRLHTPEAILNTWLHLGGAPQAPIGLIIELPPAQSGSGFTGC 1235
            .|.:|:...::| .|..:.|:   ...||       |:|||||.| ..|:   ....:..||.||
  Rat   480 LLQLNNGTPVTGQSQGQYSKI---TFRTP-------LYLGGAPSA-YWLV---RATGTNRGFQGC 530

  Fly  1236 LHTLRINGQAREI----FGDALDGFGITECGSLACLSSPCRNGAACIKIETNDLDENGEKAEKWK 1296
            :.:|.:||:..::    .|.||:|..:.||.|                                 
  Rat   531 VQSLAVNGKKIDMRPWPLGKALNGADVGECSS--------------------------------- 562

  Fly  1297 CKCPTGYMGPTCEISVCEDNPCQYGGTCVQFPGSGYLCLCPLGKHGHYCEHNLEVALPSFSGSVN 1361
                          .:|::..|..||||.......|:||||||..|.:||....:.:|.|..|  
  Rat   563 --------------GICDEASCINGGTCAAIKADSYICLCPLGFRGRHCEDAFTLTIPQFRES-- 611

  Fly  1362 GLSSFVAYTVPIPLEYSLELSFKILPQTMSQISLLAFFGQSG-------YHDEKSDHLAVSFIQG 1419
             |.|:.|  .|.|||....|||     |..:|:   |...||       |.....|.|::....|
  Rat   612 -LRSYAA--TPWPLEPQHYLSF-----TEFEIT---FRPDSGDGVLLYSYDTSSKDFLSIIMAAG 665

  Fly  1420 YIMLTWNLGAGPRRIFTQKPIDFRLDAPRVPYEIKVGRIGRQAWLSVDGKFNITGRSPGSGSRMD 1484
            ::...::.|:|...:.::..:..     ...::::|.|..:...|.||.:..:.|.:.|..:::.
  Rat   666 HVEFRFDCGSGTGVLRSEDTLTL-----GQWHDLRVSRTAKNGILQVDKQKVVEGMAEGGFTQIK 725

  Fly  1485 VLPILYLGGHEIANFNTLPHDLPLHSG----FQGCIYDVQLKAGQVTVPLQETRGVRGRGVGQCG 1545
            ....:::||  :.|::    |:..:||    |.|.|..:.|....:.|....|.||......   
  Rat   726 CNTDIFIGG--VPNYD----DVKKNSGILHPFSGSIQKIILNDRTIHVRHDFTSGVNVENAA--- 781

  Fly  1546 TRECHRHACQHDGACLQHGATFTCICQEGWYGPLCAQPTNPCDSFNNKCYEDATCVPLVNGYECD 1610
                  |.|.  ||...||                                 .:|.|...|||||
  Rat   782 ------HPCV--GAPCAHG---------------------------------GSCRPRKEGYECD 805

  Fly  1611 CPVGRTGKNCEE---------VIRSLSDVSLTGRRSYLAVRWPYLYDGGDKLGAKRSQMVSYRNF 1666
            ||:|..|.||::         :..::......| ||||.      ||..:               
  Rat   806 CPLGFEGLNCQKECGNYCLNTITEAIEIPQFIG-RSYLT------YDNPN--------------- 848

  Fly  1667 TKKLMPPKPITTPSSHFVMKLLNEVEKQRSFSPVPLMGSKSFEEHHRVQFFFIEFQLRPLSERGL 1731
                                :|..|...||.:                   |:.|  :..::.||
  Rat   849 --------------------ILKRVSGSRSNA-------------------FMRF--KTTAKDGL 872

  Fly  1732 LLYFG--TLNNNQDKKIGFVSLSLQGGVVEFRISGPSNHVTVVRSVRM---LAIGEWHKIKMAQR 1791
            ||:.|  .:..|.|    |:||.|:.|.:.|..:..|.    |.|:.:   .:.|.||::|..:.
  Rat   873 LLWRGDSPMRPNSD----FISLGLRDGALVFSYNLGSG----VASIMVNGSFSDGRWHRVKAVRD 929

  Fly  1792 GRWLTLWVEG-SASSALAPSAEVLVEPDSLLYIGGLKDVSKLPHNAISGFPIPFRGCVRGLVVSG 1855
            |:...:.|:. .|.:..:|.....:..:..||:||:|::: |..|         |..:||||...
  Rat   930 GQSGKITVDDYGARTGKSPGMMRQLNINGALYVGGMKEIA-LHTN---------RQYMRGLVGCI 984

  Fly  1856 TRIVLNE-------TNIVESRNIRDC 1874
            :...|:.       .:.|:.:||..|
  Rat   985 SHFTLSTDYHISLVEDAVDGKNINTC 1010

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
eysNP_001027571.3 EGF_CA 184..218 CDD:238011
EGF_CA 220..256 CDD:238011
EGF_CA <270..298 CDD:238011
EGF_CA 301..336 CDD:238011
EGF 342..371 CDD:394967
EGF_CA 378..413 CDD:238011
Laminin_G_2 1096..1244 CDD:460494 38/149 (26%)
EGF_CA 1314..1346 CDD:238011 13/31 (42%)
LamG 1355..1521 CDD:238058 39/176 (22%)
EGF 1549..1578 CDD:394967 6/28 (21%)
EGF_CA 1585..1621 CDD:238011 12/35 (34%)
Laminin_G_2 1723..1856 CDD:460494 36/138 (26%)
LamG 1956..2144 CDD:238058
EgflamXP_006232075.1 FN3 36..133 CDD:238020 8/28 (29%)
FN3 142..236 CDD:238020 24/131 (18%)
LamG 387..537 CDD:238058 47/179 (26%)
EGF_CA 556..598 CDD:238011 17/88 (19%)
LamG 614..761 CDD:238058 36/167 (22%)
EGF_CA 786..816 CDD:238011 16/62 (26%)
Laminin_G_2 864..989 CDD:460494 37/144 (26%)
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.