DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment dpy and Egf

DIOPT Version :10

Sequence 1:NP_001260032.1 Gene:dpy / 318824 FlyBaseID:FBgn0053196 Length:22949 Species:Drosophila melanogaster
Sequence 2:NP_036974.2 Gene:Egf / 25313 RGDID:2542 Length:1132 Species:Rattus norvegicus


Alignment Length:865 Identity:195/865 - (22%)
Similarity:285/865 - (32%) Gaps:291/865 - (33%)


- Green bases have known domain annotations that are detailed below.


  Fly   211 TDVDECRNPENCGPNALCTNTPGNYTCSCPDGYVGNNPYREGCQDVDECSYPN-VCGPGAICTNL 274
            :|.:.|:........:|....|.:.:|:|.:||..:.. |:.|:||:||:..| .|..|  |.|:
  Rat   314 SDSERCKQRRGQCLYSLSERDPNSDSCACAEGYTLSRD-RKYCEDVNECALQNHGCTLG--CENI 375

  Fly   275 EGSYRCDCPPGY----DGDGRSESGCVDQDECA--RTPCGRNADCLNTDGSFRCLCPD----GYS 329
            .|||.|.||.|:    ||     ..|.:...|.  |:.|..  ||:.|.....|:||.    |..
  Rat   376 PGSYYCTCPTGFVLLPDG-----KRCHELVACPGNRSECSH--DCILTSDGPLCICPAGSVLGKD 433

  Fly   330 GDPMNGCEDVDECATNNPCGLGAECVNLG-GSFQCRCPSGFVLEHDPH--ADQLPQPL----NTQ 387
            |....||...|    |..|  ...|:.|. .|::|.|..|:.|:.|..  |...|||.    |:|
  Rat   434 GKTCTGCSSPD----NGGC--SQICLPLSLASWECDCFPGYDLQLDRKSCAASGPQPFLLFANSQ 492

  Fly   388 QLGYGP-GATDIAPYQRTSGAGLACLDIDECNQPDGVAKCGTNAKCI---NFPGSYRCLCPSGFQ 448
            .:.:.. ..||...........:..||.|........|:  |..|.|   |..||.|        
  Rat   493 DIRHMHFDGTDYKTLLSRQMGMVFALDYDPVESKIYFAQ--TALKWIERANLDGSQR-------- 547

  Fly   449 GQGYLHCEN-INECQDNPCGENAICTDTVGSFVCTCKPDYTGDPFRGCVDIDECTALDKPCGQHA 512
                   |. |.|..|.|.|   :..|.:|..:      |..|..:..::..:.:      |:| 
  Rat   548 -------ERLITEGVDTPEG---LAVDWIGRRI------YWTDSGKSVIEGSDLS------GKH- 589

  Fly   513 VCENTVPGYNCKCPQGYDGKP------------DPKVACEQV---DVNILCSSN----------- 551
              ...:...:...|:|....|            .|::....:   |..::.|||           
  Rat   590 --HQIIIKESISRPRGIAVHPKARRLFWTDTGMSPRIESSSLQGSDRTLIASSNLLEPSGIAIDY 652

  Fly   552 -----FDCTNNAECIE-----------------NQCFCLDGFE---------------------- 572
                 :.|......||                 ...|.|..||                      
  Rat   653 LTDTLYWCDTKLSVIEMADLDGSKRRRLTQNDVGHPFSLAVFEDHVWFSDWAIPSVIRVNKRTGQ 717

  Fly   573 -----------PIGSSCVDIDE-CRTHAEVC-----GPHAQCLNTPGSYGCECEAGYVGSPPRMA 620
                       |  ||.|.:.. .:..|:.|     |....|..:.|:..|.|..|:|.:|....
  Rat   718 NRVRLRGSMLKP--SSLVVVHPLAKPGADPCLHRNGGCEHICQESLGTAQCLCREGFVKAPDGKM 780

  Fly   621 CKQPCED-VRCGAHA-----YCKPDQNEAYCVCEDGWTYNPSDVA----AGCVDIDECDVMHGPF 675
            |....:| :..|.:|     ....|.:....|.:|..|.:.:.||    :|....|:|    || 
  Rat   781 CLTRKDDQILAGDNADLSKEVASLDNSPKAYVPDDDRTESSTLVAEIMVSGLNYEDDC----GP- 840

  Fly   676 GSCGQNATCTNSAGGFTCACPPGFSGDPHSKCVDVDECRTGASKC-GAGAECVNVPGGGYTCRCP 739
            |.||.:|.|.:......|.|..||:|| .:.|.|:|||..|:|.| ...:.|:|.. |||.|:  
  Rat   841 GGCGSHAHCISEGEAAVCQCLKGFAGD-GNLCSDIDECELGSSDCPPTSSRCINTE-GGYVCQ-- 901

  Fly   740 GNTIADPDPSVRCVPIVSCSANEDCPGNSICDATKRCLCPEPNIGNDCRHPCEALNCGAHAQCML 804
                                                                             
  Rat   902 ----------------------------------------------------------------- 901

  Fly   805 ANGQAQCLCAPGYTGNSALAGGCNDIDECR--ANPCAEKAICSNTAGGYLCQCPG--GSSGDPY- 864
                    |:.||.|:...   |.|:|||:  ::.|:|.|.|:||.|||.|.|.|  .:.|.|. 
  Rat   902 --------CSEGYEGDGIY---CLDVDECQQGSHGCSENATCTNTEGGYNCTCAGCPSAPGLPCP 955

  Fly   865 ---------REGC--ITSKTVGC--SDANPCATGETCVQDSYTGNSVCICRQGY----------- 905
                     ::||  :.:...||  |....|..|..|:........||.|..||           
  Rat   956 DSTSPSLLGKDGCHWVRNSITGCPPSYDGYCLNGGVCMYVESVDRYVCNCVIGYIGERCQHRDLR 1020

  Fly   906 ---ERNSENGQCQDVDECSV 922
               .|:::.||..|:...||
  Rat  1021 WWKLRHADYGQRHDITVVSV 1040

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
dpyNP_001260032.1 EGF_3 137..166 CDD:463759
EGF_CA 212..247 CDD:238011 8/34 (24%)
EGF_CA 255..>286 CDD:214542 15/31 (48%)
EGF_CA 298..331 CDD:238011 10/38 (26%)
EGF_CA 338..373 CDD:238011 10/35 (29%)
EGF_CA 413..456 CDD:238011 10/45 (22%)
EGF_CA 457..490 CDD:238011 8/33 (24%)
EGF_CA 497..>529 CDD:214542 3/31 (10%)
EGF_CA 580..>612 CDD:214542 7/37 (19%)
EGF_3 676..702 CDD:463759 9/25 (36%)
EGF_CA 1022..1056 CDD:214542
EGF_CA 2227..2260 CDD:238011
EGF_CA 2393..>2422 CDD:214542
Herpes_BLLF1 <4032..4489 CDD:282904
Herpes_BLLF1 <4334..4797 CDD:282904
PHA03255 4845..>5021 CDD:165513
DUF5585 4959..5356 CDD:465521
PHA03247 <5170..5799 CDD:223021
PRK12495 <5749..5877 CDD:183558
PHA03247 <5865..6485 CDD:223021
Chi1 6385..>6609 CDD:442692
Herpes_BLLF1 <6607..7102 CDD:282904
Herpes_BLLF1 <7027..7490 CDD:282904
DUF5585 7550..7954 CDD:465521
DUF5585 7881..>8130 CDD:465521
Atrophin-1 18652..>19081 CDD:460830
PHA03247 <18873..19514 CDD:223021
ZP 22576..22811 CDD:214579
EgfNP_036974.2 LDL-receptor class B 1 87..128
LDL-receptor class B 2 129..170
LY 152..193 CDD:214531
LDL-receptor class B 3 171..212
LDL-receptor class B 4 213..259
cEGF 339..360 CDD:463661 8/21 (38%)
FXa_inhibition 361..396 CDD:464251 15/41 (37%)
FXa_inhibition 409..437 CDD:464251 9/29 (31%)
FXa_inhibition 440..477 CDD:464251 12/42 (29%)
LDL-receptor class B 5 484..524 8/39 (21%)
LY 505..547 CDD:214531 10/43 (23%)
LDL-receptor class B 6 525..567 15/61 (25%)
LY 548..590 CDD:214531 12/59 (20%)
LDL-receptor class B 7 568..610 7/56 (13%)
LY 591..634 CDD:214531 4/42 (10%)
LDL-receptor class B 8 611..654 5/42 (12%)
LY 634..677 CDD:214531 6/42 (14%)
LDL-receptor class B 9 655..697 7/41 (17%)
LY 678..719 CDD:214531 4/40 (10%)
FXa_inhibition 746..781 CDD:464251 9/34 (26%)
EGF_3 839..871 CDD:463759 13/33 (39%)
EGF_CA 873..913 CDD:214542 17/118 (14%)
EGF_CA 915..943 CDD:429571 14/27 (52%)
PHA03099 <974..>1016 CDD:165381 11/41 (27%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1068..1096
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.