DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment dpy and Notch2

DIOPT Version :10

Sequence 1:NP_001260032.1 Gene:dpy / 318824 FlyBaseID:FBgn0053196 Length:22949 Species:Drosophila melanogaster
Sequence 2:NP_077334.2 Gene:Notch2 / 29492 RGDID:3188 Length:2471 Species:Rattus norvegicus


Alignment Length:2091 Identity:530/2091 - (25%)
Similarity:729/2091 - (34%) Gaps:692/2091 - (33%)


- Green bases have known domain annotations that are detailed below.


  Fly    91 LKC---TNDCDKDGTKCT-HGACLNGVCHCNDGYGGCNCVDKDENECKQRPCDVFAHCTN--TLG 149
            |:|   ...|..:||..| |..  .|.|.|.:|:.|..|..:|  .|::..|.....|..  .||
  Rat    26 LQCRGGQEPCVNEGTCVTYHNG--TGYCRCPEGFLGEYCQHRD--PCEKNRCQNGGTCVTQAMLG 86

  Fly   150 SFTCTCFPGYRGNGFHCEDIDECQ----DPAIAARCVENAECCNLPA--HFLCKCKDGYEGDGEV 208
            ..||.|.||:.|        ::||    .|...:|..:|...|::.:  .:.|.|:.|:.|.   
  Rat    87 KATCRCAPGFTG--------EDCQYSTSHPCFVSRPCQNGGTCHMLSWDTYECTCQVGFTGK--- 140

  Fly   209 LCTDVDECRNPENCGPNALCTNTPGNYTCSCPDGYVGNNPYREGCQ-DVDECSYPNVCGPGAICT 272
            .|...|.|.: ..|...:.|::....::|.||.|..|     :.|. |::||..|..|..|..|.
  Rat   141 QCQWTDVCLS-HPCENGSTCSSVANQFSCRCPAGITG-----QKCDADINECDIPGRCQHGGTCL 199

  Fly   273 NLEGSYRCDCPPGYDGDGRSESGCVDQDECARTPCGRNADCLNT-DGSFRCLCPDGYSGDPMNGC 336
            ||.|||||.||.|:.|. ..:|..|   .||.:||.....|..| |.:..|.|..|:.|   :.|
  Rat   200 NLPGSYRCQCPQGFTGQ-HCDSPYV---PCAPSPCVNGGTCRQTGDFTSECHCLPGFEG---SNC 257

  Fly   337 E-DVDECATNNPCGLGAECVNLGGSFQCRCPSGFVLEHDPHADQLPQPLNTQQLGYGPGATDIAP 400
            | ::|:| .|:.|..|..||:...::.||||              ||                  
  Rat   258 ERNIDDC-PNHKCQNGGVCVDGVNTYNCRCP--------------PQ------------------ 289

  Fly   401 YQRTSGAGLACL-DIDEC-NQPDGVAKCGTNAKCINFPGSYRCLCPSGFQGQGYLHC-ENINECQ 462
                 ..|..|. |:||| .||:.   |.....|.|..|.|.|:|.:|:.|.   .| |||::|.
  Rat   290 -----WTGQFCTEDVDECLLQPNA---CQNGGTCTNRNGGYGCVCVNGWSGD---DCSENIDDCA 343

  Fly   463 DNPCGENAICTDTVGSFVCTCKPDYTGDPFRGCVDIDECTALDKPCGQHAVCE-NTVPG-YNCKC 525
            ...|...:.|.|.|.||.|.|.....|   ..|...|.|  :..||.:.|:|: |.:.| |.|.|
  Rat   344 FASCTPGSTCIDRVASFSCLCPEGKAG---LLCHLDDAC--ISNPCHKGALCDTNPLNGQYICTC 403

  Fly   526 PQGYDGKPDPKVACEQVDVNILCSSNFDCTNNAECIENQCFCLDGFEPIGSSCVDIDEC-RTHAE 589
            ||||.|                          |:|.|                 |:||| ..::.
  Rat   404 PQGYKG--------------------------ADCTE-----------------DVDECAMANSN 425

  Fly   590 VCGPHAQCLNTPGSYGCECEAGYVGSPPRMACKQPCEDVRCGAHAYCKPDQNEAYCVCEDGWTYN 654
            .|....:|:||.|::.|||..||.|  ||                            ||      
  Rat   426 PCEHAGKCVNTDGAFHCECLKGYAG--PR----------------------------CE------ 454

  Fly   655 PSDVAAGCVDIDECDVMHGPFGSCGQNATCTNSAGGFTCACPPGFSGDPHSKCVDVDECRTGASK 719
                    :||:||   |.  ..|..:|||.:..|||||.|.|||.| .|.: ::|:||::.   
  Rat   455 --------MDINEC---HS--DPCQNDATCLDKIGGFTCLCMPGFKG-VHCE-LEVNECQSN--- 501

  Fly   720 CGAGAECVNVPGGGYTCRCPGNTIADPDPSVRCVPIVSCSANEDCPGNSICDATKRCLCPEPNIG 784
                 .|||                                                        
  Rat   502 -----PCVN-------------------------------------------------------- 505

  Fly   785 NDCRHPCEALNCGAHAQCMLANGQAQCLCAPGYTGNSALAGGCN-DIDECRANPCAEKAICSNTA 848
                          :.||:....:.||||.||:||..     |. |||:|.:.||...|.|.:..
  Rat   506 --------------NGQCVDKVNRFQCLCPPGFTGPV-----CQIDIDDCSSTPCLNGAKCIDHP 551

  Fly   849 GGYLCQCPGGSSGDPYREGCITSKTVGCSDANPCATGETCVQ--DSYTGNSVCICRQGYERNSEN 911
            .||.|||..|.:      |.:..:.:...|.:||..|: |..  ||||    |||..||    ..
  Rat   552 NGYECQCATGFT------GTLCDENIDNCDPDPCHHGQ-CQDGIDSYT----CICNPGY----MG 601

  Fly   912 GQCQD-VDECSVQRGKPACGLNALCKNLPGSYECRCPQGHNGNPFIMCEICNTPECQCQSPYKLV 975
            ..|.| :|||.    ...|..:..|.:|...|:|.|..|.:|   :.||| |..:|        .
  Rat   602 AICSDQIDECY----SSPCLNDGRCIDLVNGYQCNCQPGTSG---LNCEI-NFDDC--------A 650

  Fly   976 GNSCVLSGCSSGQACPSGAECISIAGGVSYCACPKGYQTQPDGSC-VDVDECEERGAQLCAFGAQ 1039
            .|.| |.|     ||..|....|       |.|..|:..|   .| :|:|||   .:..|..||.
  Rat   651 SNPC-LHG-----ACVDGINRYS-------CVCSPGFTGQ---RCNIDIDEC---ASNPCRKGAT 696

  Fly  1040 CVNKPGSYSCHCPEG-YQGDAYNGLCALAQRKCAADRECAANEKCIQPGECVCPPPYFLDPQDNN 1103
            |:|....:.|.|||| :....|:                       |..||:             
  Rat   697 CINDVNGFRCMCPEGPHHPSCYS-----------------------QVNECL------------- 725

  Fly  1104 KCKSPCERFPCGINAKCTPS-DPPQCMCEAGFKGDPLLGC-TDEDECSHLPCAYGAYCVNKKGGY 1166
              .|||      |:..||.. ...:|:|:||:.|   :.| .|::||...||..|..|.|...||
  Rat   726 --SSPC------IHGNCTGGLSGYKCLCDAGWVG---INCEVDKNECLSNPCQNGGTCNNLVNGY 779

  Fly  1167 QCVCPKDYTGDPYKSGCIFESGTPKSKCLSN-DDCASNLACLEGSCVSPCSSLLCGSNAYCETEQ 1230
            :|.|.|.:.|               ..|..| |:||||....:|:|:...|...           
  Rat   780 RCTCKKGFKG---------------YNCQVNIDECASNPCLNQGTCLDDVSGYT----------- 818

  Fly  1231 HAGWCRCRVGYVKNGDGDCVSQCQDVICGDGALC--IPTSEGPTCKCPQGQLGNPFPGGSCS--T 1291
                |.|.:.|........::.|....|.:.|:|  .|..|..||.|..|     :.|..|:  .
  Rat   819 ----CHCMLPYTGKNCQTVLAPCSPNPCENAAVCKEAPNFESFTCLCAPG-----WQGQRCTVDV 874

  Fly  1292 DQCSAARPCGERQICINGRCKERCEGVVCGIGATCDRNNGKCICEPNFVGNPDLICMPPIEQAKC 1356
            |:| .::||....||.|.:....||                  |.|.|.|   :.|...|.....
  Rat   875 DEC-VSKPCMNNGICHNTQGSYMCE------------------CPPGFSG---MDCEEDINDCLA 917

  Fly  1357 SPGCGENAHCEYGLGQSRCACNPGTFGNPYEGCGAQSKNVCQPNSCGPNAECRAVGNHISCLCPQ 1421
            :| |.....|...:....|.|.||..|:..:    ...|.|....|.....|....|..:|.||.
  Rat   918 NP-CQNGGSCVDKVNTFSCLCLPGFVGDKCQ----TDMNECLSEPCKNGGTCSDYVNSYTCTCPA 977

  Fly  1422 GFSGNPYIGCQ-DVDECANKPCGLNAACLNRAGGFECLCLSGHAGNPYSSCQPIESKFC-QDANK 1484
            ||.|   :.|: ::|||....|.....|::....|.|||..|..|           .|| .|.|:
  Rat   978 GFHG---VHCENNIDECTESSCFNGGTCVDGINSFSCLCPVGFTG-----------PFCLHDINE 1028

  Fly  1485 CQCNERVECPEGYSCQKGQCKNLCSQASCGPRAICDAGNCICPMGYIGDPHDQVHGCSIRGQCGN 1549
            |..|   .|....:|..|                .....|.||:||                   
  Rat  1029 CSSN---PCLNSGTCVDG----------------LGTYRCTCPLGY------------------- 1055

  Fly  1550 DADCLHSEICFQLGKGLRKCVDACSKIQCGPNALCVSEDHRSSCICSDGFFGNPSN-LQVGCQP- 1612
                        .||..:..|:.||...|.....|..|..|..|:|..|:.|...: |.|.|:. 
  Rat  1056 ------------TGKNCQTLVNLCSPSPCKNKGTCAQEKARPRCLCPPGWDGAYCDVLNVSCKAA 1108

  Fly  1613 --ERTVPEEEDKCKSDQDCSRGYGCQASVNGIKECINLCSNVVCGPNELCKINPAGHA-ICNCAE 1674
              ::.||.|             :.||.|  ||  |||                 ||:. .|.|..
  Rat  1109 ALQKGVPVE-------------HLCQHS--GI--CIN-----------------AGNTHHCQCPL 1139

  Fly  1675 SYVWNPVVSSCEKPSLPDCTSDANCPDASACRPDVLGVLKCVAICDAFTCPANSVCVARQHQGRC 1739
            .|..    |.||: .|.:|.|:. |...:.|...:.|.                         ||
  Rat  1140 GYTG----SYCEE-QLDECASNP-CQHGATCSDFIGGY-------------------------RC 1173

  Fly  1740 DCLNGFVGNPNDRNGCQPAQKHHCRNHAECQESEACIK---------DESTQTLGCRPACDTVKC 1795
            :|:.|:.|     ..|: .:...|:|. .||....||.         ...|:.|.|....|....
  Rat  1174 ECVPGYQG-----VNCE-YEVDECQNQ-PCQNGGTCIDLVNHFKCSCPPGTRGLLCEENIDDCAG 1231

  Fly  1796 GPRAV----CVTNNHQAQCQCPPGPFAGDPYD-PFNGCQSVPC-----------VYNHDCP-PSQ 1843
            ||..:    ||.......|:|.|| |||:..: ..|.|.|.||           ..|:.|. .|.
  Rat  1232 GPHCLNGGQCVDRIGGYSCRCLPG-FAGERCEGDINECLSNPCSSEGSLDCIQLKNNYQCVCRSA 1295

  Fly  1844 MCNRMTHTCFDVCDEESCGDNAICLAEDH---RAVCQCPPGFKG----DPLPEVACTK------- 1894
            ...|...|..|||.::.|.:...|....:   ..:|:|||||.|    ....:|.|.:       
  Rat  1296 FTGRHCETFLDVCPQKPCLNGGTCAVASNVPDGFICRCPPGFSGARCQSSCGQVKCRRGEQCVHT 1360

  Fly  1895 --------------QGGCAAGTCHPSAICEVTPEGPV--CKCPPLFVGDAKSGGCRPDGQCPNGD 1943
                          :.|||:..|.....|....:.|.  |:|.|.|.|........|....|   
  Rat  1361 ASGPHCFCLNRKDCESGCASNPCQHGGTCYPQRQPPYYSCRCSPPFWGSHCESYTAPTSTPP--- 1422

  Fly  1944 ADCPANTICAGGVCQNPCDNACGSNA------ECKVINRKPVCSC--PLR-FQPISDTAKDGCAR 1999
            |.| .:..||.......||.||.|:|      :|.:....|..:|  .|| ::.|::...:.|  
  Rat  1423 ATC-LSQYCADKARDGICDEACNSHACQWDGGDCSLTMEDPWANCTSSLRCWEYINNQCDELC-- 1484

  Fly  2000 TISKCLTDVDCGGALCYNGQCRIACRNSQDCSDGESCLKNVCVVACLDHSQCASGLACVEGHCTI 2064
            ..::||.|         |.:|:   |||:.|...:         .|.||.:        :.||..
  Rat  1485 NTAECLFD---------NFECQ---RNSKTCKYDK---------YCADHFK--------DNHCDK 1520

  Fly  2065 GCRSNKECKQD 2075
            || :|:||..|
  Rat  1521 GC-NNEECGWD 1530

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
dpyNP_001260032.1 EGF_3 137..166 CDD:463759 10/30 (33%)
EGF_CA 212..247 CDD:238011 9/34 (26%)
EGF_CA 255..>286 CDD:214542 16/30 (53%)
EGF_CA 298..331 CDD:238011 10/33 (30%)
EGF_CA 338..373 CDD:238011 11/34 (32%)
EGF_CA 413..456 CDD:238011 16/44 (36%)
EGF_CA 457..490 CDD:238011 11/32 (34%)
EGF_CA 497..>529 CDD:214542 13/33 (39%)
EGF_CA 580..>612 CDD:214542 12/32 (38%)
EGF_3 676..702 CDD:463759 13/25 (52%)
EGF_CA 1022..1056 CDD:214542 14/34 (41%)
EGF_CA 2227..2260 CDD:238011
EGF_CA 2393..>2422 CDD:214542
Herpes_BLLF1 <4032..4489 CDD:282904
Herpes_BLLF1 <4334..4797 CDD:282904
PHA03255 4845..>5021 CDD:165513
DUF5585 4959..5356 CDD:465521
PHA03247 <5170..5799 CDD:223021
PRK12495 <5749..5877 CDD:183558
PHA03247 <5865..6485 CDD:223021
Chi1 6385..>6609 CDD:442692
Herpes_BLLF1 <6607..7102 CDD:282904
Herpes_BLLF1 <7027..7490 CDD:282904
DUF5585 7550..7954 CDD:465521
DUF5585 7881..>8130 CDD:465521
Atrophin-1 18652..>19081 CDD:460830
PHA03247 <18873..19514 CDD:223021
ZP 22576..22811 CDD:214579
Notch2NP_077334.2 EGF_CA 109..143 CDD:238011 7/36 (19%)
EGF_CA 182..218 CDD:238011 18/36 (50%)
EGF_CA 260..296 CDD:238011 14/73 (19%)
EGF_CA 298..335 CDD:238011 15/42 (36%)
EGF_CA 415..454 CDD:238011 17/68 (25%)
EGF_CA 456..492 CDD:238011 20/41 (49%)
EGF_CA 495..530 CDD:238011 16/117 (14%)
EGF_CA 532..567 CDD:238011 15/40 (38%)
EGF_CA 570..604 CDD:238011 14/42 (33%)
EGF_CA 608..643 CDD:238011 11/41 (27%)
EGF_CA 645..679 CDD:238011 14/57 (25%)
EGF_CA 682..717 CDD:238011 14/37 (38%)
EGF_CA 757..793 CDD:238011 14/50 (28%)
EGF_CA 795..831 CDD:238011 12/50 (24%)
EGF_CA 873..909 CDD:238011 13/57 (23%)
EGF_CA 911..947 CDD:238011 9/36 (25%)
EGF_CA 949..985 CDD:238011 11/38 (29%)
EGF_CA 987..1022 CDD:238011 11/45 (24%)
EGF_CA 1025..1061 CDD:238011 14/85 (16%)
EGF_CA 1117..1147 CDD:238011 15/67 (22%)
EGF_CA 1151..1185 CDD:238011 10/64 (16%)
EGF_CA 1188..1223 CDD:238011 8/35 (23%)
EGF_CA 1225..1262 CDD:238011 12/37 (32%)
EGF_CA 1264..1302 CDD:238011 9/37 (24%)
EGF_CA <1312..1343 CDD:238011 9/30 (30%)
Notch 1423..1456 CDD:459658 10/33 (30%)
Negative regulatory region (NRR). /evidence=ECO:0000250 1425..1677 36/139 (26%)
Notch 1463..1497 CDD:459658 9/44 (20%)
Notch 1501..1534 CDD:459658 12/48 (25%)
NOD 1539..1594 CDD:462014
NODP 1618..1673 CDD:462229
JMTM_Notch2 1659..1740 CDD:411986
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1751..1788
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1794..1813
ANK repeat 1827..1874 CDD:293786
ANK 1 1827..1871
ANKYR <1851..2066 CDD:440430
ANK 2 1876..1905
ANK repeat 1877..1907 CDD:293786
ANK 3 1909..1939
ANK repeat 1943..1974 CDD:293786
ANK 4 1943..1972
ANK repeat 1976..2007 CDD:293786
ANK 5 1976..2005
ANK repeat 2009..2040 CDD:293786
ANK 6 2009..2038
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 2097..2116
Atrophin-1 2105..>2422 CDD:460830
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 2380..2471
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.