DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment dpy and NOTCH2

DIOPT Version :9

Sequence 1:NP_001260032.1 Gene:dpy / 318824 FlyBaseID:FBgn0053196 Length:22949 Species:Drosophila melanogaster
Sequence 2:NP_077719.2 Gene:NOTCH2 / 4853 HGNCID:7882 Length:2471 Species:Homo sapiens


Alignment Length:2224 Identity:542/2224 - (24%)
Similarity:736/2224 - (33%) Gaps:834/2224 - (37%)


- Green bases have known domain annotations that are detailed below.


  Fly     1 MKIFLPLVTWIVLLL----SSAVHSQYSQQPQPFKTNLRANSRFRGEVFYLNLENGYFGCQVNES 61
            |....|.:.|.:|.|    ::..|:                         |...:||..| |||.
Human     1 MPALRPALLWALLALWLCCAAPAHA-------------------------LQCRDGYEPC-VNEG 39

  Fly    62 TEYLQLFNLSKLC----DGTQDC-----FLG-----ADELSKELKCTNDCDKDGTKCTHGACL-N 111
                       :|    :||..|     |||     .|...|     |.|...|| |...|.| .
Human    40 -----------MCVTYHNGTGYCKCPEGFLGEYCQHRDPCEK-----NRCQNGGT-CVAQAMLGK 87

  Fly   112 GVCHCNDGYGGCNCVDKDENECKQRPCDVFAHCTN-------TLGSFTCTCFPGYRGNGFHCEDI 169
            ..|.|..|:.|.:|     ......||.|...|.|       :..::.|||..|:.|.       
Human    88 ATCRCASGFTGEDC-----QYSTSHPCFVSRPCLNGGTCHMLSRDTYECTCQVGFTGK------- 140

  Fly   170 DECQ--DPAIAARCVENAECCNLPAHFLCKCKDGYEGDGEVLC-TDVDECRNPENCGPNALCTNT 231
             |||  |..::..|...:.|..:...|.|||..|:.|.   .| |||:||..|.:|.....|.|.
Human   141 -ECQWTDACLSHPCANGSTCTTVANQFSCKCLTGFTGQ---KCETDVNECDIPGHCQHGGTCLNL 201

  Fly   232 PGNYTCSCPDGYVGNN--------------------------------PYREG--CQ-DVDECSY 261
            ||:|.|.||.|:.|..                                |..||  |: ::|:|  
Human   202 PGSYQCQCPQGFTGQYCDSLYVPCAPSPCVNGGTCRQTGDFTFECNCLPGFEGSTCERNIDDC-- 264

  Fly   262 PN-VCGPGAICTNLEGSYRCDCPPGYDGDGRSESGCVDQDECARTP--CGRNADCLNTDGSFRCL 323
            || .|..|.:|.:...:|.|.|||.:.|...:|    |.|||...|  |.....|.|.:|.:.|:
Human   265 PNHRCQNGGVCVDGVNTYNCRCPPQWTGQFCTE----DVDECLLQPNACQNGGTCANRNGGYGCV 325

  Fly   324 CPDGYSGDPMNGCEDVDECATNNPCGLGAECVNLGGSFQCRCPSGFVLEHDPHADQLPQPLNTQQ 388
            |.:|:|||..:  |::|:||..: |..|:.|::...||.|.||.|                    
Human   326 CVNGWSGDDCS--ENIDDCAFAS-CTPGSTCIDRVASFSCMCPEG-------------------- 367

  Fly   389 LGYGPGATDIAPYQRTSGAGLACLDIDEC-NQPDGVAKCGTNAKCINFP--GSYRCLCPSGFQGQ 450
                             .|||.|...|.| :.|     |...|.|...|  |.|.|.||.|::| 
Human   368 -----------------KAGLLCHLDDACISNP-----CHKGALCDTNPLNGQYICTCPQGYKG- 409

  Fly   451 GYLHC-ENINEC---QDNPCGENAICTDTVGSFVCTCKPDYTGDPFRGCVDIDECTALDKPCGQH 511
              ..| |:::||   ..|||.....|.:|.|:|.|.|...|.|.  |..:||:||.:  .||...
Human   410 --ADCTEDVDECAMANSNPCEHAGKCVNTDGAFHCECLKGYAGP--RCEMDINECHS--DPCQND 468

  Fly   512 AVCENTVPGYNCKCPQGYDGKPDPKVACEQVDVNILCSSNFDCTNNAECIEN----QCFCLDGFE 572
            |.|.:.:.|:.|.|..|:.|     |.|| :::| .|.|| .|.||.:|::.    ||.|..|| 
Human   469 ATCLDKIGGFTCLCMPGFKG-----VHCE-LEIN-ECQSN-PCVNNGQCVDKVNRFQCLCPPGF- 524

  Fly   573 PIGSSC-VDIDECRTHAEVCGPHAQCLNTPGSYGCECEAGYVGSPPRMACKQPCEDVRCGAHAYC 636
             .|..| :|||:|  .:..|...|:|::.|..|.|:|..|:.|                      
Human   525 -TGPVCQIDIDDC--SSTPCLNGAKCIDHPNGYECQCATGFTG---------------------- 564

  Fly   637 KPDQNEAYCVCEDGWTYNPSDVAAGCVDIDECD---VMHGPFGSCGQNATCTNSAGGFTCACPPG 698
                    .:||:              :||.||   ..||         .|.:....:||.|.||
Human   565 --------VLCEE--------------NIDNCDPDPCHHG---------QCQDGIDSYTCICNPG 598

  Fly   699 FSGDPHSKCVD-VDECRTGASKCGAGAECVNVPGGGYTCRCPGNTIADPDPSVRCVPIVSCSAN- 761
            :.|   :.|.| :|||.  :|.|.....|:::. .||.|.|...|..           |:|..| 
Human   599 YMG---AICSDQIDECY--SSPCLNDGRCIDLV-NGYQCNCQPGTSG-----------VNCEINF 646

  Fly   762 EDCPGNSICDATKRCLCPEPNIGNDCRHPCEALNCGAHAQCMLANGQAQCLCAPGYTGNSALAGG 826
            :||..|                      ||      .|..||....:..|:|:||:||..     
Human   647 DDCASN----------------------PC------IHGICMDGINRYSCVCSPGFTGQR----- 678

  Fly   827 CN-DIDECRANPCAEKAICSNTAGGYLCQCPGGSSGDPYREGCITSKTVGCSDANPCATGETCVQ 890
            || |||||.:|||.:.|.|.|...|:.|.||.|    |:...|.       |..|.|.: ..|:.
Human   679 CNIDIDECASNPCRKGATCINGVNGFRCICPEG----PHHPSCY-------SQVNECLS-NPCIH 731

  Fly   891 DSYTG---NSVCICRQGYERNSENGQCQ-DVDECSVQRGKPACGLNALCKNLPGSYECRCPQGHN 951
            .:.||   ...|:|..|:    ....|: |.:||   ...| |.....|.||...|.|.|.:|..
Human   732 GNCTGGLSGYKCLCDAGW----VGINCEVDKNEC---LSNP-CQNGGTCDNLVNGYRCTCKKGFK 788

  Fly   952 G------------NPFIMCEIC----NTPECQCQSPYKLVGNSC--VLSGCSSGQACPSGAECIS 998
            |            ||.:....|    :...|.|..||  .|.:|  ||:.||. ..|.:.|.|..
Human   789 GYNCQVNIDECASNPCLNQGTCFDDISGYTCHCVLPY--TGKNCQTVLAPCSP-NPCENAAVCKE 850

  Fly   999 IAGGVSY-CACPKGYQTQPDGSC-VDVDECEERGAQLCAFGAQCVNKPGSYSCHCPEGYQGDAYN 1061
            .....|| |.|..|:|.|   .| :|:|||..:.   |.....|.|..|||.|.||.|:.|    
Human   851 SPNFESYTCLCAPGWQGQ---RCTIDIDECISKP---CMNHGLCHNTQGSYMCECPPGFSG---- 905

  Fly  1062 GLCALAQRKCAAD-RECAANEKCIQPG---------ECVCPPPYFLD--PQDNNKCKSPCERFPC 1114
                   ..|..| .:|.|| .|...|         .|:|.|.:..|  ..|.|:|.|.    ||
Human   906 -------MDCEEDIDDCLAN-PCQNGGSCMDGVNTFSCLCLPGFTGDKCQTDMNECLSE----PC 958

  Fly  1115 GINAKCTP-SDPPQCMCEAGFKGDPLLGCTDE-DECSHLPCAYGAYCVNKKGGYQCVCPKDYTGD 1177
            .....|:. .:...|.|:|||.|   :.|.:. :||:...|..|..||:....:.|:||..:|| 
Human   959 KNGGTCSDYVNSYTCKCQAGFDG---VHCENNINECTESSCFNGGTCVDGINSFSCLCPVGFTG- 1019

  Fly  1178 PYKSGCIFESGTPKSKCLSNDDCASNLACLEGSCVSPCSSLLCGSNAYCETEQHAGWCRCRVGYV 1242
               |.|:.|.          ::|:|:....||:||.       |...|        .|.|.:||.
Human  1020 ---SFCLHEI----------NECSSHPCLNEGTCVD-------GLGTY--------RCSCPLGYT 1056

  Fly  1243 KNGDGDCVSQCQDVICGDGALCIPTSEGPTCKCPQGQLGNPFPGGSCSTDQCSAARPCGERQICI 1307
            .......|:.|....|.:...|:.......|.||.|     :.|..|.....|.......|.:.:
Human  1057 GKNCQTLVNLCSRSPCKNKGTCVQKKAESQCLCPSG-----WAGAYCDVPNVSCDIAASRRGVLV 1116

  Fly  1308 NGRCKERCEGVVCGIGATCDRNNGKCICEPNFVGNPDLICMPPIEQAKCSPGCGENAHCEYGLGQ 1372
            ...|:.  .||....|     |...|.|...:.|:   .|...:::...:| |...|.|...:|.
Human  1117 EHLCQH--SGVCINAG-----NTHYCQCPLGYTGS---YCEEQLDECASNP-CQHGATCSDFIGG 1170

  Fly  1373 SRCACNPGTFGNPYEGCGAQSK-NVCQPNSCGPNAECRAVGNHISCLCPQGFSGNPYIGCQD-VD 1435
            .||.|.||     |:|...:.: :.||...|.....|..:.||..|.||.|..|   :.|:: :|
Human  1171 YRCECVPG-----YQGVNCEYEVDECQNQPCQNGGTCIDLVNHFKCSCPPGTRG---LLCEENID 1227

  Fly  1436 ECANKPCGLNAA-CLNRAGGFECLCLSGHAGNPYSSCQPIESKFCQDANKCQCNERVECPEGYSC 1499
            :||..|..||.. |::|.||:.|.||.|.||   ..|:       .|.|:|..|.          
Human  1228 DCARGPHCLNGGQCMDRIGGYSCRCLPGFAG---ERCE-------GDINECLSNP---------- 1272

  Fly  1500 QKGQCKNLCSQASCGPRAICDAGNCICPMGYIGDPHDQVHGCSIRGQCGNDADCLHSEICFQLGK 1564
                                                     ||..|    ..||:.         
Human  1273 -----------------------------------------CSSEG----SLDCIQ--------- 1283

  Fly  1565 GLRKCVDACSKIQCGPNALCVSEDHRSSCICSDGFFGNPSNLQVGCQPERTVPEEEDKCKSDQDC 1629
                                ::.|:  .|:|...|.|.                           
Human  1284 --------------------LTNDY--LCVCRSAFTGR--------------------------- 1299

  Fly  1630 SRGYGCQASVNGIKECINLCSNVVCGPNELCKINPAGHAICNCAESYVWNPVVSSCEKPSLP--- 1691
                .|:..|:            ||                                 |.:|   
Human  1300 ----HCETFVD------------VC---------------------------------PQMPCLN 1315

  Fly  1692 --DCTSDANCPDASACRPDVLGVLKCVAICDAFTCPANSVCVARQHQGRCDCLNGFVGNPNDRNG 1754
              .|...:|.||...||                 ||.                 ||.|       
Human  1316 GGTCAVASNMPDGFICR-----------------CPP-----------------GFSG------- 1339

  Fly  1755 CQPAQKHHCRNHAECQESEACIKDESTQTLGCRPACDTVKCGPRAVCVTNNHQAQCQCPPGPFAG 1819
                        |.||.|                 |..|||.....||......:|.||      
Human  1340 ------------ARCQSS-----------------CGQVKCRKGEQCVHTASGPRCFCP------ 1369

  Fly  1820 DPYDPFNGCQSVPCVYNHDCPPSQMCNRMTHTCFDVCDEESCGDNAICLAEDHRAVCQCPPGFKG 1884
            .|.|..:||.|.||.:...|.|.:                          :.....|||.|.|.|
Human  1370 SPRDCESGCASSPCQHGGSCHPQR--------------------------QPPYYSCQCAPPFSG 1408

  Fly  1885 DPLPEVACTKQGGCAAGTCHPSAICEVTPEGPVCKCPPLFVGDAKSGGCRPDGQCPNGDADCPAN 1949
                       ..|...|..||.             ||                     |.| .:
Human  1409 -----------SRCELYTAPPST-------------PP---------------------ATC-LS 1427

  Fly  1950 TICAGGVCQNPCDNACGSNA------ECKVINRKP--VCSCPLRFQPISDTAKDGC---ARTISK 2003
            ..||.......||.||.|:|      :|.:....|  .||.||   |..|...:.|   ..|: :
Human  1428 QYCADKARDGVCDEACNSHACQWDGGDCSLTMENPWANCSSPL---PCWDYINNQCDELCNTV-E 1488

  Fly  2004 CLTDVDCGGALCYNGQCRIACRNSQDCSDGESCLKNVCVVACLDHSQCASGLACVEGHCTIGCRS 2068
            ||.|         |.:|:   .||:.|...:         .|.||.:        :.||..||.|
Human  1489 CLFD---------NFECQ---GNSKTCKYDK---------YCADHFK--------DNHCDQGCNS 1524

  Fly  2069 NK------ECKQDQ 2076
            .:      :|..||
Human  1525 EECGWDGLDCAADQ 1538

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
dpyNP_001260032.1 EGF_3 137..166 CDD:289699 10/35 (29%)
EGF_CA 212..247 CDD:238011 16/34 (47%)
EGF_CA 255..>286 CDD:214542 12/31 (39%)
EGF_CA 298..331 CDD:238011 13/34 (38%)
EGF_CA 338..373 CDD:238011 12/34 (35%)
EGF_CA 413..456 CDD:238011 15/46 (33%)
EGF_CA 457..490 CDD:238011 12/35 (34%)
EGF_CA 497..>529 CDD:214542 11/31 (35%)
EGF_CA 580..>612 CDD:214542 11/31 (35%)
EGF_3 676..702 CDD:289699 6/25 (24%)
EGF_CA 1022..1056 CDD:214542 14/33 (42%)
EGF_CA 2227..2260 CDD:238011
EGF_CA 2393..>2422 CDD:214542
DUF4758 4088..4282 CDD:292572
DUF4696 4127..4678 CDD:292395
DUF4758 4275..4448 CDD:292572
DUF4758 4377..4574 CDD:292572
DUF4758 4581..4754 CDD:292572
DUF4758 4683..4847 CDD:292572
DUF4758 4785..4964 CDD:292572
DUF4696 4841..5385 CDD:292395
DUF4758 4887..5098 CDD:292572
DUF4758 5193..5371 CDD:292572
DUF4758 5294..5487 CDD:292572
DUF4758 5445..5650 CDD:292572
DUF4758 5700..5877 CDD:292572
DUF4696 5756..6396 CDD:292395
DUF4758 5802..5979 CDD:292572
DUF4758 5964..6171 CDD:292572
DUF4758 6181..6360 CDD:292572
DUF4696 6339..6999 CDD:292395
DUF4758 6662..6839 CDD:292572
DUF4758 6764..6941 CDD:292572
DUF4758 6866..7045 CDD:292572
DUF4758 6968..7179 CDD:292572
DUF4696 7024..7569 CDD:292395
DUF4758 7172..7383 CDD:292572
DUF4696 7330..7964 CDD:292395
DUF4758 7400..7587 CDD:292572
DUF4758 7538..7707 CDD:292572
DUF4758 7798..7979 CDD:292572
DUF4758 7946..8126 CDD:292572
YppG 18767..>18832 CDD:290883
Med25_SD1 18795..18955 CDD:288132
MISS 19026..19258 CDD:292450
ZP 22576..22811 CDD:214579
Zona_pellucida <22714..22810 CDD:278526
NOTCH2NP_077719.2 EGF_CA 182..218 CDD:238011 16/35 (46%)
EGF_CA 260..296 CDD:238011 13/37 (35%)
EGF_CA 298..335 CDD:238011 15/36 (42%)
EGF_CA 415..454 CDD:238011 14/40 (35%)
EGF_CA 456..492 CDD:238011 14/42 (33%)
EGF_CA 495..530 CDD:238011 14/38 (37%)
EGF_CA 532..568 CDD:238011 13/67 (19%)
EGF_CA 570..604 CDD:238011 13/45 (29%)
EGF_CA 608..643 CDD:238011 12/48 (25%)
EGF_CA 645..679 CDD:238011 15/66 (23%)
EGF_CA 682..717 CDD:238011 17/38 (45%)
EGF_CA 757..793 CDD:238011 13/39 (33%)
EGF_CA 795..831 CDD:238011 8/37 (22%)
EGF_CA 873..909 CDD:238011 15/49 (31%)
EGF_CA 911..947 CDD:238011 10/36 (28%)
EGF_CA 949..985 CDD:238011 13/42 (31%)
EGF_CA 987..1022 CDD:238011 12/38 (32%)
EGF_CA 1026..1061 CDD:238011 12/59 (20%)
EGF_CA 1117..1147 CDD:238011 8/39 (21%)
EGF_CA 1151..1185 CDD:238011 12/39 (31%)
EGF_CA 1188..1223 CDD:238011 11/37 (30%)
EGF_CA 1225..1262 CDD:238011 16/39 (41%)
EGF_CA 1264..1302 CDD:238011 14/154 (9%)
EGF_CA <1312..1343 CDD:238011 13/83 (16%)
Notch 1423..1456 CDD:278494 10/33 (30%)
Notch 1463..1497 CDD:278494 12/46 (26%)
Notch 1501..1534 CDD:278494 9/49 (18%)
NOD 1540..1591 CDD:284282
NODP 1620..1673 CDD:284987
Ank_2 <1820..1907 CDD:289560
ANK repeat 1827..1874 CDD:293786
ANK 1872..1997 CDD:238125
ANK repeat 1877..1907 CDD:293786
Ank_2 1881..1974 CDD:289560
ANK 1938..2062 CDD:238125
ANK repeat 1943..1974 CDD:293786
Ank_2 1948..2040 CDD:289560
ANK repeat 1976..2007 CDD:293786
ANK repeat 2009..2040 CDD:293786
Chorion_3 <2292..2421 CDD:253174
DUF3454 2381..2444 CDD:288764
Blue background indicates that the domain is not in the aligned region.


Information from Original Tools:


Tool Simple Score Weighted Score Original Tool Information
BLAST Result Score Score Type Cluster ID
Compara 00.000 Not matched by this tool.
Domainoid 1 1.000 47 1.000 Domainoid score I12013
eggNOG 00.000 Not matched by this tool.
Hieranoid 00.000 Not matched by this tool.
Homologene 00.000 Not matched by this tool.
Inparanoid 00.000 Not matched by this tool.
Isobase 00.000 Not matched by this tool.
OMA 00.000 Not matched by this tool.
OrthoDB 00.000 Not matched by this tool.
OrthoFinder 00.000 Not matched by this tool.
OrthoInspector 00.000 Not matched by this tool.
orthoMCL 00.000 Not matched by this tool.
Panther 00.000 Not matched by this tool.
Phylome 1 0.910 - -
RoundUp 00.000 Not matched by this tool.
SonicParanoid 00.000 Not matched by this tool.
SwiftOrtho 00.000 Not matched by this tool.
TreeFam 00.000 Not matched by this tool.
User_Submission 00.000 Not matched by this tool.
21.910

Return to query results.
Submit another query.