DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment uif and Notch2

DIOPT Version :10

Sequence 1:NP_001162899.1 Gene:uif / 33983 FlyBaseID:FBgn0031879 Length:3589 Species:Drosophila melanogaster
Sequence 2:NP_035058.2 Gene:Notch2 / 18129 MGIID:97364 Length:2473 Species:Mus musculus


Alignment Length:1868 Identity:463/1868 - (24%)
Similarity:629/1868 - (33%) Gaps:672/1868 - (35%)


- Green bases have known domain annotations that are detailed below.


  Fly  1821 GNQCPPLRALKSQISR---GFN-----------------CNVGEVLNMDTSDVPRCLHCPAG--- 1862
            |..|.|...|.....|   ||.                 |..|...:|.:.|...|. |..|   
Mouse    76 GGTCVPQGMLGKATCRCAPGFTGEDCQYSTSHPCFVSRPCQNGGTCHMLSRDTYECT-CQVGFTG 139

  Fly  1863 ----------TYVSEGQNSCTYCPRGYYQNRDRQGTCLRCPAGTYTKEEGTKSQAD---C-IP-V 1912
                      ::..|..::||        :...|.:| :||||.    .|.|.:||   | || .
Mouse   140 KQCQWTDACLSHPCENGSTCT--------SVASQFSC-KCPAGL----TGQKCEADINECDIPGR 191

  Fly  1913 CGYGTYSPTGLVPCLECPRNSFTAEPPTGGFKDCQACPAQSFT-------YQPAAS----NKDLC 1966
            |.:|.       .||..|          |.:: || || |.||       |.|.|.    |...|
Mouse   192 CQHGG-------TCLNLP----------GSYR-CQ-CP-QGFTGQHCDSPYVPCAPSPCVNGGTC 236

  Fly  1967 R--------AKCAPGTYSATGLAPCSPCPLHHYQGAA----GAQSCN-ECPSNMRTDSPASKGRE 2018
            |        ..|.||...:|.......||.|..|...    |..:.| .||... |....::..:
Mouse   237 RQTGDFTFECNCLPGFEGSTCERNIDDCPNHKCQNGGVCVDGVNTYNCRCPPQW-TGQFCTEDVD 300

  Fly  2019 QCKPVVCGEGACQHGGLCVPMGHDIQCFCPAGFSGRRCEQDIDECA------------------- 2064
            :|   :....|||:||.|........|.|..|:||..|.::||:||                   
Mouse   301 EC---LLQPNACQNGGTCTNRNGGYGCVCVNGWSGDDCSENIDDCAYASCTPGSTCIDRVASFSC 362

  Fly  2065 ------------------SQPCYNGGQCKDLPQG--YRCECPAGYSGINCQEEASDC---GNDTC 2106
                              |.||:.|..|...|..  |.|.||.||.|.:|.|:..:|   .::.|
Mouse   363 LCPEGKAGLLCHLDDACISNPCHKGALCDTNPLNGQYICTCPQGYKGADCTEDVDECAMANSNPC 427

  Fly  2107 PARAMCKNEPGYKNVTCLCRSGYTGDQCDVTIDPCTANGNPCGNGASCQALEQ-GRYKCECVPGW 2170
            .....|.|..|..:  |.|..||.|.:|::.|:.|  :.:||.|.|:|  |:: |.:.|.|:||:
Mouse   428 EHAGKCVNTDGAFH--CECLKGYAGPRCEMDINEC--HSDPCQNDATC--LDKIGGFTCLCMPGF 486

  Fly  2171 EGIHCEQNINDCSENPCLLGANCTDLVNDFQCACPPGFTGK------------------------ 2211
            :|:|||..:|:|..|||:....|.|.||.|||.|||||||.                        
Mouse   487 KGVHCELEVNECQSNPCVNNGQCVDKVNRFQCLCPPGFTGPVCQIDIDDCSSTPCLNGAKCIDHP 551

  Fly  2212 --------------RCEQKIDLCLSEPCKHGTCVDRLFDHECVCHPGWTGSACDINIDDCENRPC 2262
                          .|::.||.|..:||.||.|.|.:..:.|:|:||:.|:.|...||:|.:.||
Mouse   552 NGYECQCATGFTGILCDENIDNCDPDPCHHGQCQDGIDSYTCICNPGYMGAICSDQIDECYSSPC 616

  Fly  2263 ANEGTCVDLVDGYSCNCEPGYTGKNCQHTIDDCASNPCQHGATCVDQLDGFSCKCRPGYVGLSCE 2327
            .|:|.|:|||:||.|||:||.:|.||:...||||||||.|| .|||.::.:||.|.||:.|..|.
Mouse   617 LNDGRCIDLVNGYQCNCQPGTSGLNCEINFDDCASNPCMHG-VCVDGINRYSCVCSPGFTGQRCN 680

  Fly  2328 AEIDECLSDPCNPVGTERCLDLDNKFECVCRDGFKGPLCATDIDDCEAQPCLNNGICRDRVGGFE 2392
            .:||||.|:||....|  |::..|.|.|:|.:|...|.|.:.:::|.:.||: :|.|...:.|::
Mouse   681 IDIDECASNPCRKGAT--CINDVNGFRCICPEGPHHPSCYSQVNECLSNPCI-HGNCTGGLSGYK 742

  Fly  2393 CGCEPGWSGMRCEQQVTTCGAQAPCQNDASCIDLFQDYFCVCPSGTDGKNCETAPERCIGDPCMH 2457
            |.|:.||.|:.||.....| ...||||..:|.:|...|.|.|..|..|.||:...:.|..:||::
Mouse   743 CLCDAGWVGVNCEVDKNEC-LSNPCQNGGTCNNLVNGYRCTCKKGFKGYNCQVNIDECASNPCLN 806

  Fly  2458 GGKCQDFGSGLNCSCPADYSGIGCQYEYDACEEHVCQNGAT------------------------ 2498
            .|.|.|..||..|.|...|:|..||.....|..:.|:|.|.                        
Mouse   807 QGTCFDDVSGYTCHCMLPYTGKNCQTVLAPCSPNPCENAAVCKEAPNFESFSCLCAPGWQGKRCT 871

  Fly  2499 ----------CVDNG------AGYSCQCPPGFTGRNCEQDIVDCKDNSCPPGATCVDLTNGFYCQ 2547
                      |::||      ..|.|:|||||:|.:||:||.||..|.|..|.:|||..|.|.||
Mouse   872 VDVDECISKPCMNNGVCHNTQGSYVCECPPGFSGMDCEEDINDCLANPCQNGGSCVDHVNTFSCQ 936

  Fly  2548 CPFNMTGDDCRKAIQVDYDLYFSDPSRS--TAAQVVPFPTGEANSLTVAMWVQFAQKDDRGIFFT 2610
            |.....||.|    |.|.:...|:|.::  |.:..|       ||.|......|           
Mouse   937 CHPGFIGDKC----QTDMNECLSEPCKNGGTCSDYV-------NSYTCTCPAGF----------- 979

  Fly  2611 LYGVQSARMTQQRRMLLQAHSSGVQVSLFEDQPDAFLSFGEYTSVNDGQWHHVAVVWDGISGQLQ 2675
             :||..                       |:..|   ...|.:..|.|      ...|||:....
Mouse   980 -HGVHC-----------------------ENNID---ECTESSCFNGG------TCVDGINSFSC 1011

  Fly  2676 LITEGLIASKMEYGAGGSLPGYLWAVLGLPQPYGLSNELAYSDSGFQGTITKAQVWARALDITSE 2740
            |.                             |.|.:......|                      
Mouse  1012 LC-----------------------------PVGFTGPFCLHD---------------------- 1025

  Fly  2741 IQKQVRDCRSEPVLYPGLILNWAGYEVTSGGVERNVPSLCGQRKCPVGYTGANCQQLV--VDKEP 2803
                :.:|.|.|.|.       ||..|...|..|.:        ||:||||.|||.||  ..:.|
Mouse  1026 ----INECSSNPCLN-------AGTCVDGLGTYRCI--------CPLGYTGKNCQTLVNLCSRSP 1071

  Fly  2804 PVVEHCPGDLWVIAKN-GSAVVSWDEPHFSDNIGVTKIYERNGHRSGTTLLW-GTY-DITYIASD 2865
                         .|| |:.|.....||.....|                 | |.| |:..::..
Mouse  1072 -------------CKNKGTCVQEKARPHCLCPPG-----------------WDGAYCDVLNVSCK 1106

  Fly  2866 A---------------------AGNTASCSFKVSLLTDFCPALAD-----PVGGSQVCKDWGAGG 2904
            |                     ||||..|...:.....:|....|     |......|.|:..| 
Mouse  1107 AAALQKGVPVEHLCQHSGICINAGNTHHCQCPLGYTGSYCEEQLDECASNPCQHGATCNDFIGG- 1170

  Fly  2905 QFKVCEIACNAGLRFSEPVPEFYTCGAEGFWRPTREPSMPLVYPSCSPSKPAQRVFRIKMLFPSD 2969
                                  |.|                   .|.|                 
Mouse  1171 ----------------------YRC-------------------ECVP----------------- 1177

  Fly  2970 VLCNKAGQAVLRQKVTNSVNGLNRDWNFCSYAIE----------GTRECKDIQIDVKCDHYRGTQ 3024
                             ...|:|     |.|.::          ||  |.|:....||....||:
Mouse  1178 -----------------GYQGVN-----CEYEVDECQNQPCQNGGT--CIDLVNHFKCSCPPGTR 1218

  Fly  3025 NNRVRRQAKD--GGVYVMEAELPVVNDDDDDLTLTGRQGRQQTGGDTYTLEIAFPAANDPVVHTS 3087
            .........:  ||.:                .|.|.|...:.||.|......|           
Mouse  1219 GLLCEENIDECAGGPH----------------CLNGGQCVDRIGGYTCRCLPGF----------- 1256

  Fly  3088 TGERSTVKQLLEKLILEDDQFAVQEILPNTVPDPASL---ELGSEYACPVGQVVMIPDCVPCAIG 3149
            .|||           .|.|   :.|.|.|......||   :|.:.|.|..........|     .
Mouse  1257 AGER-----------CEGD---INECLSNPCSSEGSLDCVQLKNNYNCICRSAFTGRHC-----E 1302

  Fly  3150 TFYD-SANKTCIACSRGTYQSEAGQLQCSKCPVIAGRPGVTAGPGARSAADCKERCPAGKYFDAE 3213
            ||.| ...|.|:  :.||            |.|.:..|         ....|  |||.|  |.. 
Mouse  1303 TFLDVCPQKPCL--NGGT------------CAVASNMP---------DGFIC--RCPPG--FSG- 1339

  Fly  3214 TGLCR-SCGHGFYQPNEGSFSCELCGLGQTTRSTEATSR------KECRDECSSGQQLGADGRCE 3271
             ..|: |||.            ..|..|:....|::..|      |:|...|:|          .
Mouse  1340 -ARCQSSCGQ------------VKCRRGEQCIHTDSGPRCFCLNPKDCESGCAS----------N 1381

  Fly  3272 PCPRGTYRLQGVQPSCAACPLGRTTPKVGASSVEECTLPVCSAGTYLNATQNMCIECRKGYYQSE 3336
            ||..|.......||...:|   |..|..|.|..|..|.|          |......|:..|...:
Mouse  1382 PCQHGGTCYPQRQPPHYSC---RCPPSFGGSHCELYTAP----------TSTPPATCQSQYCADK 1433

  Fly  3337 SQQTSCLQCPPNHSTKITGATSKSECTNPCEHIAEGKPHCDVNAYCIMVPETSDFKCECKPGFNG 3401
            ::...|.:...:|:.:..|    .:|:...|         |..|.|     ||..:|        
Mouse  1434 ARDGICDEACNSHACQWDG----GDCSLTME---------DPWANC-----TSTLRC-------- 1472

  Fly  3402 TGMACTDVCDGFCENSGACVKD----LKGTPSCR----CVGSFTGPHC 3441
             .....:.||..| |:..|:.|    .:.:.:|:    |...|...||
Mouse  1473 -WEYINNQCDEQC-NTAECLFDNFECQRNSKTCKYDKYCADHFKDNHC 1518

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
uifNP_001162899.1 CLECT 36..167 CDD:214480
LDLa 172..206 CDD:238060
CUB 220..319 CDD:238001
CUB 326..435 CDD:238001
CUB 439..550 CDD:238001
PHA02927 <568..733 CDD:222943
CCP 675..731 CDD:153056
PHA02639 697..>834 CDD:165022
FA58C 833..976 CDD:330301
FXa_inhibition 988..>1016 CDD:464251
CCP 1051..1106 CDD:153056
FA58C 1303..1443 CDD:330301
HYR 1463..1547 CDD:460572
HYR 1548..1631 CDD:460572
Ephrin_rec_like 1862..1909 CDD:429604 14/62 (23%)
Ephrin_rec_like 1916..1966 CDD:429604 16/60 (27%)
Ephrin_rec_like 1973..2020 CDD:429604 11/51 (22%)
EGF 2025..2055 CDD:394967 11/29 (38%)
EGF_CA 2059..2095 CDD:238011 17/74 (23%)
EGF_CA 2138..2176 CDD:238011 15/38 (39%)
EGF_CA 2178..2214 CDD:238011 19/73 (26%)
EGF_CA 2253..2289 CDD:238011 20/35 (57%)
EGF_CA 2292..2327 CDD:238011 19/34 (56%)
EGF_CA 2369..2405 CDD:238011 11/35 (31%)
EGF_CA 2411..2444 CDD:238011 13/32 (41%)
EGF_CA 2486..2520 CDD:238011 15/73 (21%)
EGF_CA 2522..2557 CDD:238011 17/34 (50%)
LamG 2590..2735 CDD:473984 17/144 (12%)
HYR 2799..2877 CDD:460572 18/101 (18%)
Ephrin_rec_like 3149..3200 CDD:429604 10/51 (20%)
Ephrin_rec_like 3207..3254 CDD:429604 11/53 (21%)
Ephrin_rec_like 3268..3307 CDD:429604 11/38 (29%)
Ephrin_rec_like 3315..3362 CDD:429604 6/46 (13%)
Notch2NP_035058.2 EGF_CA 109..143 CDD:238011 7/34 (21%)
EGF_CA 182..218 CDD:238011 17/55 (31%)
EGF_CA 260..296 CDD:238011 9/36 (25%)
EGF_CA 298..335 CDD:238011 12/39 (31%)
EGF_CA 415..454 CDD:238011 10/40 (25%)
EGF_CA 456..492 CDD:238011 15/39 (38%)
EGF_CA 495..530 CDD:238011 19/34 (56%)
EGF_CA 532..567 CDD:238011 0/34 (0%)
EGF_CA 570..604 CDD:238011 14/33 (42%)
EGF_CA 608..643 CDD:238011 20/34 (59%)
EGF_CA 645..679 CDD:238011 19/34 (56%)
EGF_CA 682..717 CDD:238011 15/36 (42%)
EGF_CA 757..793 CDD:238011 13/36 (36%)
EGF_CA 795..831 CDD:238011 12/35 (34%)
EGF_CA 873..909 CDD:238011 11/35 (31%)
EGF_CA 911..947 CDD:238011 18/39 (46%)
EGF_CA 949..985 CDD:238011 11/77 (14%)
EGF_CA 987..1022 CDD:238011 10/72 (14%)
EGF_CA 1025..1061 CDD:238011 17/76 (22%)
EGF_CA 1117..1147 CDD:238011 5/29 (17%)
EGF_CA 1151..1185 CDD:238011 11/114 (10%)
EGF_CA 1188..1223 CDD:238011 8/36 (22%)
EGF_CA 1225..1262 CDD:238011 12/74 (16%)
EGF_CA 1264..1302 CDD:238011 10/45 (22%)
EGF_CA <1312..1343 CDD:238011 12/59 (20%)
Notch 1423..1456 CDD:459658 5/36 (14%)
Negative regulatory region (NRR). /evidence=ECO:0000250 1425..1679 24/122 (20%)
Notch 1463..1497 CDD:459658 11/48 (23%)
Notch 1501..1534 CDD:459658 5/18 (28%)
NOD 1539..1594 CDD:462014
NODP 1620..1675 CDD:462229
JMTM_Notch2 1661..1742 CDD:411986
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1755..1778
ANK repeat 1828..1875 CDD:293786
ANK 1 1828..1872
ANKYR <1852..2067 CDD:440430
ANK 2 1877..1906
ANK repeat 1878..1908 CDD:293786
ANK 3 1910..1940
ANK repeat 1944..1975 CDD:293786
ANK 4 1944..1973
ANK repeat 1977..2008 CDD:293786
ANK 5 1977..2006
ANK repeat 2010..2041 CDD:293786
ANK 6 2010..2039
PHA03247 <2069..2424 CDD:223021
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 2098..2117
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 2122..2169
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 2382..2473
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.