DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment uif and Notch1

DIOPT Version :9

Sequence 1:NP_001162899.1 Gene:uif / 33983 FlyBaseID:FBgn0031879 Length:3589 Species:Drosophila melanogaster
Sequence 2:NP_001099191.1 Gene:Notch1 / 25496 RGDID:3187 Length:2531 Species:Rattus norvegicus


Alignment Length:2053 Identity:513/2053 - (24%)
Similarity:705/2053 - (34%) Gaps:677/2053 - (32%)


- Green bases have known domain annotations that are detailed below.


  Fly  1632 KASPCVDWELQPPANGAINCLPGDRGIECIATCKPGFRFTDGEPLKTFSCETSRLWRPTSVVPDC 1696
            :|.||..   .|.|||. .|||.:....|  .|.|||.                       .|.|
  Rat   140 QADPCAS---NPCANGG-QCLPFESSYIC--GCPPGFH-----------------------GPTC 175

  Fly  1697 ---VSENTEQAAYHVTASITYRANGAVAQSCLGQYQEVLAQHYGGL--NQLLSQRCS--AVNVNM 1754
               |:|.::...                          |.:| ||.  |::.|.||:  |.:...
  Rat   176 RQDVNECSQNPG--------------------------LCRH-GGTCHNEIGSYRCACRATHTGP 213

  Fly  1755 NVTFVKSVPMLLEENVVKMDFILSILPAVRQP-QLYDLCGSTLNLIFDLS-VP-YASAVIDDLLN 1816
            :..                   |..:|....| |....|..|.:...:.: :| :|....::   
  Rat   214 HCE-------------------LPYVPCSPSPCQNGGTCRPTGDTTHECACLPGFAGQNCEE--- 256

  Fly  1817 IANIGNQCPPLRALKSQISRGFNCNVGEVLNMDTSDVPRCLHCP---AGTYVSEGQNSCTYCPRG 1878
              |: :.||           |.||..|... :|..:...| .||   .|.|.:|..:.|...|..
  Rat   257 --NV-DDCP-----------GNNCKNGGAC-VDGVNTYNC-RCPPEWTGQYCTEDVDECQLMPNA 305

  Fly  1879 YYQNRDRQGTCLRCPAG-------TYTKEEGTKSQADCI-PVCGYGT--------------YSPT 1921
             .||   .|||.....|       .:|.|:.:::..||. ..|..|.              :..|
  Rat   306 -CQN---GGTCHNSHGGYNCVCVNGWTGEDCSENIDDCASAACFQGATCHDRVASFYCECPHGRT 366

  Fly  1922 GLV----------PCLE---CPRNSFTAEPPTGGFKDCQACPAQSFTYQPAASNKDLCRAKCAPG 1973
            ||:          ||.|   |..|      |..|...| .||: .:|....:.:.|.|       
  Rat   367 GLLCHLNDACISNPCNEGSNCDTN------PVNGKAIC-TCPS-GYTGPACSQDVDEC------- 416

  Fly  1974 TYSATGLAPCS------------PCP-LHHYQGAAGAQSCNECPSNMRTDSPASKGREQCKPVVC 2025
               |.|..||.            .|. |..|.|.......|||.||                   
  Rat   417 ---ALGANPCEHAGKCLNTLGSFECQCLQGYTGPRCEIDVNECISN------------------- 459

  Fly  2026 GEGACQHGGLCVPMGHDIQCFCPAGFSGRRCEQDIDECASQPCYNGGQCKDLPQGYRCECPAGYS 2090
               .||:...|:....:.||.|..|:.|..||.:.|||||.||.:.|:|.|....:.|:||.|:|
  Rat   460 ---PCQNDATCLDQIGEFQCICMPGYEGVYCEINTDECASSPCLHNGRCVDKINEFLCQCPKGFS 521

  Fly  2091 GINCQEEASDCGNDTCPARAMCKNEPGYKNVTCLCRSGYTGDQCDVTIDPCTANGNPCGNGASCQ 2155
            |..||.:..:|.:..|...|.|.:.|  ...||:|..||||..|:|.||.|  :.:||..|....
  Rat   522 GHLCQYDVDECASTPCKNGAKCLDGP--NTYTCVCTEGYTGTHCEVDIDEC--DPDPCHYGLCKD 582

  Fly  2156 ALEQGRYKCECVPGWEGIHCEQNINDCSENPCLLGANCTDLVNDFQCACPPGFTGKRCEQKIDLC 2220
            .:  ..:.|.|.||:.|.|||.|||:|...||..|..|.|..|.:.|.|..|.||..||..:|.|
  Rat   583 GV--ATFTCLCQPGYTGHHCETNINECHSQPCRHGGTCQDRDNYYLCLCLKGTTGPNCEINLDDC 645

  Fly  2221 LSEPCKHGTCVDRLFDHECVCHPGWTGSACDINIDDCENRPCANEGTCVDLVDGYSCNCEPGYTG 2285
            .|.||..|||:|::..:||.|.||:|||.|::|||:|...||.|.|||.|.:.|::|.|..||..
  Rat   646 ASNPCDSGTCLDKIDGYECACEPGYTGSMCNVNIDECAGSPCHNGGTCEDGIAGFTCRCPEGYHD 710

  Fly  2286 KNCQHTIDDCASNPCQHGATCVDQLDGFSCKCRPGYVGLSCEAEIDECLSDPCNPVGTERCLDLD 2350
            ..|...:::|.||||.||| |.|.|:|:.|.|.||:.|.:|:...:||.|:||...||  |.|:.
  Rat   711 PTCLSEVNECNSNPCIHGA-CRDGLNGYKCDCAPGWSGTNCDINNNECESNPCVNGGT--CKDMT 772

  Fly  2351 NKFECVCRDGFKGPLCATDIDDCEAQPCLNNGICRDRVGGFECGCEPGWSGMRCEQQVTTCGAQA 2415
            :.:.|.||:||.||.|.|:|::|.:.||||.|.|.|.|.|::|.|...::|..||..:..| |.:
  Rat   773 SGYVCTCREGFSGPNCQTNINECASNPCLNQGTCIDDVAGYKCNCPLPYTGATCEVVLAPC-ATS 836

  Fly  2416 PCQNDASCIDL--FQDYFCVCPSGTDGKNCETAPERCIGDPCMHGGKCQDFGSGLNCSCPADYSG 2478
            ||:|...|.:.  ::.:.||||:|..|:.||.....|:..||.||..||:......|.|.|.|:|
  Rat   837 PCKNSGVCKESEDYESFSCVCPTGWQGQTCEIDINECVKSPCRHGASCQNTNGSYRCLCQAGYTG 901

  Fly  2479 IGCQYEYDACEEHVCQNGATCVDNGAGYSCQCPPGFTGRNCEQDIVDCKDNSCPPGATCVDLTNG 2543
            ..|:.:.|.|..:.|.||.:|.|......|.|.|||.|..||:||.:|..|.|..||.|.|..:.
  Rat   902 RNCESDIDDCRPNPCHNGGSCTDGVNAAFCDCLPGFQGAFCEEDINECASNPCQNGANCTDCVDS 966

  Fly  2544 FYCQCPFNMTGDDCRKAIQVDYDLYFSDPSRSTAAQVVPFPTGEANSLTVAMWVQFAQKDDRGIF 2608
            :.|.||....|..|                             |.|:                  
  Rat   967 YTCTCPTGFNGIHC-----------------------------ENNT------------------ 984

  Fly  2609 FTLYGVQSARMTQQRRMLLQAHSSGVQVSLFEDQPDAFLSFGEYTSVNDGQWHHVAVVWDGISGQ 2673
                                              ||.    .|.:..|.|      ...|||:..
  Rat   985 ----------------------------------PDC----TESSCFNGG------TCVDGINSF 1005

  Fly  2674 LQLITEGLIASKMEYGAGGSLPGYLWAVLGLPQPYGLSNELAYSDSGFQGTITKAQVWARALDIT 2738
            ..|...|...|..:|                                                  
  Rat  1006 TCLCPPGFTGSYCQY-------------------------------------------------- 1020

  Fly  2739 SEIQKQVRDCRSEPVLYPGLILNWAG-YEVTSGGVERNVPSLCGQRKCPVGYTGANCQQLV--VD 2800
                 .|.:|.|.|.|:.|...:..| |:.|                ||.||||.|||.||  .|
  Rat  1021 -----DVNECDSRPCLHGGTCQDSYGTYKCT----------------CPQGYTGLNCQNLVRWCD 1064

  Fly  2801 KEPPVVEHCPGDLWVIAKNGSAVVSWDEPHFSDNIGVTKIYERNGH-----RSGTTLLWGTYDIT 2860
            ..|             .|||.                 |.::.|..     |||.|         
  Rat  1065 SAP-------------CKNGG-----------------KCWQTNTQYHCECRSGWT--------- 1090

  Fly  2861 YIASDAAGNTASCSFKVSLLTDFCPALADPVGGSQVCKDWGAGGQFKVCEIACNAGLRFSEPVPE 2925
                         .|...:|:..|...|...|             ..|..:..:.||...|....
  Rat  1091 -------------GFNCDVLSVSCEVAAQKRG-------------IDVTLLCQHGGLCVDEEDKH 1129

  Fly  2926 FYTCGAEGFWRPTREPSMPLVYPSCSPSKPAQRVFRIKMLFPSDVLCNKAGQAVLRQKVTNSVNG 2990
            :..|.| |:.....|..:    ..|||: |.|.....     :|.|...:.:.|.....:|....
  Rat  1130 YCHCQA-GYTGSYCEDEV----DECSPN-PCQNGATC-----TDYLGGFSCKCVAGYHGSNCSEE 1183

  Fly  2991 LNRDWNFC-SYAIEGTRECKDIQIDVKCDHYRGTQNNRVRRQAKDGGVYVMEAELPVVNDDD--D 3052
            :|.    | |...:....|.|:....||...||||           ||:.      .:|.||  .
  Rat  1184 INE----CLSQPCQNGGTCIDLTNTYKCSCPRGTQ-----------GVHC------EINVDDCHP 1227

  Fly  3053 DLTLTGRQGR--------QQTGGDTYTLEIAFPAANDPVVHTSTGERSTVKQLLEKLILEDDQFA 3109
            .|....|..:        .|.||.|.|....|           .|||           .|.|   
  Rat  1228 PLDPASRSPKCFNNGTCVDQVGGYTCTCPPGF-----------VGER-----------CEGD--- 1267

  Fly  3110 VQEILPNTVPDPAS----LELGSEY--ACPVGQV-----VMIPDC--VPCAIGTFYDSANKTCIA 3161
            |.|.|.|.. ||..    ::..:::  .|..|..     .:|..|  .||..|.....|:.|   
  Rat  1268 VNECLSNPC-DPRGTQNCVQRVNDFHCECRAGHTGRRCESVINGCRGKPCRNGGVCAVASNT--- 1328

  Fly  3162 CSRGTYQSEAGQLQCSKCPVIAGRPGVTAGPGARSAADCKERCPAGKYFDAETGLCRSCGHGFYQ 3226
             :||        ..| :||  ||..|.|....||:....  ||..|       |.|.|   |...
  Rat  1329 -ARG--------FIC-RCP--AGFEGATCENDARTCGSL--RCLNG-------GTCIS---GPRS 1369

  Fly  3227 PN---EGSFSCELCGLGQTTRSTEATSRKECRDECSSGQQLGADGRCEPCPRGT-YRLQGVQPSC 3287
            |.   .|||:...|   |...|:.......|.::          |.|||..... ||        
  Rat  1370 PTCLCLGSFTGPEC---QFPASSPCVGSNPCYNQ----------GTCEPTSESPFYR-------- 1413

  Fly  3288 AACP------------------LGRTTPKVGASSVEE-CTLPVCSAGTYLNATQNMC-IECRK-- 3330
            ..||                  .||..|   ...:|| |.||.|..    :|...:| ::|..  
  Rat  1414 CLCPAKFNGLLCHILDYSFTGGAGRDIP---PPQIEEACELPECQE----DAGNKVCNLQCNNHA 1471

  Fly  3331 -GYYQSESQQTSCLQCPPNHSTKITGATSKSECTNPCEHIAEGKPHCDV---NAYCIMVPETSDF 3391
             |:...:        |..|.:......|...:|   .::.::|  |||.   :|.|:.    ..|
  Rat  1472 CGWDGGD--------CSLNFNDPWKNCTQSLQC---WKYFSDG--HCDSQCNSAGCLF----DGF 1519

  Fly  3392 KCECKPGFNGTGMACTDVCDGFCE---NSGACVKDLKGTPSCRCVGSFTGPHCAERSEFAYIAGG 3453
            .|:...|      .|..:.|.:|:   :.|.|.   :|..|..|  .:.|..|||.......|| 
  Rat  1520 DCQLTEG------QCNPLYDQYCKDHFSDGHCD---QGCNSAEC--EWDGLDCAEHVPERLAAG- 1572

  Fly  3454 IAGAVIFIIIIVLLIWMICVRSTK---RRDPKKMLTPAI----DQTGSQVNF-YYGAHTPYAESI 3510
                   .:::|:|:....:|:..   .|:...:|...:    |..|.|:.| |||......:  
  Rat  1573 -------TLVLVVLLPPDQLRNNSFHFLRELSHVLHTNVVFKRDAQGQQMIFPYYGREEELRK-- 1628

  Fly  3511 APSHHSTYAHYYDDEEDGWEMPNFYNETYMKDGLHGGK 3548
                     |.......||.      .|.:..|.:||:
  Rat  1629 ---------HPIKRSAVGWA------TTSLLPGTNGGR 1651

Known Domains:


Indicated by green bases in alignment.

Software error:

Illegal division by zero at /www/www.flyrnai.org/docroot/cgi-bin/DRSC_prot_align.pl line 591.

For help, please send mail to the webmaster (ritg@hms.harvard.edu), giving this error message and the time and date of the error.

GeneSequenceDomainRegion External IDIdentity
uifNP_001162899.1 CLECT 36..167 CDD:214480
LDLa 172..206 CDD:238060
CUB 220..319 CDD:238001
CUB 326..435 CDD:238001
CUB 439..550 CDD:238001
PHA02927 <568..733 CDD:222943
CCP 675..731 CDD:153056
CCP 736..791 CDD:153056
EGF_CA 792..829 CDD:214542
FA58C 828..977 CDD:214572
FA58C 833..976 CDD:238014
FXa_inhibition 988..>1016 CDD:291342
CCP 1051..1106 CDD:153056
FA58C 1297..1444 CDD:214572
FA58C 1303..1443 CDD:238014
DUF5011 1463..1547 CDD:295940
DUF5011 1548..1631 CDD:295940
GCC2_GCC3 1862..1909 CDD:285001 13/53 (25%)
GCC2_GCC3 1916..1966 CDD:285001 16/76 (21%)
GCC2_GCC3 1973..2020 CDD:285001 13/59 (22%)
EGF 2025..2055 CDD:278437 8/29 (28%)
EGF_CA 2059..2095 CDD:238011 16/35 (46%)
EGF_CA 2138..2176 CDD:238011 12/37 (32%)
EGF_CA 2178..2214 CDD:238011 15/35 (43%)
EGF_CA 2253..2289 CDD:238011 16/35 (46%)
EGF_CA 2292..2327 CDD:238011 17/34 (50%)
EGF_CA 2330..2366 CDD:238011 16/35 (46%)
EGF_CA 2369..2405 CDD:238011 14/35 (40%)
EGF_CA 2411..2444 CDD:238011 12/34 (35%)
EGF_CA 2486..2520 CDD:238011 13/33 (39%)
EGF_CA 2522..2557 CDD:238011 13/34 (38%)
LamG 2590..2735 CDD:304605 12/144 (8%)
DUF5011 2799..2877 CDD:295940 12/82 (15%)
GCC2_GCC3 3149..3200 CDD:285001 14/50 (28%)
GCC2_GCC3 3207..3254 CDD:285001 12/49 (24%)
GCC2_GCC3 3268..3307 CDD:285001 13/58 (22%)
GCC2_GCC3 3315..3362 CDD:285001 7/50 (14%)
Notch1NP_001099191.1 EGF_CA 142..175 CDD:238011 15/61 (25%)
EGF_CA 178..216 CDD:238011 11/64 (17%)
EGF_CA 257..293 CDD:238011 13/49 (27%)
EGF_CA 295..332 CDD:238011 10/40 (25%)
EGF_CA 335..370 CDD:238011 7/34 (21%)
EGF_CA 412..450 CDD:238011 10/47 (21%)