DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment uif and Notch2

DIOPT Version :10

Sequence 1:NP_001162899.1 Gene:uif / 33983 FlyBaseID:FBgn0031879 Length:3589 Species:Drosophila melanogaster
Sequence 2:NP_077334.2 Gene:Notch2 / 29492 RGDID:3188 Length:2471 Species:Rattus norvegicus


Alignment Length:2001 Identity:490/2001 - (24%)
Similarity:690/2001 - (34%) Gaps:711/2001 - (35%)


- Green bases have known domain annotations that are detailed below.


  Fly  1696 CVSENTEQAAYHVTASITYRANGAVAQSC----LGQYQEVLAQH----------YGG--LNQLL- 1743
            ||:|.|         .:||. ||.....|    ||:|    .||          .||  :.|.: 
  Rat    35 CVNEGT---------CVTYH-NGTGYCRCPEGFLGEY----CQHRDPCEKNRCQNGGTCVTQAML 85

  Fly  1744 ---SQRCSAVNVNMNVTFVKSVPMLLE-----------------ENVVKMDFILSILPAVRQPQL 1788
               :.||:......:..:..|.|..:.                 |...::.|      ..:|.|.
  Rat    86 GKATCRCAPGFTGEDCQYSTSHPCFVSRPCQNGGTCHMLSWDTYECTCQVGF------TGKQCQW 144

  Fly  1789 YDLCGSTLNLIFDLSVPYASAVIDDLLNIANIGNQCPPLRALKSQISRGFNCN-----VGEVLNM 1848
            .|:|         ||.|..:            |:.|       |.::..|:|.     .|:..:.
  Rat   145 TDVC---------LSHPCEN------------GSTC-------SSVANQFSCRCPAGITGQKCDA 181

  Fly  1849 DTS--DVP-RCLHCPAGTYVS-EGQNSCTYCPRGYY-QNRD------------RQGTCLRCPAGT 1896
            |.:  |:| ||.|  .||.:: .|...| .||:|:. |:.|            ..|||.:  .|.
  Rat   182 DINECDIPGRCQH--GGTCLNLPGSYRC-QCPQGFTGQHCDSPYVPCAPSPCVNGGTCRQ--TGD 241

  Fly  1897 YTKE-------EGTKSQA---DC---------IPVCGYGTYS----P--TG------LVPCLECP 1930
            :|.|       ||:..:.   ||         :.|.|..||:    |  ||      :..||..|
  Rat   242 FTSECHCLPGFEGSNCERNIDDCPNHKCQNGGVCVDGVNTYNCRCPPQWTGQFCTEDVDECLLQP 306

  Fly  1931 ---RNSFTAEPPTGGFKDCQACPAQSFTYQPAASNKDLCR-AKCAPGTYSATGLAPCS-PCPLHH 1990
               :|..|.....||: .| .| ...::....:.|.|.|. |.|.||:.....:|..| .||   
  Rat   307 NACQNGGTCTNRNGGY-GC-VC-VNGWSGDDCSENIDDCAFASCTPGSTCIDRVASFSCLCP--- 365

  Fly  1991 YQGAAGAQSC---NECPSNMRTDSPASKGREQCKPVVCGEGACQHGGLC--VPMGHDIQCFCPAG 2050
             :|.||. .|   :.|.||                      .|..|.||  .|:.....|.||.|
  Rat   366 -EGKAGL-LCHLDDACISN----------------------PCHKGALCDTNPLNGQYICTCPQG 406

  Fly  2051 FSGRRCEQDIDECA---SQPCYNGGQCKDLPQGYRCECPAGYSGINCQEEASDCGNDTCPARAMC 2112
            :.|..|.:|:||||   |.||.:.|:|.:....:.|||..||:|..|:.:.::|.:|.|...|.|
  Rat   407 YKGADCTEDVDECAMANSNPCEHAGKCVNTDGAFHCECLKGYAGPRCEMDINECHSDPCQNDATC 471

  Fly  2113 KNEPGYKNVTCLCRSGYTGDQCDVTIDPCTANGNPCGNGASCQALEQGRYKCECVPGWEGIHCEQ 2177
            .::.|  ..||||..|:.|..|::.::.|  ..|||.|...| ..:..|::|.|.||:.|..|:.
  Rat   472 LDKIG--GFTCLCMPGFKGVHCELEVNEC--QSNPCVNNGQC-VDKVNRFQCLCPPGFTGPVCQI 531

  Fly  2178 NINDCSENPCLLGANCTDLVNDFQCACPPGFTGKRCEQKIDLCLSEPCKHGTCVDRLFDHECVCH 2242
            :|:|||..|||.||.|.|..|.::|.|..||||..|::.||.|..:||.||.|.|.:..:.|:|:
  Rat   532 DIDDCSSTPCLNGAKCIDHPNGYECQCATGFTGTLCDENIDNCDPDPCHHGQCQDGIDSYTCICN 596

  Fly  2243 PGWTGSACDINIDDCENRPCANEGTCVDLVDGYSCNCEPGYTGKNCQHTIDDCASNPCQHGATCV 2307
            ||:.|:.|...||:|.:.||.|:|.|:|||:||.|||:||.:|.||:...||||||||.||| ||
  Rat   597 PGYMGAICSDQIDECYSSPCLNDGRCIDLVNGYQCNCQPGTSGLNCEINFDDCASNPCLHGA-CV 660

  Fly  2308 DQLDGFSCKCRPGYVGLSCEAEIDECLSDPCNPVGTERCLDLDNKFECVCRDGFKGPLCATDIDD 2372
            |.::.:||.|.||:.|..|..:||||.|:||....|  |::..|.|.|:|.:|...|.|.:.:::
  Rat   661 DGINRYSCVCSPGFTGQRCNIDIDECASNPCRKGAT--CINDVNGFRCMCPEGPHHPSCYSQVNE 723

  Fly  2373 CEAQPCLNNGICRDRVGGFECGCEPGWSGMRCEQQVTTCGAQAPCQNDASCIDLFQDYFCVCPSG 2437
            |.:.||: :|.|...:.|::|.|:.||.|:.||.....| ...||||..:|.:|...|.|.|..|
  Rat   724 CLSSPCI-HGNCTGGLSGYKCLCDAGWVGINCEVDKNEC-LSNPCQNGGTCNNLVNGYRCTCKKG 786

  Fly  2438 TDGKNCETAPERCIGDPCMHGGKCQDFGSGLNCSCPADYSGIGCQYEYDACEEHVCQNGAT---- 2498
            ..|.||:...:.|..:||::.|.|.|..||..|.|...|:|..||.....|..:.|:|.|.    
  Rat   787 FKGYNCQVNIDECASNPCLNQGTCLDDVSGYTCHCMLPYTGKNCQTVLAPCSPNPCENAAVCKEA 851

  Fly  2499 ------------------------------CVDNG------AGYSCQCPPGFTGRNCEQDIVDCK 2527
                                          |::||      ..|.|:|||||:|.:||:||.||.
  Rat   852 PNFESFTCLCAPGWQGQRCTVDVDECVSKPCMNNGICHNTQGSYMCECPPGFSGMDCEEDINDCL 916

  Fly  2528 DNSCPPGATCVDLTNGFYCQCPFNMTGDDCRKAIQVDYDLYFSDPSRS--TAAQVVPFPTGEANS 2590
            .|.|..|.:|||..|.|.|.|.....||.|    |.|.:...|:|.::  |.:..|       ||
  Rat   917 ANPCQNGGSCVDKVNTFSCLCLPGFVGDKC----QTDMNECLSEPCKNGGTCSDYV-------NS 970

  Fly  2591 LTVAMWVQFAQKDDRGIFFTLYGVQSARMTQQRRMLLQAHSSGVQVSLFEDQPDAFLSFGEYTSV 2655
            .|......|            :||..                       |:..|   ...|.:..
  Rat   971 YTCTCPAGF------------HGVHC-----------------------ENNID---ECTESSCF 997

  Fly  2656 NDGQWHHVAVVWDGISGQLQLITEGLIASKMEYGAGGSLPGYLWAVLGLPQPYGLSNELAYSDSG 2720
            |.|      ...|||:....|.                             |.|.:......|  
  Rat   998 NGG------TCVDGINSFSCLC-----------------------------PVGFTGPFCLHD-- 1025

  Fly  2721 FQGTITKAQVWARALDITSEIQKQVRDCRSEPVLYPGLILNWAG-YEVTSGGVERNVPSLCGQRK 2784
                                    :.:|.|.|.|..|..::..| |..|                
  Rat  1026 ------------------------INECSSNPCLNSGTCVDGLGTYRCT---------------- 1050

  Fly  2785 CPVGYTGANCQQLVVDKEPPVVEH--------------CPGDLWVIAKNGSAVVSWDEPHFSDNI 2835
            ||:||||.|||.||....|...::              ||             ..||        
  Rat  1051 CPLGYTGKNCQTLVNLCSPSPCKNKGTCAQEKARPRCLCP-------------PGWD-------- 1094

  Fly  2836 GVTKIYERNGHRSGTTLLWGTY-DITYIASDA---------------------AGNTASCSFKVS 2878
                               |.| |:..::..|                     ||||..|...:.
  Rat  1095 -------------------GAYCDVLNVSCKAAALQKGVPVEHLCQHSGICINAGNTHHCQCPLG 1140

  Fly  2879 LLTDFCPALAD-----PVGGSQVCKDWGAGGQFKVCEIACNAGLRFSEPVPEFYTCGAEGFWRPT 2938
            ....:|....|     |......|.|:..|                       |.|         
  Rat  1141 YTGSYCEEQLDECASNPCQHGATCSDFIGG-----------------------YRC--------- 1173

  Fly  2939 REPSMPLVYPSCSPSKPAQRVFRIKMLFPSDVLCNKAGQAVLRQKVTNSVNGLNRDWNFCSYAIE 3003
                      .|.|                                  ...|:|     |.|.::
  Rat  1174 ----------ECVP----------------------------------GYQGVN-----CEYEVD 1189

  Fly  3004 ----------GTRECKDIQIDVKCDHYRGTQNNRVRRQAKD--GGVYVMEAELPVVNDDDDDLTL 3056
                      ||  |.|:....||....||:.........|  ||.:                .|
  Rat  1190 ECQNQPCQNGGT--CIDLVNHFKCSCPPGTRGLLCEENIDDCAGGPH----------------CL 1236

  Fly  3057 TGRQGRQQTGGDTYTLEIAFPAANDPVVHTSTGERSTVKQLLEKLILEDDQFAVQEILPNTVPDP 3121
            .|.|...:.||.:......|           .|||           .|.|   :.|.|.|.....
  Rat  1237 NGGQCVDRIGGYSCRCLPGF-----------AGER-----------CEGD---INECLSNPCSSE 1276

  Fly  3122 ASL---ELGSEYACPVGQVVMIPDCVPCAIGTFYD-SANKTCIACSRGTYQSEAGQLQCSKCPVI 3182
            .||   :|.:.|.|..........|     .||.| ...|.|:........|........:||  
  Rat  1277 GSLDCIQLKNNYQCVCRSAFTGRHC-----ETFLDVCPQKPCLNGGTCAVASNVPDGFICRCP-- 1334

  Fly  3183 AGRPGVTAGPGARSAADCKE-RCPAGKYFDAETGLCRSCGHGFYQPNEGSFSCELCGLGQTTRST 3246
               ||.:   |||..:.|.: :|..|:          .|.|....|:     | .|         
  Rat  1335 ---PGFS---GARCQSSCGQVKCRRGE----------QCVHTASGPH-----C-FC--------- 1368

  Fly  3247 EATSRKECRDECSSGQQLGADGRCEPCPRGTYRLQGVQPSCAACPLGRTTPKVGASSVEECTLPV 3311
              .:||:|...|:|          .||..|.......||...:|   |.:|....|..|..|.| 
  Rat  1369 --LNRKDCESGCAS----------NPCQHGGTCYPQRQPPYYSC---RCSPPFWGSHCESYTAP- 1417

  Fly  3312 CSAGTYLNATQNMCIECRKGYYQSESQQTSCLQCPPNHSTKITGATSKSECTNPCEHIAEGKPHC 3376
                     |......|...|...:::...|.:...:|:.:..|    .:|:...|         
  Rat  1418 ---------TSTPPATCLSQYCADKARDGICDEACNSHACQWDG----GDCSLTME--------- 1460

  Fly  3377 DVNAYCIMVPETSDFKCECKPGFNGTGMACTDVCDGFCENSGACVKD----LKGTPSCR----CV 3433
            |..|.|     ||..:|         .....:.||..| |:..|:.|    .:.:.:|:    |.
  Rat  1461 DPWANC-----TSSLRC---------WEYINNQCDELC-NTAECLFDNFECQRNSKTCKYDKYCA 1510

  Fly  3434 GSFTGPHC-----------------AERSEFAYIAGGIAGAVIFIIIIVLL 3467
            ..|...||                 |::.|  .:|.||      ::|:|||
  Rat  1511 DHFKDNHCDKGCNNEECGWDGLDCAADQPE--NLAEGI------LVIVVLL 1553

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
uifNP_001162899.1 CLECT 36..167 CDD:214480
LDLa 172..206 CDD:238060
CUB 220..319 CDD:238001
CUB 326..435 CDD:238001
CUB 439..550 CDD:238001
PHA02927 <568..733 CDD:222943
CCP 675..731 CDD:153056
PHA02639 697..>834 CDD:165022
FA58C 833..976 CDD:330301
FXa_inhibition 988..>1016 CDD:464251
CCP 1051..1106 CDD:153056
FA58C 1303..1443 CDD:330301
HYR 1463..1547 CDD:460572
HYR 1548..1631 CDD:460572
Ephrin_rec_like 1862..1909 CDD:429604 17/70 (24%)
Ephrin_rec_like 1916..1966 CDD:429604 16/64 (25%)
Ephrin_rec_like 1973..2020 CDD:429604 12/50 (24%)
EGF 2025..2055 CDD:394967 10/31 (32%)
EGF_CA 2059..2095 CDD:238011 16/38 (42%)
EGF_CA 2138..2176 CDD:238011 12/37 (32%)
EGF_CA 2178..2214 CDD:238011 18/35 (51%)
EGF_CA 2253..2289 CDD:238011 20/35 (57%)
EGF_CA 2292..2327 CDD:238011 20/34 (59%)
EGF_CA 2369..2405 CDD:238011 11/35 (31%)
EGF_CA 2411..2444 CDD:238011 13/32 (41%)
EGF_CA 2486..2520 CDD:238011 15/73 (21%)
EGF_CA 2522..2557 CDD:238011 16/34 (47%)
LamG 2590..2735 CDD:473984 17/144 (12%)
HYR 2799..2877 CDD:460572 14/113 (12%)
Ephrin_rec_like 3149..3200 CDD:429604 13/51 (25%)
Ephrin_rec_like 3207..3254 CDD:429604 8/46 (17%)
Ephrin_rec_like 3268..3307 CDD:429604 10/38 (26%)
Ephrin_rec_like 3315..3362 CDD:429604 6/46 (13%)
Notch2NP_077334.2 EGF_CA 109..143 CDD:238011 3/39 (8%)
EGF_CA 182..218 CDD:238011 14/38 (37%)
EGF_CA 260..296 CDD:238011 9/35 (26%)
EGF_CA 298..335 CDD:238011 9/39 (23%)
EGF_CA 415..454 CDD:238011 16/38 (42%)
EGF_CA 456..492 CDD:238011 12/37 (32%)
EGF_CA 495..530 CDD:238011 12/37 (32%)
EGF_CA 532..567 CDD:238011 18/34 (53%)
EGF_CA 570..604 CDD:238011 14/33 (42%)
EGF_CA 608..643 CDD:238011 20/34 (59%)
EGF_CA 645..679 CDD:238011 20/34 (59%)
EGF_CA 682..717 CDD:238011 15/36 (42%)
EGF_CA 757..793 CDD:238011 13/36 (36%)
EGF_CA 795..831 CDD:238011 12/35 (34%)
EGF_CA 873..909 CDD:238011 11/35 (31%)
EGF_CA 911..947 CDD:238011 17/39 (44%)
EGF_CA 949..985 CDD:238011 11/77 (14%)
EGF_CA 987..1022 CDD:238011 10/72 (14%)
EGF_CA 1025..1061 CDD:238011 16/77 (21%)
EGF_CA 1117..1147 CDD:238011 5/29 (17%)
EGF_CA 1151..1185 CDD:238011 11/114 (10%)
EGF_CA 1188..1223 CDD:238011 8/36 (22%)
EGF_CA 1225..1262 CDD:238011 12/74 (16%)
EGF_CA 1264..1302 CDD:238011 10/45 (22%)
EGF_CA <1312..1343 CDD:238011 9/38 (24%)
Notch 1423..1456 CDD:459658 5/36 (14%)
Negative regulatory region (NRR). /evidence=ECO:0000250 1425..1677 33/165 (20%)
Notch 1463..1497 CDD:459658 11/48 (23%)
Notch 1501..1534 CDD:459658 5/32 (16%)
NOD 1539..1594 CDD:462014 8/23 (35%)
NODP 1618..1673 CDD:462229
JMTM_Notch2 1659..1740 CDD:411986
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1751..1788
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1794..1813
ANK repeat 1827..1874 CDD:293786
ANK 1 1827..1871
ANKYR <1851..2066 CDD:440430
ANK 2 1876..1905
ANK repeat 1877..1907 CDD:293786
ANK 3 1909..1939
ANK repeat 1943..1974 CDD:293786
ANK 4 1943..1972
ANK repeat 1976..2007 CDD:293786
ANK 5 1976..2005
ANK repeat 2009..2040 CDD:293786
ANK 6 2009..2038
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 2097..2116
Atrophin-1 2105..>2422 CDD:460830
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 2380..2471
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.