Message Boards

WOLFRAM COMMUNITY

14065 Views

6 Replies

48 Total Likes

View groups...

Follow this post

Share this post:

GROUPS:

Scoping constructs in Wolfram Language

Leonid Shifrin

Leonid Shifrin, Wolfram Research

Posted 7 years ago

NOTE: Please see the original version of this post and related discussion HERE. Cross-posted here per suggestion of Vitaliy Kaurov. You will find a lot of information in this answer. I will add a few personal notes. Module Use `Module` when you want to localize variables inside your function's body, and those variables will potentially acquire and/or change their values during the computation. Basic use For example: f[x_]:=Module[{y=x^2},y=y+x;{x,y}] Here, a local mutable variable (symbol) `y` is local to the `Module`, and is, indeed, a symbol with a unique name. This is the closest you have in Mathematica to, say, local variables in C. Advanced uses `Module` also has advanced uses. One of them is to create closures - functions with a persistent state. My third post in this thread illustrates many cases of that and has further references. One example I will steal from there: the following function will produce the next Fibonacci number on demand, and yet it will be as fast as the iterative loop implementation for generation of consecutive Fibonacci numbers (since Module is invoked only once, when the function is defined): Module[{prev, prevprev, this}, reset[] := (prev = 1; prevprev = 1); reset[]; nextFib[] := (this = prev + prevprev; prevprev = prev; prev = this) ]; reset[]; Table[nextFib[], {1000}]; // Timing (* ---> {0.01, Null} ) One problem with persistence created with `Module`-variables is that one should not generally serialize such state (definitions), for example by saving the state via `Save` or `DumpSave`. This is because, the uniqueness of names for `Module`-generated symbols is guaranteed only within a single Mathematica session. `Module` also allows one to create local functions, which `With` does not (except pure functions). This is a very powerful capability. It is particularly useful for writing recursive functions, but not only. In the link mentioned above, there were examples of this. One problem with local functions created by `Module` is that these symbols won't be automatically garbage-collected when `Module` finishes (if they have `DownValues`, `SubValues` or `UpValues`. `OwnValues` are fine), and so may lead to memory leaks. To avoid that, one can `Clear` these symbols inside `Module` before returning the result. With Use `With` to define local constants, which can not be changed inside the body of your function. Basic use For example, f[x_,y_]:=With[{sum = x+y},{sum x, sum y}] It is instructive to trace the execution of `f`. You will notice that `sum` gets replaced by its value very early on, before the body starts evaluating. This is quite unlike `Module`, where variable entries get replaced by their values in the process of evaluation, just as it would normally happen were the variables global. Advanced uses On an advanced level, `With` can be used to inject some evaluated code deep into some expression which is otherwise unevaluated: With[{x=5},Hold[Hold[x^2]]] ( Hold[Hold[5^2]] ) and is thus an important meta-programming tool. There are lots of uses for this feature, in particular one can use this to inject code into `Compile` at run-time right before compilation. This can extend the capabilities / flexibility of `Compile` quite a bit. One example can be found in my answer to this question. The semantics of `With` is similar to that of rule substitutions, but an important difference is that `With` cares about inner scoping constructs (during variable name collisions), while rules don't. Both behaviors can be useful in different situations. Module vs With Both of these are lexical* scoping constructs, which means that they bind their variables to lexical their occurrences in the code. Technically, the major difference between them is that you can not change the values of constants initialized in `With`, in the body of `With`, while you can change values of `Module` variables inside the body. On a deeper level, this is because `With` does not generate any new symbols. It does all the replacements before the body evaluates, and by that time no "constant symbols" are at all present, all of them replaced with their values. `Module`, OTOH, does generate temporary symbols (which are normal symbols with an attribute `Temporary`), which can store a mutable state. Stylistically, it is better to use `With` if you know that your variables are in fact constants, i.e. they won't change during the code execution. Since `With` does not create extra (mutable) state, the code is cleaner. Also, you have more chances to catch an occasional erroneous attempt in the code to modify such a constant. Performance-wise, `With` tends to be faster than `Module`, because it does not have to create new variables and then destroy them. This however usually only shows up for very light-weight functions. I would not base my preference of one over another on performance boosts. Block Basic use `Block` localizes the value of the variable. In this example, `a` does not refer to `i` literally inside `Block`, but still uses the value set by `Block`. a:=i Block[{i=2},a] {a,i} `Block` therefore affects the evaluation stack, not just the literal occurrences of a symbol inside the code of its body. Its effects are much less local than those of lexical scoping constructs, which makes it much harder to debug programs which use `Block` extensively. It is not much different from using global variables, except that `Block`guarantees that their values will be restored to their previous values once the execution exits `Block` (which is often a big deal). Even so, this non-transparent and non-local manipulation of the variable values is one reason to avoid using `Block` where `With` and / or `Module` can be used. But there are more (see below). In practice, my advice would be to avoid using `Block` unless you know quite well why you need it. It is more error-prone to use it for variable localization than `With` or `Module`, because it does not prevent variable name collisions, and those will be quite hard to debug. One of the reasons people suggest to use `Block` is that they claim it is faster. While it is true, my opinion is that the speed advantage is minimal while the risk is high. I elaborated on this point here, where at the bottom there is also an idiom which allows one to have the best of both worlds. In addition to these reasons, as noted by @Albert Retey, using `Block` with the `Dynamic` - related functionality may lead to nasty surprises, and errors resulting from that may also be quite non-local and hard to find. One valid use of `Block` is to temporarily redefine some global system settings / variables. One of the most common such use cases is when we want to temporarily change the value of $RecursionLimit or $IterationLimit variables. Note however that while using Block[{$IterationLimit = Infinity}, ...] is generally okay, using Block[{$RecursionLimit = Infinity}, ...] is not, since the stack space is limited and if it gets exhausted, the kernel will crash. A detailed discussion of this topic and how to make functions tail-recursive in Mathematica, can be found e.g. in my answer to this question. It is quite interesting that the same ability of `Block` can be used to significantly extend the control the user has over namespaces/symbol encapsulation. For example, if you want to load a package, but not add its context to the `$ContextPath` (may be, to avoid shadowing problems), all you have to do is Block[{$ContextPath}, Needs[Your-package]] As another example, some package you want to load modifies some other function (say, System`SomeFunction), and you want to prevent that without changing the code of the package. Then, you use something like Block[{SomeFunction}, Needs[That-package]] which ensures that all those modifications did not affect actual definitions for `SomeFunction` - see this answer for an example of this. Advanced uses `Block` is a very powerful metaprogramming device, because you can make every symbol (including system functions) temporarily "forget" what it is (its definitions and other global properties), and this may allow one to change the order of evaluation of an expression involving that symbol(s) in non-trivial ways, which may be hard to achieve by other means of evaluation control (this won't work on `Locked` symbols). There are many examples of this at work, one which comes to mind now is the `LetL` macro from my answer to this question. Another more advanced use of `Block` is to ensure that all used variables would be restored to their initial values, even in the case of Abort or exception happening somewhere inside the body of `Block`. In other words, it can be used to ensure that the system will not find itself in an illegal state in the case of sudden failure. If you wrap your critical (global) variables in `Block`, it will guarantee you this. A related use of `Block` is when we want to be sure that some symbols will be cleared at the end. This question and answers there represent good examples of using `Block` for this purpose. Variable name conflicts In nested scoping constructs, it may happen that they define variables with the same names. Such conflicts are typically resolved in favor of the inner scoping construct. The documentation contains more details. Block vs Module/With So, `Block` implements dynamic scoping, meaning that it binds variables in time rather than in space. One can say that a variable localized by `Block` will have its value during the time this `Block` executes (unless further redefined inside of it, of course). I tried to outline the differences between `Block` and `With`/`Module` (dynamic vs lexical scoping) in this answer. Some conclusions For most common purposes of variable localization, use `Module` For local constants, use `With` Do not ordinarily use `Block` for introducing local variables All of the scoping constructs under discussion have advanced uses. For `Module` this is mostly creating and encapsulating non-trivial state (persistent or not). For `With`, this is mostly injecting inside unevaluated expressions. For `Block`, there are several advanced uses, but all of them are, well, advanced. I'd be worried if I found myself using `Block` a lot, but there are cases when it is indispensable.

POSTED BY: Leonid Shifrin

6 Replies

Sort By:

Patrick Scheibe

Patrick Scheibe, University Leipzig

Posted 7 years ago

With vs. Block example One basic example that even new user often struggle is to insert values of parameters into an analytic solution. Let's say we have something like this solution = DSolveValue[{y'[x] + y[x] == a Sin[x], y[0] == b}, y[x], x] (* Out[1]= -(1/2) E^-x (-a - 2 b + a E^x Cos[x] - a E^x Sin[x]) ) If you now want to insert values for `a` and `b`, the usual way is to use a replacement rule `{a->aValue, b->bValue}`. Nevertheless, users might try to insert values using With[{a = 1, b = 1}, solution ] ( Out[6]= -(1/2) E^-x (-a - 2 b + a E^x Cos[x] - a E^x Sin[x]) ) which fails. As Leonid already wrote `With` "does all the replacements before the body evaluates" and this makes the approach fail. `Module` cannot be used as well, because it uses lexical scoping that introduces new variable names that only look like normal `a` and `b`. `Block`, however, can be used Block[{a = 1, b = 1}, solution ] ( Out[7]= -(1/2) E^-x (-3 + E^x Cos[x] - E^x Sin[x]) *) Although in general, using replacement rules is the better alternative, for some cases this gives a good alternative.

POSTED BY: Patrick Scheibe

Daniel Lichtblau

Daniel Lichtblau, Wolfram Research

Posted 7 years ago

Could also do: `With[{a = 1, b = 1},Evaluate@solution]`. The point of `Evaluate` here is to get past the `HoldAll` attribute of `With` for purposes of injecting those values.

POSTED BY: Daniel Lichtblau

Anonymous User

Posted 7 years ago

For differentiating between With, Module, and Block, I find this example helpful (from the docs for With, "Examples", "Properties and Relations"): In[1]:= {Block[{x = 5}, Hold[x]], With[{x = 5}, Hold[x]], Module[{x = 5}, Hold[x]]} Out[1]= {Hold[x], Hold[5], Hold[x$119582]} In[2]:= ReleaseHold[%] Out[2]= {x, 5, 5}

POSTED BY: Anonymous User

Leonid Shifrin

Leonid Shifrin, Wolfram Research

Posted 7 years ago

It's a useful example, but to fully understand it, one needs to also understand the non-standard evaluation process and how garbage collection works, so it is by no means trivial. I would say that it illustrates some advanced uses of these constructs rather than really differentiates between them. I would rather use an example like this to illustrate a basic difference: ClearAll[i, a]; i = 1; a := i^2; Module[{i = 2}, i = 3; a] Block[{i = 2}, i = 3; a] With[{i = 2}, i = 3; a] (* 1 9 During evaluation of In[12]:= Set::setraw: Cannot assign to raw object 2. 1 *)

POSTED BY: Leonid Shifrin

Sander Huisman

Sander Huisman, University of Twente

Posted 7 years ago

Thanks for sharing! very informational. I mostly use With/Module, Block very rarely. For the case of some (sub)routine, I found one generally uses Module over With because a routine generally has some temporary variable that needs to be stored/manipulated, so then one can only use Module (not With as the variables can not be manipulated).

POSTED BY: Sander Huisman

Leonid Shifrin

Leonid Shifrin, Wolfram Research

Posted 7 years ago

Glad you found it useful. Re: `Block` - there are cases where it is very useful, here is one relevant M SE discussion. One difference between techniques based on `Block` vs those based on `Module` or `With` is that, while one can emulate `Module` or `With` reasonably well with the top-level code, one can't do the same with `Block` as reliably. Re: `Module` vs `With`: I prefer to use `With` when possible, because immutability of its variables allows to catch certain bugs, and in general it is best to avoid side effects when possible. But there is a tradeoff between being side effect - free and keeping the code readable / flexible, so I personally sometimes use `Module` for readability, where nested `With` could be used instead. An alternative would be to implement one's own scoping constructs, such as a nested `With` - which particular case was discussed e.g. here and then here. There is a lot more to scoping in Mathematica / WL, than is covered in my answer above. For example, there are many subtleties related to how lexical scoping is emulated with variable renaming mechanism. There have been many discussions of that, in particular I can refer to this and this, and references / links therein. Here also belongs the topic of how variable conflict resolution is performed in nested lexical scoping constructs - it has a number of subtleties (see e.g. this answer and links therein, as well as the documentation for scoping constructs). Another topic not covered above is the use of Internal`InheritedBlock, which is an extremely useful scoping construct in some cases, and some other less known scoping constructs. There are also topics such as safe resource management and dynamic environments (one example can be found in this answer), which logically belong here too. Yet another topic is garbage collection, which is mostly relevant for `Module` (since it creates new temporary symbols). Some of the issues with this are discussed in the documentation, but there are some subtleties in how `Module` - generated symbols are garbage-collected, and in some cases very non-obvious memory leaks may happen. Some details on this behavior can be found in this answer, and more detailed discussion (in Russian, but one can google-translate) can be found here. The large-scale encapsulation constructs (contexts and packages) can also be viewed as scoping constructs, since they also are used to encapsulate / localize symbols and limit their visibility. Since this is a large topic in its own right, I will only mention here that there are lesser known subtleties in the interaction of those with local variables in usual scoping constructs (`Module`, `With`, `Function`, `RuleDelayed`). In this context, I would refer to this and this discussions for further details.

POSTED BY: Leonid Shifrin

Reply to this discussion

Reply Preview

Attachments

Remove Add a file to this post

Follow this discussion

or Discard

Group Abstract

Feedback

Scoping constructs in Wolfram Language

Module

Basic use

Advanced uses

With

Basic use

Advanced uses

Module vs With

Block

Basic use

Advanced uses

Variable name conflicts

Block vs Module/With

Some conclusions

With vs. Block example