Week 14: Notes

This week's lecture was an introduction to the λ-calculus. Here are a few notes reviewing the material from the lecture. For more information, I recommend these books:

Benjamin Pierce, Types and Programming Languages (MIT Press, 2002)
Greg Michaelson, An Introduction to Functional Programming Through Lambda Calculus (Dover, 2011)
J. Roger Hindley, Lambda-Calculus and Combinators: An Introduction (Cambridge, 2008)

introduction

The lambda calculus is a formal system for expressing computation. It was invented by Alonzo Church in the 1930s. It is an important and very widely used formalism in the theory of programming languages.

Other general models of computation include general recursive functions (Godel, 1933) and Turing machines (Turing, 1936). In the 1930s it was proved that all of these models have the same computational power. We say that a function that can be encoded in any of these models is computable.

In this course we are studying the pure untyped lambda-calculus.

syntax

The syntax of the λ-calculus is defined by this context-free grammar:

expr = var | (expr expr) | (λ var . expr)

In other words, every term is one of the following:

A variable. We assume that an infinite number of these exist.
An application.
An abstraction, i.e. a λ-term.

Here are some sample terms:

(λx.(xy))
((λy.y)(λx.(xy)))
(x(λx.(λx.x)))
(λx.(yz))

We use lowercase letters to denote variables. In the following, uppercase letters denote λ terms, e.g. T = (λx.(xx)).

We adopt a couple of precedence rules so we don't need to write so many parentheses:

Application is left-associative: MNP is (MN)Q.
Application binds more tightly than abstraction: λx.yz is λx.(yz), not (λx.y)z .

Furthermore, λxyz.M means λx.λy.λz.M.

The notation M ≡ N means that M and N are syntactically equal, i.e. the same term.

scope

In a subterm λx.M, M is the scope of λx.

An occurrence of a variable x is bound if it is in the scope of some λx, otherwise free.

If x has at least one free occurrence in M, x is a free variable of M. We write FV(M) to denote the set of all free variables of M. For example, FV(xv(λyz.yv)w) = {x, v, w}.

A closed term or combinator has no free variables.

substitution

The notation M[x := N] denotes substituting N for x in M. This means replacing all free occurrences of x with N, renaming bound variables to avoid clashes. For example:

x(yx)(λx.zx) [x := y] ≡ y(yy)(λx.zx)
(λy.x)[x := w] = λy.w
(λw.x)[x := w] = λz.w (here we must rename w to z!)

Here is a formal recursive definition of substitution:

x[x := N] ≡ N
y[x := N] ≡ y
(PQ)[x := N] ≡ (P[x := N])(Q[x := N])
(λx.P)[x := N] ≡ λx.P
(λy.P)[x := N] ≡ λy.P[x := N] (if y ∉ FV(N))
(λy.P)[x := N] ≡ λz.P[y := z][x := N] (if y ∈ FV(N)) (choosing z ∉ FV(NP))

α-conversion

Notice that in the last substitution rule above we renamed the variable y to z by replacing (λy.P) with λz.P[y := z].

More generally, if we can replace any subterm (λy.P) in M with λz.P[y := z] (z ∉ FV(P)), yielding N, we write

M →_α N ("M α-converts to N")

If we can convert M to N by a series of 0 or more α-conversions, then we write

M ≡_α N ("M and N are α-equivalent")

Some basic facts about α-equivalence:

≡_α is an equivalence relation.
If P ≡_α Q, then FV(P) = FV(Q).

Most authors identify all α-equivalent terms, and I will too. Accordingly, from now on ≡ will actually mean ≡_α .

β-reduction

(λx.M)N is called a β-redex and contracts to M[x := N], which is called its contractum.

If P contains a β-redex and we replace it with its contractum, yielding Q, we write

P →_β Q ("P β-reduces in one step to Q")

If we can transform P to Q by a series of 0 or more β-reductions, we write

P →*_β Q ("P β-reduces to Q")

A β-normal form is a term that contains no β-redexes.

If P →*_β Q and Q is in normal form, then we say that Q is a normal form of P.

Does every term have a normal form? No – consider Ω ≡ (λx.x x)(λx.x x), or Ω₁ ≡ (λx.xxy)(λx.xxy).

confluence

The Church-Rosser theorem (1935) states: If P →*_β M and P →*_β N, then there exists T such that M →*_β T and N →*_β T. In other words, β-reduction is confluent.

evaluation strategies

When we β-reduce a term, at each step we must choose which β-redex to reduce next. There are various evaluation strategies for doing so.

Normal-order reduction always reduces the leftmost outermost redex. An outermost redex is one that is not contained inside any other.

Call by name is like normal order, but performs no reductions inside abstractions. For example, it will stop at λx.((λy.y)z) rather than reducing it to λx.z. This is similar to expression evaluation in Haskell and other non-strict programming languages.

Call by value also performs no reductions inside abstractions, and applies all possible reductions to a redux's right-hand side before reducing the redex. This is similar to expression evaluation in strict programming languages, which include most mainstream languages such as C and Java.

The leftmost reduction theorem (Curry, 1958) states: If P has a normal form Q, then every normal-order reduction of P is finite and ends at Q.

Church booleans

We can encode a variety of data types in the λ-calculus.

To begin with, the Church booleans are defined as follows:

true = λx.λy.x
false = λx.λy.y

Now we can define

if = λb.λx.λy.bxy
and = λb.λc. if b c false

pairs

pair = λx.λy.λb.if b x y
fst = λp. p true
snd = λp.p false

Church numerals

0 = λf.λx.x
1 = λf.λx.fx
2 = λf.λx.f(fx)
succ = λn.λf.λx. f (n f x)
plus = λm.λn.λf.λx. m f (n f x)
times = λm.λn.m (plus n) 0

recursion

It is possible to go further: by using a fixed-point combinator we can define recursive functions and, ultimately, any recursive function. This is, however, a subject for a more advanced class.