This is an article I wrote several years ago. It is part of the “Functional Programming in Haskell” online course. It discusses one of the aspects of functional programming that I like in particular, the fact that the entire language can be build starting from the lambda calculus.
In a functional language, there are only functions
Although it might seem that a language like Haskell has a lot of different objects and constructs, they can all be reduced to functions. We will demonstrate how variables, tuples, lists, conditionals, Booleans and numbers can all be constructed from lambda functions. The article assumes some familiarity with Haskell, but here is a quick introduction.
Haskell: a quick introduction
Haskell is whitespace-sensitive like Python, but has a markedly different syntax. Because everything is a function, there is no keyword to mark a function; because there is only lexical scope, there is no need for any special scope identifiers. Function arguments are separated by spaces; anonymous functions are called lambda functions and have a special syntax to identify them:
-- named function
square x = x*x
-- lambda function bound to a named variable
anon_square = \x -> x*x
Several of the examples use the let ... in ...
construct, which behaves as a lexically scoped block:
let_square x a b c = let
x2 = a*x*x
x1 = b*x
x0 = c
in
x2+x1+x0
x0
, x1
and x2
are in scope only in the expression after the in
keyword.
Haskell is statically typed, and the type of a function or variable is written in a separate annotation, for example:
isEmpty :: [a] -> Bool
isEmpty lst = length lst == 0
The isEmpty
function has a type signature, identified by ::
, that reads “isEmpty
is a function from a list of anything to a Boolean”. Types must be written with an initial capital. The a
is a type variable which can take on any type, as explained in my post on algebraic data types. Most of the time, you don’t need to write type declarations as Haskell works them out automatically. In the rest of the article, I’m not focusing on the types so I have omitted them.
Variables and let
Haskell uses let
expressions to define variables used in a final expression, for example:
let
n = 10
f x = x+1
in
f n
We can rewrite this to use only one variable per let
:
let
n = 10
in
let
f x = x+1
in
f n
Now we rewrite any named functions (f
) as lambda functions:
let
n = 10
in
let
f = \x -> x+1
in
f n
Then we rewrite the let
expressions themselves as lambdas, first the inner let
:
let
n = 10
in
(\f -> f n) (\x -> x+1)
We do this by turning the variable in the let
part of the expression (f
) into a parameter of a lambda function (\f -> ...
). The body of the function is the expression after the in
(f n
). Then we apply this lambda function to the expression bound to the variable (\x -> x+1
).
Then we rewrite outer let
in the same way:
( \n -> ((\f -> f n) ( \x -> x+1 )) ) 10
This expression consists only of lambda expressions, which shows that variables and let
-expressions are just syntactic sugar for lambda expressions.
Tuples
Haskell has tuples, also called record types or product types, ordered collections of expressions of potentially different types:
tp = (1,"2",[3])
The tuple notation is syntactic sugar for a function application:
tp = mkTup 1 "2" [3]
The tuple construction function can again be defined purely using lambdas:
mkTup x y z = \t -> t x y z
What we do here is to use the elements of the tuple as the arguments of a lambda function. So what mkTup
returns is also a lambda function, in other words mkTup
is a higher-order function. Now we rewrite the mkTup
named function as lambda function as well:
mkTup = \x y z -> \t -> t x y z
So our tuples are now also encoded purely as lambda functions.
The same goes for the tuple accessor functions:
fst tp = tp (\x y z -> x)
snd tp = tp (\x y z -> y)
Let’s see what happens here: the argument tp
of fst
is a function: \t -> t x' y' z'
. We now apply this function to another function, \x y z -> x
:
(\t -> t x' y' z') (\x y z -> x)
Applying the function gives:
(\x y z -> x) x' y' z'
And so the result will of course be x'
, which is indeed the first element of the tuple.
Lists
Lists can be defined in terms of the empty lists []
and the cons
operation (:)
.
ls = [1,2,3]
Rewriting this using :
and []
:
ls = 1 : 2 : 3 : []
Or using cons
explicitly:
ls = cons 1 (cons 2 (cons 3 []))
Defining cons
We can define cons
using only lambda functions as
cons = \x xs ->
\c -> c x xs
We’ve used the same approach as for the tuples: cons
returns a lambda function. So we can write a list as:
ls = cons 1 (...)
= \c -> c 1 (...)
We can also define head
and tail
using only lambdas, similar to what we did for fst
and snd
above:
head ls = ls (\x y -> x)
tail ls = ls (\x y -> y)
The empty list
We can define the empty list as follows:
[] = \f -> true
This is a lambda function which always returns true
, regardless of its argument. The definitions for true
and false
are given below under Booleans. With this definition we can check if a list is empty or not:
isEmpty lst = lst (\x xs -> false)
Let’s see how this works. A non-empty list is always defined as:
lst = x:xs
which with our definition of (:)
(i.e. cons
) is:
lst = (\x xs -> \c -> c x xs) x xs
= \c -> c x xs
And therefore:
isEmpty lst
= isEmpty (\c -> c x xs)
= (\c -> c x xs) (\x xs -> false)
= false
isEmpty []
= isEmpty (\f -> true)
= (\f->true) (\x xs -> false)
= true
And so we have a pure-lambda definition of lists, including construction, access and testing for empty.
Recursion on lists
Now that we can test for the empty list we can define recursions on lists such as foldl
, map
etc.:
foldl f acc lst =
if isEmpty lst
then acc
else foldl f (f (head lst) acc) (tail lst)
and
map f lst =
let
map' f lst lst' =
if isEmpty lst
then (reverse lst')
else map' f (tail lst) (head lst: lst')
in
map' f lst []
with
reverse lst = (foldl (\acc elt -> (elt:acc)) [] lst
The definitions of foldl
and map
use an if-then-else expression which is defined below under Conditionals.
#### List concatenation
With foldl
and reverse
it is easy to express list concatenation:
(++) lst1 lst2 =
foldl (\acc elt -> (elt:acc)) lst2 (reverse lst1)
#### The length of a list
To compute the length of a list we need integers, they are defined below. We increment the lent counter for every element of the list consumed by the fold.
length lst = foldl calc_length 0 lst
where
calc_length _ len = inc len
Conditionals
We have used conditionals in the above expressions:
if cond then if_true_exp else if_false_exp
Here cond
is an expression returning either true
or false
, these are defined below.
We can write the if-then-else clause as a pure function:
ifthenelse cond if_true_exp if_false_exp
Booleans
To evaluate the condition we need to define Booleans as lambda functions:
true = \x y -> x
false = \x y -> y
The Boolean is a function selecting the expression corresponding to true or false. With this definition, the if-then-else becomes simply:
ifthenelse cond if_true_exp if_false_exp =
cond if_true_exp if_false_exp
Basic Boolean operations: and
, or
and not
Using ifthenelse
we can define and
, or
and not
:
and x y = ifthenelse x (ifthenelse y true) false
or x y = ifthenelse x true (ifthenelse y true false)
not x = ifthenelse x false true
Boolean equality: xnor
We note that to test equality of Booleans we can use xnor
, and we can of course define xor
in terms of and
, or
and not
:
xor x y = (x or y) and (not x or not y)
and
xnor x y = not (xor x y)
Signed Integers
The common way to define integers in the lambda calculus is as Church numerals. Here we take a different approach, but it is of course equivalent. We define an integer as a list of Booleans, using thermometer code, and with the following definitions:
We define unsigned 0
as a 1-element list containing false
. To get signed integers we simply define the first bit of the list as the sign bit. We define unsigned and signed versions of 0
:
u0 = false:[]
0 = +0 = true:u0
-0 = false:u0
For convenience we define also:
isPos n = head n
isNeg n = not (head n)
isZero n = not (head (tail n))
sign n = head n
Integer equality
The definition of 0
makes the integer equality (==)
easier:
(==) n1 n2 = let
s1 = head n1
s2 = head n2
b1 = head (tail n1)
b2 = head (tail n2)
if (xnor s1 s2) then
if (and (not b1) (not b2))
then true
else
if (and (not b1) (not b2))
then (==) (s1:(tail n1)) (s2:(tail n2))
else false
else false
Negation
We can also easily define negation:
neg n = (not (head n)):(tail n)
Increment and Decrement
For convenience we define also define increment and decrement operations:
inc n = if isPos n
then true:true:(tail n)
else if isZero n
then 1
else false:(tail (tail n))
dec n = if isZero n
then -1
else if isNeg n
then false:true:(tail n) n
else true:(tail (tail n))
Addition and Subtraction
General addition is quite easy:
add n1 n2 = foldl add_if_true n1 n2
where
add_if_true elt n1 = if elt then true:n1 else n1
In the same way, subtraction is also straightforward:
sub n1 n2 = foldl sub_if_true n1 n2
where
sub_if_true elt n1 = of elt then (tail n1) else n1
Multiplication
An easy way to define multiplication is by defining the replicate
and sum
operations:
replicate n m =
let
repl n m lst = if n==0
then lst
else repl (dec n) m (m:lst)
in
repl n m []
sum lst = foldl add 0 lst
Then multiplication simply becomes
mult n m = sum (replicate n m)
In a similar way we can define integer division and modulo.
Floats, Characters and Strings
We note that floating-point numbers and characters use an integer representation, and strings are simply lists of characters. So we don’t need to do any additional work to represent them, and the operations on them are analogous to the ones defined above.
Conclusion
In this way, we have defined a language with variables, (higher-order) functions, conditionals and recursion. We can manipulate lists and tuples of integers, floats, chars and strings. And yet it consists of nothing more than lambda functions!