We now know enough Prolog to understand many of the standard library predicates listed on our quick reference page.
On that page, you'll see that each argument of each predicate is annotated with a mode indicator, which is either + or ?. These indicators are not part of the Prolog language, but are a standard convention in Prolog documentation. Any argument with a ? can be used either as input or output to the predicate. However, an argument marked with + can only be used as input.
To put it differently, if all arguments of a predicate are marked with ?, then the predicate is pure: it will work in all directions. Many standard library predicates are impure. For example, the predicate msort(+L, ?M) will sort a list:
?- msort([horse, cow, dog, bird], L). L = [bird, cow, dog, horse].
However it won't work in the other direction:
?- msort(L, [horse, cow, dog, bird]). ERROR: Arguments are not sufficiently instantiated
The Prolog library contains impure predicates for practical reasons. Some predicates might be difficult or inefficient to implement in a pure way, and some operations (such as I/O) are inherently impure.
Also note that old-style arithmetic operators such
as "is
", "<
" and ">
"
are not pure since they don't work multidirectionally. By contrast,
integer constraints such as #=
, #<
and
#>
are pure.
Any program written only using pure predicates has some nice properties:
Any predicate can run in multiple directions (though it may possibly fail to terminate).
The language is sound, i.e. all results that Prolog returns will be logically correct.
Any query that terminates will always return results that are logically complete.
Reordering goals may affect termination, but will never change the results of a query.
These are nice properties to have, since they allow you to reason about a program declaratively (by considering its logical meaning) rather than procedurally (by considering the series of operations that it will perform). For this reason, I recommend that you write pure code whenever possible.
Prolog supports floating-point numbers:
?- X = 14.3. X = 14.3.
Notice that the = operator will never consider integers and floating point numbers to be equal, since they are structurally different:
?- 1 = 1.0. false.
The old-style 'is' operator and the comparison operators <, =<, >, and >= will work with floats:
?- X is 4.5 + 2.2. X = 6.7. ?- 4.8 < 10.0. true.
The '/' operator performs floating-point division:
?- X is 9.0 / 2.0. X = 4.5.
The abs()
, min()
and max()
functions that we saw with integer arithmetic will also work on
floating-point numbers. Furthermore, in floating-point expressions we
may use the constants e
and pi
, and the
trigonometric functions sin()
, cos()
, and
tan()
:
?- X is cos(pi). X = -1.0. ?- X is e^2. X = 7.3890560989306495.
Because is
and the comparison
operators such as '<' and '=<' will only work in one direction,
usually we will want to use new-style floating-point expressions,
which are based on real constraints. To enable these, add this
line to your init.pl file as described above:
:- use_module(library(clpr)).
New-style floating-point expressions use a completely different syntax from integer expressions. Instead of using operators such as #= and #<, we write floating-point expressions using ordinary operators such as '=', '<', '=<', but written inside braces. For example:
?- { X = 2 + 3 }. X = 5.0 ?- { 2 + 3 = X }. X = 5.0 ?- { X + 2 = 5 }. X = 3.0 ?- { X + 1 = X }. false. ?- { X * 2 = 10 }. X = 5.0 ?- { X * 2 = 9 }. X = 4.5 ?- { 2^X = 128 }. X = 7.0 ?- { 2^X = 100 }. X = 6.643856189774725 ?- { X^2 = 16 }. X = 4.0 ; X = -4.0
Notice that the last query above reports two solutions separately, unlike the integer solver which was able to combine them into a single result.
Prolog's real solver can also work with
inequalities, though it doesn't support the range syntax (e.g.
"4..8
") that we saw with integer constraints:
?- { 4 =< X, X =< 6 }. {X>=4.0, X=<6.0}. ?- { 10 < X, X < 30, 20 < X, X < 40 }. {X>20.0, X<30.0}. ?- { X^2 = 16, X > 0 }. X = 4.0
Unlike the integer solver, the real solver can actually solve systems of linear equations:
?- { X + Y + Z = 9, X + Y - Z = -1, X + 2 * Y + 3 * Z = 22 }. X = 1.0, Y = 3.0, Z = 5.0.
However it cannot solve a quadratic equation:
?- { X^2 - X - 2 = 0 }. {-2.0-X+X^2=0.0}.
Let's write a few predicates using floating-point arithmetic:
% X, Y, Z are sides of a right triangle triangle(X, Y, Z) :- { X > 0, Y > 0, Z > 0, X^2 + Y^2 = Z^2 }. % C = celsius, F = fahrenheit temperature(C, F) :- { F = 9 / 5 * C + 32 }. % currency conversion currency(Kc, Eur, USD) :- { 1.13 * Eur = USD, 21.7 * USD = Kc }.
They all work multidirectionally:
?- triangle(5, 12, H). H = 13.0 ?- triangle(5, X, 10). X = 8.660254037844387 ?- temperature(10, F). F = 50.0 ?- temperature(C, 50). C = 10.0 ?- currency(1000, Eur, USD). Eur = 40.78137106969537, USD = 46.082949308755765 ?- currency(Kc, Eur, 1000). Kc = 21700.0, Eur = 884.9557522123895
Last week we saw how to write recursive predicates on lists. Let's explore this topic a bit more.
The library predicate select(X, L, M)
is true if we can remove X anywhere in L to make M. Equivalently,
it's true if we can insert X anywhere in M to make L.
One possible implementation of 'select' is recursive:
select(X, [X | L], L). select(X, [Y | L], [Y | M]) :- select(X, L, M).
Or we may write 'select' using 'append':
select(X, L, M) :- append(A, [X | B], L), append(A, B, M).
Unfortunately this version of the predicate fails to terminate for some queries:
?- select(a, L, [b, c, d]). L = [a,b,c,d] ; L = [b,a,c,d] ; L = [b,c,a,d] ; L = [b,c,d,a] ; <hang>
The problem is that the first goal 'append(A, [X | B], L)
'
may produce an infinite number of results:
?- X = a, append(A, [X | B], L). X = a, A = [], L = [a|B] ; X = a, A = [_A], L = [_A, a|B] ; X = a, A = [_A, _B], L = [_A, _B, a|B] ...
If the first goal produces an infinite number of results, then Prolog's depth-first search can never terminate (at least in the pure core language that we have studied so far).
To help Prolog's search terminate, we need to add
a constraint that limits the search space. Whenever
select(X, L, M)
is true, L must have one more
element than M. Let's add a constraint that says that:
select(X, L, M) :- same_length(L, [_ | M]), append(A, [X | B], L), append(A, B, M).
Now the predicate will terminate in any direction. If the caller
specifies L, then the call to same_length()
will fix the
length of M. If the caller specifies M, then the call to
same_length()
will fix the length of L. In either case,
since the lengths of L and M are known, the following calls to
append()
will produce only a finite number of solutions,
and so the predicate select()
will terminate either as
written above, or even if we swap the two calls to append()
.
Adding a call to same_length()
will
help many other list-based predicates terminate as well. I call this
the "same_length trick".
A list may contain sublists:
?- L = [[a, b], [c, d], [e, f]]. L = [[a, b], [c, d], [e, f]].
In Prolog, two lists are equal if they have the same structure and elements:
?- [[a, b], [c, d], [e, f]] = [[a, b], [c, d], [e, f]]. true.
To put it differently, equality is deep and recursive.
Prolog can perform unification even between nested lists:
?- [[X, Y], [A, B]] = [L, [3, 4]]. A = 3, B = 4, L = [X, Y]. ?- [[X, Y], [A, B]] = [[3, X], [B, A]]. X = Y, Y = 3, A = B.
As an example of a predicate that uses nested lists, let's write a
predicate flatten(LL, L)
that is true if L is the
concatenation of all lists in LL:
flatten([], []). flatten([L | LL], M) :- append(L, M1, M), flatten(LL, M1). ?- flatten([[a, b], [c, d], [e, f]], L). L = [a, b, c, d, e, f]. ?- flatten([[a, b], L, [e, f]], [a, b, c, d, e, f]). L = [c, d]
As usual, placing the recursive call last helps the predicate terminate in both directions.
Actually this predicate is also built into the
standard library – it's called append()
, with two
arguments:
?- append([[a, b], [c, d], [e, f]], L). L = [a, b, c, d, e, f].
We can use Prolog to solve combinatorial recursive problems such as generating subsets, permutations, combinations and so on. In Prolog we will typically write predicates that generate one solution at a time, rather than producing a list of all solutions.
For example, let's write a predicate subset(S,
T)
that succeeds if T is a subset of S, i.e. contains some
elements of S in the same order in which they appear in S. Here is
one possible implementation:
subset([], []). subset([_ | L], M) :- subset(L, M). subset([X | L], [X | M]) :- subset(L, M).
It works:
?- subset([a, b, c], L). L = [] ; L = [c] ; L = [b] ; L = [b, c] ; L = [a] ; L = [a, c] ; L = [a, b] ; L = [a, b, c].
Here's one way to think about this predicate. Suppose that we are
given a set, and want to generate its subsets. If the set is empty,
its only subset is the empty set. Otherwise it must have the form [X
| L]
. Then for any subset, either X is absent or it is
present. The subsets without X are simply the subsets of L. The
subsets with X are the subsets of L, with X prepended to each of
them.
We'll solve more combinatorial problems in the tutorials.
A structure is a kind of term that consists of a functor and zero or more arguments. The functor has the same syntax as an atom. The components are arbitrary terms. Here are some examples:
point(
4
,
5
)
date(
25
,
2
,
2020
)
line(point(
1.0
,
2.5
), point(
3.2
,
4.6
))
box(box(
a
,
b
),
c
)
Two structures are equal only if they have the same functor name and the same arguments. Prolog can perform unification between arbitrary structures:
?- foo(3, 4, X) = foo(Y, 4, 5). X = 5, Y = 3. ?- foo(3, 4, X) = bar(Y, 4, 5). false. ?- foo(3, 4, X) = foo(3, Y). false. ?- a(X) = Y. Y = a(X). ?- Y = a(X), Z = b(Y). Y = a(X), Z = b(a(X)).
We may define predicates that use structures. For example, here is a
predicate that reverses the elements of a pair, where we represent
pairs using a functor pair
:
flip(pair(X, Y), pair(Y, X)).
Let's try it:
?- flip(pair(3, 4), P). P = pair(4, 3). ?- flip(P, pair(4, 3)). P = pair(3, 4). ?- flip(P, Q). P = pair(_A, _B), Q = pair(_B, _A).
Notice that predicates and structures have a similar syntax. Be sure
to understand that in the clause above, flip
is a
predicate and pair
is a functor.
As another example, suppose that we represent
points (X, Y) using a structure point(X, Y)
, and that we
represent a line between two points P and Q using a structure line(P,
Q)
. Now we may write predicates that succeed if a line is
vertical or horizontal:
vertical(line(point(X, _), point(X, _))). horizontal(line(point(_, Y), point(_, Y))).
Let's make some queries:
?- vertical(line(point(2, 4), point(2, 6))). true. ?- vertical(line(point(1, 1), point(1, Y))). true. ?- vertical(line(point(1, 1), point(2, Y))). false. ?- vertical(line(point(2, 3), P)). P = point(2, _).
Suppose the we represent a time of day using a
structure such as time(11, 26, 13)
, representing the
time 11:26:13. We'd like to write a predicate add(T1, N, T2)
,
which says that time T2 is N seconds after T1. Here is an
implementation:
% Convert a time to a number of seconds after midnight. convert(time(H, M, S), N) :- H in 0 .. 23, [M, S] ins 0 .. 59, N #= 3600 * H + 60 * M + S. % Add time T1 plus N seconds to form time T2. add(T1, N, T2) :- convert(T1, N1), convert(T2, N2), N1 + N #= N2.
Remarkably, add()
will work in every
direction:
?- add(time(11, 30, 0), 75, T). T = time(11, 31, 15). ?- add(time(11, 30, 0), N, time(11, 31, 15)). N = 75. ?- add(T, 75, time(11, 31, 15)). T = time(11, 30, 0).
Often we may wish to map keys to values. We have already seen that we may represent such a mapping using a 2-argument predicate:
color(red, 10). color(green, 20). color(blue, 30). color(green, 40).
This is called a temporal representation of the data. "temporal" means related to time. When we query this predicate, we get results one at a time:
?- color(K, V). K = red, V = 10 ; K = green, V = 20 ; K = blue, V = 30 ; K = green, V = 40.
Alternatively, we may want a spatial representation of the
data, i.e. to hold all the keys and values in a single dictionary
data structure. A simple dictionary representation in Prolog is an
association list, which is simply a list of key/value pairs.
For now, let's represent a key/value pair as a structure p(K,
V)
. (We'll see a syntactially nicer possibility in the next
lecture.)
For example, here is a dictionary:
colors(D) :- D = [ p(red, 10), p(green, 20), p(blue, 30), p(orange, 40) ].
Prolog does not have traditional global variables, but we can use a predicate such as this as a workaround. If we want access to this dictionary at any place in our program, we can simply call the predicate to assign it to a variable of our choice.
Let's now write a predicate 'lookup' that will let us look up keys in any dictionary:
lookup(D, K, V) :- member(p(K, V), D).
Now we may look up keys in the dictionary above. As usual, queries work in both directions, so we may even map values back to keys:
?- colors(_D), lookup(_D, blue, X). X = 30 ?- colors(_D), lookup(_D, C, 20). C = green ?- colors(_D), lookup(_D, K, V). K = red, V = 10 ; K = green, V = 20 ; K = blue, V = 30 ; K = orange, V = 40.
Notice that _D
is an anonymous variable since it
begins with an underscore. I recommend that you add this line to your
init.pl file:
:- set_prolog_flag(toplevel_print_anon, false).
This will disable the printing of anonymous
variables in the REPL. So then the queries above won't print
_D
out, which would only clutter the output.
We have seen that we can encode a key/value mapping either temporally (using a 2-argument predicate) or spatially (using a dictionary data structure). Which approach is better? Either is reasonable in Prolog, and the best approach may depend on the problem we are trying to solve. However, note that we can easily translate a spatial to a temporal representation: the last query above does exactly that. On the other hand, we cannot translate a temporal representation to a spatial representation using the subset of Prolog that we know so far.