This post is a literate Haskell file. To follow along, cut and paste the entire text of this post into a file called StarSemiring.lhs and load it up in GHCi.
I would like to introduce you to a very general algorithm that I like to call the Gauss-Jordan-Floyd-Warshall-McNaughton-Yamada algorithm.
With this simple algorithm (an algorithm whose implementation is not very much longer than its name) you can solve almost half of the problems you might encounter in computer science.
For instance, this algorithm will let you:
compute transitive closures
compute shortest paths
compute largest capacity paths
compute most reliable paths
compute the regular expression for a finite automaton
solve linear equations
This is all accomplished by working over an abstract mathematical structure called a *-semiring (read as "star-semiring").
We will see many examples of a *-semirings in this post implemented in literate Haskell.
First we define a semiring as a commutative monoid and a monoid satisfying the usual distributive and absorption laws.
Next we add one more method star (read as "asteration") to create a *-semiring.
A derived operation called "plus" can be defined in terms of star by x+ := xx∗.
Actually the two operations are interdefinable because x∗ = 1 + x+.
Most of our examples of a *-semiring will be instances of a more specialized structure called a Kleene algebra.
The most important examples of a *-semirings for us will be matrix *-semirings.
Let us make a quick type for square matrices indexed by i
Our smart constructor matrix builds square matrix indexed over all elements of i from a given function f.
All of our matrices for a given index type are all the same size and thus Matrix i is an applicative functor.
We postpone the obvious Applicative instances to the end as well as some pretty printing code.
All matrices over a *-semirings are themselves *-semirings. The additive operation for matrices simply lifts the additive operation from the underlying *-semiring.
The multiplication operation for a *-semiring is the usual multiplication operation for matrices.
The final operation, star, is the heart of this post.
The asteration operation solves the equation: x∗ = 1 + xx∗.
Expending this recurrence equation out we see that asteration is a solution to the infinite series:
x∗ = 1 + x + x2 + ….
Unfortunately we cannot just sum an infinite series, so instead we implement plus using the famous Gauss-Jordan-Floyd-Warshall-McNaughton-Yamada algorithm.
And we are done. Well almost. We will also note that matricies over a Kleene algebra is itself a Kleene algebra.
Now, let us look at some applications of this algorithm.
Our first example will be computing the reflexive-transitive closure of a directed graph.
We will represent directed graph by its adjacency matrix.
We build our own version of the Booleans called Connection
We will give a small example of a directed graph over five nodes.
We can print the adjacency matrix of this example directed graph.
To compute the reflexive-transitive closure of this directed graph we will turn Connection into our first example of a *-semiring. The additive operation is parallel composition of connections and the multiplicative operation is the sequential composition of connections.
Asteration is the solution to x∗ = 1 + xx∗, but, for Connections, 1 plus anything is 1. Therefore the asteration of anything is one.
We note that this Connection *-semiring is also a Kleene algebra.
Now that Connections are a *-semiring, that means matrices over Connections is a *-semiring.
In particular, Graph Node is also a *-semiring.
We can now compute the reflexive-transitive closure of our example directed graph using the asteration operation.
If you want just the transitive closure we simply compute x+.
This is great for deciding if two nodes are connected or not; however it does not tell us how to get from one node to another.
To denote paths through a directed graph we will need to label the edges.
For directed graphs with at most one edge from a source node to a destination node, we can simply label the edge by the two endpoints.
For directed multigraphs we would need to assign names some other way, but this simple labeling method will do for our example.
Let us look at our labeled adjacency matrix.
We see that each edge in the graph is label with the pair of its source node and its target node.
When there is no edge between a pair of nodes, the adjacency matrix is marked with Nothing.
There could be an infinite number of paths from a given starting node to a target node.
However, we can represent this infinite number of paths in a finite way by using regular expressions.
Regular expressions will be our third example of a *-semiring (the second example was matrices of Connections).
First we make a type for expressions of *-semirings over a given set of variable names.
The type of regular expressions over a given set of variables will be a copy of this type of *-semiring expressions; however regular expressions will have its own instance of equality and other operations, so we give it a new type.
An algorithm for deciding if two regular expressions are equal is beyond the scope of this blog post.
The problem is PSPACE complete, so it can be quite slow.
We will not need to compare regular expressions in this post so we can skip this implementation for now and instead focus on the operations for the *-semiring.
While we could simply implement *-semiring operations directly as the constructors of StarSemiringExpression, instead we will take advantage of this opportunity to implement a few local simplifications: identities of ε and 0, absorption of 0, idempotency of asteration, and the following theorems of regular expressions:
ε + ε = ε
ε + x∗ = x∗
x∗ + ε = x∗
We note that regular expressions are Kleene algebras. In fact, they are the mother of all Kleene algebras, which we will make use of later.
Now we can transform our labeled graph into a matrix of regular expressions.
Then we can compute all the regular expressions for the paths through these edges starting from and ending with any given nodes.
The regular expression we compute is not going to be minimal in general.
In fact, the resulting matrix is a bit too big to fit in this blog so I will leave it for you to try to compute printMatrix . star . reGraph . labelGraph $ exampleGraph yourself.
If you look again at what we have done, you will see that we can view the graph as a finite automaton.
What we have done is computed all the languages accepted by that finite automaton for every pair of possible initial and final states.
Although we only had at most one edge between any pair of nodes in our example, if we instead create a directed multigraph and label it with tokens accepted by that edge, the asteration operation will still compute the regular expression of the language accepted for every pair of initial and final nodes.
In fact, we can label edges with arbitrary regular expressions.
In particular we can express ε-NFAs.
Next, let us look at the question from the title of this post: how to computing the shortest path between two nodes in a graph.
We will take our example graph from the Wikipedia page for Dijkstra's algorithm.
We label the edges with their weights.
To compute the shortest path we will define a tropical *-semiring.
A tropical semiring over non-negative numbers has minimum as the additive operation, and addition as the multiplicative operation.
We add positive infinity as the identity element for minimum.
In a tropical semiring the multiplicative identity, one, is 0 which is the smallest element possible.
This means the asteration operation is the constant one.
We note that the tropical *-semiring is a Kleene algebra.
We can convert the labels in our example matrix to tropical values.
The asteration of this tropical matrix will then tell us the minimum distance between any two nodes in the graph.
Again, this only tells us what the minimum distance between two nodes is. It does not tell us what a minimal path is.
To find what the minimal paths are, we need to annotate our values with ancillary data.
We create a new type called ShortestPath to contain this annotation.
When we compute the additive operation of the shortest path we will take ancillary data corresponding to the smaller tropical value.
In case both tropical values are equal, then we take both pieces of ancillary data together by adding them.
The multiplicative operation is simply lifted from the tropical operation and the multiplicative operation on the ancillary data.
The star operation simply returns one (which is the tropical value 0) in almost all cases.
However, when the tropical value is already one (which is the tropical value 0), we can freely
sequence this value as many times as we want.
Therefore, in this case we return the asteration of the ancillary data.
We note that the resulting structure is a Kleene algebra (I think) when the ancillary data is.
If we annotate our example graph with edge names for regular expressions we can compute the regular expression corresponding to all the shortest paths in a graph.
Having a regular expression of paths is nice, but what if we want to just find one shortest path.
What we can do is compute a lazy list of all shortest paths and take the first one.
We call a lazy list of lists of labels a Language and it is our next example of a *-semiring.
The *-semiring operations on a language are ones for regular languages.
Here we allow the representation of languages to contain repeated strings.
With some care I think should be possible to eliminate repeated strings and perhaps even make the stream sorted.
Again, we note that Languages are a Kleene algebra.
Now we can annotate our ShortestPath with a language instead of a regular expression and extract the first shortest path, if it exists.
This is one very general way of computing shortest paths. By varying the *-semiring, you can compute largest capacity paths, and most reliable paths in the same way.
Recall that I said that regular expressions were the mother of all Kleene algebras.
What I mean by this is that regular expressions can interpret any other Kleene algebra if we have an interpretation for the variables.
This means that instead of computing the asteration of a matrices of different values that we are interested in, we can compute one matrix of regular expressions and then interpret that matrix in any Kleene algebra to get the same result.
This is quite useful if we want to look at many different Kleene algebra interpretations of a single graph because we only have to compute the asteration of a matrix once.
However, the resulting regular expressions are often large, so if you are interested in only one Kleene algebra interpretation, it can be faster to compute it directly.
The last example I want to show is how to solve linear equations.
This will also be our first example of a *-semiring that is not a Kleene algebra.
If we take the one point compactification of the real line, we can turn it into a *-semiring.
The additive and multiplicative operations are the usual addition and multiplication operations.
However we need to take care that 0 absorbs all elements under multiplication, including ∞.
The asteration of a number is the solution to x∗ = 1 + xx∗, which by simple algebra
must
can
be x∗ := (1 - x)-1.
We define 1∗ := ∞, and ∞∗ := ∞ to satisfy the recursion equation.
Matrix asteration solves fixpoints of affine equations. Notice that A∗B = (AA∗ + 1)B, and hence A∗B = AA∗B + B. Therefore A∗B is
the
a
solution to the equation X = AX+ B, which is a very common problem in linear algebra.
Lets compute the asteration of an 2×2 example matrix.
Asteration can be used to solve traditional problems in linear algebra too. Asteration can be used to invert a matrix A because (1 - A)∗ = A-1.
This *-semiring is not a Kleene algebra, so we cannot deduce this asteration from a matrix of regular expressions.
However we can deduce it from a matrix of *-semiring expressions.
*-semiring expressions have the same syntax as regular expressions; however it has fewer laws.
This means we can implement *-semiring expressions as a *-semiring in a similar manner to our implementation of regular expression.
All we have to do is remove some of the simplifications that we had implemented for regular expressions.
Similar to regular expressions, we can interpret any *-semiring expression in any *-semiring.
Now, if we want to, we can save an intermediate matrix of *-semiring expressions and interpret it using real numbers, or any other *-semiring (including any Kleene algebra) later.
That it for this introduction to *-semirings and their applications.
In the next installment we will see a faster method to compute the shortest path/regular expression for just a particular pair of source and target nodes by using eliminants.
References:
Transitive closure and related semiring properties via eliminants by S. Kamal Abdali and B. David Saunders
Algebraic Structures for Transitive Closure by Rafael Penaloza
Below you will find an appendix of functions that completes this module.
Edits:
Removed Enum constraints following a tip by L. Augustsson.
Implemented dovetail to properly sequence two potentially infinite languages.
Performing foldr-map fusion in star function for Matrix. It looks more natural this way anyway.
Defining Matrix to be indexed by Edge.
Adding Eq constraints to add compatibility with modern GHC prelude.