CMPT231
Lecture 10: ch22
Graph Algorithms
Some material from [Sedgewick + Wayne, "Algorithms"](http://algs4.cs.princeton.edu/)
Romans 10:13-15 (NIV)
“Everyone who calls on the name of the Lord will be saved.”
How, then, can they call on
the one they have not believed in?
And how can they believe in
the one of whom they have not heard?
And how can they hear
without someone preaching to them?
And how can anyone preach unless they are sent?
As it is written:
“How beautiful are the feet of those who bring good news!”
Outline for today
- Intro to graph algorithms
- Applications and typical problems
- Edge list, adjacency list, adjacency matrix
- Breadth-first graph traversal
- Depth-first graph traversal
- Parenthesis structure
- Edge classification
- Topological sort
- Finding strongly-connected components
Intro to graph algorithms
- Representing graphs: G = (V, E)
- V: vertices / nodes
- storage: array, linked-list, etc.
- E: edges connecting vertices
- directed or undirected
- storage: edge list, adjacency matrix, etc.
- Some corner cases:
- Self-loop: edge from vertex to itself
- Parallel edges: multiple edges with same start/end
- Complexity of graph algorithms in terms of
Applications of graphs
graph |
vertex |
edge |
air transport |
airport |
flight path |
social |
person |
friendship/relationship |
internet |
computer |
network connection |
finance |
stock/asset |
transaction |
neural net |
neuron |
synapse |
protein net |
protein |
protein-protein interaction |
Map of the internet (coloured by RIR):
[OPTE 2015](http://www.opte.org/the-internet/)

24hrs of flights in/out of Europe:
[422South for NATS](http://422south.com/work/euro-24-air-traffic-visualization-for-nats)

[Connectome](https://en.wikipedia.org/wiki/Connectome)
of 20 adult human brains: map of white-matter connections.
[CC-BY-SA 4.0, Andreashorn](https://commons.wikimedia.org/wiki/File%3AThe_Human_Connectome.png)

[Proteome](https://en.wikipedia.org/wiki/Proteome)
(protein-protein interaction network) of syphilis spirochete, T. pallidum.
[CC-BY 1.0, Häuser et al.](https://commons.wikimedia.org/wiki/File%3AThe_protein_interaction_network_of_Treponema_pallidum.png)

Problems in graph theory
- Path finding: is there a path from u to v?
- Shortest path: find the shortest path from u to v
- Cycle: does the graph have any cycles?
- Euler cycle: traverse each edge exactly once
- Hamilton cycle: touch each vertex exactly once
- Connectivity: are all the vertices connected?
- Bi-connectivity: can you disconnect the graph by removing one vertex?
- Planarity: draw graph in 2D w/o crossing edges?
- Isomorphism: are two graphs equivalent?
Outline for today
- Intro to graph algorithms
- Applications and typical problems
- Edge list, adjacency list, adjacency matrix
- Breadth-first graph traversal
- Depth-first graph traversal
- Parenthesis structure
- Edge classification
- Topological sort
- Finding strongly-connected components
Representing edges
- Edge list: array/list of (u,v) pairs of nodes
- [ (1,2), (1,3), (2,4) ]
- How to find neighbours of a vertex u?
- Adjacency list: indexed by start node
- [ {1: [2, 3]}, {2: [1, 4]}, {3: [1]}, {4: [2]} ]
- How to find the (out)-degree of each vertex?
- Adjacency matrix: boolean |V| x |V| matrix
- A[i,j] = 1 iff (i,j) is an edge:
( (0, 1, 1, 0), (1, 0, 0, 1), (1, 0, 0, 0), (0, 1, 0, 0) )
- What about directed graphs? Weighted graphs?
- In practise, most graphs are sparse: what representation?
Graph traversal: breadth-first
- Traversal: visits each node exactly once
- BFS: overlay a breadth-first tree
- Choose a start (root) node
- Path in tree = shortest path from root
- Only nodes reachable from start node
- BFS tree not necessarily unique
- Graph could have loops:
- Need to track which nodes we’ve seen
- Assign colours to nodes as we traverse graph:
- White: unvisited
- Grey: on border (some unvisited neighbours)
- Black: no unvisited neighbours
- Use FIFO queue to manage grey nodes
BFS algorithm
- In: vertex list, adjacency (linked) list, start node
- Out: modify vertex list, adding parent pointers
def BFS( V, E, start ):
init V all white and NULL parent
start.colour = grey
init FIFO: Q.push( start )
while Q.notempty():
u = Q.pop()
for v in E.adj[ u ]:
if v.colour == white:
v.colour = grey
v.parent = u
Q.push( v )
u.colour = black
Complexity?

BFS properties
- BFS examines nodes in order of distance from source
- Queue first holds all nodes of distance k,
- Then all nodes of distance k+1, etc.
- Levels of BFS tree = nodes of same distance from source
- ⇒ BFS computes shortest paths from source
to all other reachable nodes in time O(|V|+|E|)
Outline for today
- Intro to graph algorithms
- Applications and typical problems
- Edge list, adjacency list, adjacency matrix
- Breadth-first graph traversal
- Depth-first graph traversal
- Parenthesis structure
- Edge classification
- Topological sort
- Finding strongly-connected components
Trémaux maze solving
- Graph representation of a maze:
- Vertex = intersection, edge = passage
- Theseus slaying the Minotaur in the Labyrinth
- Ariadne gave him a tool: ball of string:
- Unwind string as you go
- Track each visited intersection + passage
- Retrace steps when no unvisited passages

2nd c. AD Roman mosaic, Kunsthistorische Museum, Vienna
[(Asaf Braverman, CC-BY-NC-SA 2.0)](https://www.flickr.com/photos/theheartindifferentkeys/4172187101)
Depth-first search
- First explore as deep as we can
- Backtrack to explore other paths
- Recursive algorithm (ball of string = call stack)
- Colouring: white = undiscovered, grey = discovered,
black = finished (visited all neighbours)
- Add timestamps on discover and finish
- Overlay a forest on the graph
- Subtree at a node = all nodes visited between
this node’s discovery and finish
DFS algorithm
def DFS( V, E ):
init V all white and NULL parent
time = 0
for u in V: # why loop over ALL verts?
if u.colour == white:
DFS-Visit( V, E, u )
def DFS-Visit( V, E, u ):
time++
u.discovered = time
u.colour = gray
for v in E.adj[ u ]:
if v.colour == white:
v.parent = u
DFS-Visit( V, E, v )
u.colour = black
time++
u.finished = time


Outline for today
- Intro to graph algorithms
- Applications and typical problems
- Edge list, adjacency list, adjacency matrix
- Breadth-first graph traversal
- Depth-first graph traversal
- Parenthesis structure
- Edge classification
- Topological sort
- Finding strongly-connected components
DFS: parenthesis structure
- Each node’s subtree is visited
between its discovery and finish times
- Print a (._u when we discover node u
- Print a )_u when we finish it
- Output is a valid parenthesisation:
- e.g., (._u (._v (._w )_w )_v (._x (._y )_y )_x )_u (._z )_z
- But not (._u (._v )_u )_v
- The (discover, finish) intervals for any two vertices are
- Either completely disjoint
- Or one contained in the other
DFS: white-path theorem
- The (d, f) interval for v is contained in u
⇔ v is a descendant of u in the DFS
- i.e., u.d < v.d < v.f < u.f
- White-path theorem:
- v is a descendant of u in the DFS ⇔
- When u is discovered, there is
a path u → v all of white vertices

DFS: flood-fill
- Vertex: pixel
- Edge: adjacent pixels of similar colour
- Blob: all pixels connected to given pixel


DFS: edge classification
- All edges in a graph are either
- Tree edges: in the DFS forest
- Back edges: up to ancestor in same DFS tree (incl self-loop)
- Forward edges: down to descendant
- Cross edges: different subtrees or DFS trees
- For directed graphs: acyclic ⇔ no back edges

DFS: preparing for a date (XKCD)

Outline for today
- Intro to graph algorithms
- Applications and typical problems
- Edge list, adjacency list, adjacency matrix
- Breadth-first graph traversal
- Depth-first graph traversal
- Parenthesis structure
- Edge classification
- Topological sort
- Finding strongly-connected components
DFS: topological sort
- Linear ordering of vertices such that:
- for every edge u → v, u comes before v in the sort
- Assumes no cycles! (i.e., DAG: directed acyclic)
- Applications: dependency resolution, compiling files,
task planning / Gantt chart
- Use DFS to sort in decreasing order of finish time
- As each vertex finishes, insert at head of a linked list
- DFS might not be unique, so
topological sort might not be unique
Topological sort: example

Topological sort: proof
- Recall DFS colouring: white = undiscovered
- grey = discovered, black = finished
- Proof of correctness: (u,v) in E => v.f < u.f
- When DFS explores (u,v), what colour is v?
- if gray: then v is an ancestor of u
- So (u,v) is a back edge
- So graph has a loop (disallowed)
- if white: then v becomes a child of u:
- if black: then v is done, but not u yet:
DFS: connected components
- Largest completely-connected set of vertices:
- Every vertex has a path
to every other vertex in the component
- Algorithm:
- Compute DFS to find finishing times
- Transpose the graph: reverse all edges
- Compute DFS on transposed graph
- Start at vertex that finished last in orig DFS
- Each tree in final DFS is a separate component

Connected components
- (a) Original graph:
- DFS trees shaded
- DFS starts at c
- (b) Transpose graph:
- All edges reversed
- DFS trees shaded
- DFS starts at b (last to finish in orig DFS)
- (c) Coalesce vertices into component graph

Problems in graph theory
- Path finding: is there a path from u to v?
- Shortest path: find the shortest path from u to v
- Cycle: does the graph have any cycles?
- Euler cycle: traverse each edge exactly once
- Hamilton cycle: touch each vertex exactly once
- Connectivity: are all the vertices connected?
- Bi-connectivity: can you disconnect the graph by removing one vertex?
- Planarity: draw graph in 2D w/o crossing edges?
- Isomorphism: are two graphs equivalent?
Outline for today
- Intro to graph algorithms
- Applications and typical problems
- Edge list, adjacency list, adjacency matrix
- Breadth-first graph traversal
- Depth-first graph traversal
- Parenthesis structure
- Edge classification
- Topological sort
- Finding strongly-connected components
Online demos
- Breadth-first search:
- Depth-first search:
- U San Fran
(only one tree of the DFS forest)
- VisuAlgo
(edge classification, only one tree)
- Topological sort:
- Connected components:
CMPT231
Lecture 10: ch22
Graph Algorithms
Some material from [Sedgewick + Wayne, "Algorithms"](http://algs4.cs.princeton.edu/)