Floyd–Warshall Algorithm

Overview

The Floyd–Warshall algorithm computes shortest paths between all pairs of vertices in a weighted directed graph (can be undirected by adding symmetric edges). It works even with negative edge weights (but not negative cycles), and it detects negative cycles cleanly. The algorithm is a compact dynamic programming routine with a triple nested loop that considers, for each pair (i, j), whether detouring through an intermediate vertex k yields a shorter path.

Compared to Dijkstra’s Algorithm, Floyd–Warshall is usually preferred when:

you need all pairs anyway,
the graph is dense or modest in size, or
you have negative edges (but no negative cycles).

Its simplicity and small code footprint make it a standard baseline in teaching and practice.

Core Idea

Let dist[i][j] be the best-known distance from i to j. The DP invariant after finishing outer loop value k is:

All paths from i to j that use intermediates only from the set {0,1,...,k} have length dist[i][j].

Initialization allows no intermediates (just direct edges). Each iteration k asks: is it shorter to go i → k → j than the current i → j? If yes, update.

This yields correctness by induction on k. Negative cycles are visible because allowing all vertices as intermediates lets a cycle “shrink” the cost of returning to its own start: dist[v][v] < 0.

Algorithm Steps / Pseudocode

Inputs.

n: number of vertices, labeled 0..n-1.
W: adjacency matrix weights where:
- W[i][i] = 0
- W[i][j] = weight(i→j) if edge exists,
- W[i][j] = +∞ (a large sentinel) otherwise.

Outputs.

dist[i][j]: length of the shortest path from i to j (or +∞ if none).
Optionally next[i][j] to reconstruct paths.

function FLOYD_WARSHALL(W, n):
    dist = W.clone()
    next = matrix(n,n)
    for i in 0..n-1:
        for j in 0..n-1:
            if i != j and W[i][j] < +∞:
                next[i][j] = j
            else:
                next[i][j] = NIL
 
    for k in 0..n-1:
        for i in 0..n-1:
            // small guard: skip rows/cols already +∞ to save work
            if dist[i][k] == +∞: continue
            for j in 0..n-1:
                if dist[k][j] == +∞: continue
                alt = dist[i][k] + dist[k][j]
                if alt < dist[i][j]:
                    dist[i][j] = alt
                    next[i][j] = next[i][k]   // path i→...→k then continue toward j
 
    // Negative cycle detection:
    for v in 0..n-1:
        if dist[v][v] < 0:
            // mark reachability from/to this cycle as -∞ or signal "affected by negative cycle"
            // optional handling; see notes below
    return (dist, next)

Path reconstruction (if next is maintained):

function RECONSTRUCT_PATH(next, i, j):
    if next[i][j] == NIL: return []  // no path
    path = [i]
    while i != j:
        i = next[i][j]
        path.append(i)
        if length(path) > N_LIMIT: error "negative cycle on path?"
    return path

Tip

If your application needs only distances (no paths), you can drop next entirely and keep just dist. If you only need to know whether a pair is affected by a negative cycle, run the triple loop once more: any (i, j) that can be improved again (or is reachable from and to a vertex with dist[v][v] < 0) is tainted.

Example or Trace

Consider n=4, vertices 0..3, with edges (directed) and weights:

0→1: 4, 0→2: 11
1→2: -2, 1→3: 2
2→3: 3
3→0: 1

Initialize dist with direct edges and zeros on the diagonal, +∞ otherwise. Initially, for example, dist[0][3] = +∞.

Iterate k = 0..3:

k=0: Allow paths via vertex 0. You might update dist[3][1] because 3→0→1 yields 1+4=5 if it beats any existing value.
k=1: Allow via 1. dist[0][2] improves: 0→1→2 has 4 + (-2) = 2, better than 11. Then 0→3 improves via 0→1→3 (4+2=6).
k=2: Allow via 2. 0→3 can also go 0→1→2→3 for 4 + (-2) + 3 = 5, one better than 6, so it updates again.
k=3: Allow via 3. Some cycles may enable new improvements; here no negative cycles exist, so dist[v][v] stays 0.

Final dist gives the shortest cost between every pair. Using next, RECONSTRUCT_PATH(0,3) yields [0,1,2,3].

Complexity Analysis

Let n = |V|.

Time: O(n^3) for the triple loop. On dense graphs this competes well; on sparse graphs with many vertices it can be slow compared to running Dijkstra n times (O(n m log n) with a heap).
Space: O(n^2) for dist, plus O(n^2) for next if kept.
Numerical concerns: Distances can become very large in magnitude; choose a sentinel +∞ that cannot be reached by legitimate sums, and guard against overflow if weights are large.

Optimizations or Variants

In-place updates: You can reuse the single dist matrix (no k-indexed layers) as in the pseudocode. It works because each update for k uses only values already restricted to intermediates {0..k}.
Bitset reachability: When weights are unit (or you need just reachability), Warshall’s algorithm with boolean matrices computes the transitive closure in O(n^3) bit operations; with bit-packing (word-level parallelism), it’s fast in practice.
Blocking / cache-friendly loops: Reorder the triple loop to improve cache reuse (for k { for i { for j { ... }}} is standard). For large n, block i, j loops (tiling) to keep working submatrices in cache.
Early exits: If an application only needs some rows/columns (e.g., from a small subset of sources), prefer running Dijkstra/Bellman–Ford from those sources instead of full Floyd–Warshall.
Negative-cycle propagation: After the main phase, a final pass can mark (i, j) as -∞ if there exists a k with dist[k][k] < 0 and both i can reach k and k can reach j. This explicitly signals “no well-defined shortest path.”

Applications

Routing and policy analysis: All-pairs latency/cost maps in networks, road graphs (when small enough), or inter-module call graphs.
Prerequisite/ordering graphs: Detect inconsistencies (negative cycles) when modeling constraints as weights.
Transitive closure / reachability (Warshall): task dependency planning, database query optimization.
Centrality / similarity measures: Pairwise shortest-path distances for small graphs feed into clustering, layout, or centrality scores.

Common Pitfalls or Edge Cases

Warning

Negative cycles. If dist[v][v] < 0 for any v after the algorithm, shortest paths are undefined for pairs that can reach and be reached from that cycle (you can drive cost to −∞). Do not use those distances as finite answers.

Warning

Infinity handling. Always check for +∞ before adding dist[i][k] + dist[k][j] to avoid overflow or accidental wraparound.

Warning

Path reconstruction without next. If you skip the next matrix, you cannot reconstruct paths later without re-running or storing additional parent info.

Warning

Indexing drift. Ensure vertices are consistently indexed 0..n-1. Mixing human labels with indices is a common source of off-by-one bugs.

Implementation Notes or Trade-offs

Choosing representations: Floyd–Warshall assumes an adjacency matrix style dist. For sparse graphs with large n, an adjacency list plus repeated single-source runs is often superior (see Graph Representations).
Undirected graphs: Insert both directions with the same weight; initialize W[i][i]=0.
Edge cases in initialization: Multiple edges? Use the minimum weight. No self-loops? Keep 0 on the diagonal.
Numeric types: Use 64-bit integers or doubles as appropriate; for doubles, beware of rounding if negative cycles nearly cancel out. Consider a large INF = 1e18 for integers and check abs(dist) < INF/2 when reporting.
Space-saving next: Instead of next[i][j]=j, some implementations store predecessors; both are fine if used consistently. The “successor” approach shown above is straightforward for forward path building.

Summary

Floyd–Warshall is a compact dynamic program for all-pairs shortest paths that tolerates negative edges and flags negative cycles. Initialize a dist matrix, run a simple triple loop that tries each vertex k as an intermediate, and optionally maintain a next matrix for path reconstruction. Its O(n^3) time and O(n^2) space make it ideal for small to medium graphs or as a clear, reliable baseline.

Cam's Cyberspace

Recent Notes

Home

Linked Lists — Searching

Selection Sort

Explorer

Floyd–Warshall Algorithm

Overview

Core Idea

Algorithm Steps / Pseudocode

Example or Trace

Complexity Analysis

Optimizations or Variants

Applications

Common Pitfalls or Edge Cases

Implementation Notes or Trade-offs

Summary

See also

Graph View

Table of Contents

Backlinks