Space Complexity

Overview

Space complexity measures how much additional memory an algorithm needs as a function of the input size n. It complements time complexity and guides choices in environments where memory is scarce, data volume is large, or latency depends on cache and allocation behavior. Understanding space lets you decide when an algorithm is in-place, when recursion is too deep, and when a streaming or external-memory strategy is warranted.

Note

Unless stated otherwise, auxiliary space means memory beyond the input and the required output. Some conventions count the recursion stack as part of the auxiliary space; this note does.

Motivation

Memory limits are common:

Embedded systems and microservices use small heaps or fixed arenas.
Data pipelines process inputs larger than RAM; paging trashes performance.
Concurrency multiplies memory footprints across threads or tasks. Choosing an algorithm with sublinear or in-place memory can be the difference between success and failure, even if it is not the fastest in raw CPU time.

Definition and Formalism

Let S(n) denote auxiliary space for an input of size n.

Constant space: S(n) = Θ(1) — in-place, aside from recursion stack if any.
Logarithmic space: S(n) = Θ(log n) — typical of divide-and-conquer with tail recursion eliminated or shallow stacks.
Linear space: S(n) = Θ(n) — buffers proportional to input, e.g., stable merging.
Sublinear: o(n) — streaming/sketching; limited working memory.
Superlinear: ω(n) — rare but can occur in high-dimensional dynamic programs or memoization of large state spaces.

Total space sometimes refers to input + output + auxiliary; here we focus on auxiliary unless otherwise specified.

What counts toward `S(n)`

Data structures allocated by the algorithm (arrays, hash tables, trees).
Recursion stack frames and iterative stacks/queues used for traversal.
Temporary buffers for merging, partitioning, or copying.
Metadata (visited flags, parent arrays, path reconstructions).

What usually does not count:

The input stored before the algorithm runs.
The output that the problem requires to exist at the end (e.g., the sorted array replacing the input).

Tip

State your space model explicitly in documentation: “counts recursion stack; in-place means O(1) auxiliary words excluding the input and final output.”

Example or Illustration

Consider three classic cases:

Binary Search on a sorted array uses Θ(1) space if written iteratively. A recursive version uses Θ(log n) stack frames, one per level of the decision tree.
Merge Sort on arrays is stable but needs Θ(n) extra buffer for merging. Linked-list merge sort can be made more memory-frugal (relink nodes), but the array version’s buffer dominates S(n).
Quick Sort partitions in place and recurses on subarrays. With good pivot selection and tail-call elimination, stack depth is Θ(log n) expected, so S(n) = Θ(log n). In the worst case it can reach Θ(n) if not guarded (introspective fallback helps).

Properties and Relationships

Auxiliary vs total space

Let Aux(n) be space beyond input and required output. If the algorithm must assemble a separate output (e.g., building a new list while keeping the input intact), then total space can be Θ(n) even if Aux(n) is small. Be precise about which notion is relevant for your system.

Recursion, iteration, and stack depth

Every recursive call adds a frame with parameters, locals, and return addresses.
If the recursion tree height is h(n), then stack space is Θ(h(n)).
Tail recursion can compile to loops in some languages, but do not assume guaranteed tail-call elimination; it’s not universal.

Warning

Stack overflows occur when h(n) exceeds runtime stack limits (often 0.5–8 MB by default). Deep recursions on skewed trees or lists are common culprits; prefer iterative forms or explicit stacks.

In-place algorithms

An algorithm is in-place when S(n) = O(1) words beyond the input/output. Subtleties:

Many “in-place” algorithms still need O(log n) for recursion or a few pointers.
Some operations require swap or rotate primitives; large-object swaps may incur hidden copies unless handled by reference.

Space–time trade-offs

Memoization / DP replaces exponential recomputation with a table, trading time for space (Θ(n) or more).
Hashing accelerates membership queries at the cost of Θ(n) space.
Compression reduces space but adds CPU time; e.g., succinct structures or run-length encoding of bitsets.
Bidirectional search halves depth (time) using two frontiers (space) instead of one.

Memory hierarchy & locality

Space affects cache behavior:

Smaller working sets stay in L1/L2, reducing cache misses.
Streaming algorithms with contiguous scans outperform pointer-chasing structures of the same asymptotic space.
Blocking/tiling trades modest extra buffers for locality wins.

Implementation or Practical Context

How to compute `S(n)` (checklist)

List all allocations: arrays, maps, trees, buffers, queues.
Account for recursion/iteration: maximum frame depth, explicit stacks/queues.
Find peak usage: overlapping lifetimes matter; reuse buffers where lifetimes do not overlap.
Ignore input and required output unless documenting total space.
Express the maximum as a function: S(n) = a·n + b·log n + c.

Tip

For multi-phase algorithms, sketch a timeline of lifetimes to identify buffers that can be reused (arena/stack allocators, object pools).

Common patterns

Graph traversals: BFS uses a queue with up to Θ(|V| + |E|) storage in sparse graphs (dominated by Θ(|V|) for the queue and Θ(|V|) for visited). DFS recursion uses Θ(|V|) worst-case stack depth; iterative DFS uses an explicit stack of the same order.
Prefix sums / scans: Θ(1) or Θ(k) extra depending on whether outputs overwrite inputs or are written to a separate array of the same length (Θ(n) total space).
String algorithms: KMP uses Θ(|Σ|) or Θ(m) for the failure table, where m is the pattern length; Boyer–Moore stores shift tables sized by alphabet.
Counting and radix: Counting sort keeps a Θ(k) frequency array, where k is the key range; radix uses Θ(b) per pass for bucket counts in addition to an output buffer Θ(n).

Engineering for low space

Prefer in-place partitioning (e.g., Hoare/Lomuto variants with care for stability needs).
Use two-pointer and sliding-window techniques to keep O(1) state in linear passes.
Replace recursion with explicit stacks/queues for control over memory and to avoid stack limits.
When a full table is too large, switch to streaming sketches (e.g., HyperLogLog for cardinality, Count-Min Sketch for frequencies) at the cost of approximation.

External memory and streaming

When n exceeds RAM, minimize random access and pass count:

External merge sort: sort chunks in memory, spill, then multiway-merge; uses buffers sized to RAM and sequential disk I/O.
Single-pass streaming: maintain only o(n) state (reservoir sampling, quantile sketches, online statistics).
Chunked DP: compute tiles and discard intermediates; recompute on demand if necessary.

Language/runtime considerations

Garbage-collected languages free memory eventually; peak usage depends on GC cycles and retention. Reuse buffers and null references when safe.
Allocator overhead: many small objects (linked structures) incur per-object headers and fragmentation; a flat array often reduces peak space and improves locality.
Immutability/persistence (functional data structures) increases sharing but may create versioned nodes; asymptotic space can remain Θ(n) while constants differ.

Common Misunderstandings

Warning

“In-place means zero extra space.” In practice, “in-place” allows a small number of machine words for indices/pivots and sometimes O(log n) stack space.

Warning

“Recursion is free.” Each call consumes a frame; even trivial locals add up on deep inputs. Convert to iterative or add tail-call–friendly structure only if your runtime guarantees elimination.

Warning

“Counting space ignores output.” Be clear whether you are reporting auxiliary space or total space. Constructing a distinct output array necessarily incurs Θ(n) additional space.

Warning

“Lower space is always better.” Not if it destroys locality or forces multiple passes. Sometimes a modest buffer reduces cache misses enough to win in time and energy.

Broader Implications

Space bounds shape algorithm choice:

Stable sorts like merge trade Θ(n) space for guarantees and predictable access.
Hashing vs trees: hashing uses Θ(n) space to get O(1) expected time; balanced trees use similar space for ordered operations at O(log n).
DP vs recomputation: memoization uses Θ(state) space; if memory is tight, consider divide-and-conquer DP with Hirschberg-style space reductions (e.g., LCS from Θ(nm) to Θ(min(n,m)) extra with recomputation).
Space classes (theory): L (log-space), PSPACE (poly-space) bound what is computable within memory caps; in practice, log-space algorithms often correspond to streaming and on-the-fly traversals.

Summary

Space complexity captures the working memory required by algorithms. Distinguish auxiliary from total space, account for recursion stacks, and be explicit about what “in-place” means. Use iterative forms, buffer reuse, and streaming when memory is scarce; accept larger buffers when they materially improve locality and throughput. The best choice balances time, space, and system constraints rather than optimizing one metric in isolation.

Cam's Cyberspace

Recent Notes

Home

Linked Lists — Searching

Selection Sort

Explorer

Space Complexity

Overview

Motivation

Definition and Formalism

What counts toward `S(n)`

Example or Illustration

Properties and Relationships

Auxiliary vs total space

Recursion, iteration, and stack depth

In-place algorithms

Space–time trade-offs

Memory hierarchy & locality

Implementation or Practical Context

How to compute `S(n)` (checklist)

Common patterns

Engineering for low space

External memory and streaming

Language/runtime considerations

Common Misunderstandings

Broader Implications

Summary

See also

Graph View

Table of Contents

Backlinks

Cam's Cyberspace

Recent Notes

Home

Linked Lists — Searching

Selection Sort

Explorer

Space Complexity

Overview

Motivation

Definition and Formalism

What counts toward S(n)

Example or Illustration

Properties and Relationships

Auxiliary vs total space

Recursion, iteration, and stack depth

In-place algorithms

Space–time trade-offs

Memory hierarchy & locality

Implementation or Practical Context

How to compute S(n) (checklist)

Common patterns

Engineering for low space

External memory and streaming

Language/runtime considerations

Common Misunderstandings

Broader Implications

Summary

See also

Graph View

Table of Contents

Backlinks

What counts toward `S(n)`

How to compute `S(n)` (checklist)