Variance and Covariance

Intuition

The expected value tells you where a distribution is centered, but not how spread out it is. Variance fills that gap: it measures the average squared distance from the mean. A low variance means outcomes cluster tightly; a high variance means they are dispersed.

Covariance extends this idea to pairs of variables. It answers: when $X$ is above its mean, does $Y$ tend to be above its mean too (positive covariance), below it (negative), or neither (zero)? Covariance is the raw material for correlation, regression, and dimensionality reduction.

Definition

Variance

The variance of a random variable $X$ with mean $μ = E [X]$ is:

$σ^{2} = Var (X) = E [(X - μ)^{2}]$

Expanding the square gives the computational formula, which is often easier to evaluate:

$σ^{2} = E [X^{2}] - μ^{2}$

The standard deviation $σ = σ^{2}$ has the same units as $X$ and is more interpretable as a measure of spread.

Covariance

The covariance of $X$ and $Y$ with means $μ_{X}$ and $μ_{Y}$ :

$σ_{X Y} = Cov (X, Y) = E [(X - μ_{X}) (Y - μ_{Y})]$

The computational form:

$σ_{X Y} = E [X Y] - μ_{X} μ_{Y}$

Note

$Cov (X, X) = Var (X)$ . Variance is just the covariance of a variable with itself.

Key Formulas

Properties of variance

$Var (a X + b) = a^{2} Var (X)$

Adding a constant shifts the distribution but does not change spread. Scaling by $a$ scales variance by $a^{2}$ .

Variance of a sum

$Var (X + Y) = Var (X) + Var (Y) + 2 Cov (X, Y)$

If $X$ and $Y$ are independent, then $Cov (X, Y) = 0$ and:

$Var (X + Y) = Var (X) + Var (Y)$

Independence and covariance

$X ⊥ Y ⟹ Cov (X, Y) = 0$

Warning

The converse is false. Zero covariance does not imply independence. Example: let $X \sim Uniform (- 1, 1)$ and $Y = X^{2}$ . Then $Cov (X, Y) = 0$ but $Y$ is entirely determined by $X$ .

Correlation coefficient

The Pearson correlation normalizes covariance to $[- 1, 1]$ :

$ρ_{X Y} = \frac{σ _{X Y}}{σ _{X} σ _{Y}}$

$∣ ρ ∣ = 1$ indicates a perfect linear relationship; $ρ = 0$ means no linear association.

Three scatter plots showing positive, zero, and negative correlation

Example

Manufacturing consistency. Two companies produce resistors rated at 100 ohms. Sample measurements:

Company	Sample mean	Sample variance
A	100.2 $Ω$	1.4 $Ω^{2}$
B	99.8 $Ω$	8.7 $Ω^{2}$

Both hit the target mean, but Company A’s resistors are far more consistent ( $σ_{A} \approx 1.18 Ω$ vs. $σ_{B} \approx 2.95 Ω$ ). For precision circuits, Company A is the clear choice.

Covariance in practice. Suppose study hours $X$ and exam score $Y$ have $Cov (X, Y) = 12.5$ , $σ_{X} = 2.5$ , $σ_{Y} = 8.0$ . The correlation:

$ρ = \frac{12.5}{2.5 \times 8.0} = 0.625$

A moderately strong positive linear association - more study hours correlate with higher scores.

Why It Matters in CS

PCA and dimensionality reduction. Principal Component Analysis finds directions of maximum variance by computing eigenvectors of the covariance matrix. Features with high covariance are collapsed into single components, reducing dimensionality while preserving information.
Stability of randomized algorithms. Low variance in a randomized algorithm’s runtime means its performance is predictable. Chebyshev’s inequality bounds tail probabilities using variance: $P (∣ X - μ ∣ \geq kσ) \leq 1/ k^{2}$ .
Sensor fusion and robotics. Kalman filters propagate covariance matrices to track how uncertainty evolves over time. Sensor measurements with lower variance receive more weight in the fused estimate.
Portfolio and resource optimization. In distributed systems, covariance between server loads determines whether load-balancing reduces total variance or not - negatively correlated loads are ideal.

Expected Value - variance measures spread around the expected value
Probability Distributions - each distribution has characteristic variance formulas
Regression Fundamentals - regression coefficients are ratios of covariance to variance

Cam's Cyberspace

Recent Notes

Algorithm Efficiency - Bridging Theory and Practice

Home

Best, Worst & Average Cases

Explorer

Variance and Covariance

Intuition

Definition

Variance

Covariance

Key Formulas

Properties of variance

Variance of a sum

Independence and covariance

Correlation coefficient

Example

Why It Matters in CS

Graph View

Table of Contents

Backlinks

Cam's Cyberspace

Recent Notes

Algorithm Efficiency - Bridging Theory and Practice

Home

Best, Worst & Average Cases

Explorer

Variance and Covariance

Intuition

Definition

Variance

Covariance

Key Formulas

Properties of variance

Variance of a sum

Independence and covariance

Correlation coefficient

Example

Why It Matters in CS

Related Notes

Graph View

Table of Contents

Backlinks