Random Variable

Intuition

A random variable is just a rule that turns outcomes into numbers. Flip a coin and you get heads or tails - but if you assign heads = 1 and tails = 0, you now have a random variable. Roll two dice and sum the faces: that sum is a random variable. The outcome is still random, but now it lives on a number line, so you can compute averages, measure spread, and apply the full machinery of mathematics.

Random variables are the entry point to everything else in statistics. Without them, concepts like expected value, variance, and distributions have no object to act on.

Definition

A random variable $X$ is a function from a sample space $S$ to the real numbers:

$X : S \to R$

Each outcome $s \in S$ maps to a real number $X (s)$ . The randomness comes from the underlying experiment, not from $X$ itself - $X$ is a deterministic function applied to a random outcome.

Discrete vs. continuous

Type	Values	Described by	Example
Discrete	Countable set ${x_{1}, x_{2}, \dots}$	Probability mass function $P (X = x)$	Number of bugs in a release
Continuous	Uncountable interval	Probability density function $f (x)$	Response time of an API call

For a discrete random variable, probabilities are assigned to individual values: $P (X = x)$ . For a continuous random variable, probability is defined over intervals: $P (a \leq X \leq b) = \int_{a}^{b} f (x) d x$ , and $P (X = x) = 0$ for any single point.

Random variable as a function mapping sample space to real numbers

Key Formulas

Probability mass function (discrete):

$f (x) = P (X = x), \sum_{x} f (x) = 1$

Probability density function (continuous):

$P (a \leq X \leq b) = \int_{a}^{b} f (x) d x, \int_{- \infty}^{\infty} f (x) d x = 1$

Cumulative distribution function (both types):

$F (x) = P (X \leq x)$

Expected value:

$E [X] = \sum_{x} x f (x) or E [X] = \int_{- \infty}^{\infty} x f (x) d x$

Variance:

$Var (X) = E [(X - μ)^{2}] = E [X^{2}] - (E [X])^{2}$

Example

Modelling packet loss. A network link drops each packet independently with probability $p = 0.02$ . Define $X$ = number of dropped packets in a batch of $n = 100$ .

Each packet is a Bernoulli trial, so $X \sim Binomial (100, 0.02)$ :

$E [X] = n p = 100 \times 0.02 = 2$

$Var (X) = n p (1 - p) = 100 \times 0.02 \times 0.98 = 1.96$

The random variable $X$ lets us move from “packets might get dropped” to precise quantitative statements: on average 2 drops per batch, with standard deviation $\approx 1.4$ . This informs retry buffer sizing and SLA calculations.

Why It Matters in CS

Formalizing randomness. Randomized algorithms (quicksort pivot selection, hash functions, skip lists) are analyzed by defining random variables over their internal coin flips.
Algorithm analysis. The running time of a randomized algorithm is a random variable. Its expected value gives the average-case complexity; its variance tells you how reliable that average is.
Probabilistic data structures. Bloom filters, count-min sketches, and HyperLogLog all define random variables whose distributions determine error guarantees.
Machine learning. Features are random variables. Labels are random variables. The entire supervised learning framework is built on the joint distribution $P (X, Y)$ .

Expected Value - the mean of a random variable
Variance and Covariance - measures spread and co-movement of random variables
Probability Distributions - the families that random variables follow
Binomial Distribution - a discrete random variable counting successes
Normal Distribution - the most common continuous random variable model
Poisson Distribution - a discrete random variable for event counts

Cam's Cyberspace

Recent Notes

Algorithm Efficiency - Bridging Theory and Practice

Home

Best, Worst & Average Cases

Explorer

Random Variable

Intuition

Definition

Discrete vs. continuous

Key Formulas

Example

Why It Matters in CS

Graph View

Table of Contents

Backlinks

Cam's Cyberspace

Recent Notes

Algorithm Efficiency - Bridging Theory and Practice

Home

Best, Worst & Average Cases

Explorer

Random Variable

Intuition

Definition

Discrete vs. continuous

Key Formulas

Example

Why It Matters in CS

Related Notes

Graph View

Table of Contents

Backlinks