Welcome to the World of Continuous Random Variables!

In your previous studies, you might have looked at Discrete Random Variables—things you can count, like the number of heads in three coin flips or the number of students in a class. But what about things we measure? Think about the time you wait for a bus, the weight of an apple, or the exact height of a tree. These can take any value in a range. These are called Continuous Random Variables (CRVs).

By the end of these notes, you’ll understand how to describe these variables using math, how to find the "average" result, and how to calculate the chances of something happening. Don’t worry if it sounds a bit heavy on the calculus at first; we will break it down step-by-step!

1. Discrete vs. Continuous: The Big Difference

To understand a CRV, it helps to compare it to a Discrete variable.

Discrete: Like a digital clock that only shows minutes. The time "jumps" from 10:01 to 10:02. You can list every possible outcome.
Continuous: Like an old-fashioned analog clock where the second hand sweeps smoothly. Between 10:01 and 10:02, there are an infinite number of tiny moments (10:01.0001, 10:01.00012, etc.).

Did you know? Because there are infinite possible values, the probability of a continuous variable being exactly one specific number (like exactly 1.750000... meters tall) is actually zero! Instead, we always look for the probability that a value falls within a range.

2. The Probability Density Function (PDF)

Since we can't list probabilities in a table like we do for discrete variables, we use a graph or a formula called a Probability Density Function, written as \( f(x) \).

Think of the PDF as a "map" of where the probability is most likely to be. The higher the curve, the more likely the variable is to fall in that area.

The Golden Rules of PDFs:
1. The curve must never go below the x-axis. Mathematically: \( f(x) \ge 0 \) for all \( x \). (You can't have negative probability!)
2. The total area under the entire curve must equal 1. This represents the 100% chance that something happens.

Quick Review: To find the total area, we use integration! For a PDF defined between the limits \( a \) and \( b \):
\( \int_{a}^{b} f(x) dx = 1 \)

Key Takeaway: A PDF describes the "shape" of the probability, and the total area under it is always 1.

3. Finding Probabilities (The Area Method)

As we mentioned, we only find probabilities for ranges. To find the probability that our variable \( X \) is between two values \( c \) and \( d \), we find the area under the PDF curve between those two points.

The Formula:
\( P(c \le X \le d) = \int_{c}^{d} f(x) dx \)

Common Mistake to Avoid: Students often worry about whether to use \( < \) or \( \le \). In Continuous Random Variables, it doesn't matter! Because the probability of being exactly one point is zero, \( P(X < 5) \) is the same as \( P(X \le 5) \).

Example: If the time waiting for a train is modeled by a PDF, the probability you wait between 2 and 5 minutes is just the integral of that function from 2 to 5.

4. The Cumulative Distribution Function (CDF)

The Cumulative Distribution Function, written as \( F(x) \), is like a "running total" of the probability. It tells you the probability that the variable is less than or equal to a certain value \( x \).

The Definition:
\( F(x) = P(X \le x) = \int_{-\infty}^{x} f(t) dt \)

How to switch between them:
- If you have the PDF \( f(x) \) and want the CDF \( F(x) \): Integrate.
- If you have the CDF \( F(x) \) and want the PDF \( f(x) \): Differentiate (\( f(x) = F'(x) \)).

Memory Aid: Think of CDF as the Collector. It collects all the probability from the start up to the point you are looking at.

Key Takeaway: \( F(x) \) always starts at 0 and ends at 1. It represents the probability "so far."

5. Measures of Central Tendency (Mean and Median)

Just like with lists of numbers, we want to find the "middle" or "average" of a continuous distribution.

The Mean (Expectation)

The mean, or \( E(X) \), is the "balance point" of the PDF. To find it, we multiply \( x \) by the PDF and integrate.

The Formula:
\( E(X) = \mu = \int_{all \space x} x f(x) dx \)

The Median

The median \( m \) is the value where exactly 50% of the probability is to the left and 50% is to the right. To find it, we use the CDF!

The Formula:
Solve \( F(m) = 0.5 \) for \( m \).

Analogy: If the PDF was a piece of wood, the Mean is where you could balance it on your finger. The Median is where you would saw it in half so both pieces have the same weight.

6. Variance and Standard Deviation

Variance tells us how "spread out" the probability is. If the variance is high, the values are likely to be far from the mean.

Step 1: Find \( E(X^2) \) using \( \int x^2 f(x) dx \).
Step 2: Use the variance formula:
\( Var(X) = E(X^2) - [E(X)]^2 \)
Step 3: If you need the Standard Deviation (\( \sigma \)), just take the square root of the variance.

Quick Trick: Always calculate \( E(X) \) first, as you need it for the variance formula anyway!

7. Summary Checklist for Exam Questions

When you face a Continuous Random Variable question, follow this mental map:

1. Is there an unknown constant \( k \) in the PDF?
Integrate the function over its whole range, set it equal to 1, and solve for \( k \).

2. Need to find a probability?
Integrate the PDF between the two numbers given in the question.

3. Need the Median or a Percentile?
Find the CDF \( F(x) \) first by integrating, then set it equal to 0.5 (for median) or the percentile (e.g., 0.25 for the lower quartile) and solve for \( x \).

4. Need the Mean?
Integrate \( x \times f(x) \).

Don't worry if the integration looks scary! Most Oxford AQA problems involve polynomials (like \( x^2 + 2x \)), which follow the simple rule: increase the power by 1 and divide by the new power. You've got this!