GradGPT LogoGradGPTAP

AP Statistics

AP Statistics FRQ Tips: How to Write Answers That Actually Score

6 FRQs90 min50% of your score5 short + 1 Investigative Task
15

days until your AP Stats exam

Thu, May 7 · Afternoon session

How the AP Stats FRQ points are scored

Five short FRQs (~4 points each) and one Investigative Task (~8 points). The Investigative Task asks you to apply statistical reasoning to a novel situation - it's the highest-leverage FRQ to practice.

Points per FRQ type

Short FRQs (×5) · 20 pt
Investigative Task · 8 pt

Suggested time budget · 90 min

Short FRQs (×5) · 55 min
Investigative Task · 35 min

Short FRQs (×5)

20 pt · 55 min

Standard inference, probability, and study design prompts. 1–3 sub-parts each.

Investigative Task

8 pt · 35 min

Novel application of statistical reasoning. Open-ended. Highest-leverage prep target.

Exam composite weighting

6 FRQs carry the same weight as 40 MCQs

50%
50%
Multiple choice40 questions · 90 min
Free response6 FRQs · 90 min

Decode the task word before you write

Every AP FRQ begins with a task word. It tells you the minimum sentence type required for the point. Writing too much costs time; writing the wrong kind costs the point outright.

Task word
Earns the point when you…
Common miss
Identify
Name the term, concept, or value. No explanation required.
Writing a paragraph when one phrase is asked.
Define
Give the textbook meaning in one sentence.
Defining with an example instead of the concept itself.
Describe
Give 2–3 sentences of specific detail - names, numbers, mechanisms.
Staying vague or abstract when specifics are required.
Explain
Show cause → effect with a real mechanism.
Describing instead of explaining - no causal verb.
Compare
Mention both sides in the same sentence with a linking word.
Describing each separately, never connecting them.
Justify
State your claim and back it with evidence or reasoning.
Offering the claim without the 'because' that supports it.

Want to know if you're actually doing this?

Write one Stats FRQ. Get your setup, conditions, and interpretation scored line by line.

Weak vs. strong: hypothesis test

Inference FRQs are scored on four steps: hypotheses, conditions, mechanics, and conclusion. Missing any one step caps your score.

Prompt

A researcher claims that more than 60% of adults exercise weekly. A random sample of 200 adults found 130 who exercise weekly. Test at α = 0.05.

Weak answer

2/4

"p-hat = 130/200 = 0.65. z = (0.65 - 0.60) / sqrt(0.6*0.4/200) = 1.44. p-value ≈ 0.075. Since 0.075 > 0.05, fail to reject H₀. There isn't enough evidence."

Why it lost points

  • Hypotheses H₀ and Hₐ are never written.
  • Conditions (random, independent, large sample n·p ≥ 10) are not checked.
  • Conclusion is in statistical language but doesn't reference the context (adults exercising).

Strong answer

4/4

"H₀: p = 0.60; Hₐ: p > 0.60. Conditions: (1) Random - stated in prompt. (2) Independent - 200 < 10% of all adults. (3) Large counts - np₀ = 120 ≥ 10 and n(1-p₀) = 80 ≥ 10. All satisfied. One-proportion z-test: p̂ = 130/200 = 0.65. z = (0.65 − 0.60) / √(0.6 · 0.4 / 200) = 1.44. p-value = P(Z > 1.44) ≈ 0.0749. Since p-value 0.0749 > α = 0.05, fail to reject H₀. We do not have significant evidence at the 5% level that more than 60% of adults exercise weekly."

Why it scores full marks

  • All four steps clearly present: hypotheses, conditions, mechanics, conclusion.
  • Conditions explicitly numbered and verified with context.
  • Conclusion references both the statistical result AND the context (adults exercising weekly).

Weak vs. strong: confidence interval interpretation

Interpretation prompts are where most students lose points. Readers look for the specific language: 'We are X% confident that the true [parameter in context] is between [L] and [U].'

Prompt

A 95% confidence interval for the mean height of students at a university is (65.2, 67.8) inches. Interpret this interval in context.

Weak answer

0/2

"There is a 95% chance that the mean is between 65.2 and 67.8 inches."

Why it lost points

  • '95% chance' is wrong - the confidence level refers to the METHOD, not the specific interval.
  • No mention of 'true mean' or 'students at this university.'
  • Missing the key phrasing readers look for.

Strong answer

2/2

"We are 95% confident that the true mean height of all students at this university is between 65.2 and 67.8 inches."

Why it scores full marks

  • '95% confident' - the correct frequentist phrasing.
  • References 'true mean' (the parameter) explicitly.
  • Identifies the population (students at this university) in context.

What you see in GradGPT

This is what your feedback looks like

Every Stats FRQ you write gets scored against the same rubric AP readers use. Strengths, improvements, and notes are highlighted inline.

Your response

StrengthImprovementNote

p̂ = 0.65. Using a one-proportion z-test, z = 1.44, p-value = 0.075. Since p > 0.05, we fail to reject H₀. There isn't enough evidence.

Inline feedback

ImprovementHypotheses (H₀ and Hₐ) should be written explicitly before mechanics - it's a separate scoring row.
StrengthCorrect test name and mechanics. This is the middle of the 4-step procedure.
NoteAlso verify conditions: random, independent (10% rule), large counts (np₀ ≥ 10).
ImprovementConclusion needs context - 'more than 60% of adults exercise weekly,' not just 'enough evidence.'

Rubric breakdown

You scored higher than 55% of students on this prompt

Hypotheses stated

0/1

H₀ and Hₐ missing

Conditions checked

0/1

Not verified

Mechanics + conclusion

2/2

Calculation + p-value comparison correct

Get this on your own answer.

The 5 FRQ patterns that cover AP Stats

Every AP Stats FRQ fits one of these five patterns. Learn the four-step structure for each and you have the framework for the whole exam.

1

Hypothesis test (4-step)

Hypotheses → Conditions → Mechanics → Conclusion. Missing any step caps the score.

  • Write H₀ and Hₐ in symbols AND in context.
  • Verify each condition with a specific check (random, 10%, large counts).
2

Confidence interval interpretation

One sentence. Specific phrasing required: 'We are X% confident that the true [parameter in context] is between L and U.'

  • Always reference 'true' parameter.
  • Include context - not just statistical language.
3

Study design / sampling

Describe a method (SRS, stratified, cluster) or evaluate a design's validity.

  • Name the sampling method and how it's executed (random-number table, etc.).
  • Explain WHY it reduces bias or protects validity.
4

Probability / distributions

Binomial, normal, geometric. Name the distribution and parameters explicitly.

  • State distribution + parameters: 'X ~ Binomial(n = 50, p = 0.3).'
  • Identify the calculator command used (binomcdf, normalcdf) with inputs.
5

Investigative Task

Open-ended novel application. Readers reward reasoning and connection across multiple concepts.

  • Map the familiar technique onto the unfamiliar context.
  • Show your reasoning even when the problem is non-standard - label each step.

The mistakes that quietly cost points

These show up every year. Each one is a single habit - fix the habit and you bank points.

Skipping the hypotheses. H₀ and Hₐ are always a separate scoring row on inference FRQs.

Not checking conditions in context. 'Sample is random because…' needs a reason from the prompt.

Interpreting a confidence interval as 'the probability that the true mean is in this interval.' Wrong - it's about the method, not the specific interval.

Writing conclusions in pure statistics language. 'Fail to reject H₀' without context loses the conclusion point.

Forgetting to name the distribution on probability FRQs. 'X ~ Binomial(n, p)' is the notation readers reward.

Rushing the Investigative Task. It's worth more than any short FRQ - spend the full 35 minutes.

Trusted by 10,000+ AP students

What Stats students say after their first graded FRQ

"I tried Barron's, Princeton Review & YouTube. GradGPT-AP is the only thing that actually graded my essays."

Sarah K.

scored a 5

"This is exactly like the real AP exam. Without practicing in GradGPT-AP, I would have run out of time on exam day!"

Marcus T.

3 → 5 on AP Lang in 2 weeks

"Wasted so many hours on Khan Academy. GradGPT-AP showed me exactly what I was doing wrong with my essays. I finally feel confident for the exam."

Priya R.

Your first FRQ scored against the real AP Stats rubric.

Frequently Asked Questions

Six. Five short FRQs (~4 points each) and one Investigative Task (~8 points). You have 90 minutes total - roughly 11 minutes per short FRQ and 35 minutes for the Investigative Task.

Yes. Conditions are almost always a separate scoring row. The three conditions for proportions: random, independent (10% rule), and large counts. For means: random, independent, and Normal/CLT.

Four steps, all present: (1) Hypotheses in symbols AND context. (2) Conditions verified specifically. (3) Test mechanics (statistic, p-value). (4) Conclusion referencing both the statistical result and the context.

Interpretation in pure statistical language. Every conclusion, CI interpretation, or regression inference must be stated in the CONTEXT of the prompt - not just 'we reject' or 'we are confident.'

Very. It's worth roughly twice as much as a short FRQ. Even partial progress scores well if your reasoning is clear and connects familiar concepts to the novel situation.

50/50. Six FRQs carry the same weight as 40 MCQs. FRQs are where most students have the most room to grow, making them the highest-leverage prep.

Write one Stats FRQ. See exactly where you lost points.

Paste your answer and get a rubric breakdown with inline feedback in seconds.