Free SOA Exam ASTAM (Advanced Short-Term Actuarial Mathematics) Construction and Selection of Parametric Models Practice Questions

Practice construction and selection of parametric models for Exam ASTAM. Questions test MLE, minimum distance estimation, hypothesis testing, model selection criteria, and simulation methods.

220 Questions
57 Easy
114 Medium
49 Hard
2026 Syllabus

Sample Questions

Question 1 Easy
Which of the following best describes the Akaike Information Criterion (AIC) used in model selection?
Solution
D is correct. The Akaike Information Criterion is defined as:
AIC=2^+2k\text{AIC} = -2\hat{\ell} + 2k
where ^\hat{\ell} is the maximized log-likelihood and kk is the number of estimated parameters. The criterion balances fit (via 2^-2\hat{\ell}) against complexity (via 2k2k). Lower AIC is preferred.

Why each other option is incorrect:
- (B) This is the Bayesian Information Criterion (BIC/SBC), not AIC; BIC uses klnnk\ln n instead of 2k2k as the penalty.
- (C) The parameter penalty in AIC is linear (2k2k), not quadratic (2k22k^2); a quadratic penalty is not used in standard information criteria.
- (D) The sign convention is reversed; AIC uses 2^-2\hat{\ell} (positive for typical negative log-likelihoods), and lower values are preferred, not higher.
- (E) Normalizing the log-likelihood by nn before adding the penalty is not part of the AIC formula; AIC uses the raw maximized log-likelihood.
Question 2 Medium
Which of the following correctly describes the relationship between the AIC and BIC model selection criteria and their preference for model complexity?
Solution
E is correct. AIC =2^+2p= -2\hat{\ell} + 2p penalizes each parameter by 22. (__T(B)MP__)IC =2^+plnn= -2\hat{\ell} + p\ln n penalizes each by lnn\ln n. The (__T(B)MP__)IC penalty exceeds AIC's when lnn>2\ln n > 2, i.e., n>e27.4n > e^2 \approx 7.4. For any dataset with n8n \ge 8 — which includes virtually every actuarial application — (__T(B)MP__)IC penalizes additional parameters more heavily than AIC and therefore selects simpler (fewer-parameter) models.

Why each other option is incorrect:
- (A) Claims (__T(B)MP__)IC always penalizes more for any n>1n > 1; this is false for n7n \le 7, where lnn<2\ln n < 2 and AIC has the larger per-parameter penalty.
- (C) Reverses the formulas: AIC uses 2p2p and (__T(B)MP__)IC uses plnnp\ln n, not the other way around.
- (D) Reverses the consistency properties of AIC and (__T(B)MP__)IC; (__T(B)MP__)IC is consistent (converges to the true model as nn\to\infty) while AIC tends to overfit asymptotically.
- ((B)) States both criteria minimize the same objective but with different coefficients; while true as a narrow arithmetic statement, the more important error is calling them equivalent in purpose — AIC minimizes expected Kullback-Leibler divergence (prediction) while (__T(B)MP__)IC approximates the log marginal likelihood (model probability).
Question 3 Hard
The MLE of the exponential mean based on nn complete observations is θ^=Xˉ\hat{\theta} = \bar{X}. Using the delta method, find the asymptotic variance of the MLE of S(t)=et/θS(t) = e^{-t/\theta}, evaluated at t=500t = 500 and θ^=1000\hat{\theta} = 1000.
Solution
D is correct. The survival function is S(t)=et/θS(t) = e^{-t/\theta}. By the delta method:
Var(S^(t))(dSdθ)2Var(θ^)\text{Var}(\hat{S}(t)) \approx \left(\frac{dS}{d\theta}\right)^2 \text{Var}(\hat{\theta})
dSdθ=tθ2et/θ\frac{dS}{d\theta} = \frac{t}{\theta^2}e^{-t/\theta}
Var(θ^)=θ2n\text{Var}(\hat{\theta}) = \frac{\theta^2}{n}
Var(S^(t))=(tθ2et/θ)2θ2n=t2e2t/θnθ2\text{Var}(\hat{S}(t)) = \left(\frac{t}{\theta^2}e^{-t/\theta}\right)^2 \cdot \frac{\theta^2}{n} = \frac{t^2 e^{-2t/\theta}}{n\theta^2}
At t=500t = 500, θ=1000\theta = 1000: Var=5002e1n×106=250000e1106n=e14n\text{Var} = \frac{500^2 e^{-1}}{n \times 10^6} = \frac{250000 e^{-1}}{10^6 n} = \frac{e^{-1}}{4n}.

Why each other option is incorrect:
- (B) The binomial variance S(1S)/nS(1-S)/n applies to a Bernoulli proportion, not to a parametric MLE of a continuous survival function; the delta method is the correct approach.
- (C) Omitting the e2t/θe^{-2t/\theta} factor drops the squared exponential term that arises when squaring the derivative dS/dθdS/d\theta.
- (D) A linear (first-order) Taylor expansion retains (dS/dθ)2(dS/d\theta)^2, which still includes two powers of the exponential; there is no single-power simplification.
- (E) While algebraically similar, option E uses θ2\theta^2 in the numerator implicitly and then factors differently; the exact simplified form is t2e2t/θ/(nθ2)t^2 e^{-2t/\theta}/(n\theta^2), which is choice A.
Create a Free Account to Access All 220 Questions →

More Exam ASTAM Topics

About FreeFellow

FreeFellow is a free exam prep platform for actuarial (SOA & CAS), CFA, CFP, CPA, CAIA, and securities licensing candidates. Every question includes a detailed solution. Full lessons, flashcards with spaced repetition, timed mock exams, performance analytics, and a personalized study plan are all included — no paywalls, no ads.