Model Evaluation and Selection(Part-3)
Hypothesis Testing Given a hypothesis, test if it is true with a particular confidence • Hypotheses: errorD (h1) > errorD (h2) • What % of the probability mass is associated with errorD (h1) - errorD (h2) > 0 • Example • Let the error rates measured for the two hypotheses h1 and h2 on a sample of size 100 be 0.3 and 0.2 respectively • The standard deviation for the normal distribution defined on d’ = errorS1 (h1 )- errorS2 (h2 ) is • 1.640.061 = 0.1 • 1.64 standard deviation corresponds to the 90% confidence interval and hence the % mass of the probability distribution > 0 is 95% • Result: Accept the hypothesis with 95% confidence that h2 is a more accurate hypothesis than h1 on D (the underlying population) Comparing Two Algorithms Given two learning algorithms, L1 and L2 , which one is better, on average, at learning a particular target function • Estimating the relative performance • Calculate the Expected Value of the difference in errors • For all samples of