Question 1: The process of cherry-picking promising findings and thereby risking spurious statistically-significant results is sometimes called: |
Reference: | R.L. Wasserstein and N.A. Lazar, The ASA’s statement on p-values, context, process, and purpose, The American Statistician, accepted version published online 3/7/2016. |
Choice A: | Data dredging. |
Choice B: | P-hacking. |
Choice C: | Significance chasing. |
Choice D: | Selective inference. |
Choice E: | All of the above. |
Question 2: A p-value can best be described as the probability, under a specified statistical model, that ______________. |
Reference: | Reference: R.L. Wasserstein and N.A. Lazar, The ASA’s statement on p-values, context, process, and purpose, The American Statistician, accepted version published online 3/7/2016. |
Choice A: | The null hypothesis is true. |
Choice B: | A statistical summary of the data would be equal to or more extreme than its observed value. |
Choice C: | The statistical summary of the data was a random coincidence unique to the given data under analysis. |
Choice D: | The results would not hold up if the experiment is repeated. |
Choice E: | The data has sufficient statistical power to detect a given effect. |
Question 3: True or false: Bayes theorem provides a method for computing the probability that a hypothesis is true given some observed data or event. |
Reference: | M. E. J. Masson, “A tutorial on a practical Bayesian alternative to null-hypothesis significance testing”, Behavioral Research 43, 2011. |
Choice A: | True. |
Choice B: | False. |
Question 4: When describing how a dataset deviates from a normal distribution, kurtosis is a measure of: |
Reference: | A. Fields et al., Discovering Statistics Using R, Sage, 2012. |
Choice A: | The sharpness of the peak of the frequency distribution of the dataset. |
Choice B: | The asymmetry of the frequency distribution of the dataset. |
Choice C: | The fraction of missing data in the dataset. |
Choice D: | The area under the frequency distribution curve of the dataset. |
Choice E: | None of the above. |
Question 5: True or False: The Mann-Whitney test and the Wilcoxon’s rank-sum test are non-parametric equivalents of the independent t-test. |
Reference: | A. Fields et al., Discovering Statistics Using R, Sage, 2012. |
Choice A: | True. |
Choice B: | False. |
Question 6: In simple linear regression, what is the x-axis variable? |
Reference: | Agresti, A. An introduction to categorical data analysis 2nd ed. John Wiley & Sons, Hoboken, NJ, 2007. |
Choice A: | Response variable. |
Choice B: | Outcome variable. |
Choice C: | Independent variable. |
Choice D: | Deterministic variable. |
Question 7: The Coefficient of Determination conveys what portion of data variation is accounted for by the model. |
Reference: | Agresti, A. An introduction to categorical data analysis 2nd ed. John Wiley & Sons, Hoboken, NJ, 2007. |
Choice A: | True. |
Choice B: | False. |
Question 8: What type of logistic regression is used when the categories have no natural ordering? |
Reference: | Reference: Agresti, A. An introduction to categorical data analysis 2nded. John Wiley & Sons, Hoboken, NJ, 2007. |
Choice A: | Binary. |
Choice B: | Nominal. |
Choice C: | Ordinal. |
Question 9: True or False: The ½ standard deviation method is an accepted criterion for determining clinical significance in quality of life (QOL) studies. |
Reference: | J. Sloan, et al., “Clinical significance of patient-reported questionnaire data: another step towards consensus”, J. Clinical Epidemiology 58, 2005. |
Choice A: | True. |
Choice B: | False. |
Question 10: True or False: The Bonferroni correction is the most appropriate method by which to address multiple hypotheses testing in large-scale radiomics projects. |
Reference: | A. Kumar, et al., “Radiomics: the process and the challenges”, Magnetic Resonance Imaging 30, 2012. |
Choice A: | True. |
Choice B: | False. |