Small sample sizes: A big data problem in high-dimensional data analysis

Frank Konietschke* (Corresponding Author), Karima Schwab, Markus Pauly

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

27 Citations (Scopus)
8 Downloads (Pure)

Abstract

In many experiments and especially in translational and preclinical research, sample sizes are (very) small. In addition, data designs are often high dimensional, i.e. more dependent than independent replications of the trial are observed. The present paper discusses the applicability of max t-test-type statistics (multiple contrast tests) in high-dimensional designs (repeated measures or multivariate) with small sample sizes. A randomization-based approach is developed to approximate the distribution of the maximum statistic. Extensive simulation studies confirm that the new method is particularly suitable for analyzing data sets with small sample sizes. A real data set illustrates the application of the methods. </jats:p>
Original languageEnglish
Pages (from-to)687–701
Number of pages15
JournalStatistical Methods in Medical Research
Volume30
Issue number3
Early online date24 Nov 2020
DOIs
Publication statusPublished - 1 Mar 2021

Bibliographical note

Acknowledgements
The authors are grateful to the Editor, Associate Editor and three anonymous referees for their helpful suggestions, which greatly improved the manuscript.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: The research is supported by the German Science Foundation awards number DFG KO 4680/3-2 and PA 2409/3-2.

Keywords

  • Multiple contrast tests
  • max t-test
  • repeated measures
  • resampling
  • simultaneous confidence intervals

Fingerprint

Dive into the research topics of 'Small sample sizes: A big data problem in high-dimensional data analysis'. Together they form a unique fingerprint.

Cite this