Book contents
- Frontmatter
- Contents
- Preface
- 1 Basic Concepts in Probability and Statistics
- 2 Hypothesis Tests
- 3 Confidence Intervals
- 4 Statistical Tests Based on Ranks
- 5 Introduction to Stochastic Processes
- 6 The Power Spectrum
- 7 Introduction to Multivariate Methods
- 8 Linear Regression: Least Squares Estimation
- 9 Linear Regression: Inference
- 10 Model Selection
- 11 Screening: A Pitfall in Statistics
- 12 Principal Component Analysis
- 13 Field Significance
- 14 Multivariate Linear Regression
- 15 Canonical Correlation Analysis
- 16 Covariance Discriminant Analysis
- 17 Analysis of Variance and Predictability
- 18 Predictable Component Analysis
- 19 Extreme Value Theory
- 20 Data Assimilation
- 21 Ensemble Square Root Filters
- Appendix
- References
- Index
7 - Introduction to Multivariate Methods
Published online by Cambridge University Press: 03 February 2022
- Frontmatter
- Contents
- Preface
- 1 Basic Concepts in Probability and Statistics
- 2 Hypothesis Tests
- 3 Confidence Intervals
- 4 Statistical Tests Based on Ranks
- 5 Introduction to Stochastic Processes
- 6 The Power Spectrum
- 7 Introduction to Multivariate Methods
- 8 Linear Regression: Least Squares Estimation
- 9 Linear Regression: Inference
- 10 Model Selection
- 11 Screening: A Pitfall in Statistics
- 12 Principal Component Analysis
- 13 Field Significance
- 14 Multivariate Linear Regression
- 15 Canonical Correlation Analysis
- 16 Covariance Discriminant Analysis
- 17 Analysis of Variance and Predictability
- 18 Predictable Component Analysis
- 19 Extreme Value Theory
- 20 Data Assimilation
- 21 Ensemble Square Root Filters
- Appendix
- References
- Index
Summary
Many scientific questions lead to hypotheses about random vectors. For instance, the question of whether global warming has occurred over a geographic region is a question about whether temperature has changed at each spatial location within the region. One approach to addressing such a question is to apply a univariate test to each location separately and then use the results collectively to make a decision. This approach is called multiple testing or multiple comparisons and is common in genomics for analyzing gene expressions. The disadvantage of this approach is that it does not fully account for correlation between variables. Multivariate techniques provide a framework for hypothesis testing that takes into account correlations between variables. Although multivariate tests are more comprehensive, they require estimating more parameters and therefore have low power when the number of variables is large. Multivariate statistical analysis draws heavily on linear algebra and includes a generalization of the normal distribution, called the multivariate normal distribution, whose population parameters are the mean vector and the covariance matrix.
Keywords
- Type
- Chapter
- Information
- Statistical Methods for Climate Scientists , pp. 156 - 184Publisher: Cambridge University PressPrint publication year: 2022