The paper proposes a hidden semi-Markov model for breakpoint rainfall data that consist of both the times at which rain-rate changes and the steady rates between such changes. The model builds on and extends the seminal work of Ferguson (1980) on variable duration models for speech. For the rainfall data the observations are modelled as mixtures of log-normal distributions within unobserved states where the states evolve in time according to a semi-Markov process. For the latter, parametric forms need to be specified for the state transition probabilities and dwell-time distributions.
Recursions for constructing the likelihood are developed and the EM algorithm used to fit the parameters of the model. The choice of dwell-time distribution is discussed with a mixture of distributions over disjoint domains providing a flexible alternative. The methods are also extended to deal with censored data. An application of the model to a large-scale bivariate dataset of breakpoint rainfall measurements at Wellington, New Zealand, is discussed.