Accurately estimating risk preferences is of critical importance when evaluating data from many economic experiments or strategic interactions. I use a simulation model to conduct power analyses over two lottery batteries designed to classify individual subjects as being best explained by one of a number of alternative specifications of risk preference models. I propose a case in which there are only two possible alternatives for classification and find that the statistical methods used to classify subjects result in type I and type II errors at rates far beyond traditionally acceptable levels. These results suggest that subjects in experiments must make significantly more choices, or that traditional lottery pair batteries need to be substantially redesigned to make accurate inferences about the risk preference models that characterize a subject’s choices.