Search

On the Ziv–Merhav theorem beyond Markovianity I
Part of
Nicholas Barnfield, Raphaël Grondin, Gaia Pozzoli, Renaud Raquépas
Journal:

Canadian Journal of Mathematics , First View

Published online by Cambridge University Press:

07 March 2024, pp. 1-25
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
We generalize to a broader class of decoupled measures a result of Ziv and Merhav on universal estimation of the specific cross (or relative) entropy, originally for a pair of multilevel Markov measures. Our generalization focuses on abstract decoupling conditions and covers pairs of suitably regular g-measures and pairs of equilibrium measures arising from the “small space of interactions” in mathematical statistical mechanics.

6 - Machine Learning with Spark
from Part III - Machine Learning for Big Data
Isaac Triguero, University of Nottingham, Mikel Galar, Public University of Navarre
Book:

Large-Scale Data Analytics with Python and Spark

Published online:

15 December 2023

Print publication:

23 November 2023, pp 177-211
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

This chapter introduces the machine learning side of things of this book. Although we assume some prior experience in machine learning, we start off with a full recap of the basic concepts and key terminology. This includes a discussion of learning paradigms, such as supervised and unsupervised learning, and the machine learning life cycle, articulating the steps to go from data collection to model deployment. We cover topics like data preparation and preprocessing, model evaluation and selection, and machine learning pipelines, showing how all the stages of this cycle are susceptible to being compromised when we talk about large-scale data analytics. After that, the rest of the chapter is devoted to the machine learning library of Spark, MLLib. Basic concepts such as Transformers, Estimators, and Pipelines are presented with an example using linear regression. The example provided forces us to use a pipeline of methods to get the data ready for training. This allows us to introduce some of the data preparation packages of Spark (e.g., VectorAssembler or StandardScaler). Finally, we explore evaluation packages (e.g., RegressionEvaluator) and how to perform hyperparameter tuning.

DISTRIBUTION-FREE CONFIDENCE INTERVALS FOR FUNCTIONS OF QUANTILES
Part of
- Nonparametric inference
CHANDIMA N. P. G. ARACHCHIGE
Journal:

Bulletin of the Australian Mathematical Society / Volume 103 / Issue 3 / June 2021

Published online by Cambridge University Press:

17 September 2020, pp. 520-522

Print publication:

June 2021
- Article
- - You have access
- PDF
- Export citation

11 - Principles of point estimation
from Part B - Estimation And Inference
Karim M. Abadir, Imperial College London, Risto D. H. Heijmans, Jan R. Magnus, Vrije Universiteit, Amsterdam
Book:

Statistics

Published online:

11 June 2020

Print publication:

08 November 2018, pp 437-484
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

We introduce principles of point estimation, that is, the estimation of a value for the vector of unknown parameters of the density of a variate. The chapter starts by considering some desirable properties of point estimators, a sort of “the good, the bad, and the ugly” classification! The topics covered include bias, efficiency, mean-squared error (MSE), consistency, robustness, invariance, and admissibility. We then introduce methods of summarizing the data via statistics that retain the relevant sample information about the parameter vector, and we see how they achieve the desirable properties of estimators. We discuss sufficiency, Neyman's factorization, ancillarity, Rao-Blackwellization, completeness, the Lehmann–Scheffé theorem and the minimum-variance unbiasedness of an estimator, and Basu's theorem. We consider the exponential family and special cases and conclude by introducing the most common model in statistics, the linear model, which is used for illustrations in this chapter and is covered more extensively in the following chapters.

Estimation of anisotropic Gaussian fields through Radon transform
Hermine Biermé, Frédéric Richard
Journal:

ESAIM: Probability and Statistics / Volume 12 / 2008

Published online by Cambridge University Press:

13 November 2007, pp. 30-50

Print publication:

2008
- Article
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
We estimate the anisotropic index of an anisotropic fractional Brownian field. For all directions, we give a convergent estimator of the value of the anisotropic index in this direction, based on generalized quadratic variations. We also prove a central limit theorem. First we present a result of identification that relies on the asymptotic behavior of the spectral density of a process. Then, we define Radon transforms of the anisotropic fractional Brownian field and prove that these processes admit a spectral density satisfying the previous assumptions. Finally we use simulated fields to test the proposed estimator in different anisotropic and isotropic cases. Results show that the estimator behaves similarly in all cases and is able to detect anisotropy quiteaccurately.

Comparative analysis of estimators for wind direction standard deviation
P. S. Farrugia, A. Micallef
Journal:

Meteorological Applications / Volume 13 / Issue 1 / March 2006

Published online by Cambridge University Press:

22 February 2006, pp. 29-41

Print publication:

March 2006
- Article
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Wind direction is a circular variable. This makes the algorithms used to find its standard deviation different from that of the linear variables. In particular, the requirement for storing all the data points before the standard deviation can be computed limits the storage capacity and puts great strain on remote data acquisition systems. Various algorithms have therefore been developed to estimate the standard deviation in order to reduce the number of terms stored. The following work consists of a comparative analysis of such estimators together with the parameters used. It emerges that some of the assumptions adopted to produce the equations being analysed do not hold in practice, even though this does not affect significantly the performance of the estimators that depend on them. On the other hand, the parameter that has the best trend with the algorithm adopted is the magnitude of the vector to the centre of gravity of the system. However, such a result gives rise to some concerns since it does not account for the ‘vectorial’ nature of the angle being treated.

The weak convergence of a class of estimators of the variance function of a two-dimensional Poisson process
A. M. Liebetrau
Journal:

Journal of Applied Probability / Volume 15 / Issue 2 / June 1978

Published online by Cambridge University Press:

14 July 2016, pp. 433-439

Print publication:

June 1978
- Article
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Results of a previous paper (Liebetrau (1977a)) are extended to higher dimensions. An estimator V∗(t1, t2) of the variance function V(t1, t2) of a two-dimensional process is defined, and its first- and second-moment structure is given assuming the process to be Poisson. Members of a class of estimators of the form where and for 0 < α i < 1, are shown to converge weakly to a non-stationary Gaussian process. Similar results hold when the t′i are taken to be constants, when V is replaced by a suitable estimator and when the dimensionality of the underlying Poisson process is greater than two.

On the weak convergence of a class of estimators of the variance-time curve of a weakly stationary point process
A. M. Liebetrau
Journal:

Journal of Applied Probability / Volume 14 / Issue 1 / March 1977

Published online by Cambridge University Press:

14 July 2016, pp. 114-126

Print publication:

March 1977
- Article
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
The second-moment structure of an estimator V*(t) of the variance-time curve V(t) of a weakly stationary point process is obtained in the case where the process is Poisson. This result is used to establish the weak convergence of a class of estimators of the form Tβ(V*(tTα) – V(tTα)), 0 < α < 1, to a non-stationary Gaussian process. Similar results are shown to hold when α = 0 and in the case where V(tTα) is replaced by a suitable estimator.

Search Results

Refine search

Refine search

Actions for selected content:

8 results

On the Ziv–Merhav theorem beyond Markovianity I

6 - Machine Learning with Spark

Summary

DISTRIBUTION-FREE CONFIDENCE INTERVALS FOR FUNCTIONS OF QUANTILES

11 - Principles of point estimation

Summary

Estimation of anisotropic Gaussian fields through Radon transform

Comparative analysis of estimators for wind direction standard deviation

The weak convergence of a class of estimators of the variance function of a two-dimensional Poisson process

On the weak convergence of a class of estimators of the variance-time curve of a weakly stationary point process

Search Results

Refine search

Refine search

Actions for selected content:

Save Search

8 results

Summary

Summary