Product formalisms for measures on spaces with binary tree structures: representation, visualization, and multiscale noise

Devasis Bassu; Peter W. Jones; Linda Ness; David Shallcross

doi:10.1017/fms.2020.40

Product formalisms for measures on spaces with binary tree structures: representation, visualization, and multiscale noise

Part of: Measure and integration Statistics

Published online by Cambridge University Press: 13 November 2020

Linda Ness and

Devasis Bassu: Affiliation:
Blackboard Insurance, Bedminster, NJ, US; E-mail: devasis_bassu@hotmail.com
Peter W. Jones: Affiliation:
Yale University, New Haven, CN, US; E-mail: peterwjones@comcast.net
Linda Ness: Affiliation:
Rutgers University, New Brunswick, NJ, US; E-mail: nesslinda@gmail.com
David Shallcross: Affiliation:
Perspecta Labs, Basking Ridge, NJ, US; E-mail: david.shallcross@perspecta.com

Article contents

Abstract
References

Abstract

Core share and HTML view are not available for this content. However, as you have access to this content, a full PDF is available via the ‘Save PDF’ action button.

In this paper, we present a theoretical foundation for a representation of a data set as a measure in a very large hierarchically parametrized family of positive measures, whose parameters can be computed explicitly (rather than estimated by optimization), and illustrate its applicability to a wide range of data types. The preprocessing step then consists of representing data sets as simple measures. The theoretical foundation consists of a dyadic product formula representation lemma, and a visualization theorem. We also define an additive multiscale noise model that can be used to sample from dyadic measures and a more general multiplicative multiscale noise model that can be used to perturb continuous functions, Borel measures, and dyadic measures. The first two results are based on theorems in [15, 3, 1]. The representation uses the very simple concept of a dyadic tree and hence is widely applicable, easily understood, and easily computed. Since the data sample is represented as a measure, subsequent analysis can exploit statistical and measure theoretic concepts and theories. Because the representation uses the very simple concept of a dyadic tree defined on the universe of a data set, and the parameters are simply and explicitly computable and easily interpretable and visualizable, we hope that this approach will be broadly useful to mathematicians, statisticians, and computer scientists who are intrigued by or involved in data science, including its mathematical foundations.

Keywords

measures dyadic sets quasi-conformal mappings weights multiscale noise

MSC classification

Primary: 28-02: Research exposition (monographs, survey articles)

Secondary: 62-07: Data analysis

Type: Applied Analysis
Information: Forum of Mathematics, Sigma , Volume 8 , 2020 , e47

DOI: https://doi.org/10.1017/fms.2020.40 [Opens in a new window]
Creative Commons: This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution, and reproduction in any medium, provided the original work is properly cited.
Copyright: © The Author(s), 2020. Published by Cambridge University Press

References

Ahlfors, L., Lectures on Quasi-Conformal Mappings , Mathematical Studies, vol. 10 (1966), van Nostrand.Google Scholar

Astala, K., Kupiainen, A., Saksman, E., and Jones, P., ‘Random conformal weldings’, Acta Math. 207(2), (2011) 203–254.CrossRef Google Scholar

Beurling, A. and Ahlfors, L., ‘The boundary correspondence under quasi-conformal mappings’, Acta Math. 96 (1956), 125–142.CrossRef Google Scholar

Bassu, D., Izmailov, R., McIntosh, A., Ness, L., and Shallcross, D., ‘Centralized multi-scale singular value decomposition for feature construction in LiDAR image classification problems’, IEEE AIPR 1–6 (2012).Google Scholar

Belkin, M. and Niyogi, P., ‘Laplacian eigenmaps for dimensionality reduction and data representation’, Neural Computation 15 (2003), 1373–1396.CrossRef Google Scholar

Billingsley, P., Probability and Measure (Wiley, 2012).Google Scholar

Brodu, N. and Lague, D., ‘3D terrestrial LiDAR data classification of complex natural scenes using a multi-scale dimensionality criterion: applications in geomorphology’, ISPRS Journal of Photogrammetry and Remote Sensing 16 (2012), 121–134.CrossRef Google Scholar

Bruna, J. and Mallat, S., ‘Invariant scattering convolutional networks’, IEEE Trans. on Pattern Analysis and Machine Intelligence 35(8) (2013).CrossRef Google Scholar

Bruna, J., Szlam, A., and LeCun, Y., ‘Learning stable group invariant representations with convolutional networks, ICLR (January 2013).Google Scholar

Campbell, J. B., Introduction to Remote Sensing, (3rd ed.) (The Guilford Press, 2002).Google Scholar

Coifman, R. and Lafon, S., ‘Diffusion maps’, Applied and Computational Harmonic Analysis 21 (2006), 5–30.CrossRef Google Scholar

Coifman, R., Lafon, S., Maggioni, M., Nadler, B., Warner, F., and Zucker, S. W., ‘Geometric diffusions as a tool for harmonic analysis and structure definition of data: diffusion maps’, Proc. Natl. Acad. Sci. USA 102 (2005), 7426–7431.CrossRef Google Scholar PubMed

Comer, D., Internetworking with TCP/IP 4th Edition: Principles , Protocol and Architecture (Pearson, 2000), vol. 1.Google Scholar

Cybenko, G., ‘Approximation by superpositions of a sigmoidal function’, Mathematics of Control, Signals, and Systems 2, 303–314.CrossRef Google Scholar

Fefferman, R., Kenig, C., and Pipher, J., ‘The theory of weights and the Dirichlet problem for elliptical equations’, Ann. of Math 134 (1991), 65–124.CrossRef Google Scholar

Fowlkes, C., Belongie, S., Chung, F., and Malik, J., ‘Spectral grouping using the Nyström method’, IEEE Transactions on Pattern Analysis and Machine Intelligence 26(2) (2004), 214–225.CrossRef Google Scholar PubMed

Golberg, Y., ‘A primer on neural network models for natural language processing’, J. Artificial Intelligence (2016).Google Scholar

Goodfellow, I., Bengio, Y., and Courville, A., Deep Learning (MIT Press, 2016), Section 6.4.1.Google Scholar

Grebenkov, D. S., Beliaev, D., and Jones, P. W., ‘A multiscale guide to Brownian motion’, Journal of Physics A: Mathematical and Theoretical 49(4) (2015), 043001.CrossRef Google Scholar

Hornik, K., and White, H., ‘Multilayer feedforward networks are universal approximators’, Neural Networks 2 (1989), 359–366.CrossRef Google Scholar

Kahane, J.-P. and Peyriere, J, ‘Sur certaines martingales de B. Mandelbrot’, Adv. Math. 22 (1976), 131–145.CrossRef Google Scholar

Kahane, J.-P., ‘Sur le chaos multiplicative’, Ann . Sci. Math. 9(2) (1985) 105–150.Google Scholar

Kunin, D., Bloom, J., Goeva, A., and Seed, C., ‘Loss landscapes of regularized linear autoencoders’, ICML (2019), 3560–3569.Google Scholar

Jones, P. W., ‘On removable sets for Sobolev spaces’, in: Fefferman, C., et al. eds., Essays on Fourier Analysis in Honor of E.M. Stein (Princeton University Press, 1995), 250–267.CrossRef Google Scholar

Medina, F. P., Ness, L., Weber, M., and Yacoubou Djima, K., ‘Heuristic framework for multiscale testing of the multi-manifold hypothesis’, in: Domeniconi, C. and Gasparovic, E., eds., Research in Data Science (Springer AWM Series, 2019).Google Scholar

Mikolov, T., Sutskever, I., Chen, K., Corrado, G. S., and Dean, J., ‘Distributed representations of words and phrases and their compositionality’, NIPS’13 2 (2014), 3111–3119.Google Scholar

Mallat, S., ‘Group invariant scattering’, Comm. Pure Appl. Math. 65(10) (2012), 1331–1398.CrossRef Google Scholar

Mandelbrot, B. B., ‘A possible refinement of the lognormal hypothesis concerning the distribution of energy in intermittent turbulence’, in: Statistical Models and Turbulence, Lecture Notes in Phys. no. 12 (Springer, 1972), 333–351.CrossRef Google Scholar

Mumford, D., and Sharon, E., ‘

$2D$ -shape analysis using conformal mapping’, Int. J. of Computer Vision 70 (2006), 55–75.Google Scholar

Ness, L., ‘Dyadic product formula representations of confidence measures and decision rules for dyadic data set samples’, MISNC SI, DS ’16, August 2016, Union, NJ, USA.Google Scholar

Ness, L., ‘Inference of a dyadic measure and its simplicial geometry from binary feature data and application to data quality’, in: Domeniconi, C. and Gasparovic, E., eds., Research in Data Science (Springer AWM Series, 2019).Google Scholar

Oikawa, K., ‘Welding of polygons and the type of Riemann surfaces’, Kodai Math. Sem. Rep 13(1) (1961), 37–52.CrossRef Google Scholar

Okikiolu, K., ‘Characterization of subsets of rectifiable curves in

${R}^n$ , J. London Math. Soc . 46(2) (1992), 336–348.CrossRef Google Scholar

Osgood, W. F., ‘A Jordan curve of positive area’, Trans. Amer. Math. Soc. 4(1) (1903), 107–112.CrossRef Google Scholar

Pennington, J., Socher, R., and Manning, C., ‘GloVe: global vectors for word representation, EMNLP (2014).Google Scholar

Shaham, U., Cloninger, A., and Coifmann, R., ‘Provable approximation properties for deep neural networks’, Appl. Comput. Harmon. Anal. 44(3) (2018), 527–557.CrossRef Google Scholar

Ylonene, T., Turner, P., Scarfone, K., and Souppaya, S. M., ‘Security of interactive and automated access management using Secure Shell (SSH)’, NIST Internal Report 7966 (2015).CrossRef Google Scholar

Article contents

Product formalisms for measures on spaces with binary tree structures: representation, visualization, and multiscale noise

Abstract

Keywords

MSC classification

References

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests