Hostname: page-component-78c5997874-dh8gc Total loading time: 0 Render date: 2024-11-10T13:54:12.890Z Has data issue: false hasContentIssue false

Product formalisms for measures on spaces with binary tree structures: representation, visualization, and multiscale noise

Published online by Cambridge University Press:  13 November 2020

Devasis Bassu
Affiliation:
Blackboard Insurance, Bedminster, NJ, US; E-mail: devasis_bassu@hotmail.com
Peter W. Jones
Affiliation:
Yale University, New Haven, CN, US; E-mail: peterwjones@comcast.net
Linda Ness
Affiliation:
Rutgers University, New Brunswick, NJ, US; E-mail: nesslinda@gmail.com
David Shallcross
Affiliation:
Perspecta Labs, Basking Ridge, NJ, US; E-mail: david.shallcross@perspecta.com

Abstract

Core share and HTML view are not available for this content. However, as you have access to this content, a full PDF is available via the ‘Save PDF’ action button.

In this paper, we present a theoretical foundation for a representation of a data set as a measure in a very large hierarchically parametrized family of positive measures, whose parameters can be computed explicitly (rather than estimated by optimization), and illustrate its applicability to a wide range of data types. The preprocessing step then consists of representing data sets as simple measures. The theoretical foundation consists of a dyadic product formula representation lemma, and a visualization theorem. We also define an additive multiscale noise model that can be used to sample from dyadic measures and a more general multiplicative multiscale noise model that can be used to perturb continuous functions, Borel measures, and dyadic measures. The first two results are based on theorems in [1531]. The representation uses the very simple concept of a dyadic tree and hence is widely applicable, easily understood, and easily computed. Since the data sample is represented as a measure, subsequent analysis can exploit statistical and measure theoretic concepts and theories. Because the representation uses the very simple concept of a dyadic tree defined on the universe of a data set, and the parameters are simply and explicitly computable and easily interpretable and visualizable, we hope that this approach will be broadly useful to mathematicians, statisticians, and computer scientists who are intrigued by or involved in data science, including its mathematical foundations.

Type
Applied Analysis
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution, and reproduction in any medium, provided the original work is properly cited.
Copyright
© The Author(s), 2020. Published by Cambridge University Press

References

Ahlfors, L., Lectures on Quasi-Conformal Mappings , Mathematical Studies, vol. 10 (1966), van Nostrand.Google Scholar
Astala, K., Kupiainen, A., Saksman, E., and Jones, P., ‘Random conformal weldings’, Acta Math. 207(2), (2011) 203254.CrossRefGoogle Scholar
Beurling, A. and Ahlfors, L., ‘The boundary correspondence under quasi-conformal mappings’, Acta Math. 96 (1956), 125142.CrossRefGoogle Scholar
Bassu, D., Izmailov, R., McIntosh, A., Ness, L., and Shallcross, D., ‘Centralized multi-scale singular value decomposition for feature construction in LiDAR image classification problems’, IEEE AIPR 16 (2012).Google Scholar
Belkin, M. and Niyogi, P., ‘Laplacian eigenmaps for dimensionality reduction and data representation’, Neural Computation 15 (2003), 13731396.CrossRefGoogle Scholar
Billingsley, P., Probability and Measure (Wiley, 2012).Google Scholar
Brodu, N. and Lague, D., ‘3D terrestrial LiDAR data classification of complex natural scenes using a multi-scale dimensionality criterion: applications in geomorphology’, ISPRS Journal of Photogrammetry and Remote Sensing 16 (2012), 121134.CrossRefGoogle Scholar
Bruna, J. and Mallat, S., ‘Invariant scattering convolutional networks’, IEEE Trans. on Pattern Analysis and Machine Intelligence 35(8) (2013).CrossRefGoogle Scholar
Bruna, J., Szlam, A., and LeCun, Y., ‘Learning stable group invariant representations with convolutional networks, ICLR (January 2013).Google Scholar
Campbell, J. B., Introduction to Remote Sensing, (3rd ed.) (The Guilford Press, 2002).Google Scholar
Coifman, R. and Lafon, S., ‘Diffusion maps’, Applied and Computational Harmonic Analysis 21 (2006), 530.CrossRefGoogle Scholar
Coifman, R., Lafon, S., Maggioni, M., Nadler, B., Warner, F., and Zucker, S. W., ‘Geometric diffusions as a tool for harmonic analysis and structure definition of data: diffusion maps’, Proc. Natl. Acad. Sci. USA 102 (2005), 74267431.CrossRefGoogle ScholarPubMed
Comer, D., Internetworking with TCP/IP 4th Edition: Principles , Protocol and Architecture (Pearson, 2000), vol. 1.Google Scholar
Cybenko, G., ‘Approximation by superpositions of a sigmoidal function’, Mathematics of Control, Signals, and Systems 2, 303314.CrossRefGoogle Scholar
Fefferman, R., Kenig, C., and Pipher, J., ‘The theory of weights and the Dirichlet problem for elliptical equations’, Ann. of Math 134 (1991), 65124.CrossRefGoogle Scholar
Fowlkes, C., Belongie, S., Chung, F., and Malik, J., ‘Spectral grouping using the Nyström method’, IEEE Transactions on Pattern Analysis and Machine Intelligence 26(2) (2004), 214225.CrossRefGoogle ScholarPubMed
Golberg, Y., ‘A primer on neural network models for natural language processing’, J. Artificial Intelligence (2016).Google Scholar
Goodfellow, I., Bengio, Y., and Courville, A., Deep Learning (MIT Press, 2016), Section 6.4.1.Google Scholar
Grebenkov, D. S., Beliaev, D., and Jones, P. W., ‘A multiscale guide to Brownian motion’, Journal of Physics A: Mathematical and Theoretical 49(4) (2015), 043001.CrossRefGoogle Scholar
Hornik, K., and White, H., ‘Multilayer feedforward networks are universal approximators’, Neural Networks 2 (1989), 359366.CrossRefGoogle Scholar
Kahane, J.-P. and Peyriere, J, ‘Sur certaines martingales de B. Mandelbrot’, Adv. Math. 22 (1976), 131145.CrossRefGoogle Scholar
Kahane, J.-P., ‘Sur le chaos multiplicative’, Ann . Sci. Math. 9(2) (1985) 105150.Google Scholar
Kunin, D., Bloom, J., Goeva, A., and Seed, C., ‘Loss landscapes of regularized linear autoencoders’, ICML (2019), 35603569.Google Scholar
Jones, P. W., ‘On removable sets for Sobolev spaces’, in: Fefferman, C., et al. eds., Essays on Fourier Analysis in Honor of E.M. Stein (Princeton University Press, 1995), 250267.CrossRefGoogle Scholar
Medina, F. P., Ness, L., Weber, M., and Yacoubou Djima, K., ‘Heuristic framework for multiscale testing of the multi-manifold hypothesis’, in: Domeniconi, C. and Gasparovic, E., eds., Research in Data Science (Springer AWM Series, 2019).Google Scholar
Mikolov, T., Sutskever, I., Chen, K., Corrado, G. S., and Dean, J., ‘Distributed representations of words and phrases and their compositionality’, NIPS’13 2 (2014), 31113119.Google Scholar
Mallat, S., ‘Group invariant scattering’, Comm. Pure Appl. Math. 65(10) (2012), 13311398.CrossRefGoogle Scholar
Mandelbrot, B. B., ‘A possible refinement of the lognormal hypothesis concerning the distribution of energy in intermittent turbulence’, in: Statistical Models and Turbulence, Lecture Notes in Phys. no. 12 (Springer, 1972), 333351.CrossRefGoogle Scholar
Mumford, D., and Sharon, E., ‘ $2D$ -shape analysis using conformal mapping’, Int. J. of Computer Vision 70 (2006), 5575.Google Scholar
Ness, L., ‘Dyadic product formula representations of confidence measures and decision rules for dyadic data set samples’, MISNC SI, DS ’16, August 2016, Union, NJ, USA.Google Scholar
Ness, L., ‘Inference of a dyadic measure and its simplicial geometry from binary feature data and application to data quality’, in: Domeniconi, C. and Gasparovic, E., eds., Research in Data Science (Springer AWM Series, 2019).Google Scholar
Oikawa, K., ‘Welding of polygons and the type of Riemann surfaces’, Kodai Math. Sem. Rep 13(1) (1961), 3752.CrossRefGoogle Scholar
Okikiolu, K., ‘Characterization of subsets of rectifiable curves in ${R}^n$ , J. London Math. Soc . 46(2) (1992), 336348.CrossRefGoogle Scholar
Osgood, W. F., ‘A Jordan curve of positive area’, Trans. Amer. Math. Soc. 4(1) (1903), 107112.CrossRefGoogle Scholar
Pennington, J., Socher, R., and Manning, C., ‘GloVe: global vectors for word representation, EMNLP (2014).Google Scholar
Shaham, U., Cloninger, A., and Coifmann, R., ‘Provable approximation properties for deep neural networks’, Appl. Comput. Harmon. Anal. 44(3) (2018), 527557.CrossRefGoogle Scholar
Ylonene, T., Turner, P., Scarfone, K., and Souppaya, S. M., ‘Security of interactive and automated access management using Secure Shell (SSH)’, NIST Internal Report 7966 (2015).CrossRefGoogle Scholar