Search

4 - Optimization: Hard Computing
Marco P. Schoen, Idaho State University
Book:

Introduction to Intelligent Systems, Control, and Machine Learning using MATLAB

Published online:

27 November 2023

Print publication:

16 November 2023, pp 109-141
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

In this chapter, we establish the mathematical foundation for hard computing optimization algorithms. We look at the classical optimization approaches and extend our discussion to include iterative methods, which hold a special role in machine learning. In particular, we review the gradient decent method, Newton’s method, the conjugate gradient method and the quasi-Newton’s method. Along with the discussion of these optimization methods, implementation using Matlab script as well as considerations for use in neural network training algorithms are provided. Finally, the Levenberg-Marquardt method is introduced, discussed, and implemented in Matlab script to compare its functioning with the other four iterative algorithms introduced in this chapter.

Appendix - Other Numerical Methods
Tao Xiang, Chinese Academy of Sciences, Beijing
Book:

Density Matrix and Tensor Network Renormalization

Published online:

18 January 2024

Print publication:

31 August 2023, pp 394-407
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Several numerical methods used in the study of tensor network renormalization are introduced, including the power, Lanczos, conjugate gradient, Arnoldi methods, and quantum Monte Carlo simulation.

7 - Non-linear Optimization
William W. Hsieh, University of British Columbia, Vancouver
Book:

Introduction to Environmental Data Science

Published online:

23 March 2023

Print publication:

23 March 2023, pp 216-244
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Many machine learning methods require non-linear optimization, performed by the backward propagation of model errors, with the process complicated by the presence of multiple minima and saddle points. Numerous gradient descent algorithms are available for optimization, including stochastic gradient descent, conjugate gradient, quasi-Newton and non-linear least squares such as Levenberg-Marquardt. In contrast to deterministic optimization, stochastic optimization methods repeatedly introduce randomness during the search process to avoid getting trapped in a local minimum. Evolutionary algorithms, borrowing concepts from evolution to solve optimization problems, include genetic algorithm and differential evolution.

6 - Numerical Minimization Process
from Part II - Practical Tools
Seon Ki Park, Milija Zupanski, Colorado State University
Book:

Principles of Data Assimilation

Published online:

22 September 2022

Print publication:

29 September 2022, pp 128-160
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Mathematical background and formulation of numerical minimization process are described in terms of gradient-based methods, whose ingredients include gradient, Hessian, directional derivatives, optimality conditions for minimization, Hessian eigensystem, conjugate number of Hessian, and conjugate vectors. Various minimization algorithms, such as the steepest descent method, Newton’s method, conjugate gradient method, and quasi-Newton’s method, are introduced along with practical examples.

MATRIX ANALYSES ON THE DAI–LIAO CONJUGATE GRADIENT METHOD
Part of
- Numerical methods in calculus of variations and optimal control
Z. AMINIFARD, S. BABAIE-KAFAKI
Journal:

The ANZIAM Journal / Volume 61 / Issue 2 / April 2019

Published online by Cambridge University Press:

09 May 2019, pp. 195-203
- Article
- - You have access
- PDF
- Export citation
Some optimal choices for a parameter of the Dai–Liao conjugate gradient method are proposed by conducting matrix analyses of the method. More precisely, first the $\ell _{1}$ and $\ell _{\infty }$ norm condition numbers of the search direction matrix are minimized, yielding two adaptive choices for the Dai–Liao parameter. Then we show that a recent formula for computing this parameter which guarantees the descent property can be considered as a minimizer of the spectral condition number as well as the well-known measure function for a symmetrized version of the search direction matrix. Brief convergence analyses are also carried out. Finally, some numerical experiments on a set of test problems related to constrained and unconstrained testing environment, are conducted using a well-known performance profile.

A NEW DERIVATIVE-FREE CONJUGATE GRADIENT METHOD FOR LARGE-SCALE NONLINEAR SYSTEMS OF EQUATIONS
XIAOWEI FANG, QIN NI
Journal:

Bulletin of the Australian Mathematical Society / Volume 95 / Issue 3 / June 2017

Published online by Cambridge University Press:

22 March 2017, pp. 500-511

Print publication:

June 2017
- Article
- - You have access
- PDF
- Export citation
We propose a new derivative-free conjugate gradient method for large-scale nonlinear systems of equations. The method combines the Rivaie–Mustafa–Ismail–Leong conjugate gradient method for unconstrained optimisation problems and a new nonmonotone line-search method. The global convergence of the proposed method is established under some mild assumptions. Numerical results using 104 test problems from the CUTEst test problem library show that the proposed method is promising.

A MODIFIED FR CONJUGATE GRADIENT METHOD FOR COMPUTING $Z$ -EIGENPAIRS OF SYMMETRIC TENSORS
Part of
- Basic linear algebra
- Nonlinear algebraic or transcendental equations
MEILAN ZENG, GUANGHUI ZHOU
Journal:

Bulletin of the Australian Mathematical Society / Volume 94 / Issue 3 / December 2016

Published online by Cambridge University Press:

26 July 2016, pp. 411-420

Print publication:

December 2016
- Article
- - You have access
- PDF
- Export citation
This paper proposes improvements to the modified Fletcher–Reeves conjugate gradient method (FR-CGM) for computing $Z$ -eigenpairs of symmetric tensors. The FR-CGM does not need to compute the exact gradient and Jacobian. The global convergence of this method is established. We also test other conjugate gradient methods such as the modified Polak–Ribière–Polyak conjugate gradient method (PRP-CGM) and shifted power method (SS-HOPM). Numerical experiments of FR-CGM, PRP-CGM and SS-HOPM show the efficiency of the proposed method for finding $Z$ -eigenpairs of symmetric tensors.

Conjugate Gradient Method for Estimation of Robin Coefficients
Yan-Bo Ma, Fu-Rong Lin
Journal:

East Asian Journal on Applied Mathematics / Volume 4 / Issue 2 / May 2014

Published online by Cambridge University Press:

28 May 2015, pp. 189-204

Print publication:

May 2014
- Article
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
We consider a Robin inverse problem associated with the Laplace equation, which is a severely ill-posed and nonlinear. We formulate the problem as a boundary integral equation, and introduce a functional of the Robin coefficient as a regularisation term. A conjugate gradient method is proposed for solving the consequent regularised nonlinear least squares problem. Numerical examples are presented to illustrate the effectiveness of the proposed method.

Spectral Optimization Methods for the Time Fractional Diffusion Inverse Problem
Xingyang Ye, Chuanju Xu
Journal:

Numerical Mathematics: Theory, Methods and Applications / Volume 6 / Issue 3 / August 2013

Published online by Cambridge University Press:

28 May 2015, pp. 499-519

Print publication:

August 2013
- Article
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
An inverse problem of reconstructing the initial condition for a time fractional diffusion equation is investigated. On the basis of the optimal control framework, the uniqueness and first order necessary optimality condition of the minimizer for the objective functional are established, and a time-space spectral method is proposed to numerically solve the resulting minimization problem. The contribution of the paper is threefold: 1) a priori error estimate for the spectral approximation is derived; 2) a conjugate gradient optimization algorithm is designed to efficiently solve the inverse problem; 3) some numerical experiments are carried out to show that the proposed method is capable to find out the optimal initial condition, and that the convergence rate of the method is exponential if the optimal initial condition is smooth.

A least-squares method for the numerical solution of theDirichlet problem for the elliptic monge − ampère equation in dimension two ∗
Alexandre Caboussat, Roland Glowinski, Danny C. Sorensen
Journal:

ESAIM: Control, Optimisation and Calculus of Variations / Volume 19 / Issue 3 / July 2013

Published online by Cambridge University Press:

03 June 2013, pp. 780-810

Print publication:

July 2013
- Article
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
We address in this article the computation of the convex solutions of the Dirichletproblem for the real elliptic Monge − Ampère equation for general convex domains in twodimensions. The method we discuss combines a least-squares formulation with a relaxationmethod. This approach leads to a sequence of Poisson − Dirichlet problems and anothersequence of low dimensional algebraic eigenvalue problems of a new type. Mixed finiteelement approximations with a smoothing procedure are used for the computer implementationof our least-squares/relaxation methodology. Domains with curved boundaries are easilyaccommodated. Numerical experiments show the convergence of the computed solutions totheir continuous counterparts when such solutions exist. On the other hand, when classicalsolutions do not exist, our methodology produces solutions in a least-squares sense.

ITERATIVE SOLUTION OF SHIFTED POSITIVE-DEFINITE LINEAR SYSTEMS ARISING IN A NUMERICAL METHOD FOR THE HEAT EQUATION BASED ON LAPLACE TRANSFORMATION AND QUADRATURE
Part of
WILLIAM MCLEAN, VIDAR THOMÉE
Journal:

The ANZIAM Journal / Volume 53 / Issue 2 / October 2011

Published online by Cambridge University Press:

15 August 2012, pp. 134-155
- Article
- - You have access
- PDF
- Export citation
In earlier work we have studied a method for discretization in time of a parabolic problem, which consists of representing the exact solution as an integral in the complex plane and then applying a quadrature formula to this integral. In application to a spatially semidiscrete finite-element version of the parabolic problem, at each quadrature point one then needs to solve a linear algebraic system having a positive-definite matrix with a complex shift. We study iterative methods for such systems, considering the basic and preconditioned versions of first the Richardson algorithm and then a conjugate gradient method.

Preconditioners and Electron Density Optimization in Orbital-Free Density Functional Theory
Linda Hung, Chen Huang, Emily A. Carter
Journal:

Communications in Computational Physics / Volume 12 / Issue 1 / July 2012

Published online by Cambridge University Press:

20 August 2015, pp. 135-161

Print publication:

July 2012
- Article
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Orbital-free density functional theory (OFDFT) is a quantum mechanical method in which the energy of a material depends only on the electron density and ionic positions. We examine some popular algorithms for optimizing the electron density distribution in OFDFT, explaining their suitability, benchmarking their performance, and suggesting some improvements. We start by describing the constrained optimization problem that encompasses electron density optimization. Next, we discuss the line search (including Wolfe conditions) and the nonlinear conjugate gradient and truncated Newton algorithms, as implemented in our open source OFDFT code. We finally focus on preconditioners derived from OFDFT energy functionals. Newly-derived preconditioners are successful for simulation cells of all sizes without regions of low electron-density and for small simulation cells with such regions.

Algebraic Multigrid Preconditioning for Finite Element Solution of Inhomogeneous Elastic Inclusion Problems in Articular Cartilage
Zhengzheng Hu, Mansoor A Haider
Journal:

Advances in Applied Mathematics and Mechanics / Volume 3 / Issue 6 / December 2011

Published online by Cambridge University Press:

03 June 2015, pp. 729-744

Print publication:

December 2011
- Article
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
In studying biomechanical deformation in articular cartilage, the presence of cells (chondrocytes) necessitates the consideration of inhomogeneous elasticity problems in which cells are idealized as soft inclusions within a stiff extracellular matrix. An analytical solution of a soft inclusion problem is derived and used to evaluate iterative numerical solutions of the associated linear algebraic system based on discretization via the finite element method, and use of an iterative conjugate gradient method with algebraic multigrid preconditioning (AMG-PCG). Accuracy and efficiency of the AMG-PCG algorithm is compared to two other conjugate gradient algorithms with diagonal preconditioning (DS-PCG) or a modified incomplete LU decomposition (Euclid-PCG) based on comparison to the analytical solution. While all three algorithms are shown to be accurate, the AMG-PCG algorithm is demonstrated to provide significant savings in CPU time as the number of nodal unknowns is increased. In contrast to the other two algorithms, the AMG-PCG algorithm also exhibits little sensitivity of CPU time and number of iterations to variations in material properties that are known to significantly affect model variables. Results demonstrate the benefits of algebraic multigrid preconditioners for the iterative solution of assembled linear systems based on finite element modeling of soft elastic inclusion problems and may be particularly advantageous for large scale problems with many nodal unknowns.

Optimal design of 6-DOF eclipse mechanism based on task-oriented workspace
Donghun Lee, Jongwon Kim, TaeWon Seo
Journal:

Robotica / Volume 30 / Issue 7 / December 2012

Published online by Cambridge University Press:

25 July 2011, pp. 1041-1048
- Article
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
We present a new numerical optimal design for a redundant parallel manipulator, the eclipse, which has a geometrically symmetric workspace shape. We simultaneously consider the structural mass and design efficiency as objective functions to maximize the mass reduction and minimize the loss of design efficiency. The task-oriented workspace (TOW) and its partial workspace (PW) are considered in efficiently obtaining an optimal design by excluding useless orientations of the end-effector and by including just one cross-sectional area of the TOW. The proposed numerical procedure is composed of coarse and fine search steps. In the coarse search step, we find the feasible parameter regions (FPR) in which the set of parameters only satisfy the marginal constraints. In the fine search step, we consider the multiobjective function in the FPR to find the optimal set of parameters. In this step, fine search will be kept until it reaches the optimal set of parameters that minimize the proposed objective functions by continuously updating the PW in every iteration. By applying the proposed approach to an eclipse-rapid prototyping machine, the structural mass of the machine can be reduced by 8.79% while the design efficiency is increased by 6.2%. This can be physically interpreted as a mass reduction of 49 kg (the initial structural mass was 554.7 kg) and a loss of 496 mm3/mm in the workspace volume per unit length. The proposed optimal design procedure could be applied to other serial or parallel mechanism platforms that have geometrically symmetric workspace shapes.

Construction of Probabilistic Boolean Networks from a Prescribed Transition Probability Matrix: A Maximum Entropy Rate Approach
Xi Chen, Wai-Ki Ching, Xiao-Shan Chen, Yang Cong, Nam-Kiu Tsing
Journal:

East Asian Journal on Applied Mathematics / Volume 1 / Issue 2 / May 2011

Published online by Cambridge University Press:

28 May 2015, pp. 132-154

Print publication:

May 2011
- Article
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Modeling genetic regulatory networks is an important problem in genomic research. Boolean Networks (BNs) and their extensions Probabilistic Boolean Networks (PBNs) have been proposed for modeling genetic regulatory interactions. In a PBN, its steady-state distribution gives very important information about the long-run behavior of the whole network. However, one is also interested in system synthesis which requires the construction of networks. The inverse problem is ill-posed and challenging, as there may be many networks or no network having the given properties, and the size of the problem is huge. The construction of PBNs from a given transition-probability matrix and a given set of BNs is an inverse problem of huge size. We propose a maximum entropy approach for the above problem. Newton's method in conjunction with the Conjugate Gradient (CG) method is then applied to solving the inverse problem. We investigate the convergence rate of the proposed method. Numerical examples are also given to demonstrate the effectiveness of our proposed method.

Search Results

Refine search

Refine search

Actions for selected content:

15 results

4 - Optimization: Hard Computing

Summary

Appendix - Other Numerical Methods

Summary

7 - Non-linear Optimization

Summary

6 - Numerical Minimization Process

Summary

MATRIX ANALYSES ON THE DAI–LIAO CONJUGATE GRADIENT METHOD

A NEW DERIVATIVE-FREE CONJUGATE GRADIENT METHOD FOR LARGE-SCALE NONLINEAR SYSTEMS OF EQUATIONS

A MODIFIED FR CONJUGATE GRADIENT METHOD FOR COMPUTING $Z$ -EIGENPAIRS OF SYMMETRIC TENSORS

Conjugate Gradient Method for Estimation of Robin Coefficients

Spectral Optimization Methods for the Time Fractional Diffusion Inverse Problem

A least-squares method for the numerical solution of theDirichlet problem for the elliptic monge − ampère equation in dimension two ∗

ITERATIVE SOLUTION OF SHIFTED POSITIVE-DEFINITE LINEAR SYSTEMS ARISING IN A NUMERICAL METHOD FOR THE HEAT EQUATION BASED ON LAPLACE TRANSFORMATION AND QUADRATURE

Preconditioners and Electron Density Optimization in Orbital-Free Density Functional Theory

Algebraic Multigrid Preconditioning for Finite Element Solution of Inhomogeneous Elastic Inclusion Problems in Articular Cartilage

Optimal design of 6-DOF eclipse mechanism based on task-oriented workspace

Construction of Probabilistic Boolean Networks from a Prescribed Transition Probability Matrix: A Maximum Entropy Rate Approach

Search Results

Refine search

Refine search

Actions for selected content:

Save Search

15 results

Summary

Summary

Summary

Summary