Best Practices in Deep Learning

Mihai Surdeanu; Marco Antonio Valenzuela-Escárcega

doi:10.1017/9781009026222.007

6 - Best Practices in Deep Learning

Published online by Cambridge University Press: 01 February 2024

Mihai Surdeanu and

Marco Antonio Valenzuela-Escárcega

Show author details

Mihai Surdeanu: Affiliation:
University of Arizona
Marco Antonio Valenzuela-Escárcega: Affiliation:
University of Arizona

Book contents

Get access

Summary

The previous chapter introduced feed-forward neural networks and demonstrated that, theoretically, implementing the training procedure for an arbitrary feed-forward neural network is relatively simple. Unfortunately, neural networks trained this way will suffer from several problems such as stability of the training process – that is, slow convergence due to parameters jumping around a good minimum – and overfitting. In this chapter, we will describe several practical solutions that mitigate these problems. In particular, we discuss minibatching, multiple optimization algorithms, other activation and cost functions, regularization, dropout, temporal averaging, and parameter initialization and normalization.

Keywords

minibatching optimization algorithms activation functions cost functions regularization dropout temporal averaging parameter initialization and normalization

Information

Type: Chapter
Information: Deep Learning for Natural Language Processing
A Gentle Introduction
, pp. 87 - 106

DOI: https://doi.org/10.1017/9781009026222.007 [Opens in a new window]

Publisher: Cambridge University Press

Print publication year: 2024

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Book purchase

Temporarily unavailable

Accessibility standard: Unknown

Why this information is here

This section outlines the accessibility features of this content - including support for screen readers, full keyboard navigation and high-contrast display options. This may not be relevant for you.

Accessibility Information

Accessibility compliance for the PDF of this book is currently unknown and may be updated in the future.