The Minimal Deterministic Finite-State Automaton for a Finite Language

Stoyan Mihov; Klaus U. Schulz

doi:10.1017/9781108756945.011

10 - The Minimal Deterministic Finite-State Automaton for a Finite Language

from Part II - From Theory to Practice

Published online by Cambridge University Press: 29 July 2019

Stoyan Mihov and

Klaus U. Schulz

Show author details

Stoyan Mihov: Affiliation:
Bulgarian Academy of Sciences
Klaus U. Schulz: Affiliation:
Ludwig-Maximilians-Universität Munchen

Book contents

Get access

Summary

A fundamental task in natural language processing is the efficient representation of lexica. From a computational viewpoint, lexica need to be represented in a way directly supporting fast access to entries, and minimizing space requirements. A standard method is to represent lexica as minimal deterministic (classical) finite-state automata. To reach such a representation it is of course possible to first build the trie of the lexicon and then to minimize this automaton afterwards. However, in general the intermediate trie is much larger than the resulting minimal automaton. Hence a much better strategy is to use a specialized algorithm to directly compute the minimal deterministic automaton in an incremental way. In this chapter we describe such a procedure.

Keywords

lexica dictionaries alphabetic order minimal deterministic finite-state automata implementation adaptation of automaton language two-sided dictionaries minimal subsequential transducers

Information

Type: Chapter
Information: Finite-State Techniques
Automata, Transducers and Bimachines
, pp. 253 - 278

DOI: https://doi.org/10.1017/9781108756945.011 [Opens in a new window]

Publisher: Cambridge University Press

Print publication year: 2019

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Book purchase

Temporarily unavailable

Accessibility standard: Unknown

Accessibility compliance for the PDF of this book is currently unknown and may be updated in the future.