Hostname: page-component-78c5997874-s2hrs Total loading time: 0 Render date: 2024-11-13T04:03:38.020Z Has data issue: false hasContentIssue false

On some applications of finite-state automata theory to natural language processing

Published online by Cambridge University Press:  01 March 1996

MEHRYAR MOHRI
Affiliation:
AT&T Bell Laboratories, 600 Mountain Avenue, Murray Hill, NJ 07974, USA. E-mail: mohri@research.att.com

Abstract

We describe new applications of the theory of automata to natural language processing: the representation of very large scale dictionaries and the indexation of natural language texts. They are based on new algorithms that we introduce and describe in detail. In particular, we give pseudocodes for the determinisation of string to string transducers, the deterministic union of p-subsequential string to string transducers, and the indexation by automata. We report on several experiments illustrating the applications.

Type
Research Article
Copyright
© 1996 Cambridge University Press

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Footnotes

This work was done while the author was an associate professor of computer science and computational linguistics at the Institut Gaspard Monge-LADL in Paris, France.