Hostname: page-component-78c5997874-j824f Total loading time: 0 Render date: 2024-11-14T05:47:11.366Z Has data issue: false hasContentIssue false

Application of finite-state transducers to the acquisition of verb subcategorization information

Published online by Cambridge University Press:  04 June 2003

I. ALDEZABAL
Affiliation:
IXA group, Department of Computer Languages and Systems, University of the Basque Country, 649 P.K., 20080-Donostia, Spain e-mail: jibalroi@si.ehu.es
M. ARANZABE
Affiliation:
IXA group, Department of Computer Languages and Systems, University of the Basque Country, 649 P.K., 20080-Donostia, Spain e-mail: jibarurm@si.ehu.es
K. GOJENOLA
Affiliation:
IXA group, Department of Computer Languages and Systems, University of the Basque Country, 649 P.K., 20080-Donostia, Spain e-mail: jipgogak@si.ehu.es
M. ORONOZ
Affiliation:
IXA group, Department of Computer Languages and Systems, University of the Basque Country, 649 P.K., 20080-Donostia, Spain e-mail: jiboranm@si.ehu.es
K. SARASOLA
Affiliation:
IXA group, Department of Computer Languages and Systems, University of the Basque Country, 649 P.K., 20080-Donostia, Spain e-mail: jipsagak@si.ehu.es
A. ATUTXA
Affiliation:
Department of Linguistics, University of Maryland, College Park, MD 20742, USA e-mail: sener@wam.umd.edu

Abstract

This paper presents the design and implementation of a finite-state syntactic grammar of Basque that has been used with the objective of extracting information about verb subcategorization instances from newspaper texts. After a partial parser has built basic syntactic units such as noun phrases, prepositional phrases, and sentential complements, a finite-state parser performs syntactic disambiguation, determination of clause boundaries and filtering of the results, in order to obtain a verb occurrence together with its associated syntactic components, either complements or adjuncts. The set of occurrences for each verb is then filtered by statistical measures that distinguish arguments from adjuncts.

Type
Research Article
Copyright
© 2003 Cambridge University Press

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)