Restricted representation of phrase structure grammar for building a tree annotated corpus of Korean

KONG JOO LEE; GIL CHANG KIM; JAE-HOON KIM; YOUNG S. HAN

doi:10.1017/S1351324997001782

Restricted representation of phrase structure grammar for building a tree annotated corpus of Korean

Published online by Cambridge University Press: 01 September 1997

JAE-HOON KIM and

KONG JOO LEE: Affiliation:
Korea Advanced Institute of Science and Technology, Taejon, Korea
GIL CHANG KIM: Affiliation:
Korea Advanced Institute of Science and Technology, Taejon, Korea
JAE-HOON KIM: Affiliation:
Electronics and Telecommunications Research Institute, Taejon, Korea
YOUNG S. HAN: Affiliation:
Suwon University, Suwon, Korea

Article contents

Abstract

Get access

Rights & Permissions

Abstract

In this paper, we introduce a method to represent phrase structure grammars for building a large annotated corpus of Korean syntactic trees. Korean is different from English in word order and word compositions. As a result of our study, it turned out that the differences are significant enough to induce meaningful changes in the tree annotation scheme for Korean with respect to the schemes for English. A tree annotation scheme defines the grammar formalism to be assumed, categories to be used, and rules to determine correct parses for unsettled issues in parse construction. Korean is partially free in word order and the essential components such as subjects and objects of a sentence can be omitted with greater freedom than in English. We propose a restricted representation of phrase structure grammar to handle the characteristics of Korean more efficiently. The proposed representation is shown by means of an extensive experiment to gain improvements in parsing time as well as grammar size. We also describe the system named Teb that is a software environment set up with a goal to build a tree annotated corpus of Korean containing more than one million units.

Information

Type: Research Article
Information: Natural Language Engineering , Volume 3 , Issue 2 , September 1997 , pp. 215 - 230

DOI: https://doi.org/10.1017/S1351324997001782 [Opens in a new window]

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Article contents

Restricted representation of phrase structure grammar for building a tree annotated corpus of Korean

Abstract

Information

Access options

Article purchase

Temporarily unavailable

This article has been cited by the following publications. This list is generated based on data provided by Crossref.

Article contents

Restricted representation of phrase structure grammar for building a tree annotated corpus of Korean

Abstract

Information

Access options

Article purchase

Temporarily unavailable

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests