31 search hits
-
JACY - A Grammar for Annotating Syntax, Semantics and Pragmatics of Written and Spoken Japanese for NLP Application Purposes
(2006)
-
Melanie Siegel
- In this text, we describe the development of a broad coverage grammar for Japanese that has
been built for and used in different application contexts. The grammar is based on work done
in the Verbmobil project (Siegel 2000) on machine translation of spoken dialogues in the
domain of travel planning. The second application for JACY was the automatic email
response task. Grammar development was described in Oepen et al. (2002a). Third, it was
applied to the task of understanding material on mobile phones available on the internet, while
embedded in the project DeepThought (Callmeier et al. 2004, Uszkoreit et al. 2004).
Currently, it is being used for treebanking and ontology extraction from dictionary definition
sentences by the Japanese company NTT (Bond et al. 2004).
-
Definitheit und Numerus : Anforderungen an den Transfer Japanisch-Englisch
(1994)
-
Melanie Siegel
- Das Problem des Transfers in der maschinellen Übersetzung von Japanisch nach Englisch ist fehlende Information über Numerus und Definitheit im Japanischen, die für die Wahl der englischen Artikel und die Nomenmarkierung gebraucht wird. Obwohl dieses Problem signifikant ist, beschäftigt sich die Forschungsliteratur kaum damit. [...] Wir bsaieren unsere Untersuchungen auf experimentell erhobenen Daten aus einem Experiment über deutsch-japanische gedolmetschte Terminaushandlungsdialoge [...]. Auf diese Weise können Phänomene bestimmt werden, die für die Domäne von VERBMOBIL relevant sind. Wir sehen unser Vorgehen in Übereinstimmung mit dem 'Sublanguage'-Ansatz [...].
-
Definiteness and Number in Japanese to German Machine Translation
(1996)
-
Melanie Siegel
-
Head-Initial Constructions in Japanese
(2004)
-
Melanie Siegel
Emily M. Bender
- Japanese is often taken to be strictly head-final in its syntax. In our work on a broad-coverage, precision implemented HPSG for Japanese, we have found that while this is generally true, there are nonetheless a few minor exceptions to the broad trend. In this paper, we describe the grammar engineering project, present the exceptions we have found, and conclude that this kind of phenomenon motivates on the one hand the HPSG type hierarchical approach which allows for the statement of both broad generalizations and exceptions to those generalizations and on the other hand the usefulness of grammar engineering as a means of testing linguistic hypotheses.
-
Efficient Deep Processing of Japanese
(2002)
-
Melanie Siegel
Emily M. Bender
- We present a broad coverage Japanese grammar written in the HPSG formalism with MRS semantics. The grammar is created for use in real world applications, such that robustness and performance issues play an important role. It is connected to a POS tagging and word segmentation tool. This grammar is being developed in a multilingual context, requiring MRS structures that are easily comparable across languages.
-
Parallel Distributed Grammar Engineering for Practical Applications
(2002)
-
Stephan Oepen
Emily M. Bender
Uli Callmeier
Dan Flickinger
Melanie Siegel
- Based on a detailed case study of parallel grammar development distributed across two sites, we review some of the requirements for regression testing in grammar engineering, summarize our approach to systematic competence and performance profiling, and discuss our experience with grammar development for a commercial application. If possible, the workshop presentation will be organized around a software demonstration.
-
Annotating Honorifics Denoting Social Ranking of Referents
(2005)
-
Shigeko Nariyama
Hiromi Nakaiwa
Melanie Siegel
- This paper proposes an annotating scheme that encodes honorifics (respectful words). Honorifics are used extensively in Japanese, reflecting the social relationship (e.g. social ranks and age) of the referents. This referential information is vital for resolving zero
pronouns and improving machine translation outputs. Annotating honorifics is a complex task that involves identifying a predicate with honorifics, assigning ranks to referents of the
predicate, calibrating the ranks, and connecting referents with their predicates.
-
Zero pronoun processing : some requirements for a VERBMOBIL system
(1994)
-
Dieter Metzing
Melanie Siegel
- Some requirements for a VERBMOBIL system capable of processing Japanese dialogue input have been explored. Based on a pilot study in the VERBMOBIL domain, dialogues between 2 participants and a professional Japanese interpreter have been analyzed with respect to a very typical and frequent feature: zero pronouns. Zero pronouns in Japanese texts or dialogues as well as overt pronouns in English texts or dialogues are an important element of discourse coherence. As to translation, this difference in the use of pronouns is a case of translation mismatch: information not explicitly expressed in the source language is needed in the target language. (Verb argument positions, normally obligatoryin English, are rather frequently omitted in Japanese. Furthermore, verbs in Japanese are not marked with respect to features necessary for pronoun selection in English.)
-
An HSPG-to-CFG Approximation of Japanese
(2000)
-
Bernd Kiefer
Hans-Ulrich Krieger
Melanie Siegel
- We present a simple approximation method for turning a Head-Driven Phrase Structure Grammar into a context-free grammar. The approximation method can be seen as the construction of the least fixpoint of a certain monotonic function. We discuss an experiment with a large HPSG for Japanese.
-
An Integrated Architecture for Shallow and Deep Processing
(2002)
-
Berthold Crysmann
Anette Frank
Bernd Kiefer
Stefan Müller
Günter Neumann
Jakub Piskorski
Ulrich Schäfer
Melanie Siegel
Hans Uszkoreit
Feiyu Xu
Markus Becker
Hans-Ulrich Krieger
- We present an architecture for the integration of shallow and deep NLP components which is aimed at flexible combination of different language technologies for a range of practical current and future applications. In particular, we describe the integration of a high-level HPSG parsing system with different high-performance shallow components, ranging from named entity recognition to chunk parsing and shallow clause recognition. The NLP components enrich a representation of natural language text with layers of new XML meta-information using a single shared data structure, called the text chart. We describe details of the integration methods, and show how information extraction and language checking applications for realworld German text benefit from a deep grammatical analysis.