118 search hits
-
The language Planning Situation in Ireland
(2005)
-
Muiris Ó Laoire
- Language planning for the Irish language in the Republic of Ireland has featured prominently in international language policy and planning literature over the years. Researchers in the field may not be up to date, however, with recent developments in the area of Irish language planning and their impact on the language ecology. This monograph describes the language planning situation in the Republic of Ireland in its historical and social contexts as well as delineating language policy and planning for the Irish language implemented over the past number of years, showing developments in education, community, media, religion and local politics. Keywords: Ireland, language planning, language ecology, language policy, language pedagogy
-
(Non)retroflexivity of Slavic affricates and its motivation : evidence from Polish and Czech <č>
(2005)
-
Marzena Zygis
-
The semantics of smallpox in Early Modern English
(2005)
-
Anna Zbierska-Sawała
-
Zürcher Sprachsituation mit Solothurner Muttersprache
(2005)
-
Eva Lia Wyss
-
Multiple hierarchies : new aspects of an old solution
(2005)
-
Andreas Witt
- In this paper, we present the Multiple Annotation approach, which solves two problems: the problem of annotating overlapping structures, and the problem that occurs when documents should be annotated according to different, possibly heterogeneous tag sets. This approach has many advantages: it is based on XML, the modeling of alternative annotations is possible, each level can be viewed separately, and new levels can be added at any time. The files can be regarded as an interrelated unit, with the text serving as the implicit link. Two representations of the information contained in the multiple files (one in Prolog and one in XML) are described. These representations serve as a base for several applications.
-
Beers, kaffi, and Schnaps : different grammatical options for 'restaurant talk' coercions in three Germanic languages
(2005)
-
Heike Wiese
Joan Maling
- This paper discusses constructions like “We’ll have two beers and a coffee.” that are typically used for beverage orders in restaurant contexts. We compare the behaviour of nouns in these constructions in three Germanic languages, English, Icelandic, and German, and take a closer look at the correlation of the morpho-syntactic and semantic-conceptual changes involved here. We show that even within such a closely related linguistic sample, one finds three different grammatical options for the expression of the same conceptual transition. Our findings suggest an analysis of coercion as a genuinely semantic phenomenon, a phenomenon that is located on a level of semantic representations that serves as an interface between the conceptual and the grammatical system and takes into account inter- and intralinguistic variations.
-
Stop bashing givenness : a note on Elke Kasimir´s “questions-answers test andgivenness
(2005)
-
Thomas Weskott
- Elke Kasimir´s paper (in this volume) argues against employing the notion of Givenness in the explanation of accent assignment. I will claim that the arguments against Givenness put forward by Kasimir are inconclusive because they beg the question of the role of Givenness. It is concluded that, more generally, arguments against Givenness as a diagnostic for information structural partitions should not be accepted offhand, since the notion of Givenness of discourse referents is (a) theoretically simple, (b) readily observable and quantifiable, and (c) bears cognitive significance.
-
Die thematische Erschließung von Sprachkorpora
(2005)
-
Christian Weiß
- Ziel des Teilprojekts ist die thematische Erschließung der Korpora, um sowohl themenspezifische virtuelle Subkorpora zusammenstellen zu können als auch aufgrund der Analyse sachgebietsbezogener Häufigkeitsverteilungen z.B. Lesarten disambiguieren zu können. Ausgangspunkt ist die Erstellung einer Taxonomie von Sachgebietsthemen. Dies erfolgt in einem semiautomatischen Verfahren, welches die Anwendung von Textmining (Dokumentclustering) und die manuelle Zuordnung von Clustern in eine externen Ontologie beinhaltet. Es wird argumentiert, dass die so gewonnene Taxonomie sowohl intuitiver als auch objektiver ist als bestehende, rein manuelle Ansätze. Sie eignet sich zudem gleichermaßen für manuelle als auch für maschinelle Klassifikation. Für letzteres wird der Naive Bayes'sche Textklassifikator motiviert und für ein klassifiziertes Korpus von knapp zwei Milliarden Wörtern evaluiert.
-
Unity in diversity : integrating differing linguistic data in TUSNELDA
(2005)
-
Andreas Wagner
- This paper describes the creation and preparation of TUSNELDA, a collection of corpus data built for linguistic research. This collection contains a number of linguistically annotated corpora which differ in various aspects such as language, text sorts / data types, encoded annotation levels, and linguistic theories underlying the annotation. The paper focuses on this variation on the one hand and the way how these heterogeneous data are integrated into one resource on the other hand.
-
Parser evaluation across text types
(2005)
-
Yannick Versley
- When a statistical parser is trained on one treebank, one usually tests it on another portion of the same treebank, partly due to the fact that a comparable annotation format is needed for testing. But the user of a parser may not be interested in parsing sentences from the same newspaper all over, or even wants syntactic annotations for a slightly different text type. Gildea (2001) for instance found that a parser trained on the WSJ portion of the Penn Treebank performs less well on the Brown corpus (the subset that is available in the PTB bracketing format) than a parser that has been trained only on the Brown corpus, although the latter one has only half as many sentences as the former. Additionally, a parser trained on both the WSJ and Brown corpora performs less well on the Brown corpus than on the WSJ one. This leads us to the following questions that we would like to address in this paper: - Is there a difference in usefulness of techniques that are used to improve parser performance between the same-corpus and the different-corpus case? - Are different types of parsers (rule-based and statistical) equally sensitive to corpus variation? To achieve this, we compared the quality of the parses of a hand-crafted constraint-based parser and a statistical PCFG-based parser that was trained on a treebank of German newspaper text.