This is the first monograph on the emerging area of linguistic linked data. Presenting a combination of background information on linguistic linked data and concrete implementation advice, it introduces and discusses the main benefits of applying linked data (LD) principles to the representation and publication of linguistic resources, arguing that LD does not look at a single resource in isolation but seeks to create a large network of resources that can be used together and uniformly, and so making more of the single resource. The book describes how the LD principles can be applied to modelling language resources. The first part provides the foundation for understanding the remainder of the book, introducing the data models, ontology and query languages used as the basis of the Semantic Web and LD and offering a more detailed overview of the Linguistic Linked Data Cloud. The second part of the book focuses on modelling language resources using LD principles, describing how to model lexical resources using Ontolex-lemon, the lexicon model for ontologies, and how to annotate and address elements of text represented in RDF. It also demonstrates how to model annotations, and how to capture the metadata of language resources. Further, it includes a chapter on representing linguistic categories. In the third part of the book, the authors describe how language resources can be transformed into LD and how links can be inferred and added to the data to increase connectivity and linking between different datasets. They also discuss using LD resources for natural language processing. The last part describes concrete applications of the technologies: representing and linking multilingual wordnets, applications in digital humanities and the discovery of language resources. Given its scope, the book is relevant for researchers and graduate students interested in topics at the crossroads of natural language processing / computational linguistics and the Semantic Web / linked data. It appeals to Semantic Web experts who are not proficient in applying the Semantic Web and LD principles to linguistic data, as well as to computational linguists who are used to working with lexical and linguistic resources wanting to learn about a new paradigm for modelling, publishing and exploiting linguistic resources. Inhaltsverzeichnis 1 Introduction.- 2 Preliminaries.- 3 Linguistic Linked Open Data Cloud.- 4 Modelling lexical resources as Linked Data.- 5 Representing annotated texts as RDF.- 6 Modelling linguistic annotations.- 7 Modelling metadata of language resources.- 8 Linguistic Categories.- 9 Converting language resources into Linked Data.- 10 Link Representation and Discovery.- 11 Linked Data-based NLP Workflows.- 12 Applying linked data principles to linking multilingualWordnets.- 13 Linguistic Linked Data in Digital Humanities.- 14 Discovery of language resources.- 15 Conclusion.
Christian Chiarcos Knihy



This volume explores the role of salience in discourse, offering a comprehensive examination of various perspectives and functions. It employs multidisciplinary approaches, differing in theoretical proposals and research focuses. Topics include entity-based salience, discourse-structural salience of utterances, and extra-linguistic factors influencing discourse salience. The book is organized into three sections. Part I delves into discourse referents and the selection of referring expressions, addressing themes such as salience and demonstrativity in Russian, the relationship between discourse salience and grammatical voice in Eastern Khanty, the interplay of syntactic and semantic prominence, and a computational framework for salience metrics. Part II shifts focus to linguistic structures at or above the clause level, examining the salience of discourse segments in relation to discourse relations and the positioning of verb arguments in Old High German. Part III broadens the discussion to include extra-linguistic salience in discourse processing, covering topics like visual salience in situated dialogue, salience marking through hypertextual links, and salience derived from mental representations of described situations. The concept of salience is significant across discourse studies in theoretical linguistics, computational linguistics, and psycholinguistics.
Von der Form zur Bedeutung: Texte automatisch verarbeiten
- 275 stránok
- 10 hodin čítania
Die zweijaehrig stattfindende Tagung der Gesellschaft fuer Sprachtechnologie und Computerlinguistik (GSCL) vermittelt jeweils einen ueberblick ueber den aktuellen Stand der Forschung (vorwiegend im deutschsprachigen Raum) zur automatischen Verarbeitung natuerlicher Sprache. Im Jahr 2009 findet die Tagung an der Universitaet Potsdam statt und befasst sich vor allem mit der Analyse der Text-Ebene und damit verbundenen Problemen. Fuer Wissenschaftler und fortgeschrittene Studierende vermittelt dieser Tagungsband Einblicke in zentrale Fragestellungen der automatischen Textanalyse.