Linguistics across Disciplinary Borders
The March of Data
- 264 pages
- English
- ePUB (mobile friendly)
- Available on iOS & Android
Linguistics across Disciplinary Borders
The March of Data
About This Book
This volume highlights the ways in which recent developments in corpus linguistics and natural language processing can engage with topics across language studies, humanities and social science disciplines. New approaches have emerged in recent years that blur disciplinary boundaries, facilitated by factors such as the application of computational methods, access to large data sets, and the sharing of code, as well as continual advances in technologies related to data storage, retrieval, and processing. The "march of data" denotes an area at the border region of linguistics, humanities, and social science disciplines, but also the inevitable development of the underlying technologies that drive analysis in these subject areas. Organized into 3 sections, the chapters are connected by the underlying thread of linguistic corpora: how they can be created, how they can shed light on varieties or registers, and how their metadata can be utilized to better understand the internal structure of similar resources. While some chapters in the volume make use of well-established existing corpora, others analyze data from platforms such as YouTube, Twitter or Reddit. The volume provides insight into the diversity of methods, approaches, and corpora that inform our understanding of the "border regions" between the realms of data science, language/linguistics, and social or cultural studies.
Frequently asked questions
Information
Table of contents
- Cover
- Half-Title
- Series
- Title
- Contents
- List of Figures
- List of Tables
- Introduction
- Part I Methods for Data Collection, Analysis and Visualization
- 1 Noisy Data: Using Automatic Speech Recognition Transcripts for Linguistic Research
- 2 Low-code Data Science Tools for Linguistics: Swiss Army Knives or Pretty Black Boxes?
- 3 The Visualization and Evaluation of Semantic and Conceptual Maps
- Part II Corpus Construction, Registers and Genres
- 4 Towards Automatic Register Classification in Unrestricted Databases of Historical English
- 5 The Topical Landscape of Web Registers: Exploring the Interplay of Registers and Topicality in a Web-scale Corpus
- 6 Towards âLarge and Tidyâ: Establishing Internal Structure in Mega-corpora
- Part III Social Media, Discourse and Meanings
- 7 Multi-modal Considerations for Social Media Discourse Analysis: A Specialized Corpus of Twitter Commentary on âWorking from Homeâ
- 8 Exploring Self-identification and the Functions of the Identify as Construction in the LGBTQ+ Reddit Corpus
- List of Contributors
- Index
- Copyright