El proximo 19 de octubre presentrremos nuestro grupo e investigación en el Seminario de Investigación de Filología Inglesa (SI-FI) organizado por el Departamento de Filología Inglesa de la Universidad de Murcia.
Será en la sala Mariano Baquero (Facultad de Letras) entre las 17:30 y las 19:00
Antes de nuestra presentación (aprox 18:15), el Grupo LACELL, Lingüística Aplicada a LaComputación, Enseñanza deLenguas y Lexicografía también hablarán sobre su investigación.
Gaëtanelle Gilquin (Université Catholique de Louvain, Belgium) Ilmari Ivaska (Turun Yliopisto, Finland) Cristóbal Lozano (Universidad de Granada, Spain)
Event Details:
Date: 26-28 September 2024 Location: Institute of Foreign Languages and Cultures and the Institute of Estonian and General Linguistics. University of Tartu, Estonia. Topics: Areas of interest include, but are not limited to, the following: Language for academic purposes Language for specific purposes Language teaching, assessment and testing Learner corpus-based SLA studies Corpora as pedagogical resources Multimodal learner corpora Software for learner corpus analysis Corpus-based translation studies English as a Medium of Instruction (EMI) English as a Lingua Franca (ELF) Data mining and other explorative approaches to learner corpora Statistical methods in learner corpus studies Discourse analysis and pragmatics Studies related to lexis: semantics, metaphor, etc. NLP approaches Complexity, accuracy and/or fluency (CAF) analysis
Abstracts:
A short summary of the intended presentation, capturing the central idea along with the research questions, methods of research and the (possibly tentative) key conclusions, also citing any relevant previous work or theoretical background of the field. Limited to 300 words, excluding keywords and references . Anonymous: the abstract itself should hold no reference to the author or their affiliation
The Core Metadata Schema for L2 data consists in a comprehensive set of variables that encapsulate crucial information about L2 data. It is organized into several sections that describe specific aspects of a learner corpus. These include administrative details (e.g. authors or license), corpus design, text-related variables, learner-related variables, in-built annotation(e.g. details about manual or automatic annotation), information about annotators or transcribers (e.g. native language or language repertoire) and task-related details (e.g. instructions, time constraints) (Paquot et al., 2023). It is the result of extensive collaboration between learner corpus compilers at the Centre for English Corpus Linguistics (UCLouvain, Belgium) and EURAC Research (Bolzano, Italy), and a research data infrastructure expert and member of CLARIN’s metadata taskforce (König et al., 2022; Frey et al. 2023).
In this presentation, I will discuss the underlying rationale for the development of such a resource and present its second version. This will give me the opportunity to clarify in what ways we have tried to embark learner corpus researchers into this initiative and reiterate our hope that the LCR community will collaborate with us to refine the schema and align it with the evolving needs of the field.
References
Frey, J.-C., König, A., Stemle, E. & M. Paquot (2023). A core metadata schema for L2 data. Paper presented at the 32nd Conference of the European Second Language Association (EUROSLA), 30 August – 2 September 2023, University of Birmingham, UK.
König, A., Frey J.-C., Stemle, E., Glaznieks, A. & M. Paquot (2022). Towards standardizing LCR metadata. Paper presented at Learner Corpus Research 6, 22-24 September 2022, University of Padua, Italy.
Paquot, M., König, A., Stemle, E. & J.-C. Frey (2023). Core Metadata Schema for Learner Corpora, https://doi.org/10.14428/DVN/4CDX3P
Dr Magali Paquot is a permanent FNRS research associate at the Centre for English Corpus Linguistics, Institut Langage et Communication, UCLouvain, and an affiliate member of the Corpus Linguistics Lab, University of Florida. She holds a PhD in Linguistics (Université catholique de Louvain) and a degree in Natural Language Processing (Université de Liège). Her research interests include (but are not limited to) corpus linguistics, learner corpus research, vocabulary, phraseology (collocations, lexical bundles, …), pedagogical lexicography, electronic lexicography, terminology, EAP (English for Academic Purposes), ESP (English for Specific Purposes), EFL (English as a Foreign Language), SLA (Second Language Acquisition), linguistic complexity and L1 influence.
Comparability is a core criterion underpinning corpus linguistics research. From using a reference corpus to determine keywords to comparing across time, space, and language, corpus linguistics often draws on different data sets to tell us what is special about the language we are studying. This view has become so naturalised within corpus linguistics methodologies that discussions of comparability in corpus research are quite uncommon. This challenge of addressing comparability is long-standing in fields like contrastive analysis, which came to prominence and fell to decline owing to advances and limitations in methodological approaches, in part related to issues of comparability. In its most recent rise, as corpus-based contrastive linguistics, research has sought to merge contrastive and corpus linguistics approaches to address the weaknesses identified in contrastive analysis methodologies and enhance perspectives on comparability in corpus linguistics research. Merging contrastive and corpus linguistics approaches, this talk presents case studies with a view to interrogating issues of comparability in corpus analysis and establishing theoretical bases from which to draw meaningful comparisons across multilingual discourses. Specifically, the talk sheds light on the methodological pitfalls we encounter in comparing corpora representing a range of contexts and variables, the impact that our methods of analysis can have on our findings, and the importance of contextually situating contrastive studies from epistemological and ontological perspectives. The findings of the talk are intended to offer points of reflection for anyone applying contrastive approaches in corpus linguistics research, both across languages and across language varieties.
Dr Niall Curry is a Senior Lecturer in TESOL and Applied Linguistics within the Department of Languages, Information and Communications, at Manchester Metropolitan University. Currently, he is researching language relating to global crises and global issues. He is particularly interested in investigating how knowledge of these issues and crises is socially and discursively constructed across contexts, times, languages, and cultures with a view to understanding better how global issues vary across local contexts, and for international and local audiences. His areas of focus include (but are not limited to) issues such as climate, health, economics, and education. In parallel, Niall is conducting research on applied linguistics and TESOL related issues, spanning foci on register, genre, metadiscourse, materials development, and digital pedagogies.
In this talk, I will describe what Multiple Correspondence Analysis (MCA) is and how it can be used for the Multi-Dimensional Analysis of short texts, as well as for corpus-assisted discourse analysis in an approach called Keyword Co-occurrence Analysis, drawing on the results of my own research on tweets (Clarke and Grieve, 2019; Clarke, 2022) and discourses of Islam in the UK press (Clarke et al. 2021; 2022). I will then go on to demonstrate how the results can be used to track communicative functions and discourses over time in diachronic analyses. Finally, I will discuss the limitations of MCA in these tasks.
Dr Isobelle Clarke‘s research interests include corpus linguistics, forensic linguistics, sociolinguistics and news discourse and discourse analysis. Her previous research covers language variation on social media, especially Twitter, and authorship analysis. Her current research examines the representation of Islam in the press and second learner language and spoken language. Dr Clarke received a Leverhulme’s early career researcher fellowship to investigate anti-science discourses, such as anti-vaccination discourse, climate change denials, and anti-GMO discourse.
Taller práctico presencial de Lingüística de Corpus sobre AntConc: “A Corpus analysis toolkit for concordancing and text analysis” impartido por el profesor Pascual Pérez Paredes.
Organizado por el Instituto Interuniversitario de Lenguas Modernas Aplicadas de la Comunitat Valenciana
Lugar y fecha: Universidad Jaume I, Castellón, 16 diciembre 2022