INF
Data management, modeling and exploration

INF entwickelt neue Workflows zur Erstellung von Korpora und wendet diese an. INF unterstützt die Projekte des SFB in der Erstellung von Daten-Pipelines und erweitert diese um Datentests zur Sicherstellung der Zuverlässigkeit. Darüber hinaus leitet INF die Projekte bei der Einhaltung der Datenmanagementpläne an. Ein zentrales Werkzeug ist das von INF gehostete und erweiterte VIVO, eine zentrale Repräsentation der Forschungsdaten und Werkzeuge des SFB. Zudem hostet INF Annotations- und Analysetools auf Servern der HU, wartet und bespielt die Webseite des SFB sowie Social-Media-Kanäle. INF trägt zum Wissensmanagement des SFB durch Workshops, Schulungen und Dokumentation bei.

Sie können alle INF-Mitarbeiter über die E-Mailadresse sfb1412-inf@lists.hu-berlin.de erreichen.

Mitarbeiter*innen

Leitung

Mitarbeiter*innen

Martin Klotz

Institut für deutsche Sprache und Linguistik
Humboldt-Universität zu Berlin

martin.klotz@hu-berlin.de

Thomas Pfuhl

Humboldt-Universität zu Berlin

thomas.pfuhl@hu-berlin.de

Studentische Hilfskräfte

Rebecca Grunow

Ehemalige Mitarbeiter*innen

Zehui Guo

Institut für deutsche Sprache und Linguistik

zehui.guo.1@hu-berlin.de

Felix Golcher

Institut für deutsche Sprache und Linguistik
Humboldt-Universität zu Berlin

felix.golcher@alumni.hu-berlin.de

Arsenii Gulevich

Institut für deutsche Sprache und Linguistik
Humboldt-Universität zu Berlin

Kontakt

Martin Klotz

Humboldt-Universität zu Berlin

martin.klotz@hu-berlin.de

Institut für deutsche Sprache und Linguistik
SFB1412 Register
Humboldt-Universität zu Berlin
Unter den Linden 6
10099 Berlin

Projekt in Förderphase I

Titel in Phase I

Data management and statistical analysis

Beschreibung für Phase I

In enger Zusammenarbeit mit dem Computer- und Medienservice der Humboldt-Universität bietet das INF-Projekt technische und statistische Unterstützung für alle Mitglieder des SFB. Insbesondere wird INF in allen Fragen der Datenerhebung, -verarbeitung, -aufbereitung, -speicherung und -analyse beraten. Neben den Serviceaufgaben wird INF die vorhandene Infrastruktur für die Erfordernisse des SFB weiterentwickeln. Die für die Registeranalyse oft verwendeten multifaktoriellen Methoden beschreiben die Daten nicht immer angemessen und müssen verändert und durch andere Methoden ergänzt werden.

Material und Templates

Ein Template für Beamerpräsentationen mit Latex findet sich zum einen in einem öffentlichen git-Repository und auch als overleaf-Projekt, das angemeldete Benutzer kopieren können.

Ein Template für Konfernenzposter mit Latex findet sich ebenfalls in einem öffentlichen git-Repository und auch als overleaf-Projekt, das angemeldete Benutzer kopieren können.

Das Template für Einreichungen in REALIS findet sich ebenfalls in einem öffentlichen git-Repository und als overleaf-Projekt, das angemeldete Benutzer kopieren können.

Hier finden Sie unsere aktuelle Literaturdatenbank im biblatex-Format.

Veröffentlichungen und Präsentationen

Veröffentlichungen

2025

Belz, Malte; Betz, Simon; Lüdeling, Anke; Mooshammer, Christine (2025) Alles Interpretationssache? Korpuslinguistische Zugänge zu Pausen, Stille, Gaps und Schweigen In: Zeitschrift für germanistische Linguistik [DOI] [ViVo]

2024

Lüdeling, Anke; Szucsich, Luka; Zeige, Lars Erik; Alexiadou, Artemis; Adli, Aria; Belz, Malte; Bouzouita, Miriam; Bunk, Oliver; Dreyer, Malte; Egg, Markus; Feulner, Anna Helene; Fleischer, Jürg; Gagarina, Natalia; Hirsch, Aron; Jannedy, Stefanie; Knoeferle, Pia; Krause, Thomas; Kutscher, Silvia; Liu, Mingya; Lütke, Beate; Machicao y Priemer, Antonio; Maquate, Katja; Merino Hernández, Laura; Meyer, Roland; Mooshammer, Christine; Müller, Stefan; Sauerland, Uli; Sauermann, Antje; Schmitt, Viola; Serova, Dina; Solt, Stephanie; Waltereit, Richard; Weirich, Melanie; Wiese, Heike; Verhoeven, Elisabeth; Schumacher, Nicole (2024) Register: Language Users’ Knowledge of Situational-Functional Variation. Frame text of the Second Phase Proposal for the CRC 1412 In: Register Aspects of Language in Situation [DOI] [ViVo]
Lange, Robert; Sell, Bianca Maria; Terada, Megumi; Belz, Malte; Mooshammer, Christine; Lüdeling, Anke (2024) Schwa realisation in verbal inflection in two dialogue registers of German spontaneous speech In: Zeitschrift für Sprachwissenschaft [DOI] [ViVo]
Abstract Word-final schwa in German inflectional suffixes shows varying realisations in spontaneous speech – from full realisations with varying duration to no realisation. While previous research has identified numerous social, distributional, and grammatical factors influencing the variation of phonetic variables in general, it remains unclear how fine-grained functional differences in different registers specifically affect schwa realisation. In this corpus-based study, we compare schwa realisation in two dialogue registers of German spontaneous speech – free conversation and task-based dialogues – which differ only in their communicative goal and therefore have different functional requirements. We find that schwa is rarely realised, though slightly but significantly more often in free conversation than in task-based dialogue. Other factors also promoting schwa realisation across both situations are less frequent verbs and sequences, and IP-final position.
Kühnast, Milena; Lüdeling, Anke; Schulz, Konstantin (2024) Development of basic reading skills in Latin: a corpus-based tool for computer-assisted fluency training In: Cogent Education [DOI] [ViVo]
Betz, Simon; Belz, Malte; Lüdeling, Anke; Mooshammer, Christine (2024) Schweigen is something different than pause or deliberate silence In: P&P 20. Martin-Luther-Universität Halle-Wittenberg [ViVo]
Ebert, Melissa; Lange, Robert; Weißgerber, Marie-Theres; Belz, Malte; Lüdeling, Anke; Mooshammer, Christine (2024) Register. Realisation of schwa and word‑final /t d/ In: Inter-CRC-Meeting on “Face to Face Interaction”, University of Cologne [ViVo]
Weißgerber, Marie-Theres; Belz, Malte; Lüdeling, Anke; Mooshammer, Christine (2024) The realisation of German indefinite articles in formal and informal spontaneous speech In: P&P 20. Martin-Luther-Universität Halle-Wittenberg [ViVo]

2023

Lüdeling, Anke; Lukassek, Julia; Akbari, Roodabeh (2023) Guidelines for the Morphological Annotation of Nouns in the Falko Learner Corpus In: REALIS: Register Aspects of Language in Situation [DOI] [ViVo]
These guidelines describe the annotation of different types of nominal word forma-
tion in German. We also report on the evaluation of these guidelines in the Falko
corpus.
Pescuma, Valentina Nicole; Serova, Dina; Lukassek, Julia; Sauermann, Antje; Schäfer, Roland; Adli, Aria; Bildhauer, Felix; Egg, Markus; Hülk, Kristina; Ito, Aine; Jannedy, Stefanie; Kordoni, Valia; Kühnast, Milena; Kutscher, Silvia; Lange, Robert; Lehmann, Nico; Liu, Mingya; Lütke, Beate; Maquate, Katja; Mooshammer, Christine; Mortezapour, Vahid; Müller, Stefan; Norde, Muriel; Pankratz, Elizabeth; Patarroyo, Angela Giovanna; Plesca, Ana-Maria; Ronderos, Camilo R.; Rotter, Stephanie; Sauerland, Uli; Schulte, Britta; Schüppenhauer, Gediminas; Sell, Bianca Maria; Solt, Stephanie; Terada, Megumi; Tsiapou, Dimitra; Verhoeven, Elisabeth; Weirich, Melanie; Wiese, Heike; Zaruba, Kathy; Zeige, Lars Erik; Lüdeling, Anke; Knoeferle, Pia; Schnelle, Gohar (2023) Situating language register across the ages, languages, modalities, and cultural aspects: Evidence from complementary methods In: Frontiers in Psychology [DOI] [ViVo]
In the present review paper by members of the collaborative research center ‘Register: Language Users’ Knowledge of SituationalFunctional Variation’ (CRC 1412), we assess the pervasiveness of register phenomena across different time periods, languages, modalities, and cultures. We define ‘register’ as recurring variation in language use depending on the function of language and on the social situation. Informed by rich data, we aim to better understand and model the knowledge involved in situation- and function-based use of language register. In order to achieve this goal, we are using complementary methods and measures. In the review, we start by clarifying the concept of ‘register’, by reviewing the state of the art, and by setting out our methods and modeling goals. Against this background, we discuss three key challenges, two at the methodological level and one at the theoretical level: 1. To better uncover registers in text and spoken corpora, we propose changes to established analytical approaches. 2. To tease apart between-subject variability from the linguistic variability at issue (intra-individual situation based register variability), we use within-subject designs and the modeling of individuals’ social, language, and educational background. 3. We highlight a gap in cognitive modeling, viz. modeling the mental representations of register (processing), and present our first attempts at filling this gap. We argue that the targeted use of multiple complementary methods and measures supports investigating the pervasiveness of register phenomena and yields comprehensive insights into the cross-methodological robustness of register-related language variability. These comprehensive insights in turn provide a solid foundation for associated cognitive modeling.
Krause, Thomas (2023) Hexatomic: An extensible, OS-independent platform for deep multi-layer linguistic annotation of corpora In: Journal of Open Source Software [DOI] [ViVo]
Lehmann, Nico; Serova, Dina; Lukassek, Julia; Döring, Sophia; Goymann, Frank; Lüdeling, Anke; Akbari, Roodabeh (2023) Guidelines for the annotation of parameters of narration. In: REALIS: Register Aspects of Language in Situation [DOI] [ViVo]
The present guidelines describe the annotation of narrative phenomena on the clause level, using a combination of ideas and methods from linguistics and lit- erary studies. The main categories marking the discourse strategy “narration” in stretches of text have been narrowed down to mediacy, i. e. involving a narrator, and sequentiality of events. This document specifies how to define mediacy, and in turn determine whether a narrator is present, as well as how to identify events and their sequential ordering. Lastly, a functional layer annotation is proposed which allows researchers to compare different types of narrative instances. This offers a basis for investigating a potential narrative register which is said to be important for many kinds of register studies.
Schnelle, Gohar; Hennig, Mathilde; Hennig, Mathilde (2023) Historische Korpora in sprachhistorisch orientierter germanistischer Hochschullehre In: Beiträge zur Geschichte der deutschen Sprache und Literatur [DOI] [ViVo]
Abstract This paper argues for incorporating corpus data into the teaching of historical linguistics. While deeply annotated historical corpora are becoming available and corpus data is already widely used to answer various research questions, corpora are as yet rarely used in teaching. We believe they are ideally suited to make the variation in historical data transparent and help students to explore contexts and parameters. In our first study, we show how the KaJuK corpus and its more elaborated version, the GiesKaNe corpus, can be exploited to study adverbial sentences. Using the RIDGES corpus, the second study deals with phrasal and lexical development. Both studies focus on explaining the method and its extension to other corpora and research questions.
Belz, Malte; Sell, Bianca Maria; Lange, Robert; Mooshammer, Christine; Lüdeling, Anke (2023) BeDiaCo (L1-L1) und CoNNAR (L1-L1/L2). Freie und aufgabenorientierte spontansprachliche Dialoge in direkter und videobasierter Kommunikation In: Neue Entwicklungen in der Korpuslandschaft der Germanistik. [ViVo]
Lüdeling, Anke; Mooshammer, Christine; Lange, Robert; Sell, Bianca Maria; Terada, Megumi (2023) Corpus of Non-Native Addressee Register (CoNNAR). Version 1[ViVo]
Veenstra, Tonjes; Krifka, Manfred; Akbari, Roodabeh; Buchmüller, Olga; Chark, Jordan; Döring, Sophia; Golcher, Felix; Schmidt, Peter (2023) Podcast: Sprachen aus dem Schnellkochtopf: Register in Kreols (Teil 1) [ViVo]
Kreolsprachen sind ein Wunder der Linguistik. Innerhalb weniger Generationen entstehen diese Sprachen dort, wo Menschen ohne gemeinsame Sprache miteinander kommunizieren müssen. Unser Projekt A02 "Speaker's choices in a creole context: Bislama and Morisien" untersucht zwei Kreolsprachen aus Melanesien und Mauritius. Wir sprechen mit Manfred Krifka und Tonjes Veenstra.
Lüdeling, Anke; Akbari, Roodabeh; Buchmüller, Olga; Chark, Jordan; Döring, Sophia; Golcher, Felix; Schmidt, Peter (2023) Podcast: Was ist ein Register? [ViVo]
Was ist damit gemeint, wenn wir in der Linguistik von "Registern" sprechen und warum ist das überhaupt interessant? Wir zeigen Beispiele dafür, wie Sprecher*innen zwischen Registern wechseln und was passiert, wenn das falsche Register gewählt wird. Anke Lüdeling erzählt uns in einem Interview, wie die Idee zu dem Sonderforschungsbereich entstanden ist.
(Vielen Dank an Onur Özsoy, der uns das Telefon-Beispiel eingesprochen hat und an Andreas Nolda für die Orgeleinspielungen!)

2022

Adli, Aria; Lüdeling, Anke; Alexiadou, Artemis; Donhauser, Karin; Dreyer, Malte; Egg, Markus; Feulner, Anna Helene; Gagarina, Natalia; Hock, Wolfgang; Jannedy, Stefanie; Kammerzell, Frank; Knoeferle, Pia; Krause, Thomas; Krifka, Manfred; Kutscher, Silvia; Lütke, Beate; McFadden, Thomas; Meyer, Roland; Mooshammer, Christine; Müller, Stefan; Maquate, Katja; Norde, Muriel; Sauerland, Uli; Solt, Stephanie; Waltereit, Richard; Wolfsgruber, Anne; Zeige, Lars Erik; Verhoeven, Elisabeth; Szucsich, Luka (2022) Register: Language Users’ Knowledge of Situational-Functional Variation. Frame text of the First Phase Proposal for the CRC 1412 In: REALIS: Register Aspects of Language in Situation [DOI] [ViVo]
Wisniewski, Katrin; Lüdeling, Anke; Czinglar, Christine (2022) Zu Umgang mit Variation in der Lernersprachenanalyse. Perspektiven aus und für DaF/DaZ In: Deutsch als Fremdsprache [DOI] [ViVo]
In this article, we argue that learner language analysis in German as a Second / Foreign Language should give greater consideration to variation. Looking at three exemplary research topics (verb placement, complexity, and register in German as a Second / Foreign Language), we demonstrate where variation plays a role and how it can be integrated into acquisition models both theoretically and methodologically. The field of learner corpus linguistics provides excellent methods for analysing variability in learner data.
Lüdeling, Anke; Alexiadou, Artemis; Adli, Aria; Donhauser, Karin; Dreyer, Malte; Egg, Markus; Feulner, Anna Helene; Gagarina, Natalia; Hock, Wolfgang; Jannedy, Stefanie; Kammerzell, Frank; Knoeferle, Pia; Krause, Thomas; Krifka, Manfred; Kutscher, Silvia; Lütke, Beate; McFadden, Thomas; Meyer, Roland; Mooshammer, Christine; Müller, Stefan; Maquate, Katja; Norde, Muriel; Sauerland, Uli; Szucsich, Luka; Verhoeven, Elisabeth; Waltereit, Richard; Wolfsgruber, Anne; Zeige, Lars Erik (2022) Register: Language Users’ Knowledge of Situational-Functional Variation In: REALIS: Register Aspects of Language in Situation [DOI] [ViVo]
The Collaborative Research Center 1412 “Register: Language Users’ Knowledge of Situational-Functional Variation” (CRC 1412) investigates the role of register in language, focusing in particular on what constitutes a language user’s register knowledge and which situational-functional factors determine a user’s choices. The following paper is an extract from the frame text of the proposal for the CRC 1412, which was submitted to the Deutsche Forschungsgemeinschaft in 2019, followed by a successful onsite evaluation that took place in 2019. The CRC 1412 then started its work on January 1, 2020. The theoretical part of the frame text gives an extensive overview of the theoretical and empirical perspectives on register knowledge from the viewpoint of 2019. Due to the high collaborative effort of all PIs involved, the frame text is unique in its scope on register research, encompassing register-relevant aspects from variationist approaches, psycholinguistics, grammatical theory, acquisition theory, historical linguistics, phonology, phonetics, typology, corpus linguistics, and computational linguistics, as well as qualitative and quantitative modeling. Although our positions and hypotheses since its submission have developed further, the frame text is still a vital resource as a compilation of state-of-the-art register research and a documentation of the start of the CRC 1412. The theoretical part without administrative components therefore presents an ideal starter publication to kick off the CRC’s publication series REALIS. For an overview of the projects and more information on the CRC, see https://sfb1412.hu-berlin.de/.
Schnelle, Gohar; Fisher, Catharina; Lüdeling, Anke; Perlitz, Laura (2022) "Die Blumenzeit der Frau": A corpus-based study of the development of medical references to menstruation in historical texts on herbology In: Corpus Pragmatic Studies on the History of Medical Discourse [ViVo]

2021

Lüdeling, Anke; Hirschmann , Hagen; Shadrova, Anna; Wan, Shujun (2021) Tiefe Analyse von Lernerkorpora In: Deutsch in Europa [DOI] [ViVo]
Die Sprache von Lerner/-innen einer Fremdsprache unterscheidet sich auf allen linguistischen Ebenen von der Sprache von Muttersprachler/-innen. Seit einigen Jahrzehnten werden Lernerkorpora gebaut, um Lernersprache quantitativ und qualitativ zu analysieren. Hier argumentieren wir anhand von drei Fallbeispielen (zu Modifikation, Koselektion und rhetorischen Strukturen) für eine linguistisch informierte, tiefe Phänomenmodellierung und Annotation sowie für eine auf das jeweilige Phänomen passende formale und quantitative Modellierung. Dabei diskutieren wir die Abwägung von tiefer, mehrschichtiger Analyse einerseits und notwendigen Datenmengen für bestimmte quantitative Verfahren andererseits und zeigen, dass mittelgroße Korpora (wie die meisten Lernerkorpora) interessante Erkenntnisse ermöglichen, die große, flacher annotierte Korpora so nicht erlauben würden.
Shadrova, Anna; Lukassek, Julia; Lukassek, Julia; Lüdeling, Anke (2021) A Challenge for Contrastive L1/L2 Corpus Studies: Large Inter- and Intra-Individual Variation Across Morphological, but Not Global Syntactic Categories in Task-Based Corpus Data of a Homogeneous L1 German Group In: Frontiers in Psychology [DOI] [ViVo]
In this paper, we present corpus data that questions the concept of native speaker homogeneity as it is presumed in many studies using native speakers (L1) as a control group for learner data (L2), especially in corpus contexts. Usage-based research on second and foreign language acquisition often investigates quantitative differences between learners, and usually a group of native speakers serves as a control group, but often without elaborating on differences within this group to the same extent. We examine inter-personal differences using data from two well-controlled German native speaker corpora collected as control groups in the context of second and foreign language research. Our results suggest that certain linguistic aspects vary to an extent in the native speaker data that undermines general statements about quantitative expectations in L1. However, we also find differences between phenomena: while morphological and syntactic sub-classes of verbs and nouns show great variability in their distribution in native speaker writing, other, coarser categories, like parts of speech, or types of syntactic dependencies, behave more predictably and homogeneously. Our results highlight the necessity of accounting for inter-individual variance in native speakers where L1 is used as a target ideal for L2. They also raise theoretical questions concerning a) explanations for the divergence between phenomena, b) the role of frequency distributions of morphosyntactic phenomena in usage-based linguistic frameworks, and c) the notion of the individual adult native speaker as a general representative of the target language in language acquisition studies or language in general.

2020

Guescini, Rolf Borgen; Krause, Thomas; Guescini, Rolf Borgen (2020) Laudatio Repository - Long-term Access and Usage of Deeply Annotated Information Docker Images[DOI] [ViVo]
This is the dockerized images of the Laudatio Repository software described by the following: The management and archiving of digital research data is an overlapping field for linguistics, library and information science (LIS) and computer science: The departments of Corpus Linguistics and the Computer and Media Service (CMS) at Humboldt-Universität zu Berlin and The National Institute for Research in Computer Science and Control (INRIA France) are project partners cooperating with the Berlin School of Library and Information Science (BSLIS). LAUDATIO has developed an open access research data repository for historical corpora. For the access and (re-)use of historical corpora, the LAUDATIO repository uses a flexible and appropriate documentation schema with a subset of TEI customized by TEI ODD. The extensive metadata schema contains information about the preparation and checking methods applied to the data, tools, formats and annotation guidelines used in the project, as well as bibliographic metadata, and information on the research context (e.g. the research project). To provide complex and comprehensive search in the annotation data, the search and visualization tool ANNIS is integrated in the LAUDATIO-Repository.

Präsentationen

2026

Lüdeling, Anke; C05 Development of specialized knowledge in linguistics and register flexibility in early adulthood (2026) Register vs. text type. Why parameters matter In: "Storytelling as a window to language, cognition and societal participation" of the COST Action 22139 Justice to youth language needs [ViVo]
Haden, Lena; Lüdeling, Anke; Fleischer, Jürg (2026) Welch Relativsatz! Entwicklung von freien, eingeleiteten und weiterführenden Relativsätzen mit welch- & N (in RIDGES und RegiL) In: Kolloquium Korpuslinguistik und Phonetik (2026), Humboldt-Universität zu Berlin [ViVo]
Lüdeling, Anke; Haden, Lena; Fleischer, Jürg (2026) Welch Relativsatz! Development and distribution of determining welch- in Early New High German relative clauses In: Wh-elements in Relative and Adverbial clauses: Perspectives on Polyfunctionality (WRAPP) [ViVo]
Krause, Thomas; Lüdeling, Anke; Odebrecht, Carolin; Klotz, Martin (2026) Large Language Models and the Process of Corpus Linguistic Research In: Corpus Linguistics in the AI Era [ViVo]

2025

Weller, Samantha; Schumacher, Nicole; Lüdeling, Anke; Lütke, Beate; Kühnast, Milena (2025) Mittelgroße Lernerkorpora: Feingranulare Registervariation - Parameter, Phänomene, Designentscheidungen In: Große Lernerkorpora - Möglichkeiten und Grenzen; Universität Leipzig [ViVo]
Schumacher, Nicole; Hirschmann , Hagen; Lüdeling, Anke (2025) Erwerb von syntaktischer und morphologischer Komplexität in der Nominaldomäne bei Lernenden des Deutschen als Fremdsprache In: Chinesisch-deutsches linguistisches Kolloquium, TU Berlin [ViVo]
Weller, Samantha; Schumacher, Nicole; Kühnast, Milena; Lütke, Beate; Lüdeling, Anke (2025) Grasping register appropriateness: A ranking study of metalinguistic awareness In: 30th LIPP Symposium "Variation and Cognition in Language", München [ViVo]
Lüdeling, Anke; Zeige, Lars Erik (2025) Modellierung sprachexterner Parameter mit dem Nähe/Distanz-Modell und dem Register-Modell In: 40 Jahre ‚Nähe und Distanz‘ [ViVo]
Hirschmann , Hagen; Schumacher, Nicole; Lüdeling, Anke (2025) Untersuchung von Registerkompetenz und Merkmalen von Nominalstil im Falko-Korpus In: Große Lernerkorpora - Möglichkeiten und Grenzen; Universität Leipzig [ViVo]
Poster

2024

Schumacher, Nicole; Lüdeling, Anke; Hirschmann , Hagen (2024) Stand Falko-Korpus. Auswertungen zum Nominalstil In: Kolloquium Korpuslinguistik und Phonetik (SoSe 2024), Humboldt-Universität zu Berlin [ViVo]

2023

Lüdeling, Anke (2023) Registers and Individual Differences in Heritage Data In: Workshop on Immigrant Languages in the Americas (WILA 14), Flensburg [ViVo]
Krause, Thomas (2023) The four elements of achieving research software sustainability for long tail projects In: deRSE23 - Conference for Research Software Engineering in Germany [ViVo]
At deRSE19 we presented Hexatomic, a project to investigate what small research software projects need to make their software more sustainable. This talk reports our results. Developing annotation software for multi-layer linguistic corpora, we found that there is a minimal infrastructure that needs to be in place to activate a potential for sustainability in the first place, and that four elements play key roles in reducing the risk of software collapse. These four elements are: a clearly defined, resourced maintainer role; multi-modal documentation to guide maintenance and use; automated tests; code review and triage processes supported by static code analysis. We have tested this by performing changes in maintainership and observing success in reviving development and maintenance activities. Our results furthermore clearly point to the necessity for RSEs to be involved in software projects as part of research projects. In this talk, we describe our research project, the minimal infrastructure as well as development, maintenance, and release and publication workflows we implemented, and how the four elements helped maintainers take up their roles.

2022

Renis, André; Golcher, Felix (2022) Introduction to VIVO In: Kolloquium SFB1412 (2022) [ViVo]
Temme, Anne; Golcher, Felix (2022) Contrast Coding In: CRC/MGK Methodschool 2022 [ViVo]
Lüdeling, Anke; Krause, Thomas (2022) What is a good corpus? In: SFB 1412 Colloquium, Humboldt-Universität zu Berlin [ViVo]
Lüdeling, Anke (2022) Zum Umgang mit Variation in der Lernersprachenanalyse. Perspektiven aus und für DaF/DaZ In: LCR2022 6th Learner Corpus Research Conference, Padua, September [ViVo]
Lüdeling, Anke (2022) Variability in Grammatical Categories and Structures: The Case of Word Formation, Ghent, Belgium In: Grammar and Corpora (GaC) [ViVo]
Shadrova, Anna; Wan, Shujun; Lukassek, Julia; Lüdeling, Anke (2022) Complex nouns as markers of academic register in L1-and L2-authored essays In: Word formation and discourse structure [ViVo]
Temme, Anne; Renis, André (2022) Jupyter Notebooks for Linguists: Introduction to FLAIR NLP In: CRC/MGK Methodschool 2022 [ViVo]
Mooshammer, Christine; Oliviera, Maggie Bullock; Lüdeling, Anke; Lange, Robert; Sell, Bianca Maria (2022) Non-Native Addressee Register in German In: 11th International Conference on Language Variation in Europe - ICLaVE|11 [ViVo]
Lüdeling, Anke; Lukassek, Julia; Shadrova, Anna (2022) Variation and Productivity in German L1 and L2 Nominal Word Formation In: DGfS 2022, University of Tübingen [ViVo]
Mooshammer, Christine; Belz, Malte; Sell, Bianca Maria; Lange, Robert; Lüdeling, Anke; Terada, Megumi (2022) Das Berlin Dialogue Corpus (BeDiaCo) und das Corpus of Non-Native Addressee Register (CoNNAR) – themen- und aufgabenorientierte spontansprachliche Dialoge in direkter und videobasierter Kommunikation In: IDS-Methodenmesse 2022 [ViVo]

2021

Temme, Anne; Golcher, Felix (2021) R Markdown In: CRC/MGK Methodschool 2021 [ViVo]
Lüdeling, Anke; Belz, Malte; Mooshammer, Christine; Sell, Bianca Maria; Lange, Robert (2021) The phonetic realisation of verbal inflection in two dialogue registers of German spontaneous speech In: Kolloquium SFB1412 (2021) [ViVo]
Lüdeling, Anke; Lukassek, Julia (2021) Untersuchung von Registern in Texten von Lerner:innen des Deutschen als Fremdsprache In: Colloquium RWTH Aachen [ViVo]
Lukassek, Julia; Lüdeling, Anke (2021) Zum Erwerb von Registerwissen bei Lerner:innen des Deutschen als Fremdsprache. Registerstudien in Lernerkorpora In: Colloquium Uni Gießen [ViVo]
Lukassek, Julia; Lüdeling, Anke (2021) Registerwissen und morphologische Struktur. Eine Studie zu komplexen Wörtern bei Lerner:innen des Deutschen als Fremdsprache und Muttersprachler:innen In: SPIGL [ViVo]
Temme, Anne; Renis, André (2021) Python for Linguists (and Introduction to spaCy NLP) In: CRC/MGK Methodschool 2021 [ViVo]

2020

Renis, André; Temme, Anne; Krause, Thomas; Golcher, Felix (2020) Working with corpora and corpus data In: CRC/MGK Summerschool 2020 [ViVo]
Kühnast, Milena; Lüdeling, Anke; Schumacher, Nicole; Weller, Samantha; Lütke, Beate () Registerflexibilität und metasprachliches Wissen: Wie erklären Deutsch-Lehramtsstudierende grammatische Unterschiede zwischen Sätzen? In: 48. Jahrestagung der Deutsche Gesellschaft für Sprachwissenschaft, Trier [ViVo]

Mitarbeiter*innen

Leitung

Malte Dreyer

Dr. Thomas Krause

Prof. Dr. Anke Lüdeling

Mitarbeiter*innen

Martin Klotz

Thomas Pfuhl

Studentische Hilfskräfte

Rebecca Grunow

Ehemalige Mitarbeiter*innen

Zehui Guo

Felix Golcher

Arsenii Gulevich

Kontakt

Projekt in Förderphase I

Titel in Phase I

Beschreibung für Phase I

Material und Templates

Veröffentlichungen und Präsentationen

Veröffentlichungen

2025

2024

2023

2022

2021

2020

Präsentationen

2026

2025

2024

2023

2022

2021

2020