Resources

Several types of data beyond traditional publications are used and created throughout the CRC. We  are committed to publish our research data and software wherever possible under an Open Access license. By sharing our resources, we want to enable reproducibility and re-use by the research community.

Corpora

BeDiaCo – Berlin Dialogue Corpus

The corpus consists of acoustic recordings of spontaneous dialogues of German native speakers with both task-free and task-based parts and additional read word lists.

Available from: https://rs.cms.hu-berlin.de/phon
Documentation: https://doi.org/10.18452/21361
Cite as: Malte Belz & Christine Mooshammer (2020): Berlin Dialogue Corpus (BeDiaCo). Version 1.0. Humboldt-Universität zu Berlin: DOI: 10.18452/21361

Software

ANNIS

A web browser-based search and visualization architecture for complex multilayer linguistic corpora with diverse types of annotation.

Available from: https://corpus-tools.org/annis/
Documentation: https://corpus-tools.org/annis/documentation.html
Cite as: Krause, Thomas & Zeldes, Amir (2016): ANNIS3: A new architecture for generic corpus query and visualization. in: Digital Scholarship in the Humanities 2016 (31). http://dsh.oxfordjournals.org/content/31/1/118