gos/data/corpus/00README.txt
2022-07-06 21:35:05 +02:00

11 lines
611 B
Plaintext

Spoken corpus Gos 1.1 (transcription)
Citation, documentation, download, and licence available from
http://hdl.handle.net/11356/1438
The Gos.TEI directory contains the following files:
* gos.xml: The root TEI files giving the corpus header and XIncluding its texts
* gos123.xml: The TEI file of one lecture; the files have been automatically PoS tagged and lemmatised
* Gos-speeches.txt: TSV list of speeches with metadata
* Gos-speakers.txt: TSV list of speakers with metadata
* schema/: Directory with the TEI schema for the corpus