14 lines
448 B
Plaintext
14 lines
448 B
Plaintext
Pipeline for parsing a file of arbitrary Slovene string and assigning
|
|
(first creating, if necessary) structure_ids for each string.
|
|
|
|
Example usage:
|
|
|
|
$ cd scripts
|
|
$ ./setup.sh
|
|
$ echo "velika miza" > ../tmp/strings.txt
|
|
$ echo "kdo ne more mimo česa" >> ../tmp/strings.txt
|
|
$ echo "pazi, avto!" >> ../tmp/strings.txt
|
|
$ echo "počitnice" >> ../tmp/strings.txt
|
|
$ source ../venv/bin/activate
|
|
$ python pipeline.py ../tmp/strings.txt ../tmp/dictionary.xml
|