structure_assignment/README

14 lines
448 B
Plaintext

Pipeline for parsing a file of arbitrary Slovene string and assigning
(first creating, if necessary) structure_ids for each string.
Example usage:
$ cd scripts
$ ./setup.sh
$ echo "velika miza" > ../tmp/strings.txt
$ echo "kdo ne more mimo česa" >> ../tmp/strings.txt
$ echo "pazi, avto!" >> ../tmp/strings.txt
$ echo "počitnice" >> ../tmp/strings.txt
$ source ../venv/bin/activate
$ python pipeline.py ../tmp/strings.txt ../tmp/dictionary.xml