You can not select more than 25 topics
Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
|
4 years ago | |
---|---|---|
resources | 4 years ago | |
scripts | 4 years ago | |
tmp | 4 years ago | |
.gitignore | 4 years ago | |
README | 4 years ago | |
requirements.txt | 4 years ago |
README
Pipeline for parsing a file of arbitrary Slovene string and assigning (first creating, if necessary) structure_ids for each string. Example usage: $ cd scripts $ ./setup.sh $ echo "velika miza" > ../tmp/strings.txt $ echo "kdo ne more mimo česa" >> ../tmp/strings.txt $ echo "pazi, avto!" >> ../tmp/strings.txt $ echo "počitnice" >> ../tmp/strings.txt $ source ../venv/bin/activate $ python pipeline.py ../tmp/strings.txt ../tmp/dictionary.xml