Pipeline which combines scripts and resources from other repositories to parse strings and assign them to standard CJVT structures, creating new structures if necessary.
resources | ||
scripts | ||
tmp | ||
.gitignore | ||
README | ||
requirements.txt |
Pipeline for parsing a file of arbitrary Slovene string and assigning (first creating, if necessary) structure_ids for each string. Example usage: $ cd scripts $ ./setup.sh $ echo "velika miza" > ../tmp/strings.txt $ echo "kdo ne more mimo česa" >> ../tmp/strings.txt $ echo "pazi, avto!" >> ../tmp/strings.txt $ echo "počitnice" >> ../tmp/strings.txt $ source ../venv/bin/activate $ python pipeline.py ../tmp/strings.txt ../tmp/dictionary.xml