Redmine #2619: removes reference to obsolete wani in readme

This commit is contained in:
Cyprian Laskowski 2023-08-30 18:59:34 +02:00
parent 6daaae1725
commit 7ae0b06510

View File

@ -6,16 +6,13 @@ provisional new structures if necessary.
## Installation ## Installation
Installation requires the [CLASSLA](https://github.com/clarinsi/classla) standard_jos models, as Installation requires the [CLASSLA](https://github.com/clarinsi/classla) standard_jos models:
well as (for now) the wani.py script from
[luscenje_struktur](https://gitea.cjvt.si/ozbolt/luscenje_struktur):
pip install . pip install .
python -c "import classla; classla.download('sl', dir='resources/classla', type='standard_jos')" python -c "import classla; classla.download('sl', dir='resources/classla', type='standard_jos')"
curl -o resources/wani.py https://gitea.cjvt.si/ozbolt/luscenje_struktur/raw/branch/master/wani.py
The classla directory and wani.py file do not necessarily need to be placed under resources/, but The classla directory does not necessarily need to be placed under resources/, but the wrapper
the wrapper script scripts/process.py assumes that they are. script scripts/process.py assumes that it is.
## Usage ## Usage
@ -44,12 +41,11 @@ $ python process.py -mode strings_to_parse -infile /tmp/strings.txt -outfile /tm
The input should be a TEI XML file (in the same particular format as The input should be a TEI XML file (in the same particular format as
the output of strings_to_parse) and an xml file of structure the output of strings_to_parse) and an xml file of structure
specifications. The script first uses the MWE extraction script specifications. The script first uses the MWE extraction script to
[wani.py](https://gitea.cjvt.si/ozbolt/luscenje_struktur) to find and find and assign all matches for collocation structures. For units
assign all matches for collocation structures. For units without such without such matches, it then finds (creating, if necessary) and
matches, it then finds (creating, if necessary) and assigns assigns single-component or other structures. Finally the TEI is
single-component or other structures. Finally the TEI is converted to converted to CJVT dictionary XML format. Example:
CJVT dictionary XML format. Example:
``` ```
$ python process.py -mode parse_to_dictionary -infile /tmp/parsed.xml -instructs /tmp/structures_old.xml -outfile /tmp/dictionary.xml -outstructs /tmp/structures_new.xml $ python process.py -mode parse_to_dictionary -infile /tmp/parsed.xml -instructs /tmp/structures_old.xml -outfile /tmp/dictionary.xml -outstructs /tmp/structures_new.xml