As per redmine#687
for word_form all, now removing duplicates for word_form msd, now word_forms from the collocation, not from whole corpus determening more specific msd for agreements, so that it gets better match when using backup-lemma representation for agreements, now ordered by colocation's own number of occurances, not global removed a bit of debug code |
||
|---|---|---|
| .gitignore | ||
| msd_translate.py | ||
| README.md | ||
| wani.py | ||
Navodila
Potrebne datoteke:
- korpus v "ssj500k obliki"
- definicije struktur
- Python 3.5+
Priporocam: pypy3 paket za hitrejse poganjanje.
Primer uporabe: python3 wani.py ssj500k.xml Kolokacije_strukture.xml izhod.csv