As per redmine#687
Ozbolt Menegatti
307007218d
for word_form all, now removing duplicates for word_form msd, now word_forms from the collocation, not from whole corpus determening more specific msd for agreements, so that it gets better match when using backup-lemma representation for agreements, now ordered by colocation's own number of occurances, not global removed a bit of debug code |
||
---|---|---|
.gitignore | ||
msd_translate.py | ||
README.md | ||
wani.py |
Navodila
Potrebne datoteke:
- korpus v "ssj500k obliki"
- definicije struktur
- Python 3.5+
Priporocam: pypy3 paket za hitrejse poganjanje.
Primer uporabe: python3 wani.py ssj500k.xml Kolokacije_strukture.xml izhod.csv