Commit Graph

191 Commits (master)
 

Author SHA1 Message Date
Luka 7c735e33f7 Db and syntactic_structures fixes
2 years ago
Luka 598ab102b3 Adding uncommited changes
3 years ago
Luka d67976c3d9 Modified prints + sqlalchemy and psycopg2cffi made optional
3 years ago
Luka 39692e839f Extended recalculate statistics to filtered output
3 years ago
Luka f1366548b6 White reset at paragraphs not sentences + progress bar updates on paragraphs not sentences.
3 years ago
Luka 552f2e4bd0 Changed whitespace aspect from document to sentence based.
3 years ago
Luka 361331515e Ignoring @type=single and added option for --new-tei
3 years ago
Luka fa4479af60 Fixed repeating words bug
3 years ago
Luka 25db8eeb7a Adding --fixed-restriction-order parameter
4 years ago
Luka dd5fa4a1b8 Changed spaces settings - both swiched with neither and left with right.
4 years ago
Luka c63a9d47da Adding restriction on spaces on punctuations.
4 years ago
Luka 6dd97838b4 Added fix for when two restrictions are satisfied with the same word.
4 years ago
Luka 8c87d07b8a Scripts adapted to changes of new structures.xml format
4 years ago
Luka 09c4277ebe Modified error signal + Fixed no_stat
4 years ago
Luka 06435aa3a2 Added options for "modra"
4 years ago
Luka 1ea454f63c Added fix for punctuations
4 years ago
Luka d5668c8b68 Moved wani.py + Added ignore of .zstd files for valency
4 years ago
Luka 412d0c0f62 Changing file structure
4 years ago
Luka c19c95ad97 Renaming src to luscenje struktur
4 years ago
Luka 5bff3e370f Added setup.py
4 years ago
Luka 01b08667d2 Added some functions for compatibility with valency, fixed readme and fixed some minor bugs.
4 years ago
Luka 1b0e6a27eb Modified readme.md + Removed obligatory sloleks_db + Added frequency_limit and sorted parameters in recalculate_statistics.py
4 years ago
Luka 41952738ed Added support for valency
4 years ago
Luka e38ff4c7b0 Added limit to minimum frequency = 10 + Ordered by frequency
4 years ago
Luka edea80e6e0 Added script for file extension
4 years ago
lkrsnik e8fdbfdb6a Merge branch 'master' of https://gitea.cjvt.si/ozbolt/luscenje_struktur
4 years ago
lkrsnik 49a8d5123e Quick fix for missing dispersions
4 years ago
Luka 8cf9083421 Removing results
4 years ago
Luka 23b062cc1b Adding issue992 fixes
4 years ago
Luka f330a37764 Improved representations speed + Fixed bug in representations
4 years ago
lkrsnik 4c84873ff5 Fixing for run.sh and adding run.sh
4 years ago
Luka 14951e8422 Added multi file reading
4 years ago
Luka eb86a6bb1c Added collocation_sentence_map_dest
4 years ago
Luka 9a9d344510 Created new column "Joint_representative_form_variable" + Fixed collocation structures + Fixed bug with wrong lemma_fallback msds
4 years ago
Luka de3e52c57c Changed output document to reflect most frequent word order
4 years ago
Luka 777791ad1e Added s/z, k/h + fixed bug 90 + connecting with sloleks on lemma_fallback
4 years ago
ozbolt ec113f9cd2 Merge branch 'sql-join-test' of ozbolt/luscenje_struktur into master
4 years ago
Ozbolt Menegatti 9e8cd2a2ec Issue #1000
4 years ago
Ozbolt Menegatti 1d4c0238a6 fixing how min_freq is used and more verbose writer
5 years ago
Ozbolt Menegatti 8fee3f8a8e Testing delayed insertions of representations
5 years ago
Ozbolt Menegatti 6bb3586051 Attempt at speed optimization with sql-join
5 years ago
Ozbolt Menegatti 4124036474 match_num now loaded from database
5 years ago
Ozbolt Menegatti 07242f74c8 Also remember representations step.
5 years ago
Ozbolt Menegatti 33528f1495 step_done now implemented in database.py
5 years ago
Ozbolt Menegatti 3ea62ed242 dispersions now loaded into database and stored/loaded.
5 years ago
Ozbolt Menegatti dedc031696 Step recorded: generate_renders
5 years ago
Ozbolt Menegatti 046aef031f adding timeinfo
5 years ago
Ozbolt Menegatti 2018745d52 files loaded now in database
5 years ago
Ozbolt Menegatti 8cca761b91 min frequecy now part of writer
5 years ago
Ozbolt Menegatti 3f1c154705 can now load csv files
5 years ago