Ozbolt Menegatti
|
d497749c78
|
better database commiting
|
5 years ago |
Ozbolt Menegatti
|
b25e3de76b
|
adding total keyword to progress and total time spent
|
5 years ago |
Ozbolt Menegatti
|
771547b7e4
|
progress for dispersions
|
5 years ago |
Ozbolt Menegatti
|
f9bfac6430
|
If no output, then just commit stuff to database and exit.
|
5 years ago |
Ozbolt Menegatti
|
ec02242f47
|
num-words now part of database
|
5 years ago |
Ozbolt Menegatti
|
ea92b44d71
|
Removing parallel stuff
|
5 years ago |
Ozbolt Menegatti
|
d771137dc7
|
removing pickled structures
|
5 years ago |
Ozbolt Menegatti
|
a07d14011d
|
simplifying progress, because I will remove the parallel stuff
|
5 years ago |
Ozbolt Menegatti
|
577983427e
|
Better error reporting in parsing syntactic structures
|
5 years ago |
Ozbolt Menegatti
|
48795c6227
|
common msd now calculated per colocation id and not for whole corpus
|
5 years ago |
Ozbolt Menegatti
|
2f789e6550
|
last agreement now confirms some matches even if not all matches are ok
|
5 years ago |
Ozbolt Menegatti
|
1401b82324
|
Adding msd to out formatter
|
5 years ago |
Ozbolt Menegatti
|
47340fe80c
|
common msd now based on (lemma,msd0) not only lemma #757-127
|
5 years ago |
Ozbolt Menegatti
|
8c20295adf
|
Adding dispersions to sqlite, finished moving to it.
|
5 years ago |
Ozbolt Menegatti
|
b5e281bdf4
|
adding indexes for speed and set_representations via database
|
5 years ago |
Ozbolt Menegatti
|
188763c06a
|
Incorporating database also in MatchStore
|
5 years ago |
Ozbolt Menegatti
|
c25844a335
|
adding separate database class
|
5 years ago |
Ozbolt Menegatti
|
fa8a5e55f8
|
Merge branch 'sqlite'
|
5 years ago |
Ozbolt Menegatti
|
c2c2ce7ff8
|
making sorted words sorted a bit more non-randomly.
|
5 years ago |
Ozbolt Menegatti
|
8b06c4ec38
|
Skipping already used abailable words, stupid refactoring bug
|
5 years ago |
Ozbolt Menegatti
|
11706b6f81
|
word stats on sqlite now, not yet really working.
|
5 years ago |
Ozbolt Menegatti
|
1256a4de40
|
Fixing loading bad gz files and progress showing
|
5 years ago |
Ozbolt Menegatti
|
049f5ca3dc
|
Adding new N* msds
|
5 years ago |
Ozbolt Menegatti
|
cfdb36b894
|
Adding ability to load gz files.
|
5 years ago |
Ozbolt Menegatti
|
d2f6f8dac8
|
adding new Nw msd
|
5 years ago |
Ozbolt Menegatti
|
70b05e8637
|
New progress bar
|
5 years ago |
Ozbolt Menegatti
|
3552f14b81
|
Loader to its own module
|
5 years ago |
Ozbolt Menegatti
|
51cf3e7064
|
Improving debugging ouptut
|
5 years ago |
Ozbolt Menegatti
|
dc285ce265
|
Saving memory in word-stats
|
5 years ago |
Ozbolt Menegatti
|
37acabc076
|
able to load pickled structures
|
5 years ago |
Ozbolt Menegatti
|
f0109771aa
|
chunk size now handled in file-sentence-generator
|
5 years ago |
Ozbolt Menegatti
|
0d8aeb2282
|
load_files now returns a generator of senteces, not a generator of the whole file
This makes it much slower, but more adaptable for huge files.
|
5 years ago |
Ozbolt Menegatti
|
a8183cf507
|
word stats now collected more memory-efficient
|
5 years ago |
Ozbolt Menegatti
|
90dbbca5d5
|
HUGE refactor, creating lots of modules, no code changes though!
|
5 years ago |
Ozbolt Menegatti
|
43c6c9151b
|
Simplifying and also improving the speed (less regex comparisons!)
|
5 years ago |
Ozbolt Menegatti
|
09bdd0fe3f
|
Adding gitignore
|
5 years ago |
Ozbolt Menegatti
|
c0939fbbd4
|
fixed performance bug for representations
No more creating millions of namedtuple classes. Works about 15x faster
|
5 years ago |
Ozbolt Menegatti
|
3be4118dc0
|
Refactoring lexis/morphology matchers, now "pickable".
|
5 years ago |
Ozbolt Menegatti
|
ad0f9b0956
|
Fixing logdice all stat (and mini refactoring)
|
5 years ago |
Ozbolt Menegatti
|
d30f8c1980
|
Dynamically calculated max num components
|
5 years ago |
Ozbolt Menegatti
|
c0a22a4ef3
|
float formatting for stats
|
5 years ago |
Ozbolt Menegatti
|
bf0ed35e00
|
removing old unused commented out code
|
5 years ago |
Ozbolt Menegatti
|
68c22d4e27
|
deprecating output to stdout
|
5 years ago |
Ozbolt Menegatti
|
b819d9953f
|
using new formatters via --out and --out-no-stat
|
5 years ago |
Ozbolt Menegatti
|
432dc87a5f
|
new outformatter, old is not outnostatformatter
|
5 years ago |
Ozbolt Menegatti
|
cb53a9c7b3
|
moving delta_p12/21 to the end of stats formatter
|
5 years ago |
Ozbolt Menegatti
|
9ccbd02603
|
Implementing the rest of stats. Maybe ok?
|
5 years ago |
Ozbolt Menegatti
|
d7f97ba9b3
|
implementing but commenting out distinct_2w_forms
|
5 years ago |
Ozbolt Menegatti
|
ca0d6f0f55
|
num_words now proper dict
|
5 years ago |
Ozbolt Menegatti
|
865351b3f6
|
Turns out previous commit was OK. Proceeding with stats work
|
5 years ago |