Ozbolt Menegatti
c6440162b8
NOT WORKING inbetween commit
5 years ago
Ozbolt Menegatti
dff9643edf
Simplifying main writing stuff
5 years ago
Ozbolt Menegatti
89f35f5259
handling writers for when we dont need outputs (no --all for example)
5 years ago
Ozbolt Menegatti
5929004c44
now using new formatters, simplifies the code nicely
5 years ago
Ozbolt Menegatti
111b088c6c
defining formatter for --output
5 years ago
Ozbolt Menegatti
2a437b1703
Defining writer for --all
5 years ago
Ozbolt Menegatti
96e61d2f64
Defining Formatter parent class for out/all/stats output files
5 years ago
Ozbolt Menegatti
2387bd7cb7
Stats flag
5 years ago
Ozbolt Menegatti
6a9ee516a3
EMPTY COMMIT - fixing some pylint warnings
5 years ago
Ozbolt Menegatti
9117734b91
EMPTY COMMIT - assert statement vs function call
...
and one if statement simplified and unused variable
5 years ago
Ozbolt Menegatti
46e169095c
EMPTY COMMIT - removing too long lines
5 years ago
Ozbolt Menegatti
797060f619
EMPTY COMMIT - removing trailing whitespace
5 years ago
Ozbolt Menegatti
3a22cd91c3
determining jppb (for 2 word statistics)
5 years ago
Ozbolt Menegatti
30a5e80569
determine polnopomenska-beseda components in structure (for now only type='main')
5 years ago
Ozbolt Menegatti
9ae7e1e9f6
Determine distrinct matches for one colocation id.
5 years ago
Ozbolt Menegatti
2773a8b9e9
Getters for number of lemmas and number of all words
5 years ago
Ozbolt Menegatti
2167e4b6fe
Restrictions now always a list, removes/simplifies a bit of code
5 years ago
Ozbolt Menegatti
d83d619dc0
removing old __str__ and __repr__ debugging code
5 years ago
Ozbolt Menegatti
b2baedca52
determining dispersions
5 years ago
Ozbolt Menegatti
57c0ff6f85
Removing prints from slimmer
5 years ago
Ozbolt Menegatti
3263125898
Also need to check msd for agreements in the whole corpus.
5 years ago
Ozbolt Menegatti
44d532808d
tqdm now optional
5 years ago
Ozbolt Menegatti
ed27e549b7
Adding slimming script
5 years ago
Ozbolt Menegatti
08c8050f3f
Removing old logging.debug calls, makes matching stuff much faster :)
5 years ago
Ozbolt Menegatti
2c8a9f0ed0
Whitespace fixes
5 years ago
Ozbolt Menegatti
460a55cb6c
Improving representation speed ~5%
5 years ago
Ozbolt Menegatti
5f226d0cd4
fixing matching of agreements with msd
5 years ago
Ozbolt Menegatti
5b9859af3e
Removing dead code
5 years ago
Ozbolt Menegatti
44f0a6762e
Improving speed of matching ~40%
5 years ago
Ozbolt Menegatti
fe4c95939f
Removing deprecated commented out code.
5 years ago
Ozbolt Menegatti
ed83b2b9c4
implementing multiple agreements to one cid.
5 years ago
Ozbolt Menegatti
0249ef1523
Correct ordercorrect order for wordform any/msd rendering
...
(most frequent first)
5 years ago
Ozbolt Menegatti
119b85568f
actually not showing components without representation
5 years ago
Ozbolt Menegatti
7d1bfbf73e
wordform all only lowercase
5 years ago
Ozbolt Menegatti
ad7ba8c0b2
removing debugging/dead code
5 years ago
Ozbolt Menegatti
09bd4f55ef
mor->more typo
5 years ago
Ozbolt Menegatti
bfd4d4a747
Refactoring representations. Now muuuuch nicer code, not yet working though :)
...
Added: multiple representations per component id
5 years ago
Ozbolt Menegatti
307007218d
Work to fix #757-104 and #757-89
...
for word_form all, now removing duplicates
for word_form msd, now word_forms from the collocation, not from whole corpus
determening more specific msd for agreements, so that it gets better match when using backup-lemma representation
for agreements, now ordered by colocation's own number of occurances, not global
removed a bit of debug code
5 years ago
Ozbolt Menegatti
4c2b5f2b13
Updating for lemma representation of word_form. Also cleaning code, adding tqdm,...
5 years ago
Ozbolt Menegatti
3c669c7901
looking for agreements from the whole corpus
5 years ago
Ozbolt Menegatti
e99ba59908
lemma/msd representations now global! Need to also use for agreements
5 years ago
Ozbolt Menegatti
d14efff709
Intermediate UGLY CODE commit. Working more on representations
5 years ago
Ozbolt Menegatti
dce55d04a3
Does not yet work, agreements in representation
5 years ago
Ozbolt Menegatti
5bd0b4a064
correct representation when rep_failed
5 years ago
Ozbolt Menegatti
111512a901
no more structureselection enum
5 years ago
Ozbolt Menegatti
d2f1e95a8f
continued work on representation, almost there...
5 years ago
Ozbolt Menegatti
84a184c44d
I think this is the way to set representations, all info is available
...
... just have to actually use it
5 years ago
Ozbolt Menegatti
6eefd9c9f6
redid representation storate, (as prev commit: to make it easier to use)
...
find_next does not collect representations, no separate
class to parse representation features,
5 years ago
Ozbolt Menegatti
19067e4135
Moving matches into colocation ids, now easier for representation
5 years ago
Ozbolt Menegatti
87712128be
joint representation form
5 years ago