Commit Graph

19 Commits

Author SHA1 Message Date
792c0b03fd processing kres files in chunks to reduce number of DB connections 2019-04-21 21:50:56 +02:00
3276619e6f close dbclient after usage 2019-04-21 19:35:39 +02:00
bcc64c767c separate parsing of ssj and kres (mem management) 2019-04-21 19:18:19 +02:00
00d9192993 moved parse_ssj into a subprocess 2019-04-21 17:04:20 +02:00
14c607c106 Merge branch 'master' of gitea.cjvt.si:kristjan/cjvt-corpusparser into my-fix 2019-04-21 16:07:15 +02:00
94d5a6cd73 added exception to parsing ssj 2019-04-21 16:07:08 +02:00
5ae1a9783c fixed some logging 2019-04-21 13:11:05 +02:00
voje
c6b8426fb3 added adjective handling (appending _ to headwords) 2019-04-19 07:41:50 +02:00
af4f6045bb prevent duplicate entries in DB 2019-04-15 20:48:10 +02:00
f0b0abac1b added functors and headwords to db entry 2019-04-15 02:34:53 +02:00
86e56767dd added parallel processing 2019-04-15 00:25:26 +02:00
cce83045e8 adding per-file parsing, for parallel use 2019-04-14 17:16:45 +02:00
19945a9dd9 changed default mongo auth mechanism 2019-04-14 16:50:54 +02:00
c17361fbda added more logging 2019-04-14 04:18:52 +02:00
voje
2b7339ac5a update instead of insert, fixing sentence duplication in db 2019-04-11 07:55:44 +02:00
77c599dded bug fix 2019-03-24 13:44:47 +01:00
e599740ac9 some updates 2019-03-17 17:25:19 +01:00
5dc1b8825a added .gitignore 2019-03-17 13:22:46 +01:00
f87eafd9f2 corpusparser as standalone project 2019-03-17 13:22:01 +01:00