You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
Luka c1ecc4cdbc
Big changes
2 years ago
bilateral-srl@86642e1866 added some data and updated 5 years ago
data sending some pipe-breaking files 5 years ago
dockerfiles Big changes 2 years ago
parser some changes on server 5 years ago
tools Big changes 2 years ago
.gitignore Big changes 2 years ago
.gitmodules added some data and updated 5 years ago
Makefile added parallel json output creation 5 years ago migrated to cjvt-gitea 5 years ago
data_format.xml 5 years ago


We'll be using mate-tools to perform SRL on Kres.


The tools require Java. Go to ./dockerfiles/python-java/ and run make.
You should get a docker environment, mounting this repo.


Check out ./tools/srl-20131216/


Check all possible xml tags (that occur after the tag.

cat F0006347.xml.parsed.xml | grep -A 999999999999 -e '<body>' | grep -o -e '<[^" "]*' | sort | uniq


  • Parser for reading both SSJ500k 2.1 TEI xml and Kres F....xml.parsed.xml" files found in ./tools/parser/
  • fillpred_model for creating a yes/no model for preditcing the predicate (based on ssj500k data).


$ cd ./dockerfiles/python-java`
$ make
# you should be inside a container now
$ cd ./cjvt-srl-tagging
$ make

If you want to run it on a server overnight, you might want to use nohup, so you can close the ssh connection without closing the process.

$ nohup make &

See progress in generated logfile (check git root).


The Makefile follows certain steps:

  1. Create a fillpred model.
  2. Parse .xml files and create .tsv files.
  3. Run mate-tools srl-tagger on the created .tsv files.