You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
voje a6cee3d459 migrated to cjvt-gitea 10 months ago
bilateral-srl @ 86642e1866 added some data and updated README.md 11 months ago
data sending some pipe-breaking files 10 months ago
dockerfiles added parallel json output creation 10 months ago
parser some changes on server 10 months ago
tools Setup that SRL tagged kres 10 months ago
.gitignore parsing... 10 months ago
.gitmodules added some data and updated README.md 11 months ago
Makefile added parallel json output creation 10 months ago
README.md migrated to cjvt-gitea 10 months ago
data_format.xml msdmap.py 11 months ago

README.md

cjvt-srl-tagging

We’ll be using mate-tools to perform SRL on Kres.

workspace

The tools require Java. Go to ./dockerfiles/python-java/ and run make.
You should get a docker environment, mounting this repo.

mate-tools

Check out ./tools/srl-20131216/README.md.

Scripts

Check all possible xml tags (that occur after the tag.

cat F0006347.xml.parsed.xml | grep -A 999999999999 -e '<body>' | grep -o -e '<[^" "]*' | sort | uniq

Tools

  • Parser for reading both SSJ500k 2.1 TEI xml and Kres F....xml.parsed.xml" files found in ./tools/parser/parser.py.
  • fillpred_model for creating a yes/no model for preditcing the predicate (based on ssj500k data).

Usage

$ cd ./dockerfiles/python-java`
$ make
# you should be inside a container now
$ cd ./cjvt-srl-tagging
$ make

If you want to run it on a server overnight, you might want to use nohup, so you can close the ssh connection without closing the process.

$ nohup make &

See progress in generated logfile (check git root).

Makefile

The Makefile follows certain steps:

  1. Create a fillpred model.
  2. Parse .xml files and create .tsv files.
  3. Run mate-tools srl-tagger on the created .tsv files.

Sources