BioNLP'09 Shared Task on Event Extraction
in conjunction with BioNLP, a NAACL-HLT 2009 workshop, June 4-5 2009, Boulder, Colorado


Sample Data

bionlp09_shared_task_sample_data_rev3.tar.gz (8631 bytes)

It contains sample files of shared task data for training and evaluation. The data is in the following file types:

Training Data

bionlp09_shared_task_training_data_rev2.tar.gz (721906 bytes)

Development Data

bionlp09_shared_task_development_data_rev1.tar.gz (146695 bytes)

Test Data

bionlp09_shared_task_test_data_without_gold_annotation.tar.gz (183696 bytes)

Evaluation Tools

bionlp09_shared_task_evaluation_tools_v1.tar.gz (14180 bytes)

a2 file converter

generate-task-specific-a2-file_pl (4161 bytes)
*Please rename generate-task-specific-a2-file_pl to after download.

The *.a2 files in the sample training data has all the annotations required to fullfill the all three tasks. For those who do not participate in Task 2 and/or Task 3, some of the annotations are unnecessary, i.e. the 'M' type annotations are only necessary to those who participate in Task 3. With this script, participants can filter out such unnecessary annotations by specifying tasks they are interested in. Note that only the following task specifications are allowed as Task 1 is mandatory:

standoff format checker

standoff-check_pl (11940 bytes)
*Please rename standoff-check_pl to after download.

It performs format checking for the task specific a2 files, /.a2.t12?3?/. For the detail of its usage and the format, please excuete it without parameters.

event viewer

eventview_pl (4866 bytes)
*Please rename eventview_pl to after download.

It is a simple text-based event annotation viewer. It is not a fancy viewer, but rather developed to support a quick collection of event patterns in a readable shape. The output is designed based on the assumption that it will be used together with the unix command 'grep'.