EDNIL (2020)

Event Detection from News in Indian Languages

Dataset


Datset Descreption

The dataset is annotated at the word level. Each word is enclosed in <W> tag. These <W> tags are enclosed together in <P> tags. Words which are related to any of the events, reason, place, casualties, time are enclosed in separate tags. The tags are listed below.

<MANMADE_EVENT TYPE = “Subtype”>: This tag contains the words that are related to a manmade disaster. The TYPE contain the subtypes that have been mentioned for manmade disaster in Task 2.

<NATURAL_EVENT TYPE = “Subtype”>: This tag contains the words that are related to a natural disaster. The TYPE contain the subtypes that have been mentioned for natural disaster in Task 2.

<REASON-ARG>: This tag contains the words that are the reason due to which the event has occurred.

<TIME-ARG>: This tag contains the words that are time at which the event has occurred.

<CASUALTIES-ARG>: This tag contains the words that are casualties that have occurred due to an event.

<PLACE_ARG>: This tag contains the words that is the place at which the event has occurred.


Train and Test Data


Language

Train Data

Test Data

Bengali

Download

Download

English

Download

Download

Hindi

Download

Download

Marathi

Download

Download

Tamil

Download

Download

Decryption key for the datasets can be obtained by registering for the task.