Open Dutch FrameNet
Welcome to the Open Dutch FrameNet! This page contains pointers to our lexicon and corpus:
Dutch FrameNet lexicon: https://github.com/cltl/OpenDutchFrameNetLexicon/tree/main
Dutch FrameNet data: https://github.com/cltl/OpenDutchFrameNetData
We also present an overview of the resources below. You can read a detailed description in the following publication:
Piek Vossen, Pia Sommerauer, Levi Remijnse. From incidents to framing: a Dutch and English frame-semantic corpus and lexicon. LREC 2026. [link to be added].
Data-to-text approach
We collected texts covering different types of events to capture a large spectrum of variation in framing. We used Wikidata and other event registeries to collect texts written about specific events.
Lexicon overview
The lexicon derived from the data-to-text corpus has a total of 6,964 entries. More detailed information is shown below:
| Dutch FrameNet v1.0 | Counts |
|---|---|
| Total entries | 6,964 |
| Nouns | 2,883 |
| Names | 218 |
| Verbs | 2,621 |
| Adjectives | 530 |
| Frames per entry | 3.01 |
| Annotation per entry | 12.41 |
| Total annotations | 28,670 |
| Frame coverage | 2,311 |
| Manual annotations | 27,841 |
| Lexical baseline | 829 |
| Data-to-text annotations | 22,435 |
| RBN-Wordnet-FrameNet mappings | 954 |
| Sonar annotations | 5,281 |
Corpus overview
Using the data-to-text approach, we have collected reference texts in English and Dutch. A subset of the texts has been human-annotated for frames related to the incident of interest. Another subset of the English texts has been machine-labeled with OpenSesame (for all frames).
| total | en | nl | both | |
|---|---|---|---|---|
| types | 38 | 37 | 28 | 27 |
| incidents | 22,058 | 22,002 | 2,333 | 2,277 |
| texts | 28,642 | 25,648 | 2,993 | n.a. |
| human-annotated | 1,312 | 975 | 337 | n.a. |
| system-annotated | 24,670 | 24,670 | 0 | n.a. |
| primary reference texts | 7,903 | 6,729 | 1,173 | n.a. |
| human-annotated | 1,312 | 975 | 337 | n.a. |
| system-annotated | 5,751 | 5,751 | 0 | n.a. |
| secondary reference texts | 20,739 | 18,919 | 1,820 | n.a. |
| human-annotated | 0 | 0 | 0 | n.a. |
| system-annotated | 18,919 | 18,919 | 0 | n.a. |
| word-count | 4,839,844 | 4,431,000 | 408,251 | n.a. |
| human-annotated | 699,876 | 583,683 | 116,193 | n.a. |
| system-annotated | 3,844,778 | 3,844,778 | 0 | n.a. |