wg1:wg1:task1.2:call-for-language-leaders
Differences
This shows you the differences between two versions of the page.
Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
wg1:wg1:task1.2:call-for-language-leaders [2025/02/14 18:29] – agata.savary | wg1:wg1:task1.2:call-for-language-leaders [2025/02/18 08:04] (current) – agata.savary | ||
---|---|---|---|
Line 1: | Line 1: | ||
====== Call for expressions of interest in PARSEME/ | ====== Call for expressions of interest in PARSEME/ | ||
- | The [[https:// | + | The [[https:// |
Three past PARSEME annotation campaigns were dedicated exclusively to __verbal__ MWEs (VMWEs) and resulted in 4 editions of the [[https:// | Three past PARSEME annotation campaigns were dedicated exclusively to __verbal__ MWEs (VMWEs) and resulted in 4 editions of the [[https:// | ||
- | The current annotation campaign will cover MWEs of **all syntactic types**. It follows the spirit of **universality**. Namely, the [[https:// | + | The current annotation campaign will cover MWEs of **all syntactic types** |
For the languages already present in the PARSEME corpus, the agenda is to: | For the languages already present in the PARSEME corpus, the agenda is to: | ||
* Re-annotate the existing corpus with MWEs other than verbal. Annotating only part of the existing corpus is an option. In this case we recommend a minimum of 3500 annotated MWEs (so that each selected text is exhaustively annotated for all syntactic types of MWEs). A lower number of annotations can do but the system results are expected not to be representative. | * Re-annotate the existing corpus with MWEs other than verbal. Annotating only part of the existing corpus is an option. In this case we recommend a minimum of 3500 annotated MWEs (so that each selected text is exhaustively annotated for all syntactic types of MWEs). A lower number of annotations can do but the system results are expected not to be representative. | ||
- | * Add some new texts annotated from scratch (to counterbalance language model contamination from previously published data) | + | * Add some new texts annotated from scratch (to counterbalance language model contamination from previously published data) |
- | For new languages, corpora will be annotated for all syntactic types at once. | + | For new languages, corpora will be annotated for all syntactic types at once. |
+ | Conversions from other MWE annotation schemes are fine, if curated so as to fit the PARSEME guidelines. | ||
A language team should consist of **at least 2 annotators** (including the Language Leader), for the sake of inter-annotator agreement estimation. It is possible to start annotating alone and recruit more annotators at a later stage (May at latest). | A language team should consist of **at least 2 annotators** (including the Language Leader), for the sake of inter-annotator agreement estimation. It is possible to start annotating alone and recruit more annotators at a later stage (May at latest). | ||
- | Centralized [[https:// | + | Centralized [[https:// |
- | | | + | |
We propose the following timeline: | We propose the following timeline: | ||
- | * [language leaders: 27 February] Expression of interest from Language Leaders | + | |
- | * [task leaders: late-February] Creating FLAT accounts | + | |
- | * [language leaders: mid-March] Reading guidelines, reading the Language Leader guide, filling in MWE examples, recruiting annotators, selecting corpora | + | |
- | * [all: March] Pilot annotation | + | |
- | * [shared task leaders: 31 March] SEMEVAL shared task proposal | + | |
- | * [language teams: April-May] Annotation (including a double-annotated sample for inter-annotator agreement estimation) | + | |
- | * [SEMEVAL: 19 May] Notification about the selected shared task | + | |
- | * [language leaders: June] Consistency checks and inter-annotator agreement estimation | + | |
- | * [shared task leaders: 15 July] Sample data ready | + | |
- | * [task leaders: July-August] Consolidating and splitting the corpora | + | |
- | * [WG3 shared task leaders: 1 September] Training data for SEMEVAL | + | |
More details about the role of the Language Leader can be found in the PARSEME [[https:// | More details about the role of the Language Leader can be found in the PARSEME [[https:// | ||
Feel free to contact us for any questions you might have. | Feel free to contact us for any questions you might have. | ||
- | + | UniDive task 1.2 co-leaders: Voula Giouli, Stella Markantonatou, | |
- | UniDive task 1.2 co-leaders< | + | |
- | Voula Giouli, Stella Markantonatou, | + | |
wg1/wg1/task1.2/call-for-language-leaders.1739554156.txt.gz · Last modified: by agata.savary