The UniDive COST action (task 1.2) and the PARSEME community are happy to announce the upcoming multilingual corpus annotation campaign dedicated to multiword expressions (MWEs). We call for expression of interest from current or future Language Leaders, who wish to propose a language team. If you are interested, please, fill in the EoI form, best before 27 February 2025.
Three past PARSEME annotation campaigns were dedicated exclusively to verbal MWEs (VMWEs) and resulted in 4 editions of the PARSEME corpus, which jointly covers 26 languages. Three PARSEME shared tasks on automatic identification of VMWEs have been organized on the basis of this corpus and set the state of the art in the task.
The current annotation campaign will cover MWEs of all syntactic types (including nominal, adjectival, adverbial and functional MWEs). It follows the spirit of universality. Namely, the annotation guidelines are unified across all participating languages, whenever possible, still leaving room for truly language-specific phenomena. This approach is expected to promote meaningful cross-language comparisons. The resulting corpus will be used in a PARSEME/UniDive shared task on identifying and understanding MWEs, to be proposed for SemEval 2026.
For the languages already present in the PARSEME corpus, the agenda is to:
For new languages, corpora will be annotated for all syntactic types at once. Conversions from other MWE annotation schemes are fine, if curated so as to fit the PARSEME guidelines.
A language team should consist of at least 2 annotators (including the Language Leader), for the sake of inter-annotator agreement estimation. It is possible to start annotating alone and recruit more annotators at a later stage (May at latest).
Centralized documentation and tools (including the online FLAT annotation platform) are available.
We propose the following timeline:
More details about the role of the Language Leader can be found in the PARSEME Language Leader guide.
Feel free to contact us for any questions you might have.
UniDive task 1.2 co-leaders: Voula Giouli, Stella Markantonatou, Carlos Ramisch, Agata Savary, Sara Stymne