other-events:parseme-st
Differences
This shows you the differences between two versions of the page.
Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
other-events:parseme-st [2025/04/15 10:02] – agata.savary | other-events:parseme-st [2025/04/15 15:17] (current) – agata.savary | ||
---|---|---|---|
Line 2: | Line 2: | ||
* **Event title**: culminating workshop of the PARSEME 2.0 shared task | * **Event title**: culminating workshop of the PARSEME 2.0 shared task | ||
- | * **Proposal**: | + | * **Proposal**: |
* **Location**: | * **Location**: | ||
* **Dates**: TBA | * **Dates**: TBA | ||
+ | * **Data**: to be provided by the ongoing [[: | ||
* **Shared task organizers**: | * **Shared task organizers**: | ||
* Manon Scholivet, Université Paris-Saclay, | * Manon Scholivet, Université Paris-Saclay, | ||
* Takuya Nakamura, Université Paris-Saclay, | * Takuya Nakamura, Université Paris-Saclay, | ||
- | * [[https:// | + | * [[https:// |
* Eric Bilinski, Université Paris-Saclay, | * Eric Bilinski, Université Paris-Saclay, | ||
* [[https:// | * [[https:// | ||
- | |[[https:// | + | |[[https:// |
+ | |||
+ | ===== Subtask 1: MWE identification ===== | ||
+ | This subtask is an extension of [[https:// | ||
+ | * Task: Given a raw text, automatically underline MWEs in it | ||
+ | * Data: PARSEME 2.0 annotated corpora (not necessarily all the texts from release 1.3) | ||
+ | * Language teams willing to participate with PARSEME data | ||
+ | * Albanian | ||
+ | * Egyptian (ca. 2700-2000 BC): MWEs from the UD-EUJA treebank. | ||
+ | * Georgian | ||
+ | * Greek (Modern) | ||
+ | * Greek (Ancient) | ||
+ | * Hebrew | ||
+ | * Japanese | ||
+ | * Lithuanian | ||
+ | * Persian (Farsi) | ||
+ | * Polish | ||
+ | * Romanian | ||
+ | * Serbian | ||
+ | * Slovene | ||
+ | * Swedish | ||
+ | * Ukrainian | ||
+ | * Minimum annotation effort: 2000 annotated MWEs | ||
+ | |||
+ | ===== Subtask 2: MWE paraphrasing ===== | ||
+ | * Task: Given a sentence with a MWE, rephrase a sentence so that there is no MWEs but the meaning is the same | ||
+ | * Examples: | ||
+ | * //She made up her mind to…// => //She finally decided to…// | ||
+ | * //He kicked the bucket// ===> //He died// (But __not__ //He passed away//) | ||
+ | * Data: | ||
+ | * Selected sentences form PARSEME annotated corpora | ||
+ | * The same sentences manually paraphrased | ||
+ | * One to several hundred examples per language | ||
+ | * Language teams willing to participate | ||
+ | * Albanian | ||
+ | * French | ||
+ | * Greek (Modern) | ||
+ | * Japanese | ||
+ | * Hebrew | ||
+ | * Lithuanian | ||
+ | * Persian (Farsi) | ||
+ | * Polish | ||
+ | * Brazilian Portuguese | ||
+ | * Romanian | ||
+ | * Serbian | ||
+ | * Slovene | ||
+ | * Swedish | ||
+ | * Ukrainian | ||
+ | |||
+ | ===== Timeline ===== | ||
+ | * < | ||
+ | * 19 May 2025 - SemEval notification | ||
+ | * 15 July 2025: Sample data ready | ||
+ | * 1 September 2025: Training data ready | ||
+ | * 1 December 2025: Evaluation data ready (internal deadline; not for public release) | ||
+ | * 10 January 2026: Evaluation start | ||
+ | * 31 January 2026: Evaluation end (latest date; task organizers may choose an earlier date) | ||
+ | * February 2026: Paper submission | ||
+ | * March 2026: Notification to authors | ||
+ | * April 2026: Camera ready | ||
+ | * Summer 2026: SemEval workshop (co-located with a major NLP conference) | ||
other-events/parseme-st.1744704169.txt.gz · Last modified: by agata.savary