wg1:wg1
Differences
This shows you the differences between two versions of the page.
Both sides previous revisionPrevious revisionNext revision | Previous revisionNext revisionBoth sides next revision | ||
wg1:wg1 [2024/03/28 10:03] – kaja.dobrovoljc | wg1:wg1 [2024/06/10 15:37] – [WG1 Tasks] bruno.guillaume | ||
---|---|---|---|
Line 30: | Line 30: | ||
==== Upcoming meetings ==== | ==== Upcoming meetings ==== | ||
- | * WG Meeting 8 (online) - 11 April 2024, 09:00 CET | + | * WG Meeting 8 (online) - 11 April 2024, 09:00 **CEST** |
- | * WG Meeting 9 (online) - 11 June 2024, 13:30 CET | + | * WG Meeting 9 (online) - 11 June 2024, 13:30 **CEST** |
Line 47: | Line 47: | ||
==== WG1 Tasks ==== | ==== WG1 Tasks ==== | ||
* **Task 1.1: Linguistic typology and multilingual corpus annotation** | * **Task 1.1: Linguistic typology and multilingual corpus annotation** | ||
+ | * __Leaders / Contacts__: André Coneglian, A. Seza Doğröuz | ||
+ | * __Objectives__: | ||
+ | * __Work plan__: | ||
+ | - Determine ways in which linguistic typology can help in the trade-off between universality and language specific phenomena in corpus annotation (Systematic overview of problematic (or difficult) phenomena for annotation (e.g., noun incorporation, | ||
+ | - Take into account less-resourced languages in corpus annotation so as to create new annotated corpora | ||
+ | - More broadly, assess how annotated treebanks (particularly UD treebanks) can figure in typological research | ||
+ | * __How can I contribute: | ||
+ | * __Documents / Links__: | ||
* [[https:// | * [[https:// | ||
* [[https:// | * [[https:// | ||
Line 55: | Line 63: | ||
* __Workplan__: | * __Workplan__: | ||
* __How can I contribute: | * __How can I contribute: | ||
- | * __Documents__ | + | * __Documents / Links__ |
* [[https:// | * [[https:// | ||
* White paper proposition of the [[https:// | * White paper proposition of the [[https:// | ||
| | ||
* **Task 1.3: Extensions and updates to morphosyntactic annotation guidelines** | * **Task 1.3: Extensions and updates to morphosyntactic annotation guidelines** | ||
+ | * __Leaders / Contacts:__ Atul Kr. Ojha, Daniel Zeman | ||
+ | * __Objectives: | ||
+ | * Subtask **A:** Issues in the [[https:// | ||
+ | * Subtask **B:** Construction-oriented guidelines. The UD website is relatively good as a reference manual, with separate pages for individual part-of-speech tags, morphological features and relations in individual languages. It is not so good in providing the big picture with a wholistic solution for individual constructions and strategies, although there is a growing number of documentation pages that attempt to close this gap. Since 2018, there is also an [[https:// | ||
+ | * __How can I contribute: | ||
+ | * Join the ongoing discussions on GitHub (UD issue tracker, see the link above). | ||
+ | * If you can write part of the construction-oriented documentation, | ||
+ | * __Documents / Links:__ | ||
+ | * [[https:// | ||
* [[https:// | * [[https:// | ||
* [[https:// | * [[https:// | ||
* **Task 1.4: Sharing tools, formats, and infrastructure** | * **Task 1.4: Sharing tools, formats, and infrastructure** | ||
- | * [[https:// | + | * __Leaders / Contacts__: František Forgáč, Bruno Guillaume |
+ | * __Objectives__: | ||
+ | | ||
+ | * Subtask **B**: Evaluate the pros and cons of tabular formats (such as CoNNL-U) currently used in the UD and Parseme projects | ||
+ | * __Workplan__: | ||
+ | * Subtask **A**: The specific objective is to create a comparison table of available manual annotation tools morpho-syntactic and multiword expression annotations. A survey will be propose in the upcoming weeks, to collect feedback adn to produce the final version of the table. | ||
+ | * Subtask **B**: Conduct a detailed analysis of the advantages and disadvantages of the tabular annotation formats, specifically CoNLL-U, as utilized in the Universal Dependencies (UD) and PARSEME projects. A first draft of an evolution of the formats currently used will be proposed for dicussions and for testing. | ||
+ | * __How can I contribute? | ||
+ | * Join to the ongoing discussions on GitHub (links above) | ||
+ | * Stay tuned for the call to complete the survey | ||
+ | * Join the task co-leaders team | ||
+ | * __Documents__ | ||
+ | | ||
+ | * GitHub discussions about [[https:// | ||
+ | * Document used in the Task 1.4 session at the WG1 meeting in Naples (February 2024): | ||
+ | * **Task 1.5: Annotation of Spoken data** | ||
+ | * __Leaders / Contacts__: Kaja Dobrovoljc, Sylvain Kahane | ||
+ | * __Objectives__: | ||
==== Training ==== | ==== Training ==== | ||
* [[https:// | * [[https:// |
wg1/wg1.txt · Last modified: 2024/06/11 15:38 by dan.zeman