wg1:wg1
Differences
This shows you the differences between two versions of the page.
Both sides previous revisionPrevious revisionNext revision | Previous revisionLast revisionBoth sides next revision | ||
wg1:wg1 [2024/03/27 14:20] – kaja.dobrovoljc | wg1:wg1 [2024/06/11 14:20] – [Upcoming meetings] added Task 1.3 meeting atul.kumar.ojha | ||
---|---|---|---|
Line 25: | Line 25: | ||
==== Members and organisation ==== | ==== Members and organisation ==== | ||
* [[https:// | * [[https:// | ||
- | * Expression of interest in WG1 tasks [[https:// | + | * Activities are currently structured around four primary |
- | * Task 1.1: Linguistic typology and multilingual corpus annotation | + | |
- | * Task 1.2: Extensions and updates to MWE annotation guidelines and UD-PARSEME unification | + | |
- | * Task 1.3: Extensions and updates to morphosyntactic annotation guidelines | + | |
- | * Task 1.4: Sharing tools, formats, and infrastructure | + | |
==== Upcoming meetings ==== | ==== Upcoming meetings ==== | ||
- | * WG Meeting 8 (online) - 11 April 2024, 09:00 CET | + | |
- | * WG Meeting 9 (online) - 11 June 2024, 13:30 CET | + | |
+ | * WG Meeting 9 (online) - 11 June 2024, 13:30 **CEST** | ||
Line 48: | Line 47: | ||
==== WG1 Tasks ==== | ==== WG1 Tasks ==== | ||
- | * **Task 1.1:** Linguistic typology and multilingual | + | * **Task 1.1: Linguistic typology and multilingual corpus annotation** |
+ | * __Leaders / Contacts__: André Coneglian, A. Seza Doğröuz | ||
+ | * __Objectives__: | ||
+ | * __Work plan__: | ||
+ | - Determine ways in which linguistic | ||
+ | - Take into account less-resourced languages in corpus annotation so as to create new annotated corpora | ||
+ | - More broadly, assess how annotated treebanks (particularly UD treebanks) can figure in typological research | ||
+ | * __How can I contribute: | ||
+ | * __Documents / Links__: | ||
* [[https:// | * [[https:// | ||
* [[https:// | * [[https:// | ||
- | * **Task 1.2** on MWE annotation guidelines and UD-PARSEME unification | + | * **Task 1.2 on MWE annotation guidelines and UD-PARSEME unification** |
- | * __Task leaders__: Agata Savary, Voula Giouli, Stella Markanotatou, | + | * __Leaders / Contacts__: Agata Savary, Voula Giouli, Stella Markanotatou, |
* __Objectives__: | * __Objectives__: | ||
- | * __Workplan__: | + | * __Workplan__: |
- | * __How can I contribute: | + | * __How can I contribute: |
- | * __Documents__ | + | * __Documents / Links__ |
- | * [[https:// | + | * [[https:// |
- | * White paper proposition the [[https:// | + | * White paper proposition |
| | ||
- | * **Task 1.3:** Extensions and updates to morphosyntactic annotation guidelines | + | * **Task 1.3: Extensions and updates to morphosyntactic annotation guidelines** |
+ | * __Leaders / Contacts:__ Atul Kr. Ojha, Daniel Zeman | ||
+ | * __Objectives: | ||
+ | * Subtask **A:** Issues in the [[https:// | ||
+ | * Subtask **B:** Construction-oriented guidelines. The UD website is relatively good as a reference manual, with separate pages for individual part-of-speech tags, morphological features and relations in individual languages. It is not so good in providing the big picture with a wholistic solution for individual constructions and strategies, although there is a growing number of documentation pages that attempt to close this gap. Since 2018, there is also an [[https:// | ||
+ | * __How can I contribute: | ||
+ | * Join the ongoing discussions on GitHub (UD issue tracker, see the link above). | ||
+ | * If you can write part of the construction-oriented documentation, | ||
+ | * __Documents / Links:__ | ||
+ | * [[https:// | ||
* [[https:// | * [[https:// | ||
* [[https:// | * [[https:// | ||
- | * **Task 1.4:** Sharing tools, formats, and infrastructure | + | * **Task 1.4: Sharing tools, formats, and infrastructure** |
- | * [[https:// | + | * __Leaders / Contacts__: František Forgáč, Bruno Guillaume |
+ | * __Objectives__: | ||
+ | * Subtask **A**: Provide an overview of existing software and/or tools that support manual linguistic annotation | ||
+ | * Subtask **B**: Evaluate the pros and cons of tabular formats (such as CoNNL-U) currently used in the UD and Parseme projects | ||
+ | * __Workplan__: | ||
+ | * Subtask **A**: The specific objective is to create a comparison table of available manual annotation tools morpho-syntactic and multiword expression annotations. A survey will be propose in the upcoming weeks, to collect feedback adn to produce the final version of the table. | ||
+ | * Subtask **B**: Conduct a detailed analysis of the advantages and disadvantages of the tabular annotation formats, specifically CoNLL-U, as utilized in the Universal Dependencies (UD) and PARSEME projects. A first draft of an evolution of the formats currently used will be proposed for dicussions and for testing. | ||
+ | * __How can I contribute? | ||
+ | * Join to the ongoing discussions on GitHub (links above) | ||
+ | * Stay tuned for the call to complete the survey | ||
+ | * Join the task co-leaders team | ||
+ | * __Documents__ | ||
+ | * [[https:// | ||
+ | * GitHub discussions about [[https:// | ||
+ | * Document used in the Task 1.4 session at the WG1 meeting in Naples (February 2024): | ||
+ | * **Task 1.5: Annotation of Spoken data** | ||
+ | * __Leaders / Contacts__: Kaja Dobrovoljc, Sylvain Kahane | ||
+ | * __Objectives__: | ||
==== Training ==== | ==== Training ==== | ||
* [[https:// | * [[https:// |
wg1/wg1.txt · Last modified: 2024/06/11 15:38 by dan.zeman