wg4:wg4
Differences
This shows you the differences between two versions of the page.
Both sides previous revisionPrevious revisionNext revision | Previous revisionNext revisionBoth sides next revision | ||
wg4 [2022/12/15 16:14] – olesea.caftanov | wg4 [2023/03/21 13:46] – [Documents] agata.savary | ||
---|---|---|---|
Line 1: | Line 1: | ||
====== Working Group 4: Quantifying and promoting diversity ====== | ====== Working Group 4: Quantifying and promoting diversity ====== | ||
+ | |||
+ | * **Leader**: [[https:// | ||
+ | * **Vice-leader**: | ||
+ | |||
+ | ==== Workplan ==== | ||
This WG is transversal to WGs 1-3 and will focus on how the Action serves inter- and intra-linguistic | This WG is transversal to WGs 1-3 and will focus on how the Action serves inter- and intra-linguistic | ||
diversity. Its activities will overlap with the 3 other WGs in: | diversity. Its activities will overlap with the 3 other WGs in: | ||
- | - Networking for diversity: | + | - **Networking** for diversity: |
* bringing together pre-existing groups dedicated to NLP-applicable universality, | * bringing together pre-existing groups dedicated to NLP-applicable universality, | ||
* integrating experts of (notably low-resourced) languages not yet covered by these groups, | * integrating experts of (notably low-resourced) languages not yet covered by these groups, | ||
* integrating experts in linguistic typology; | * integrating experts in linguistic typology; | ||
- | - Quantifying diversity: | + | - **Quantifying** diversity: |
- | * designing measures of inter- and intra-linguistic diversity in language resources and tools, | + | * designing |
* using these measures to quantify diversity in UD and PARSEME corpora; | * using these measures to quantify diversity in UD and PARSEME corpora; | ||
- Promoting diversity: | - Promoting diversity: | ||
- | * procedures for better use of the existing resources, based on their estimated diversity, | + | * procedures for **better use of the existing resources**, based on their estimated diversity, |
- | * selecting new data to be annotated, so as to favour intra-linguistic diversity, | + | * **selecting new data** to be annotated, so as to favour intra-linguistic diversity, |
- | * designing evaluation scenarios which favour tools performing well on rare and diverse phenomena and low resourced languages, | + | * designing |
- | * integrating and training new experts dedicated to low-resourced and endangered languages, | + | * integrating and **training** new experts dedicated to low-resourced and endangered languages, |
- | * validating the unified annotation guidelines (WG1) and lexicon formats (WG2) against newly included languages and defining new language-specific categories and extensions, if needed, | + | * validating the unified annotation |
- | * coordinating of the creation and enhancement of annotated corpora and lexica for low-resourced languages, | + | * coordinating of the creation and enhancement of annotated |
- | * discovering and analysing rare linguistic phenomena, and describing them in resources and tools, | + | * discovering and analysing |
- | * coordination of the development of NLP tools (WG3) for low-resourced and endangered languages. | + | * coordination of the development of NLP **tools** (WG3) for low-resourced and endangered languages. |
+ | |||
+ | ==== Documents ==== | ||
+ | |||
+ | TBA | ||
wg4/wg4.txt · Last modified: 2024/04/22 15:30 by marie-catherine.de-marneffe