wg4:wg4
Differences
This shows you the differences between two versions of the page.
Both sides previous revisionPrevious revision | Next revisionBoth sides next revision | ||
wg4 [2022/12/15 16:04] – olesea.caftanov | wg4 [2022/12/15 16:10] – olesea.caftanov | ||
---|---|---|---|
Line 1: | Line 1: | ||
- | ====== Working Group 2: Lexicon-corpus interface ====== | + | ====== Working Group 4: Lexicon-corpus interface ====== |
+ | |||
+ | This WG is transversal to WGs 1-3 and will focus on how the Action serves inter- and intra-linguistic | ||
+ | diversity. Its activities will overlap with the 3 other WGs in: | ||
+ | 20 | ||
+ | Networking for diversity: (i) bringing together pre-existing groups dedicated to NLP-applicable | ||
+ | universality, | ||
+ | groups, (iii) integrating experts in linguistic typology; | ||
+ | Quantifying diversity: (i) designing measures of inter- and intra-linguistic diversity in language | ||
+ | resources and tools, (ii) using these measures to quantify diversity in UD and PARSEME corpora; | ||
+ | Promoting diversity: (i) procedures for better use of the existing resources, based on their | ||
+ | estimated diversity, (ii) selecting new data to be annotated, so as to favour intra-linguistic diversity, | ||
+ | (iii) designing evaluation scenarios which favour tools performing well on rare and diverse | ||
+ | phenomena and low-resourced languages, (iv) integrating and training new experts dedicated to | ||
+ | low-resourced and endangered languages, (v) validating the unified annotation guidelines (WG1) | ||
+ | and lexicon formats (WG2) against newly included languages and defining new language-specific | ||
+ | categories and extensions, if needed, (vi) coordinating of the creation and enhancement of annotated | ||
+ | corpora and lexica for low-resourced languages, (vii) discovering and analysing rare linguistic | ||
+ | phenomena, and describing them in resources and tools, (viii) coordination of the development of | ||
+ | NLP tools (WG3) for low-resourced and endangered languages. | ||
wg4/wg4.txt · Last modified: 2024/04/22 15:30 by marie-catherine.de-marneffe