User Tools

Site Tools


wg4:wg4

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
Next revisionBoth sides next revision
wg4 [2022/12/15 16:10] olesea.caftanovwg4 [2023/03/21 13:40] agata.savary
Line 1: Line 1:
-====== Working Group 4: Lexicon-corpus interface ======+====== Working Group 4: Quantifying and promoting diversity ====== 
 + 
 +  * **Leader**: [[https://www.asc.ohio-state.edu/demarneffe.1/|Marie-Catherine de Marneffe]] (Belgium) 
 +  * **Vice-leader**: [[https://www.adaptcentre.ie/experts/abigail-walsh/|Abigail Walsh]] (Ireland) 
 + 
 +==== Workplan ====
  
 This WG is transversal to WGs 1-3 and will focus on how the Action serves inter- and intra-linguistic  This WG is transversal to WGs 1-3 and will focus on how the Action serves inter- and intra-linguistic 
 diversity. Its activities will overlap with the 3 other WGs in:  diversity. Its activities will overlap with the 3 other WGs in: 
- 20 + 
- Networking for diversity: (i) bringing together pre-existing groups dedicated to NLP-applicable  +  - **Networking** for diversity: 
-universality, (ii) integrating experts of (notably low-resourced) languages not yet covered by these  +     bringing together pre-existing groups dedicated to NLP-applicable universality,  
-groups, (iii) integrating experts in linguistic typology;  +     integrating experts of (notably low-resourced) languages not yet covered by these groups,  
- Quantifying diversity: (i) designing measures of inter- and intra-linguistic diversity in language  +     integrating experts in linguistic typology;  
-resources and tools, (ii) using these measures to quantify diversity in UD and PARSEME corpora;  +  - **Quantifying** diversity: 
- Promoting diversity: (i) procedures for better use of the existing resources, based on their  +     designing **measures of inter- and intra-linguistic diversity** in language resources and tools, 
-estimated diversity, (ii) selecting new data to be annotated, so as to favour intra-linguistic diversity,  +     using these measures to quantify diversity in UD and PARSEME corpora;  
-(iii) designing evaluation scenarios which favour tools performing well on rare and diverse  +  Promoting diversity: 
-phenomena and low-resourced languages, (iv) integrating and training new experts dedicated to  +     procedures for **better use of the existing resources**, based on their estimated diversity, 
-low-resourced and endangered languages, (v) validating the unified annotation guidelines (WG1)  +     * **selecting new data** to be annotated, so as to favour intra-linguistic diversity,  
-and lexicon formats (WG2) against newly included languages and defining new language-specific  +     * designing **evaluation scenarios** which favour tools performing well on rare and diverse phenomena and low resourced languages,  
-categories and extensions, if needed, (vi) coordinating of the creation and enhancement of annotated  +     integrating and **training** new experts dedicated to low-resourced and endangered languages, 
-corpora and lexica for low-resourced languages, (vii) discovering and analysing rare linguistic  +     validating the unified annotation **guidelines** (WG1) and **lexicon formats** (WG2) against newly included languages and defining new language-specific categories and extensions, if needed,  
-phenomena, and describing them in resources and tools, (viii) coordination of the development of  +     coordinating of the creation and enhancement of annotated **corpora** and **lexica** for low-resourced languages, 
-NLP tools (WG3) for low-resourced and endangered languages. +     discovering and analysing **rare** linguistic **phenomena**, and describing them in resources and tools, 
 +     coordination of the development of NLP **tools** (WG3) for low-resourced and endangered languages.  
 + 
 +==== Documents ==== 
 + 
 +TBA
  
wg4/wg4.txt · Last modified: 2024/04/22 15:30 by marie-catherine.de-marneffe