User Tools

Site Tools


wg4:wg4

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
Next revisionBoth sides next revision
wg4 [2022/12/15 16:04] olesea.caftanovwg4 [2023/03/21 13:40] agata.savary
Line 1: Line 1:
-====== Working Group 2Lexicon-corpus interface ======+====== Working Group 4Quantifying and promoting diversity ====== 
 + 
 +  * **Leader**: [[https://www.asc.ohio-state.edu/demarneffe.1/|Marie-Catherine de Marneffe]] (Belgium) 
 +  * **Vice-leader**: [[https://www.adaptcentre.ie/experts/abigail-walsh/|Abigail Walsh]] (Ireland) 
 + 
 +==== Workplan ==== 
 + 
 +This WG is transversal to WGs 1-3 and will focus on how the Action serves inter- and intra-linguistic  
 +diversity. Its activities will overlap with the 3 other WGs in:  
 + 
 +  - **Networking** for diversity: 
 +     * bringing together pre-existing groups dedicated to NLP-applicable universality,  
 +     * integrating experts of (notably low-resourced) languages not yet covered by these groups,  
 +     * integrating experts in linguistic typology;  
 +  - **Quantifying** diversity: 
 +     * designing **measures of inter- and intra-linguistic diversity** in language resources and tools, 
 +     * using these measures to quantify diversity in UD and PARSEME corpora;  
 +  - Promoting diversity: 
 +     * procedures for **better use of the existing resources**, based on their estimated diversity, 
 +     * **selecting new data** to be annotated, so as to favour intra-linguistic diversity,  
 +     * designing **evaluation scenarios** which favour tools performing well on rare and diverse phenomena and low resourced languages,  
 +     * integrating and **training** new experts dedicated to low-resourced and endangered languages, 
 +     * validating the unified annotation **guidelines** (WG1) and **lexicon formats** (WG2) against newly included languages and defining new language-specific categories and extensions, if needed,  
 +     * coordinating of the creation and enhancement of annotated **corpora** and **lexica** for low-resourced languages, 
 +     * discovering and analysing **rare** linguistic **phenomena**, and describing them in resources and tools, 
 +     * coordination of the development of NLP **tools** (WG3) for low-resourced and endangered languages.  
 + 
 +==== Documents ==== 
 + 
 +TBA
  
wg4/wg4.txt · Last modified: 2024/04/22 15:30 by marie-catherine.de-marneffe