User Tools

Site Tools


wg4:wg4

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Next revision
Previous revision
Next revisionBoth sides next revision
wg4 [2022/12/12 19:37] – created agata.savarywg4:wg4 [2023/03/22 11:57] – [Documents] agata.savary
Line 1: Line 1:
-====== Page under construction ======+====== Working Group 4: Quantifying and promoting diversity ====== 
 + 
 +  * **Leader**: [[https://www.asc.ohio-state.edu/demarneffe.1/|Marie-Catherine de Marneffe]] (Belgium) 
 +  * **Vice-leader**: [[https://www.adaptcentre.ie/experts/abigail-walsh/|Abigail Walsh]] (Ireland) 
 + 
 +==== Workplan ==== 
 + 
 +This WG is transversal to WGs 1-3 and will focus on how the Action serves inter- and intra-linguistic  
 +diversity. Its activities will overlap with the 3 other WGs in:  
 + 
 +  - **Networking** for diversity: 
 +     * bringing together pre-existing groups dedicated to NLP-applicable universality,  
 +     * integrating experts of (notably low-resourced) languages not yet covered by these groups,  
 +     * integrating experts in linguistic typology;  
 +  - **Quantifying** diversity: 
 +     * designing **measures of inter- and intra-linguistic diversity** in language resources and tools, 
 +     * using these measures to quantify diversity in UD and PARSEME corpora;  
 +  - Promoting diversity: 
 +     * procedures for **better use of the existing resources**, based on their estimated diversity, 
 +     * **selecting new data** to be annotated, so as to favour intra-linguistic diversity,  
 +     * designing **evaluation scenarios** which favour tools performing well on rare and diverse phenomena and low resourced languages,  
 +     * integrating and **training** new experts dedicated to low-resourced and endangered languages, 
 +     * validating the unified annotation **guidelines** (WG1) and **lexicon formats** (WG2) against newly included languages and defining new language-specific categories and extensions, if needed,  
 +     * coordinating of the creation and enhancement of annotated **corpora** and **lexica** for low-resourced languages, 
 +     * discovering and analysing **rare** linguistic **phenomena**, and describing them in resources and tools, 
 +     * coordination of the development of NLP **tools** (WG3) for low-resourced and endangered languages.  
 + 
 +==== Documents ==== 
 + 
 +  * WG4 Meeting 1 [[https://docs.google.com/presentation/d/1n-yKDOws4zipOG8DekxhZygO5WZYP6AAwIYFcG6hf8o/edit?usp=sharing|slides and discussion summary]] - **16-17 March 2023**, Paris-Saclay University, France (co-located with [[meetings:general_meetings:1st_unidive_general_meeting|UniDive 1st general meeting]],
  
wg4/wg4.txt · Last modified: 2024/04/22 15:30 by marie-catherine.de-marneffe