User Tools

Site Tools


wg4:wg4

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
wg4 [2022/12/15 16:14] olesea.caftanovwg4:wg4 [2024/04/22 15:30] (current) – [Documents] marie-catherine.de-marneffe
Line 1: Line 1:
 ====== Working Group 4: Quantifying and promoting diversity ====== ====== Working Group 4: Quantifying and promoting diversity ======
 +
 +  * **Leader**: [[https://cental.uclouvain.be/team/mcdm/|Marie-Catherine de Marneffe]] (Belgium)
 +  * **Vice-leader**: [[https://www.ukr.uni-jena.de/members/olha-kanishcheva|Olha Kanishcheva]] (Germany/Ukraine)
 +
 +==== Workplan ====
  
 This WG is transversal to WGs 1-3 and will focus on how the Action serves inter- and intra-linguistic  This WG is transversal to WGs 1-3 and will focus on how the Action serves inter- and intra-linguistic 
 diversity. Its activities will overlap with the 3 other WGs in:  diversity. Its activities will overlap with the 3 other WGs in: 
  
-  - Networking for diversity:+  - **Networking** for diversity:
      * bringing together pre-existing groups dedicated to NLP-applicable universality,       * bringing together pre-existing groups dedicated to NLP-applicable universality, 
      * integrating experts of (notably low-resourced) languages not yet covered by these groups,       * integrating experts of (notably low-resourced) languages not yet covered by these groups, 
      * integrating experts in linguistic typology;       * integrating experts in linguistic typology; 
-  - Quantifying diversity: +  - **Quantifying** diversity: 
-     * designing measures of inter- and intra-linguistic diversity in language resources and tools,+     * designing **measures of inter- and intra-linguistic diversity** in language resources and tools,
      * using these measures to quantify diversity in UD and PARSEME corpora;       * using these measures to quantify diversity in UD and PARSEME corpora; 
   - Promoting diversity:   - Promoting diversity:
-     * procedures for better use of the existing resources, based on their estimated diversity, +     * procedures for **better use of the existing resources**, based on their estimated diversity, 
-     * selecting new data to be annotated, so as to favour intra-linguistic diversity,  +     * **selecting new data** to be annotated, so as to favour intra-linguistic diversity,  
-     * designing evaluation scenarios which favour tools performing well on rare and diverse phenomena and low resourced languages,  +     * designing **evaluation scenarios** which favour tools performing well on rare and diverse phenomena and low resourced languages,  
-     * integrating and training new experts dedicated to low-resourced and endangered languages, +     * integrating and **training** new experts dedicated to low-resourced and endangered languages, 
-     * validating the unified annotation guidelines (WG1) and lexicon formats (WG2) against newly included languages and defining new language-specific categories and extensions, if needed,  +     * validating the unified annotation **guidelines** (WG1) and **lexicon formats** (WG2) against newly included languages and defining new language-specific categories and extensions, if needed,  
-     * coordinating of the creation and enhancement of annotated corpora and lexica for low-resourced languages, +     * coordinating the creation and enhancement of annotated **corpora** and **lexica** for low-resourced languages, 
-     * discovering and analysing rare linguistic phenomena, and describing them in resources and tools, +     * discovering and analysing **rare** linguistic **phenomena**, and describing them in resources and tools, 
-     * coordination of the development of NLP tools (WG3) for low-resourced and endangered languages. +     * coordination of the development of NLP **tools** (WG3) for low-resourced and endangered languages.  
 + 
 +==== Current Subtasks ==== 
 + 
 +  * **Task 4.1**: Promoting low-resourced/endangered languages [co-leaders: Lucia Amoros, Abigail Walsh] 
 +  * **Task 4.2**: Survey of diversity measures [leader: Louis Estève] 
 + 
 +==== Documents ==== 
 + 
 +  * WG4 in-person meeting 1 [[https://docs.google.com/presentation/d/1n-yKDOws4zipOG8DekxhZygO5WZYP6AAwIYFcG6hf8o/edit?usp=sharing|slides and discussion summary]] - **16-17 March 2023**, Paris-Saclay University, France (co-located with the [[meetings:general_meetings:1st_unidive_general_meeting|UniDive 1st general meeting]]) 
 +  * WG4 online meetings [[https://docs.google.com/document/d/1tBQ9oeV0tN1h5uYL3xraWItGf9Ox1TWk1vXRWKQiRaI/edit?usp=sharing|summaries]]: September 4 2023 (full day), September 29 2023 (1h30), November 10 2023 (1h30) 
 +  * WG4 in-person meeting 2 [[https://docs.google.com/document/d/10Q8CrYrTnvEieQ1lwWLkWC0aMQKnaVXw_V_Fa4ltxsk/edit?usp=sharing|summary]] - **9 February 2024**, University of Naples L’Orientale, Italy (co-located with the [[meetings:general_meetings:2nd_unidive_general_meeting|UniDive 2nd general meeting]]) 
 +  * [[http://wp.dldp.eu/digital-language-survival-kits/|Digital Language Survival Kits]] by the [[http://wp.dldp.eu/|DLDP project]] 
 +  * WG4 online meetings [[https://docs.google.com/document/d/1ClJUY3d8WgmOclFKb-i9CrfH0ppCseXFNKqcDExYRJM/edit?usp=sharing|summaries]]: April 22 204 (1h)
  
wg4/wg4.txt · Last modified: 2024/04/22 15:30 by marie-catherine.de-marneffe