User Tools

Site Tools


wg4:wg4

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
Last revisionBoth sides next revision
wg4:wg4 [2023/03/21 14:15] – removed - external edit (Unknown date) 127.0.0.1wg4:wg4 [2024/04/22 15:29] – [Documents] marie-catherine.de-marneffe
Line 1: Line 1:
 +====== Working Group 4: Quantifying and promoting diversity ======
 +
 +  * **Leader**: [[https://cental.uclouvain.be/team/mcdm/|Marie-Catherine de Marneffe]] (Belgium)
 +  * **Vice-leader**: [[https://www.ukr.uni-jena.de/members/olha-kanishcheva|Olha Kanishcheva]] (Germany/Ukraine)
 +
 +==== Workplan ====
 +
 +This WG is transversal to WGs 1-3 and will focus on how the Action serves inter- and intra-linguistic 
 +diversity. Its activities will overlap with the 3 other WGs in: 
 +
 +  - **Networking** for diversity:
 +     * bringing together pre-existing groups dedicated to NLP-applicable universality, 
 +     * integrating experts of (notably low-resourced) languages not yet covered by these groups, 
 +     * integrating experts in linguistic typology; 
 +  - **Quantifying** diversity:
 +     * designing **measures of inter- and intra-linguistic diversity** in language resources and tools,
 +     * using these measures to quantify diversity in UD and PARSEME corpora; 
 +  - Promoting diversity:
 +     * procedures for **better use of the existing resources**, based on their estimated diversity,
 +     * **selecting new data** to be annotated, so as to favour intra-linguistic diversity, 
 +     * designing **evaluation scenarios** which favour tools performing well on rare and diverse phenomena and low resourced languages, 
 +     * integrating and **training** new experts dedicated to low-resourced and endangered languages,
 +     * validating the unified annotation **guidelines** (WG1) and **lexicon formats** (WG2) against newly included languages and defining new language-specific categories and extensions, if needed, 
 +     * coordinating the creation and enhancement of annotated **corpora** and **lexica** for low-resourced languages,
 +     * discovering and analysing **rare** linguistic **phenomena**, and describing them in resources and tools,
 +     * coordination of the development of NLP **tools** (WG3) for low-resourced and endangered languages. 
 +
 +==== Current Subtasks ====
 +
 +  * **Task 4.1**: Promoting low-resourced/endangered languages [co-leaders: Lucia Amoros, Abigail Walsh]
 +  * **Task 4.2**: Survey of diversity measures [leader: Louis Estève]
 +
 +==== Documents ====
 +
 +  * WG4 Meeting 1 [[https://docs.google.com/presentation/d/1n-yKDOws4zipOG8DekxhZygO5WZYP6AAwIYFcG6hf8o/edit?usp=sharing|slides and discussion summary]] - **16-17 March 2023**, Paris-Saclay University, France (co-located with the [[meetings:general_meetings:1st_unidive_general_meeting|UniDive 1st general meeting]])
 +  * WG4 online meetings [[https://docs.google.com/document/d/1tBQ9oeV0tN1h5uYL3xraWItGf9Ox1TWk1vXRWKQiRaI/edit?usp=sharing|summaries]]: September 4 2023 (full day), September 29 2023 (1h30), November 10 2023 (1h30)
 +  * WG4 Meeting 2 [[https://docs.google.com/document/d/10Q8CrYrTnvEieQ1lwWLkWC0aMQKnaVXw_V_Fa4ltxsk/edit?usp=sharing|summary]] - **9 February 2024**, University of Naples L’Orientale, Italy (co-located with the [[meetings:general_meetings:2nd_unidive_general_meeting|UniDive 2nd general meeting]])
 +  * [[http://wp.dldp.eu/digital-language-survival-kits/|Digital Language Survival Kits]] by the [[http://wp.dldp.eu/|DLDP project]]
 +  * WG4 online meetings [[https://docs.google.com/document/d/1ClJUY3d8WgmOclFKb-i9CrfH0ppCseXFNKqcDExYRJM/edit?usp=sharing|summaries]]: April 22 204 (1h)
  
wg4/wg4.txt · Last modified: 2024/04/22 15:30 by marie-catherine.de-marneffe