User Tools

Site Tools


wg4:wg4

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revisionBoth sides next revision
wg4 [2022/12/15 16:04] olesea.caftanovwg4 [2022/12/15 16:10] olesea.caftanov
Line 1: Line 1:
-====== Working Group 2: Lexicon-corpus interface ======+====== Working Group 4: Lexicon-corpus interface ====== 
 + 
 +This WG is transversal to WGs 1-3 and will focus on how the Action serves inter- and intra-linguistic  
 +diversity. Its activities will overlap with the 3 other WGs in:  
 + 20 
 + Networking for diversity: (i) bringing together pre-existing groups dedicated to NLP-applicable  
 +universality, (ii) integrating experts of (notably low-resourced) languages not yet covered by these  
 +groups, (iii) integrating experts in linguistic typology;  
 + Quantifying diversity: (i) designing measures of inter- and intra-linguistic diversity in language  
 +resources and tools, (ii) using these measures to quantify diversity in UD and PARSEME corpora;  
 + Promoting diversity: (i) procedures for better use of the existing resources, based on their  
 +estimated diversity, (ii) selecting new data to be annotated, so as to favour intra-linguistic diversity,  
 +(iii) designing evaluation scenarios which favour tools performing well on rare and diverse  
 +phenomena and low-resourced languages, (iv) integrating and training new experts dedicated to  
 +low-resourced and endangered languages, (v) validating the unified annotation guidelines (WG1)  
 +and lexicon formats (WG2) against newly included languages and defining new language-specific  
 +categories and extensions, if needed, (vi) coordinating of the creation and enhancement of annotated  
 +corpora and lexica for low-resourced languages, (vii) discovering and analysing rare linguistic  
 +phenomena, and describing them in resources and tools, (viii) coordination of the development of  
 +NLP tools (WG3) for low-resourced and endangered languages. 
  
wg4/wg4.txt · Last modified: 2024/04/22 15:30 by marie-catherine.de-marneffe