wg3:wg3_meeting_2023-03-17
Differences
This shows you the differences between two versions of the page.
Both sides previous revisionPrevious revisionNext revision | Previous revisionNext revisionBoth sides next revision | ||
wg3:wg3_meeting_2023-03-17 [2023/03/21 16:37] – gulsen.eryigit | wg3:wg3_meeting_2023-03-17 [2023/03/21 16:49] – [Session 1] gulsen.eryigit | ||
---|---|---|---|
Line 2: | Line 2: | ||
- | === Session 1 === | + | ==== Session 1 ==== |
- | /10.45–11.00 Introduction to WG3 (slides) | + | * 10.45–11.00 Introduction to WG3 (slides) |
- | /11.00–11.30 Brainstorming on ideas and expectations | + | * 11.00–11.30 Brainstorming on ideas and expectations |
+ | |||
+ | == Discussion questions: == | ||
+ | |||
+ | * What is most important for you in multilingual and cross-lingual NLP? | ||
+ | * What activities do you think we should prioritize? | ||
+ | * How can we work together to make progress towards our goals? | ||
+ | |||
+ | == Points raised: == | ||
+ | |||
+ | * Large language models are most important | ||
+ | * Articulating linguistic theories underlying tools | ||
+ | * Defining idiosyncrasy and diversity | ||
+ | * The user perspective is important | ||
+ | * Supporting low-resource languages through cross-lingual technology | ||
+ | * Supporting low-resource languages through annotation tools | ||
+ | * Supporting low-resource languages through data collection | ||
+ | * Supporting low-resource languages with semantics | ||
+ | * Tools for all languages – start with morphology | ||
+ | * Low-resource language is not a homogeneous concept | ||
+ | * Building resources for specific languages (Serbian) | ||
+ | * Linking corpus resources between languages | ||
+ | * Standardized tools applicable to different languages | ||
+ | * Evaluation of tools – coordinate with other WGs | ||
+ | * Tracking evaluation status for different types of tools | ||
+ | * Improved benchmarking and experimental design | ||
+ | * Organize shared tasks | ||
| | ||
+ | * 11.30–12.00 Initial discussion on documentation of tools | ||
+ | |||
+ | == Discussion questions: == | ||
+ | |||
+ | * Which types of tools do we want to include? | ||
+ | * Where do we want to keep the documentation? | ||
+ | * How do we create this documentation/ | ||
+ | |||
+ | == Points raised: == | ||
+ | |||
+ | * A huge multidimensional matrix | ||
+ | * A shared repository | ||
+ | * Tools shared between typologically similar languages | ||
+ | * Consider end users | ||
+ | * Too many languages have nothing – document what is missing rather than what exists | ||
+ | * Connect to CLARIN | ||
+ | * Flagship project on MWE | ||
+ | * Include all tools or be selective? | ||
+ | * What about commercial tools? | ||
+ | * What about tools without documentation? | ||
+ | |||
+ | == WG tasks emerging from the discussion: == | ||
+ | |||
+ | * Define multidimensional taxonomy of tools for documentation | ||
+ | * Define infrastructure and procedure for creating documentation | ||
wg3/wg3_meeting_2023-03-17.txt · Last modified: 2023/09/20 10:28 by joakim.nivre