wg1:wg1
Differences
This shows you the differences between two versions of the page.
Both sides previous revisionPrevious revision | |||
wg1:wg1 [2024/05/07 10:31] – [WG1 Tasks] bruno.guillaume | wg1:wg1 [2024/05/07 10:41] (current) – [WG1 Tasks] bruno.guillaume | ||
---|---|---|---|
Line 77: | Line 77: | ||
* Subtask **B**: Evaluate the pros and cons of tabular formats (such as CoNNL-U) currently used in the UD and Parseme projects | * Subtask **B**: Evaluate the pros and cons of tabular formats (such as CoNNL-U) currently used in the UD and Parseme projects | ||
* __Workplan__: | * __Workplan__: | ||
- | * Subtask **A**: The specific objective is to create a comparison table of available manual annotation tools, with a focus on UD and Parseme interests (i.e. morpho-syntactic and multiword expression annotations). The next steps are: | + | * Subtask **A**: The specific objective is to create a comparison table of available manual annotation tools morpho-syntactic and multiword expression annotations. |
- | * Consolidate the set of features to be used in the comparison (the rows of the tables) | + | * Subtask **B**: Conduct a detailed analysis of the advantages and disadvantages of the tabular annotation formats, specifically CoNLL-U, as utilized in the Universal Dependencies (UD) and PARSEME projects. |
- | * Create a survey | + | |
- | * Analyse the results of the survey and produce the final version of the table. | + | |
- | * Subtask **B**: Conduct a detailed analysis of the advantages and disadvantages of the tabular annotation formats, specifically CoNLL-U, as utilized in the Universal Dependencies (UD) and PARSEME projects. | + | |
- | * Develop a Schema/ | + | |
- | * Refine Data Encoding Standards: Currently, UD prescribes both WHAT to encode (the content) and HOW to encode it (the format). Ideally, these aspects should | + | |
- | * The format should dictate HOW to encode data, providing the structural means. | + | |
- | * Guidelines like UD or others should specify WHAT can be encoded, focusing on content restrictions. This separation would enhance the format' | + | |
- | * Generate Initial Working Examples | + | |
- | * Convert existing datasets to test the new format. | + | |
- | * Evaluate and compare these results with those of CoNLL-U and possibly enhanced formats such as CoNLL-U Plus. | + | |
* __How can I contribute? | * __How can I contribute? | ||
* Join to the ongoing discussions on GitHub (links above) | * Join to the ongoing discussions on GitHub (links above) |
wg1/wg1.txt · Last modified: 2024/05/07 10:41 by bruno.guillaume