User Tools

Site Tools


wg1:wg1

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
wg1:wg1 [2024/05/07 10:31] – [WG1 Tasks] bruno.guillaumewg1:wg1 [2024/05/07 10:41] (current) – [WG1 Tasks] bruno.guillaume
Line 77: Line 77:
        * Subtask **B**: Evaluate the pros and cons of tabular formats (such as CoNNL-U) currently used in the UD and Parseme projects        * Subtask **B**: Evaluate the pros and cons of tabular formats (such as CoNNL-U) currently used in the UD and Parseme projects
     * __Workplan__:      * __Workplan__: 
-       * Subtask **A**: The specific objective is to create a comparison table of available manual annotation tools, with a focus on UD and Parseme interests (i.e. morpho-syntactic and multiword expression annotations)The next steps are: +       * Subtask **A**: The specific objective is to create a comparison table of available manual annotation tools morpho-syntactic and multiword expression annotations. A survey will be propose in the upcoming weeks, to collect feedback adn to produce the final version of the table. 
-          * Consolidate the set of features to be used in the comparison (the rows of the tables) +       * Subtask **B**: Conduct a detailed analysis of the advantages and disadvantages of the tabular annotation formats, specifically CoNLL-U, as utilized in the Universal Dependencies (UD) and PARSEME projects. A first draft of an evolution of the formats currently used will be proposed for dicussions and for testing.
-          * Create a survey to collect information about each annotation tool +
-          * Analyse the results of the survey and produce the final version of the table. +
-       * Subtask **B**: Conduct a detailed analysis of the advantages and disadvantages of the tabular annotation formats, specifically CoNLL-U, as utilized in the Universal Dependencies (UD) and PARSEME projects. The next steps are: +
-          * Develop a Schema/Definition for Structured Data Format: Consider framing this as part of a shared task in the future +
-          * Refine Data Encoding Standards: Currently, UD prescribes both WHAT to encode (the content) and HOW to encode it (the format). Ideally, these aspects should be decoupled: +
-             * The format should dictate HOW to encode data, providing the structural means. +
-             * Guidelines like UD or others should specify WHAT can be encoded, focusing on content restrictions. This separation would enhance the format's flexibility and adaptability to new types of annotations, while the guidelines ensure relevance of data. +
-          * Generate Initial Working Examples +
-             * Convert existing datasets to test the new format. +
-             * Evaluate and compare these results with those of CoNLL-U and possibly enhanced formats such as CoNLL-U Plus.+
     * __How can I contribute?__      * __How can I contribute?__ 
       * Join to the ongoing discussions on GitHub (links above)       * Join to the ongoing discussions on GitHub (links above)
wg1/wg1.txt · Last modified: 2024/05/07 10:41 by bruno.guillaume