User Tools

Site Tools


meetings:other-events:1st_unidive_training_school:courses

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
meetings:other-events:1st_unidive_training_school:courses [2024/04/29 12:35] – [Dependency syntax, Surface-Syntactic UD, and UD] agata.savarymeetings:other-events:1st_unidive_training_school:courses [2024/05/30 15:22] (current) – [Corpus annotation infrastructure] agata.savary
Line 28: Line 28:
     * ideally, having some data you want to annotate (please take contact before the summer school for the preparation of the data)     * ideally, having some data you want to annotate (please take contact before the summer school for the preparation of the data)
  
-  * **Preparatory work**: +  * **Preparatory work** (offered in a parallel course by Bruno Guillaume)
-    * looking at treebanks on Grew-Match (this will be offered in a parallel course by Bruno Guillaume)  +    * looking at treebanks on Grew-Match   
-    * comparing UD and SUD annotation (possible with Grew-Match) +    * comparing UD and SUD annotation
-    * reading Gerdes et al. 2018 +
-    * reading a book or a tutorial on dependency syntax: Mel’cuk 1988,  Tesnière 2015, Osborne 2019, Kahane 2013+
  
-  * **Bibliography**: +  * **Further readings**: 
-    * Lucien Tesnière (2015) [[https://benjamins.com/catalog/z.185|Elements of structural syntax]], Benjamins. ebook in open access. +    * Lucien Tesnière (2015)[[https://benjamins.com/catalog/z.185|Elements of structural syntax]], Benjamins. ebook in open access. 
-    * Igor Mel’cuk (1988)Dependency syntax: theory and practice. SUNY press. +    * Igor Mel’cuk (1988)Dependency syntax: theory and practice. SUNY press. 
-    * Timothy Osborne (2019) A Dependency Grammar of English. Benjamins.+    * Timothy Osborne (2019)A Dependency Grammar of English. Benjamins.
     * Sylvain Kahane, 2003, [[https://kahane.fr/wp-content/uploads/2017/01/mtt-handbook2003.pdf|The Meaning-Text Theory]], in Dependency and Valency, Handbooks of Linguistics and Communication Sciences, 25 : 1-2, Berlin/NY: De Gruyter, 32 p.     * Sylvain Kahane, 2003, [[https://kahane.fr/wp-content/uploads/2017/01/mtt-handbook2003.pdf|The Meaning-Text Theory]], in Dependency and Valency, Handbooks of Linguistics and Communication Sciences, 25 : 1-2, Berlin/NY: De Gruyter, 32 p.
 +    * De Marneffe, M. C., Manning, C. D., Nivre, J., & Zeman, D. (2021). [[https://aclanthology.org/2021.cl-2.11/|Universal dependencies]]. Computational linguistics, 47(2), 255-308.
 +    * Gerdes K., Guillaume B., Kahane S., Perrier G. (2018) [[https://aclanthology.org/W18-6008/|SUD or Surface-Syntactic Universal Dependencies: An annotation scheme near-isomorphic to UD]], Proceedings of the Universal Dependencies Workshop (UDW), EMNLP.
 +    * Gerdes K., Guillaume B., Kahane S, Perrier G. (2021) [[https://aclanthology.org/2021.depling-1.4.pdf|Starting a new treebank? Go SUD!]], Proceedings of 6th international conference on Dependency Linguistics (DepLing), SyntaxFest, ACL.
  
 =====Annotation of multiword expressions for newcomers===== =====Annotation of multiword expressions for newcomers=====
Line 82: Line 83:
     * mostly practical exercises in corpus querying and processing     * mostly practical exercises in corpus querying and processing
  
-  * **Contents (not necessarily in chronological order)** +  * **Contents** 
-    * Session 1 (by Bruno Guillaume), joined with Sylvain's course in dependency syntax +    * Session 1 (by Daniel Zeman & Agata Savary & Bruno Guillaume): **Git infrastructure** 
-      * Storage formats of dataConNLL-U, CUPT +      * Git for beginners
-      Basic usage of Grew-match of morpho-syntactic treebanks +
-       Hands-on: observe main difference between UD and SUD +
-      ArboratorGrew basic usage: users roles, graphical edition, conllu edition, metadata +
-    Sessions 2-3 (by Bruno Guillaume) +
-      * Advanced usages of Grew-match +
-        * On PARSEME data +
-        * Usage of clustering / tables for corpus maintenance, error mining and checking annotation consistency +
-      * Advanced usage of ArboratorGrew +
-        * usage of rewriting rules for corpus pre-annotation / maintenance +
-        * usage of Parser for pre-annotation +
-        * usage of Github synchronisation +
-    * Session 4 (by Agata Savary) +
-      * Git for beginners:  +
-        * a repository, a clone, a commit +
-        * Git operations: clone, pull, add, commit, push +
-        * branches +
-        *Gitlab vs. Github +
-      * PARSEME Git infrastructure +
-        * PARSEME project on Git and its repositories +
-        * Managing language repositories +
-        * PARSEME utilities +
-        * PARSEME/UD consistency +
-    * Sessions 5-6 (by Daniel Zeman)+
       * UD GitHub repositories       * UD GitHub repositories
-        Branches, push access, pull requests +      PARSEME Gitlab repositories  
-        How to uploadUse git diff before committing and pushing +      Github synchronisation in Grew" 
-        TortoiseGit +    * Session 2 (by Bruno Guillaume jointly with Sylvain's course on dependency syntax)**Basics of treebank querying and annotation** 
-      * Prescribed structure of the dev branch +      Corpus queries with Grew-Match  
-        Do not pull history from the master branch +      * UD vs. SUD  
-        The docs repository, language-specific documentation +      Corpus annotation with Arborator Grew 
-        Working with personal UD repositories +    Session 3 (by Daniel Zeman & Agata Savary): **Corpus format validation** 
-      * Validator +      * File formats (CoNLL-U, CUPT) 
-        On-line report after uploading data +      CoNLL-U validator 
-        How to run locally (there are two scripts!) +      * PARSEME validator 
-        How to locate and fix the error +      UD/PARSEME consistency 
-          Demonstrate some common errors, validation levels +    Session 4 (by Bruno Guillaume)**Advanced treebank querying and annotation** 
-        How to register language-specific features, relation subtypes, auxiliaries +      querrying PARSEME data 
-        How to fix documentation errors (demonstrate+      corpus pre-annotation 
-      * Fixing the errors +    Session 5 (by Daniel Zeman & Bruno Guillaume): **Corpus quality** 
-        Annotation tool (cf. Grew) +      * error mining and correcting with Grew-match 
-        Text editor (do not use Word!+      fixing errors in text editors 
-        Udapi +    Session 6 (by Daniel Zeman & Agata Savary): **Documentation and discussion on Git** 
-      * UD Github issues: asking for linguistic help in docs, reporting bugs in treebank-specific repos +      Documenting a corpus in README 
-        Referring to particular commits, files and lines in the repo.+      * UD Github issues  
 +      PARSEME Gitlab issues
  
meetings/other-events/1st_unidive_training_school/courses.1714386908.txt.gz · Last modified: 2024/04/29 12:35 by agata.savary