User Tools

Site Tools


meetings:other-events:1st_unidive_training_school:courses

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
meetings:other-events:1st_unidive_training_school:courses [2024/04/29 10:16] agata.savarymeetings:other-events:1st_unidive_training_school:courses [2024/04/29 13:47] (current) – [Dependency syntax, Surface-Syntactic UD, and UD] agata.savary
Line 18: Line 18:
  
   * **Exercises**:    * **Exercises**: 
-    * understanding the SUD (and UD) annotation scheme by exploring some treebanks with Grew-match (SUD_English, converted from UD; SUD_Naija, a native SUD treebank of a pidgincreole of English; mSUD_Beja, a native morph-based SUD treebank glossed in English) (joint session with Bruno Guillaume?)+    * understanding the [[https://surfacesyntacticud.github.io/|SUD]] (and [[https://universaldependencies.org/guidelines.html|UD]]) annotation scheme by exploring some treebanks with [[https://match.grew.fr/|Grew-match]] (SUD_English, converted from UD; [[https://universal.grew.fr/?corpus=SUD_Naija-NSC@2.13|SUD_Naija]], a native SUD treebank of a pidgincreole of English; [[https://universal.grew.fr/?corpus=mSUD_Beja-NSC@2.13|mSUD_Beja]], a native morph-based SUD treebank glossed in English) (joint session with Bruno Guillaume?)
     * example of a SUD annotation from scratch based on data from the participants which are glossed and translated in English     * example of a SUD annotation from scratch based on data from the participants which are glossed and translated in English
       * creation of a project on ArboratorGrew       * creation of a project on ArboratorGrew
Line 28: Line 28:
     * ideally, having some data you want to annotate (please take contact before the summer school for the preparation of the data)     * ideally, having some data you want to annotate (please take contact before the summer school for the preparation of the data)
  
-  * **Preparatory work**: +  * **Preparatory work** (offered in a parallel course by Bruno Guillaume)
-    * looking at treebanks on Grew-Match (this will be offered in a parallel course by Bruno Guillaume)  +    * looking at treebanks on Grew-Match   
-    * comparing UD and SUD annotation (possible with Grew-Match) +    * comparing UD and SUD annotation 
-    * reading Gerdes et al2018 + 
-    * reading a book or a tutorial on dependency syntax: Mel’cuk 1988,  Tesnière 2015, Osborne 2019, Kahane 2013+  * **Further readings**: 
 +    * Lucien Tesnière (2015), [[https://benjamins.com/catalog/z.185|Elements of structural syntax]], Benjamins. ebook in open access
 +    * Igor Mel’cuk (1988)Dependency syntax: theory and practice. SUNY press. 
 +    * Timothy Osborne (2019)A Dependency Grammar of English. Benjamins. 
 +    * Sylvain Kahane, 2003, [[https://kahane.fr/wp-content/uploads/2017/01/mtt-handbook2003.pdf|The Meaning-Text Theory]], in Dependency and Valency, Handbooks of Linguistics and Communication Sciences, 25 : 1-2, Berlin/NY: De Gruyter, 32 p. 
 +    * De Marneffe, M. C., Manning, C. D., Nivre, J., & Zeman, D. (2021). [[https://aclanthology.org/2021.cl-2.11/|Universal dependencies]]. Computational linguistics, 47(2), 255-308. 
 +    * Gerdes K., Guillaume B., Kahane S., Perrier G. (2018) [[https://aclanthology.org/W18-6008/|SUD or Surface-Syntactic Universal Dependencies: An annotation scheme near-isomorphic to UD]], Proceedings of the Universal Dependencies Workshop (UDW), EMNLP. 
 +    * Gerdes K., Guillaume B., Kahane S, Perrier G. (2021) [[https://aclanthology.org/2021.depling-1.4.pdf|Starting a new treebank? Go SUD!]], Proceedings of 6th international conference on Dependency Linguistics (DepLing), SyntaxFest, ACL.
  
 =====Annotation of multiword expressions for newcomers===== =====Annotation of multiword expressions for newcomers=====
Line 76: Line 83:
     * mostly practical exercises in corpus querying and processing     * mostly practical exercises in corpus querying and processing
  
-  * **Contents** +  * **Contents (not necessarily in chronological order)** 
-    * Session 1 (by Bruno Guillaume), joined with Sylvain's course in dependency sytax+    * Session 1 (by Bruno Guillaume), joined with Sylvain's course in dependency syntax
       * Storage formats of data: ConNLL-U, CUPT       * Storage formats of data: ConNLL-U, CUPT
       * Basic usage of Grew-match of morpho-syntactic treebanks       * Basic usage of Grew-match of morpho-syntactic treebanks
       *  Hands-on: observe main difference between UD and SUD       *  Hands-on: observe main difference between UD and SUD
       * ArboratorGrew basic usage: users roles, graphical edition, conllu edition, metadata       * ArboratorGrew basic usage: users roles, graphical edition, conllu edition, metadata
- +    Sessions 2-3 (by Bruno Guillaume)
-    Session 2 (by Bruno Guillaume)+
       * Advanced usages of Grew-match       * Advanced usages of Grew-match
         * On PARSEME data         * On PARSEME data
Line 91: Line 97:
         * usage of Parser for pre-annotation         * usage of Parser for pre-annotation
         * usage of Github synchronisation         * usage of Github synchronisation
 +    * Session 4 (by Agata Savary)
 +      * Git for beginners: 
 +        * a repository, a clone, a commit
 +        * Git operations: clone, pull, add, commit, push
 +        * branches
 +        *Gitlab vs. Github
 +      * PARSEME Git infrastructure
 +        * PARSEME project on Git and its repositories
 +        * Managing language repositories
 +        * PARSEME utilities
 +        * PARSEME/UD consistency
 +    * Sessions 5-6 (by Daniel Zeman)
 +      * UD GitHub repositories
 +        * Branches, push access, pull requests
 +        * How to upload: Use git diff before committing and pushing
 +        * TortoiseGit
 +      * Prescribed structure of the dev branch
 +        * Do not pull history from the master branch
 +        * The docs repository, language-specific documentation
 +        * Working with personal UD repositories
 +      * Validator
 +        * On-line report after uploading data
 +        * How to run locally (there are two scripts!)
 +        * How to locate and fix the error
 +          * Demonstrate some common errors, validation levels
 +        * How to register language-specific features, relation subtypes, auxiliaries
 +        * How to fix documentation errors (demonstrate)
 +      * Fixing the errors
 +        * Annotation tool (cf. Grew)
 +        * Text editor (do not use Word!)
 +        * Udapi
 +      * UD Github issues: asking for linguistic help in docs, reporting bugs in treebank-specific repos
 +        * Referring to particular commits, files and lines in the repo.
 +
meetings/other-events/1st_unidive_training_school/courses.1714378578.txt.gz · Last modified: 2024/04/29 10:16 by agata.savary