====== UniDive software ====== This page contains references of software whose construction and/or release was coordinated by or benefited from UniDive. ===== Universal Dependencies tools ===== ** [[https://universaldependencies.org/tools.html#ud-maintained-tools|UD maintained tools]] ===== Grew suite ===== *[[https://grew.fr/|Grew]] - Graph Rewriting tool for Natural Language Processing; * [[https://match.grew.fr/|Grew-match]] - corpus browser form (S)UD treebanks, PARSEME corpora and many others * [[https://arboratorgrew.elizia.net/#/|Arborator Grew]] - a collaborative annotation tool for treebank developpement ===== PARSEME tools ===== * [[https://gitlab.com/parseme/utilities|PARSEME utilities]] - for language leaders and corpus release experts * new [[https://gitlab.com/parseme/corpora/-/wikis/parseme-tools#file-format-validation|cupt form validator]] - fully compatible with the UD validator * new tools for [[https://gitlab.com/parseme/corpora/-/wikis/Updating-morphosyntactic-annotations|updating morphoyntactic annotations]] in cupt files, to synchronize them with UD corpora * [[https://flat.lisn.upsaclay.fr/|FLAT annotation platform]] - used for manual annotation of multiword expressions in PARSEME; migrated recently to a new server, and extended to cover non-verbal MWEs * [[https://gitlab.com/parseme/corpora/-/wikis/parseme-tools#annotation-flat|FLAT documentation]] * [[https://github.com/empiriker/mwe-detector|MWE detector]] - SpaCy MWE identification pipeline component ===== Libraries for the quantification of diversity in language resources and tools ===== * [[https://github.com/estevelouis/WG4|diversutils]] - library for measuring diversity of linguistic resources, as well as system predictions //(France)// * [[https://gitlab.lisn.upsaclay.fr/esteve/stark_diversity|DUST]] - a library for measuring syntactic diversity in treebanks //(France, Slovenia)// * [[https://github.com/ICEF-NLP/jmm_diversity/tree/langdive-lib?tab=readme-ov-file|LangDive]] -- library for measuring the level of linguistic diversity in multilingual NLP datasets //(Serbia, Switzerland)// ===== Tools for language learning ===== * [[https://linguse.com/|Linguse]] - reading application for language learners. Its latest beta version integrates a module for idiom identification, explanation and translation. * [[https://www.youtube.com/watch?v=q016Sn3aIjk|Multiword Expressions: The Spice of Language Learning]] -- a talk at the [[https://www.polyglotgathering.com/2023/en/|Polyglot Gathering 2023]] * [[https://nlp4call2024.sciencesconf.org/data/pages/Programme_NLP4CALL_2029.pdf|Sailing through multiword expression identification with Wiktionary and Linguse: A case study of language learning]] - a paper at the [[https://nlp4call2024.sciencesconf.org/|NLP4CALL 2024 Workshop]]