This is the central repository for the SynSemClass project, which aggregates the source code from all related repositories created under this initiative.
SynSemClass is an event-type ontology in multiple languages. The project attempts to create specifications and definitions of a hierarchical event-type ontology, populated with words denoting events or states (primarily verbs, verbal nouns, and adjectives, but also any other single- or multiword units denoting events or states). It links its entries or "classes" (and the words that evoke them) to several existing lexical resources that exist and has, to some extent, similar goals; such linking allows for both theoretical and practical comparison and use of the resources.
- Please see the individual repositories for their corresponding publications for citations.
- Or, please select one of the publications from this list on the documentation page most relevant to your work.
- If unsure, the canonical publication for this project is Urešová et al. (2020):
@inproceedings{uresova-etal-2020-synsemclass,
title = "{S}yn{S}em{C}lass Linked Lexicon: Mapping Synonymy between Languages",
author = "Uresova, Zdenka and
Fucikova, Eva and
Hajicova, Eva and
Hajic, Jan",
editor = "Kernerman, Ilan and
Krek, Simon and
McCrae, John P. and
Gracia, Jorge and
Ahmadi, Sina and
Kabashi, Besim",
booktitle = "Proceedings of the 2020 Globalex Workshop on Linked Lexicography",
month = may,
year = "2020",
address = "Marseille, France",
publisher = "European Language Resources Association",
url = "https://aclanthology.org/2020.globalex-1.2",
pages = "10--19",
language = "English",
ISBN = "979-10-95546-46-7",
}
Copyright 2024 Institute of Formal and Applied Linguistics, Faculty of Mathematics and Physics, Charles University, Czech Republic.
This Source Code Form is subject to the terms of the Mozilla Public License, v. 2.0. If a copy of the MPL was not distributed with this file, You can obtain one at http://mozilla.org/MPL/2.0/.
Please see the individual repositories for additional licensing.
The data is released under CC BY-NC-ND 4.0.
You can either clone the related repository/repositories individually, for example, to clone just the SynSemClassSearch:
git clone https://github.com/ufal/SynSemClassSearch
Or you can clone the entire project, including all the related repositories as submodules, each in a subdirectory (please note the --resurse_submodules
which adds the submodules into subdirectories):
git clone --recurse-submodules https://github.com/ufal/SynSemClass/
- SynSemClassML: machine learning for the SynSemClass project. Also accompanies Straková et al. (2023).
- SynSemClassMulti: a tool for manual annotation of SynSemClass lexicons - version with the separate lexicon for every language.
- SynSemClassPipeline: a toolchain for pre-annotation of a new language in a semantic ontology.
- SynSemClassSearch: a Search Tool for the SynSemClass ontology.
This work has been supported by the Grant Agency of the Czech Republic under the EXPRO program as project “LUSyD” (project No. GX20-16819X), and by the UMR project No. LUAUS23283 supported by the Czech Ministry of Education, Youth and Sports of the Czech Republic (MŠMT ČR). It has used data provided by the LINDAT/CLARIAH-CZ Research Infrastructure, Projects No. LM2018101 and LM2023062, supported by the Ministry of Education, Youth and Sports of the Czech Republic (MŠMT ČR).
- Zdeňka Urešová
[email protected]
is in charge of the SynSemClass project, - Jan Hajič
[email protected]
is the PI of LINDAT/CLARIAH-CZ and of the LuSyD project and coordinates the expansion work on the lexicon, - Jana Straková
[email protected]
is in charge of this repository.