Skip to content

Commit 4541b33

Browse files
committed
bibtex
1 parent aea212a commit 4541b33

File tree

1 file changed

+46
-0
lines changed

1 file changed

+46
-0
lines changed

CITATION.cff

+46
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,46 @@
1+
authors:
2+
- family-names: Wiechetek
3+
given-names: Linda
4+
orcid: "https://orcid.org/0000-0002-5171-0841"
5+
- family-names: Unhammer
6+
given-names: Kevin Brubeck
7+
orcid: "https://orcid.org/0000-0002-2883-1899"
8+
- family-names: Moshagen
9+
given-names: Sjur Nørstebø
10+
orcid: "https://orcid.org/0000-0003-3771-9521"
11+
cff-version: 1.2.0
12+
identifiers:
13+
- description: Workshop on the Use of Computational Methods in the Study of Endangered Languages
14+
type: url
15+
value: https://computel-workshop.org/wp-content/uploads/2019/02/CEL3_book_papers_draft.pdf#page=58
16+
keywords:
17+
- Sámi
18+
- Saami
19+
- North Saami
20+
- proofing
21+
- grammar checking
22+
- grammar checker
23+
- spellcheck
24+
- tokenisation
25+
- FST
26+
- HFST
27+
message: If you use this software, please cite it using these metadata.
28+
repository-code: "https://github.com/divvun/libdivvun"
29+
title: Divvun gramcheck
30+
version: 0.3.10
31+
preferred-citation:
32+
authors:
33+
- family-names: Wiechetek
34+
given-names: Linda
35+
- family-names: Unhammer
36+
given-names: Kevin Brubeck
37+
- family-names: Moshagen
38+
given-names: Sjur Nørstebø
39+
title: "Seeing more than whitespace—Tokenisation and disambiguation in a North Sámi grammar checker"
40+
type: article
41+
year: 2019
42+
url: "https://computel-workshop.org/wp-content/uploads/2019/02/CEL3_book_papers_draft.pdf#page=58"
43+
abstract: "Communities of lesser resourced languages like North Sámi benefit from language tools such as spell checkers and grammar checkers to improve literacy. Accurate error feedback is dependent on well-tokenised input, but traditional tokenisation as shallow preprocessing is inadequate to solve the challenges of real-world language usage. We present an alternative where tokenisation remains ambiguous until we have linguistic context information available. This lets us accurately detect sentence boundaries, multiwords and compound error detection. We describe a North Sámi grammar checker with such a tokenisation system, and show the results of its evaluation."
44+
license: GPL-3.0-or-later
45+
url: https://github.com/divvun/libdivvun
46+

0 commit comments

Comments
 (0)