Skip to content

Commit 587892d

Browse files
add labse annotation (#2182)
* add labse annotation * Update mteb/models/sentence_transformers_models.py Co-authored-by: Kenneth Enevoldsen <[email protected]> --------- Co-authored-by: Kenneth Enevoldsen <[email protected]>
1 parent 7af37d4 commit 587892d

File tree

1 file changed

+5
-1
lines changed

1 file changed

+5
-1
lines changed

mteb/models/sentence_transformers_models.py

Lines changed: 5 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -215,7 +215,11 @@
215215
use_instructions=False,
216216
superseded_by=None,
217217
adapted_from=None,
218-
training_datasets=None,
218+
training_datasets={
219+
# CommonCrawl
220+
# wiki 05-21-2020 dump
221+
# The translation corpus is constructed from web pages using a bitext mining system
222+
},
219223
# scraped and mined webdata including CC, wiki, see section 3.1 https://aclanthology.org/2022.acl-long.62.pdf
220224
public_training_code="https://www.kaggle.com/models/google/labse/tensorFlow2/labse/2?tfhub-redirect=true",
221225
public_training_data=None,

0 commit comments

Comments
 (0)