GitHub - umanlp/webisadb-extractor: Relation Extractor for WebIsADb

Branches Tags

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
org/webdatacommons/isadb		org/webdatacommons/isadb
readme.txt		readme.txt

Repository files navigation

Web Data Commons - WebIsA Database
http://webdatacommons.org/isadb/

Christian Bizer
Kai Eckert
Stefano Faralli
Robert Meusel
Heiko Paulheim
Simone Paolo Ponzetto


WebIsADb is a publicly available database containing more than 400 million hypernymy relations we extracted from the CommonCrawl web corpus. This collection of relations represents a rich source of knowledge and may be useful for many researchers. We offer the tuple dataset for public download and an application programming interface to help other researchers programmatically query the database.

This software is meant to be part of the CommonCrawl framework: http://commoncrawl.org/ 
to re-build a new WebIsADb from fresh CommonCrawl dumps.

The WebIsADb and the API are licensed under a Creative Commons Attribution-Non Commercial-Share Alike 3.0 License: 
http://creativecommons.org/licenses/by-nc-sa/3.0/.

Acknowledgements
This work was partially funded by the Deutsche Forschungsgemeinschaft within the JOIN-T project (research grant PO 1900/1-1). Part of the computational resources used for this work were provide by an Amazon AWS in Education Grant award.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Releases

Packages

Languages

umanlp/webisadb-extractor

Folders and files

Latest commit

History

Repository files navigation

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages