Cross_Domain_Hedge_Detection

We implement the cross domain Chinese hedge detection using Keras.

DataSet

This dataset contains four domains: wiki, biomedical abstract, discuss and result.

For example, we use the abstract to train and test for the wiki, this is called wiki_by_abstract (test_by_train).

We use five fold test to evaluate our model.

word embedding

We train our word embedding on a small corpus downloaded from the Medline, we will make our word embedding public as soon as possible.
We would like to make our data public as soon as possible.

Use

python 2.7
Keras 2.0.1
Tensorflow 1.0.1
nltk 3.2.2
tqdm

Run

To creature the data and features

python hedge_process.py

To process the data into the matrix and use for learning

python process_data.py

To run the BiLSTM model

python main.py

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
LICENSE		LICENSE
README.md		README.md
hedge_process.py		hedge_process.py
main.py		main.py
process_data.py		process_data.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LICENSE

LICENSE

README.md

README.md

hedge_process.py

hedge_process.py

main.py

main.py

process_data.py

process_data.py

Repository files navigation

Cross_Domain_Hedge_Detection

DataSet

word embedding

Use

Run

About

Releases

Packages

Languages

License

Xls1994/Cross_Domain_Hedge_Detection

Folders and files

Latest commit

History

Repository files navigation

Cross_Domain_Hedge_Detection

DataSet

word embedding

Use

Run

About

Topics

Resources

License

Stars

Watchers

Forks

Languages