This project is built on the project "LexiconAugmentNER". We incorporate information from lexicon and dictionary into the character representations to augment the performance.
Python 3.6
Pytorch 0.4.1
With each character and its label split by a whitespace in a line. The "BIOES" tag scheme is prefered.
发 B-SYM
热 E-SYM
汗 B-SYM
多 E-SYM
者 O
The best performance model can be downloaded from the link below:
Download TCM_DSET (Extracting code:l3y3) and put it in data file.
Download best model (Extracting code:jl93) and put it in save_model file.