seq2annotation

基于 TensorFlow & PaddlePaddle 的通用序列标注算法库（目前包含 BiLSTM+CRF 和 IDCNN+CRF，更多算法正在持续添加中）实现中文分词（Tokenizer / segmentation）、词性标注（Part Of Speech, POS）和命名实体识别（Named Entity Recognition, NER）等序列标注任务。

特色

通用的序列标注：能够解决通用的序列标注问题：分词、词性标注和实体识别仅仅是特例。
Tag schema free: 你可以选择你想用的任何 Tagset。依赖于 tokenizer_tools 提供的编码、解码功能

TODO

current TF Metrics is not launch on pypi, but seq2annotation depends on it, so seq2annotation currently can't packaged as python package on pypi

More Algorithms To Do

Credits

深受 Guillaume Genthial 的 tf_ner 项目的影响

增加 NER 评估方案

From http://www.davidsbatista.net/blog/2018/05/09/Named_Entity_Evaluation/

Name		Name	Last commit message	Last commit date
Latest commit History 413 Commits
blackbox_tests		blackbox_tests
data		data
docker		docker
seq2annotation		seq2annotation
tests		tests
.dockerignore		.dockerignore
.gitignore		.gitignore
.gitmodules		.gitmodules
.travis.yml		.travis.yml
Dockerfile		Dockerfile
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
Makefile		Makefile
README.en-Us.md		README.en-Us.md
README.md		README.md
azure-pipelines.yml		azure-pipelines.yml
build_image.sh		build_image.sh
builtin_configure.json		builtin_configure.json
configure.yaml		configure.yaml
configure_for_estimator.yaml		configure_for_estimator.yaml
dev_requirements.txt		dev_requirements.txt
hyper-parameter_searching.py		hyper-parameter_searching.py
loadmodel.py		loadmodel.py
main.py		main.py
main_with_dictionary.py		main_with_dictionary.py
run_http_api_serve.py		run_http_api_serve.py
run_rest_client.py		run_rest_client.py
run_tf_predictor.py		run_tf_predictor.py
run_tf_serve.bash		run_tf_serve.bash
savedmodel_to_freeze_graph.py		savedmodel_to_freeze_graph.py
setup.cfg		setup.cfg
setup.py		setup.py
snapshot_requirements.txt		snapshot_requirements.txt
stacked_bilistm_crf_main.py		stacked_bilistm_crf_main.py
task_main.py		task_main.py
test_requirements.txt		test_requirements.txt
tpu_main.py		tpu_main.py

License

howl-anderson/seq2annotation

Folders and files

Latest commit

History

Repository files navigation

seq2annotation

特色

TODO

More Algorithms To Do

Credits

增加 NER 评估方案

About

Topics

Resources

License

Stars

Watchers

Forks

Languages