GitHub - svjack/Sbert-ChineseExample: Sentence-Transformers Information Retrieval example on Chinese

* 1 这个工程使用自定义的 es-pandas 的重载接口 (支持向量存储) 来使用pandas对于elasticsearch实现简单的操作。
* 2 try_sbert_neg_sampler.py 抽取困难样本(模型识别困难的样本)的功能来自于 https://guzpenha.github.io/transformer_rankers/, 也可以使用 elasticsearch 生成困难样本, 相应的功能在 valid_cross_encoder_on_bi_encoder.py 中定义。
* 3 上面在 cross_encoder 上训练的功能, 需要预先在不同的句子间检查语义区别程度, 组合相似语义的样本对于模型训练是有帮助的。
* 4 增加了一些对Sentence-Transformers多类别结果比较的工具。

贡献

Contributing

License

Distributed under the MIT License. See LICENSE for more information.

Contact

svjack - svjackbt@gmail.com ehangzhou@outlook.com

Project Link: https://github.com/svjack/Sbert-ChineseExample

Acknowledgements

About

Sentence-Transformers Information Retrieval example on Chinese

Releases

No releases published

Packages

No packages published

Languages

Python 100.0%

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

svjack/Sbert-ChineseExample

Folders and files

Latest commit

History

Repository files navigation

Sbert-ChineseExample

内容提要

关于这个工程

About The Project

构建信息

Built With

开始

Getting Started

安装

Installation

使用

Usage

引导

Roadmap

贡献

Contributing

License

Contact

Acknowledgements

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages

Uh oh!

Languages

svjack/Sbert-ChineseExample

Folders and files

Latest commit

History

Repository files navigation

Sbert-ChineseExample

内容提要

关于这个工程

About The Project

构建信息

Built With

开始

Getting Started

安装

Installation

使用

Usage

引导

Roadmap

贡献

Contributing

License

Contact

Acknowledgements

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages