Name	Name	Last commit message	Last commit date
Latest commit History 140 Commits
.github	.github
docs	docs
rapid_table	rapid_table
tests	tests
.gitignore	.gitignore
.pre-commit-config.yaml	.pre-commit-config.yaml
LICENSE	LICENSE
README.md	README.md
cliff.toml	cliff.toml
demo.py	demo.py
requirements.txt	requirements.txt
setup.py	setup.py

📊 Rapid Table

🌟 简介

RapidTable库是专门用来文档类图像的表格结构还原,表格结构模型均属于序列预测方法,结合RapidOCR,将给定图像中的表格转化对应的HTML格式。

slanet_plus是paddlex内置的SLANet升级版模型,准确率有大幅提升

unitable是来源unitable的transformer模型,精度最高,暂仅支持pytorch推理,支持gpu推理加速,训练权重来源于 OhMyTable项目

📅 最近动态

2025年08月29日 update: 发布v3.0.0,支持batch推理,更改了返回值参数,大家可debug单条查看使用
2025年06月22日 update: 发布v2.x,适配rapidocr v3.x
2025年01月09日 update: 发布v1.x,全新接口升级
2024年12月30日 update:支持Unitable模型的表格识别,使用pytorch框架
2024年11月24日 update:支持gpu推理,适配 rapidOCR 单字识别匹配,支持逻辑坐标返回及可视化
2024年10月13日 update:补充最新paddlex-SLANet-plus 模型(paddle2onnx原因暂不能支持onnx)

📸 效果展示

Demo

🖥️ 支持设备

通过ONNXRuntime推理引擎支持(rapid_table>=2.0.0):

DirectML
昇腾NPU

具体使用方法:

安装(需要卸载其他onnxruntime):

# DirectML
pip install onnxruntime-directml
# 昇腾NPU
pip install onnxruntime-cann

使用:

from rapidocr import RapidOCR
from rapid_table import ModelType, RapidTable, RapidTableInput
# DirectML
ocr_engine = RapidOCR(params={"EngineConfig.onnxruntime.use_dml": True})
input_args = RapidTableInput(
 model_type=ModelType.SLANETPLUS, engine_cfg={"use_dml": True}
)
# 昇腾NPU
ocr_engine = RapidOCR(params={"EngineConfig.onnxruntime.use_cann": True})
input_args = RapidTableInput(
 model_type=ModelType.SLANETPLUS,
 engine_cfg={"use_cann": True, "cann_ep_cfg.gpu_id": 1},
)
table_engine = RapidTable(input_args)
img_path = "https://raw.githubusercontent.com/RapidAI/RapidTable/refs/heads/main/tests/test_files/table.jpg"
ori_ocr_res = ocr_engine(img_path)
ocr_results = [ori_ocr_res.boxes, ori_ocr_res.txts, ori_ocr_res.scores]
results = table_engine(img_path, ocr_results=ocr_results)
results.vis(save_dir="outputs", save_name="vis")

🧩 模型列表

`model_type`	模型名称	推理框架	模型大小	推理耗时(单图 60KB)
`ppstructure_en`	`en_ppstructure_mobile_v2_SLANet.onnx`	onnxruntime	7.3M	0.15s
`ppstructure_zh`	`ch_ppstructure_mobile_v2_SLANet.onnx`	onnxruntime	7.4M	0.15s
`slanet_plus`	`slanet-plus.onnx`	onnxruntime	6.8M	0.15s
`unitable`	`unitable(encoder.pth,decoder.pth)`	pytorch	500M	cpu(6s) gpu-4090(1.5s)

模型来源
PaddleOCR 表格识别
 PaddleX-SlaNetPlus 表格识别
 Unitable

模型下载地址:link

🛠️ 安装

版本依赖关系如下:

`rapid_table`	OCR
v2.x & v3.x	`rapidocr>=3.0.0`
v1.0.x	`rapidocr>=2.0.0,<3.0.0`
v0.x	`rapidocr_onnxruntime`

由于模型较小,预先将slanet-plus表格识别模型(slanet-plus.onnx)打包进了whl包内。其余模型在初始化RapidTable类时,会根据model_type来自动下载模型到安装包所在models目录下。

当然也可以通过RapidTableInput(model_path='')来指定自己模型路径(v1.0.x 参数变量名使用model_path, v2.x 参数变量名变更为model_dir_or_path)。注意仅限于我们现支持的model_type。

⚠️注意:rapid_table>=v1.0.0之后,不再将rapidocr依赖强制打包到rapid_table中。使用前,需要自行安装rapidocr包。

⚠️注意:rapid_table>=v0.1.0,<1.0.0之后,不再将rapidocr依赖强制打包到rapid_table中。使用前,需要自行安装rapidocr_onnxruntime包。

pip install rapidocr
pip install rapid_table
# 基于torch来推理unitable模型
pip install rapid_table[torch] # for unitable inference
# onnxruntime-gpu推理
pip uninstall onnxruntime
pip install onnxruntime-gpu # for onnx gpu inference

🚀 使用方式

🐍 Python脚本运行

⚠️注意:在rapid_table>=1.0.0之后,模型输入均采用dataclasses封装,简化和兼容参数传递。输入和输出定义如下:

ModelType支持已有的4个模型 (source):

class ModelType(Enum):
 PPSTRUCTURE_EN = "ppstructure_en" # onnxruntime
 PPSTRUCTURE_ZH = "ppstructure_zh" # onnxruntime
 SLANETPLUS = "slanet_plus" # onnxruntime
 UNITABLE = "unitable" # torch推理引擎

Batch推理

from pathlib import Path
from rapid_table import ModelType, RapidTable, RapidTableInput
input_args = RapidTableInput(model_type=ModelType.PPSTRUCTURE_ZH)
table_engine = RapidTable(input_args)
img_list = list(Path("images").iterdir())
results = table_engine(img_path, batch_size=3) # 这里,batch_size默认为1
# indexes:指定可视化的图像索引。默认为0
results.vis(save_dir="outputs", save_name="vis", indexes=(0, 1, 2))

CPU使用

from rapid_table import ModelType, RapidTable, RapidTableInput
input_args = RapidTableInput(model_type=ModelType.UNITABLE)
table_engine = RapidTable(input_args)
img_path = "https://raw.githubusercontent.com/RapidAI/RapidTable/refs/heads/main/tests/test_files/table.jpg"
ori_ocr_res = ocr_engine(img_path)
results = table_engine(img_path)
results.vis(save_dir="outputs", save_name="vis")

GPU使用

engine_cfg中参数是和engine_cfg.yaml相对应的。

from rapid_table import ModelType, RapidTable, RapidTableInput
# onnxruntime-gpu
input_args = RapidTableInput(
 model_type=ModelType.SLANETPLUS,
 engine_cfg={"use_cuda": True, "cuda_ep_cfg.gpu_id": 1}
)
# torch gpu
# input_args = RapidTableInput(
# model_type=ModelType.UNITABLE,
# engine_cfg={"use_cuda": True, "gpu_id": 1},
# )
table_engine = RapidTable(input_args)
img_path = "https://raw.githubusercontent.com/RapidAI/RapidTable/refs/heads/main/tests/test_files/table.jpg"
results = table_engine(img_path)
results.vis(save_dir="outputs", save_name="vis")

📦 终端运行

rapid_table https://raw.githubusercontent.com/RapidAI/RapidTable/refs/heads/main/tests/test_files/table.jpg -v

📝 结果

📎 返回结果

Details

<html>
<body>
<table>
 <tr>
 <td>Methods</td>
 <td></td>
 <td></td>
 <td></td>
 <td>FPS</td>
 </tr>
 <tr>
 <td>SegLink [26]</td>
 <td>70.0</td>
 <td>86d>
 <td.0
 </td>
 <td>77.0</td>
 <td>8.9</td>
 </tr>
 <tr>
 <td>PixelLink [4]</td>
 <td>73.2</td>
 <td>83.0</td>
 <td>77.8</td>
 <td></td>
 </tr>
 <tr>
 <td>TextSnake [18]</td>
 <td>73.9</td>
 <td>83.2</td>
 <td>78.3</td>
 <td>1.1</td>
 </tr>
 <tr>
 <td>TextField [37]</td>
 <td>75.9</td>
 <td>87.4</td>
 <td>81.3</td>
 <td>5.2</td>
 </tr>
 <tr>
 <td>MSR[38]</td>
 <td>76.7</td>
 <td>87.87.4</td>
 <td>81.7</td>
 <td></td>
 </tr>
 <tr>
 <td>FTSN [3]</td>
 <td>77.1</td>
 <td>87.6</td>
 <td>82.0</td>
 <td></td>
 </tr>
 <tr>
 <td>LSE[30]</td>
 <td>81.7</td>
 <td>84.2</td>
 <td>82.9</td>
 <>
 <ttd></td>
 </tr>
 <tr>
 <td>CRAFT [2]</td>
 <td>78.2</td>
 <td>88.2</td>
 <td>82.9</td>
 <td>8.6</td>
 </tr>
 <tr>
 <td>MCN[16]</td>
 <td>79</td>
 <td>88</td>
 <td>83</td>
 <td></td>
 </tr>
 <tr>
 <td>ATRR</
 >[35]</td>
 <td>82.1</td>
 <td>85.2</td>
 <td>83.6</td>
 <td></td>
 </tr>
 <tr>
 <td>PAN [34]</td>
 <td>83.8</td>
 <td>84.4</td>
 <td>84.1</td>
 <td>30.2</td>
 </tr>
 <tr>
 <td>DB[12]</td>
 <td>79.2</t91/d>
 <td>91.5</td>
 <td>84.9</td>
 <td>32.0</td>
 </tr>
 <tr>
 <td>DRRG[41]</td>
 <td>82.30</td>
 <td>88.05</td>
 <td>85.08</td>
 <td></td>
 </tr>
 <tr>
 <td>Ours (SynText)</td>
 <td>80.68</td>
 <td>85
 <t..40
 </td>
 <td>82.97</td>
 <td>12.68</td>
 </tr>
 <tr>
 <td>Ours (MLT-17)</td>
 <td>84.54</td>
 <td>86.62</td>
 <td>85.57</td>
 <td>12.31</td>
 </tr>
</table>
</body>
</html>

🖼️ 可视化结果

Methods FPS

SegLink [26] 70.0 86d> 77.0 8.9

PixelLink [4] 73.2 83.0 77.8

TextSnake [18] 73.9 83.2 78.3 1.1

TextField [37] 75.9 87.4 81.3 5.2

MSR[38] 76.7 87.87.4 81.7

FTSN [3] 77.1 87.6 82.0

LSE[30] 81.7 84.2 82.9

CRAFT [2] 78.2 88.2 82.9 8.6

MCN[16] 79 88 83

ATRR[35] 82.1 85.2 83.6

PAN [34] 83.8 84.4 84.1 30.2

DB[12] 79.2 91.5 84.9 32.0

DRRG[41] 82.30 88.05 85.08

Ours (SynText) 80.68 85 82.97 12.68

Ours (MLT-17) 84.54 86.62 85.57 12.31

🔄 与TableStructureRec关系

TableStructureRec库是一个表格识别算法的集合库,当前有wired_table_rec有线表格识别算法和lineless_table_rec无线表格识别算法的推理包。

RapidTable是整理自PP-Structure中表格识别部分而来。由于PP-Structure较早,这个库命名就成了rapid_table。

总之,RapidTable和TabelStructureRec都是表格识别的仓库。大家可以都试试,哪个好用用哪个。由于每个算法都不太同,暂时不打算做统一处理。

关于表格识别算法的比较,可参见TableStructureRec测评

📌 更新日志 (more)

Details

2024年12月30日 update

支持Unitable模型的表格识别,使用pytorch框架

2024年11月24日 update

支持gpu推理,适配 rapidOCR 单字识别匹配,支持逻辑坐标返回及可视化

2024年10月13日 update

补充最新paddlex-SLANet-plus 模型(paddle2onnx原因暂不能支持onnx)

2023年12月29日 v0.1.3 update

优化可视化结果部分

2023年12月27日 v0.1.2 update

添加返回cell坐标框参数
完善可视化函数

2023年07月17日 v0.1.0 update

将rapidocr_onnxruntime部分从rapid_table中解耦合出来,给出选项是否依赖,更加灵活。
增加接口输入参数ocr_result:
- 如果在调用函数时,事先指定了ocr_result参数值,则不会再走OCR。其中ocr_result格式需要和rapidocr_onnxruntime返回值一致。
- 如果未指定ocr_result参数值,但是事先安装了rapidocr_onnxruntime库,则会自动调用该库,进行识别。
- 如果ocr_result未指定,且rapidocr_onnxruntime未安装,则会报错。必须满足两个条件中一个。

2023年07月10日 v0.0.13 updata

更改传入表格还原中OCR的实例接口,可以传入其他OCR实例,前提要与rapidocr_onnxruntime接口一致

2023年07月06日 v0.0.12 update

去掉返回表格的html字符串中的<thead></thead><tbody></tbody>元素,便于后续统一。
采用Black工具优化代码

Uh oh!

License

RapidAI/RapidTable

Folders and files

Latest commit

History

Repository files navigation

📊 Rapid Table

🌟 简介

📅 最近动态

📸 效果展示

🖥️ 支持设备

🧩 模型列表

🛠️ 安装

🚀 使用方式

🐍 Python脚本运行

Batch推理

CPU使用

GPU使用

📦 终端运行

📝 结果

📎 返回结果

🖼️ 可视化结果

🔄 与TableStructureRec关系

📌 更新日志 (more)

2024年12月30日 update

2024年11月24日 update

2024年10月13日 update

2023年12月29日 v0.1.3 update

2023年12月27日 v0.1.2 update

2023年07月17日 v0.1.0 update

2023年07月10日 v0.0.13 updata

2023年07月06日 v0.0.12 update

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 13

Sponsor this project

Uh oh!

Contributors 6

Languages