Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

提交blog:小语种OCR标注效率提升10+倍:PaddleOCR+ERNIE 4.5自动标注实战解析 #175

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
openvino-book wants to merge 8 commits into PFCCLab:main
base: main
Choose a base branch
Loading
from openvino-book:main

Conversation

@openvino-book
Copy link
Contributor

@openvino-book openvino-book commented Aug 22, 2025

提交blog:小语种OCR标注效率提升10+倍:PaddleOCR+ERNIE 4.5自动标注实战解析

其它commit请自动忽略

Copy link

netlify bot commented Aug 22, 2025
edited
Loading

Deploy Preview for pfccblog failed.

Name Link
🔨 Latest commit 41a5738
🔍 Latest deploy log https://app.netlify.com/projects/pfccblog/deploys/68a81ea5cbba4100087cf654

Copilot AI review requested due to automatic review settings December 7, 2025 08:14
Copy link

netlify bot commented Dec 7, 2025
edited
Loading

Deploy Preview for pfccblog failed.

Name Link
🔨 Latest commit 57136ef
🔍 Latest deploy log https://app.netlify.com/projects/pfccblog/deploys/693537650282a20008a569a5

Copy link
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

This PR submits a new technical blog post that demonstrates how to achieve 10x+ efficiency improvement in OCR annotation for minority languages using PaddleOCR combined with ERNIE 4.5. The solution addresses the critical bottleneck of scarce and expensive labeled data for minority language OCR development.

Key Changes:

  • Introduces an automated annotation workflow that uses PaddleOCR for text detection/cropping and ERNIE 4.5 for dual-prediction with consistency verification
  • Reduces data preparation cycle from weeks to hours while improving annotation accuracy from 92.1% to 96.3%
  • Provides complete implementation code examples and performance benchmarks demonstrating 22.5x speed improvement and 95%+ cost reduction

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Reviewers

Copilot code review Copilot Copilot left review comments

At least 1 approving review is required to merge this pull request.

Assignees

No one assigned

Labels

None yet

Projects

None yet

Milestone

No milestone

Development

Successfully merging this pull request may close these issues.

1 participant

AltStyle によって変換されたページ (->オリジナル) /