Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

Web & CLI tool to fix noisy OCR text using a fine-tuned T5 model

License

Notifications You must be signed in to change notification settings

stelaras36/OCRfixer

Folders and files

NameName
Last commit message
Last commit date

Latest commit

History

1 Commit

Repository files navigation

OCRfixer — OCR text cleaner with T5

Web & CLI εργαλείο που διορθώνει θορυβώδες OCR κείμενο (π.χ. 0->o, 1->l, 5->s) χρησιμοποιώντας fine-tuned T5.
Περιλαμβάνει annotation UI, data generation, training script, και web inference με πολλαπλούς υποψήφιους και confidence scores.


Features

  • Διόρθωση OCR λαθών σε Αγγλικά κείμενα
  • Web UI και Command-line interface
  • Training pipeline με synthetic data
  • Πολλαπλοί υποψήφιοι με confidence score

Demo

Web UI (Fix OCR)

Web demo

CLI

Enter noisy text: th1s 1s a t3xt w1th 0cr err0rs! Fixed: This pipeline exports and trains a model.

Enter noisy text: we1c0me t0 the neW pr0ject Fixed: We are testing our OCR fixer project.

Enter noisy text: the qu1ck br0wn f0x jump5 0ver the l4zy d0g Fixed: The quick brown fox jumps over the lazy dog. Installation

git clone https://github.com//OCRfixer.git cd OCRfixer pip install -r requirements.txt Usage CLI

python demo.py Web UI

python app.py Then open http://127.0.0.1:5000/fix in your browser.

About

Web & CLI tool to fix noisy OCR text using a fine-tuned T5 model

Topics

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

AltStyle によって変換されたページ (->オリジナル) /