Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

Comments

PDFBOX-5747: Surrogate pairs with combining diacritics are incorrectly ordered on text extraction#200

Open
reckart wants to merge 1 commit intoapache:trunk from
reckart:bugfix/PDFBOX-5747-Surrogate-pairs-with-combining-diacritics-are-incorrectly-ordered-on-text-extraction
Open

PDFBOX-5747: Surrogate pairs with combining diacritics are incorrectly ordered on text extraction #200
reckart wants to merge 1 commit intoapache:trunk from
reckart:bugfix/PDFBOX-5747-Surrogate-pairs-with-combining-diacritics-are-incorrectly-ordered-on-text-extraction

Conversation

@reckart
Copy link
Member

@reckart reckart commented Jan 26, 2025

  • Changed TextPosition.insertDiacritic() to preserve surrogate pairs
  • Added unit test
  • Included example test PDF file attached to PDFBOX-5747

...y ordered on text extraction
- Changed TextPosition.insertDiacritic() to preserve surrogate pairs
- Added unit test
- Included example test PDF file attached to PDFBOX-5747
@reckart reckart force-pushed the bugfix/PDFBOX-5747-Surrogate-pairs-with-combining-diacritics-are-incorrectly-ordered-on-text-extraction branch from a7e4da0 to 0841f61 Compare January 26, 2025 10:38
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Reviewers

No reviews

Assignees

No one assigned

Labels

None yet

Projects

None yet

Milestone

No milestone

Development

Successfully merging this pull request may close these issues.

1 participant

AltStyle によって変換されたページ (->オリジナル) /