Skip to content

normstrngs: add more hyphens and quotes#4195

Merged
stweil merged 2 commits intotesseract-ocr:mainfrom
bertsky:patch-2
Mar 5, 2024
Merged

normstrngs: add more hyphens and quotes#4195
stweil merged 2 commits intotesseract-ocr:mainfrom
bertsky:patch-2

Conversation

@bertsky
Copy link
Contributor

@bertsky bertsky commented Mar 5, 2024

IIUC, this is used when a language model / dict / dawg is in place to normalise the characters before applying hyphenation and quoting rules (which are not all covered by punctuation dawgs etc).

Copy link
Member

@stweil stweil left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thank you.

@stweil stweil merged commit a08a6ac into tesseract-ocr:main Mar 5, 2024
@stweil
Copy link
Member

stweil commented Mar 5, 2024

The 2nd commit was not strictly necessary. It's quite common that lists are terminated like that because adding more entries at the end is easier.

@bertsky
Copy link
Contributor Author

bertsky commented Mar 5, 2024

sorry, I forgot to increase the constants for the number of initialisers, hence the failing build.

will make a new PR with that fix right now

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants