snyders | Ideas for free: 100% correct OCR or an error correcting watermark

You're viewing

snyders's journal
Create a Dreamwidth Account Learn More

Reload page in style: site light

Present day OCR is good but still it makes 1-2% of errors. A simple solution would be to embed an error-correction code on each page as a small faint image, that would not distract from the main contents. This would allow to achieve 100% correct OCR of text.

One may ask, why bother if the text is prepared electronically in the first place. Counter-argument: in many cases we do not have access to electronic original, that's why we need OCR in the first place.

Update: turned out to be not a new idea, see comments.

Flat | Top-Level Comments Only

From:

dimrub.livejournal.com

A real life example is how the code of PGP made it (legally) out of the US.

From:

snyders.livejournal.com

it probably was OCRed but was there error correction?

From:

dimrub.livejournal.com

There was :)

From:

snyders.livejournal.com

You are right. Thanks!

Flat | Top-Level Comments Only

Profile

snyders

December 2025

S	M	T	W	T	F	S
	1	2	3	4	5	6
7	8	9	10	11	12	13
14	15	16	17	18	19	20
21	22	23	24	25	26	27
28	29	30	31

Most Popular Tags

Page Summary

dimrub.livejournal.com - (no subject)

Style Credit

Style: Neutral Good for Practicality by timeasmymeasure

Expand Cut Tags

No cut tags

Page generated Jan. 14th, 2026 04:09 pm

Powered by Dreamwidth Studios