FRESH

Hacker News

Show HN: Local-first fast CPU image to text for screenshots, PDFs, webpages

17 points by mrkn1

by KetoManx64

1 subcomments

What's the performance like compared to tesseract? I don't see tesseract mentioned anywhere in the readme, which is surprising considering that's the number one tool most go to for Image > text OCR.

by lavaman131

0 subcomment

This is awesome! Been needing something like this for some research paper diagrams I've been indexing.

by abstract257

1 subcomments

Curious how it does on multi-page scanned PDFs vs. single screenshots? The ORT vision/decoder split is the part that usually makes or breaks CPU VLM OCR...

by vivzkestrel

1 subcomments

- how well do you think this ll work with code? i mean take code screenshots and convert it into actual code for vscode

by monosma

1 subcomments

What was the reason for adopting PaddleOCR? Can other OCR models be used as well?

by kouru225

1 subcomments

by garrett2558

1 subcomments

by BIGFOOT_EXISTS

1 subcomments