Show HN: Local-first fast CPU image to text for screenshots, PDFs, webpages
17 points by mrkn1
by KetoManx64
1 subcomments
What's the performance like compared to tesseract?
I don't see tesseract mentioned anywhere in the readme, which is surprising considering that's the number one tool most go to for Image > text OCR.
by lavaman131
0 subcomment
This is awesome! Been needing something like this for some research paper diagrams I've been indexing.
by abstract257
1 subcomments
Curious how it does on multi-page scanned PDFs vs. single screenshots? The ORT vision/decoder split is the part that usually makes or breaks CPU VLM OCR...
by vivzkestrel
1 subcomments
- how well do you think this ll work with code? i mean take code screenshots and convert it into actual code for vscode
by monosma
1 subcomments
What was the reason for adopting PaddleOCR?
Can other OCR models be used as well?
by kouru225
1 subcomments
Roman alphabet only or does this work with other alphabets?
by garrett2558
1 subcomments
Very cool, I'm building my own local-first product as well