This is neat. I wonder… Is there a more comprehensive analysis of how well the Apple Vision framework compares to other multi-modal AI? Would there be any benefit to pre-processing images via Auge before handing them to Claude, GPT?
by thealistra
0 subcomment
The ocr example says it recognizes Chinese, but output ignores it - maybe just AI bug in generated examples