FRESH

Hacker News

1-Bit Bonsai Image 4B Image Generation for Local Devices

98 points by modinfo

by captainregex

0 subcomment

what trade off would one need to clear to justify the hardware and the work to get this running locally as part of a broader system? It’s a lot of work setting up and maintaining a production harness/system on a local device. I don’t personally repeatedly generate images at a scale where using a lab’s app somehow burns all my tokens. I like the ideas of local ai but I don’t see widespread adoption of it happening in commercial or customer situations anytime soon no matter how little/good enough they get. Even Uber- token burn whiplash but I doubt their answer will be “run some of it local”. IT nightmare, I’d imagine.

by jeroenhd

0 subcomment

Couldn't try it because the demo app is iOS only and the web version just crashes my browser. The small model is impressive but if you front load a 1.8GB text encoder model, the savings aren't quite as useful.
I do wonder how these compare to existing image generation models. I've tried https://github.com/alichherawalla/off-grid-mobile-ai for a while but I find the image generation models rather lacking.

by lumost

2 subcomments

I actually can’t wait for the future where I upgrade hardware in order to upgrade my ai as an alternative to an expensive subscription.
There are many problems I want to work on which require billions of tokens. These are completely inaccessible without corporate project sponsorship at the moment. An asic generation machine which can pump out a few 10s of thousands of tokens per second at opus4.6 quality is more than sufficient.

by sorenjan

0 subcomment

They call it a diffusion model, but it's based on Flux.2 which is a rectified flow model.

by wiradikusuma

0 subcomment

Is there a benchmark of local image generation models? Local = can run on a 16 GB MacBook or 8 GB+ NVIDIA card.

by a1o

1 subcomments

Anyone could pickup the minimal hardware requirements for this? Like both RAM and Storage?

by iJohnDoe

1 subcomments

by sudb

0 subcomment

Very interested to see where this kind of work goes for on-device video generation!

by potatoman22

0 subcomment

by janniks

1 subcomments

by MitPitt

6 subcomments

Lately I've noticed posts with barely 10 points getting to HN frontpage. Was it always like this?

by SilentM68

0 subcomment

Question,
Is it compatible with Ollama, ComfyUI or are those providers unneeded, compatible with low-end hardware?
Also, where does "./setup.sh/ drop the components in Linux?
Thank you, Sol

by yieldcrv

0 subcomment

impressive, combines a couple techniques that I always wanted the frontier models to have
having trouble loading the webgl browser demo on my phone but no biggy