by captainregex
0 subcomment
- what trade off would one need to clear to justify the hardware and the work to get this running locally as part of a broader system? It’s a lot of work setting up and maintaining a production harness/system on a local device. I don’t personally repeatedly generate images at a scale where using a lab’s app somehow burns all my tokens. I like the ideas of local ai but I don’t see widespread adoption of it happening in commercial or customer situations anytime soon no matter how little/good enough they get. Even Uber- token burn whiplash but I doubt their answer will be “run some of it local”. IT nightmare, I’d imagine.
- Couldn't try it because the demo app is iOS only and the web version just crashes my browser. The small model is impressive but if you front load a 1.8GB text encoder model, the savings aren't quite as useful.
I do wonder how these compare to existing image generation models. I've tried https://github.com/alichherawalla/off-grid-mobile-ai for a while but I find the image generation models rather lacking.
- I actually can’t wait for the future where I upgrade hardware in order to upgrade my ai as an alternative to an expensive subscription.
There are many problems I want to work on which require billions of tokens. These are completely inaccessible without corporate project sponsorship at the moment. An asic generation machine which can pump out a few 10s of thousands of tokens per second at opus4.6 quality is more than sufficient.
- They call it a diffusion model, but it's based on Flux.2 which is a rectified flow model.
by wiradikusuma
0 subcomment
- Is there a benchmark of local image generation models? Local = can run on a 16 GB MacBook or 8 GB+ NVIDIA card.
- Anyone could pickup the minimal hardware requirements for this? Like both RAM and Storage?
- Does anyone ever get their stuff to actually work. Like actually load?
- Very interested to see where this kind of work goes for on-device video generation!
by potatoman22
0 subcomment
- I wonder why they didn't use a Bonsai model as the text encoder
- I was expecting to see images of Bonsai trees when I clicked this
- Lately I've noticed posts with barely 10 points getting to HN frontpage. Was it always like this?
- Question,
Is it compatible with Ollama, ComfyUI or are those providers unneeded, compatible with low-end hardware?
Also, where does "./setup.sh/ drop the components in Linux?
Thank you,
Sol
- impressive, combines a couple techniques that I always wanted the frontier models to have
having trouble loading the webgl browser demo on my phone but no biggy