I've been using the simpler but not as flexible alternative: I'm running Claude Code for web (Anthropic's version of Codex Cloud) via the Claude iPhone app, with an environment I created called "Everything" which allows all network access.
(This is moderately unsafe if you're working with private source code or environment variables containing API keys and other secrets, but most of my stuff is either open source or personal such that I don't care if the source code leaks.)
Anthropic run multiple ~21GB VMs for me on-demand to handle sessions that I start via the app. They don't charge anything extra for VM time which is nice.
I frequently have 2-3 separate Claude Code for web sessions running at once, often prompted from my phone, some of them started while I'm out walking the dog. Works really well!
Thanks to tailscale and ssh I can vibecode on the go from my phone with this setup.
While it’s great to leave a task running, no matter what I do I can’t achieve the type of high quality work on the go that I can when I’m sitting at my desk.
For me working on a full SaaS.. I just can’t do quality work on my phone.
The only way I can do quality work is to sit at my desk where I’m focused on the work. To play with the result of a prompt, take copious notes, feed them back to the agent, not ship until the thing is polished to a shine. To feature flag the changes, review all code in excruciating detail as though it was written by a dyslexic intern, add all the metrics and logs one can think of (VictoriaMetrics), add user-behavior logging (Amplitude/Posthog) and monitor the thing like your livelihood depends on it. Because it’s a product and you have pride in your work.
All of that needs loads of screen real estate and a keyboard.
So I guess the more things change, the more they stay the same.
This blog is super sus too. All the posts are about Claude. I suspect it's run by Anthropic, just read the About page: https://granda.org/en/about/
It won't matter if I'm washing the dishes, walking the dog, driving to the supermarket, picking up my kids from school. I'll always be switched on, on my phone, continuously talking to an LLM, delivering questionable features and building meaningless products, destroying in the process the environment my kids are going to have to grow in.
I'm a heavy LLM user. On a daily basis, I find LLMs extremely useful both professionally and personally. But the cognitive dissonance I feel when I think about what this means over a longer time horizon is really painful.
On the $210/month VM cost - fair point. For me it's worth it because I'm running multiple parallel agents throughout the day, but you could definitely do this cheaper with a smaller instance or spot pricing.
The real bottleneck isn't typing on mobile - it's reviewing the output. I've found 2-3 parallel agents is my sweet spot before I can't keep up with reviewing PRs. Git worktrees help a lot here since each agent works in isolation with its own containers (including db).
And to the work-life balance concerns - totally valid. For me this isn't about working more, it's about capturing ideas when they strike (usually on walks or waiting in line). The Poke notifications let me stay async rather than glued to a terminal.
Although a little late to the HN post, happy to answer any questions about the setup!
But relevant to this article here, it also has a super sick web based agent, Shelley, that is quite adequate for using from the phone.
I used it to build a little guestbook thing in ~2 hours, late night in bed in my phone. Link to submission, and my post on it there, and the guestbook I wrote. https://news.ycombinator.com/item?id=46397609 https://news.ycombinator.com/item?id=46398115 https://nan-falcon.exe.xyz
I'd also note that OpenCode is a solidjs app, that can run in tui (how most folks know it) or the web. And it has an excellent excellent plugin architecture. The work in this post to build workflows is great!
This works reasonably well, but there is a gap for small messages or review comments. I am waiting for Anthropic to shop a feature where the Claude mobile app is able to mirror Claude Code (not the Claude desktop app) and lets me see the diffs of the changes it made and send commands. I'd use this to steer the conversation while on the go with short commands or prompts so that when I'm back at the computer I can focus on the important feedback that I can jot down quickly on the computer keyboard.
I feel like I'm limited by writing specs for agents, then by reviewing their work.
I typically spend 30-60 minutes writing a spec, then the agent runs for 30-60 minutes, then I spend 30-60 minutes refining code/ui/etc before putting up a PR, then another 30-60 minutes waiting for CI + addressing automated + human code review feedback.
Unless it's a totally vibe coded side project without any tests or quality control of some sort.
I'm just curious what you can build with this setup. It just seems to be the way to create a mountain of sloppy, unmaintainable code.
Essentially you run a server on some machine. Sessions are created in Docker containers, K8s pods, or via Zellij (an app similar to tmux). You can:
- Directly attach to sessions via Docker attach (built-in via a TUI). You get a normal Claude Code experience, but multiplexed. The switcher/UI shows you the status of Claude and the PR (pushed, merge conflicts, CI status, review status, etc.)
- Manage sessions via a web UI. Connect to Claude Code directly via your browser. You have access to the usual Claude Code terminal or a native chat view.
- Manage sessions via an app. You have access to a native chat view.
It achieves isolation via Git worktrees + a proxy so that containers have access to zero credentials (there aren't even any Claude code creds in the container), which allows you to more safely use bypass all permissions mode.
This works better for me that Claude Code on Web because I have control over the environment Claude is running in. I can give it any Docker image I want, I can have it connect to my local network, etc.
It's still a WIP (the core bits are there, but it's not polished yet), but I'm hoping it provides a friendlier UX with a similar goal for what the OP has in mind.
> hash_val = sum(ord(c) for c in branch_name)
> django_port = 8001 + (hash_val % 99)
> Six agents, six features, one phone.
What do you mean, no conflicts? The probability of a collision with six branches and 99 ports slots is ~14% assuming optimal hashing (which this decidedly isn't).
I have a tweak that allows pasting images to claude code over SSH:
How it works:
PTY Interception: It creates a pseudo-terminal (PTY) to wrap the SSH process, allowing it to sit as a "man-in-the-middle" between your keyboard and the remote shell.
Bracketed Paste Detection: It monitors stdin for "bracketed paste" sequences (the control codes terminals send when you Cmd+V or drag-and-drop a file).
The "Hook": When a paste occurs, it pauses execution and scans the text for local macOS file paths.
Auto-Sync: If a local path is found, it immediately syncs that file to the remote server (using the provided SSH key) in the background.
Transparent Forwarding: Once the sync is complete, it forwards the original text to the shell.
You can drag and drop a file from your local Finder into a remote SSH session, and the file is automatically uploaded to the server before the path appears on the command line. Also works with copy paste, screnshots.
E.g. a Terragonlabs subscription is 25/month for 3 concurrent tasks and 50/month for 10.
I kick off a prompt as a GitHub issue, Claude fires away on this issue, provides updates as comments and a PR is created for me at the end for review.
It also notifies me throughout, and I can look at the pipelines to see the thinking behind the action.
Before that I used Happy (https://happy.engineering/) which is also open source and a lot more sophisticated. It has a voice assistant that can chat with Claude Code on your behalf in the mobile app. However, it wasn't very reliable, and there are other reasons to use Hapi instead (documented in the Hapi repo).
Before that, Omnara (https://www.omnara.com/) a YC company and seemingly a proprietary Happy fork (?) but it never worked properly for me.
Long story short, there are a few of the around, and frankly I really like to use them. Unlike other commenters, I don't find that they wreck my work-life balance. Rather, I can go out and have a walk in the park, only checking in on long-running tasks every once in a while. The diff view is pretty good too. There are many tasks where I'd rather not stare at my PC all day and instead do other things, and these tools allow me to do that.
I am also hoping / trying to put Claude code on top of a personal zettlekasten to automate more of my “personal life” tasks and get more stuff done for me. Haven’t gotten it really singing yet but I think that could also be really cool.
I also used Web based coding environment like Glitch (R.I.P.) for years.
You can do that with your virtual keyboard, voice or a even a physical keyboard via BT, e.g. Corne-ish Zen.
That's how I travel.
That's really nothing AI specific or novel. It's cool though.
FWIW I even coined a related term https://fabien.benetou.fr/Languages/OwnConcepts#ResponsivePr... "extending responsive design to be able to program on the device, any device from eink to mobile phone to device, one is currently using not just to "consume" content, e.g read a Website that is then properly formatted for it, but rather program back that very device"
That being said, if you do want to go that route check out CloudInit as it will help you (or whatever tool you rely on) to spawn new instance on your favorite cloud provider to boot specific instances and e.g. setup Docker/Podman then services, etc with no interaction. Also ntfy can help you manage notifications across devices on your own infrastructure, no 3rd parties.
One thing to note: the VM seems like an absolute waste of money. If you are using tailscale, might as well connect back to bare metal VMs you can run at home. Save yourself some coin.
I've not tried that myself since I've only been using it from my laptop, but I do prefer chatting with their coding agent in a browser tab to using Claude Code in a terminal window.
Ofc if you have demo deployments etc on branches that you could open on mobile it works for longer.
Another issue is that I often need to sit down and think about the next prompt going back and forth with the agent on a plan. Try out other product features, do other research before I even know what exactly to build. Often doing some sample implementations with Claude code and click around these days. Doing this on a phone feels... limiting.
I also can't stand the constant context switching. Doing multiple feature in parallel already feels dumb because every time I come from feature B to A or worse from feature G to E it takes me some time to adjust to where I was, what Claude last did and how to proceed from here. Doing more tasks than 2 max. 3 in parallel often ends up slowing me down. Now you add ordering coffee and small talk to the mix and I definitely can't effectively prompt without rereading all history for minutes before sending the next prompt. At which point I might have also opened up my laptop.
Ofc if you truly vibe code and just add feature on feature and pray nothing breaks, the validation overhead and bar for quality goes down a lot so it works a lot better but the output is also just slop by then.
I typed this on my phone and it took 20 minutes, a laptop might have been faster.
I love that as we go through our GenAI development journey, we're all finding success in the same patterns.
I find myself planning and jotting down things into a notebook while juggling adult/parent responsibilities. On little longer gaps I research. Then when the occasional longer gap happens I'm ready to start cracking on my desktop. I've been only dabbling with AI but have found that writing prompts by hand in the notebook and using the desk time to execute them works well. This also keeps me in the free tier.
Anyone have better results?
The limitation is that some Typescript builds run out of RAM (even with swap) and I can't use playwright, but still it's been useful.
It's fun writing an agent, too.
It is tolerable on an iPad with Blink with commands to maximise and minimise panes using vim-style keyboard bindings, kind of like an iOS sway.
QUESTION=$(echo "$EVENT_DATA" | jq -r'.tool_input.questions[0].question')I've not tried Claude Code for Web but assume it would be similar. https://code.claude.com/docs/en/claude-code-on-the-web
The thing I'm missing is a phone that makes it comfy. I could just SSH feom my standard S23, but what I've got my eye on is one of those foldable things.
Has anyone used one like a laptop? Keyboard on the bottom half, terminal on the top? Does it work decently?
1.Install Tailscale on WSL2 and your iPhone 2.Install openssh-server on WSL2 3.Get an SSH terminal app (Blink, Termius, etc.). I use blink ($20/yr). 4.SSH from Blink to your WSL2’s Tailscale IP 5. Run claude code inside tmux on your phone.
Tailscale handles the networking from anywhere. tmux keeps your session alive if you hit dead spots. Full agentic coding from your phone.
Step 2: SSH server In WSL2:
sudo apt install openssh-server sudo service ssh start
Run tailscale ip to get your WSL2’s IP (100.x.x.x). That’s what you’ll connect to from your phone.
Step 3: Passwordless login In Blink, type config → Keys → + → create an Ed25519 key. Copy the public key. On WSL2:
echo "your-public-key" >> ~/.ssh/authorized_keys
Then in Blink: Hosts → + → add your Tailscale IP, username, and select your key. Now it’s one tap to connect.
Step 4: tmux keeps you alive iOS kills background SSH connections. tmux solves this.
sudo apt install tmux tmux claude
Switch apps, connection dies, no problem. Reconnect: I can just type `ssh dev` in blink and I'm in my workstation, then `tmux attach`, you’re right back in your session.
Pro tip: multiple Claude sessions Inside tmux: •Ctrl+b c — new window •Ctrl+b 0/1/2 — switch windows I run different repos or multiple agents in the same repo, in different windows and jump between them. Full multi-project workflow from my phone.
Free and seamless setup!
But anthropic has since launched the ability to “teleport” sessions to mobile. (Claude Code is baked into the app). The iOS experience has been smooth for the most part.
People keep saying things like “2026 is the year of background agents, sandboxes, etc” but imo the harness will eat the entire platform stack. It already is. It will only get better.
.. with a valid SSH key unless I’m reading it wrong?
Except that you are doing anything else but coding here. Coding involves writing code, which isn't actually done by the author here.