by vadepaysa
3 subcomments
- I did a Show HN[0] a few days back with my CLI agent called cook[1] and for a moment I was ecstatic my tool made it to the front page. haha.
[0]: https://news.ycombinator.com/item?id=47262711
[1]: https://getcook.dev
- Can someone explain what this is to my n00b brain. I don't get what claude-cli is missing that this adds in?
- I dunno. I just let Claude build a python script that calls Claude code though subprocess.run().
I recently made a sort of Autoresearch with that approach. The script calls Claude Code to create a hyphotesis, then code based on that, evaluate- rinse and repeat. I am still trying to figure out if I am actually on to something or just burning tokens. Jury is still out.
- A noob question: is there a tool that automatically instructs Claude Code to "continue" when the token quota is reset after 5h? I am interested in that more than some rather fancy loops.
- There is a skill installation option. The skill markdown has 180 lines [1].
My take? I like it. It's concise enough for me to try it out. And I love the webpage.
[1] https://github.com/rjcorwin/cook/blob/main/no-code/SKILL.md
- Looks pretty nice. I think a lot of devs have been making similar tools, I've written my own thing that does a work review loop. I like the interface you've made. I'll probably give it a go, but I'm also reluctant to relinquish the control I have when it's my own code doing orchestration.
- I wrote https://jsr.io/@cdaringe/ralphmania which has a lot of feature overlap. cook looks more polished.
1. How do you handle worktree merge conflicts and/or integration validation issues?
2. Can i work straight from a list of requirements? I think i saw you support it…
3. I have my variant write a minimal explainer for every satisfied spec, aka receipts. Its pretty great, because i often review the receipts, and if imperfect, mark as NEEDS_REWORK + notes, and it’ll eventually just pick that up on a future iter
- The composability here is really elegant — review v3 pick as a pipeline that "just works" is the kind of DX that makes agent orchestration feel tractable rather than overwhelming.
- A lot of people coming up with these tools to orchestrate agents to do a one-shot implementation in a sandbox (in a server/container/pipeline).
I also have a similar - yet different approach - with a Mother Agent (MoMa) planner-reviewer-implementer multi agent pattern that orchestrates a feedback loop using Claude memory between agents.
https://news.ycombinator.com/item?id=47437012#47437013
I understood that you have a Judge agent that evaluates independent subagent solo executions and chooses best solution based on ralph algorithm. Did you play with limits on how many solo agents it is sufficient to spawn vs. not getting a better solution? and what is the limit of soloagent solutions that the Judge can compare? (obviously must depend on the complexity and context cost of a solution)
- Nice tool.
- AI agent orchestration is future. That's where workflow engine shines. I'm doing the same thing using Dagu.sh and I don't use terminal so much anymore.
by kasperstorgaard
1 subcomments
- How heavy on tokens is this? I don't use these style workflows and am fairly new to claude code, so I assume it's better than 3x tokens when doing 3 passes?
- One thing i wish more CLI tools did: non-interactive mode. i build bash tools that have interactive prompts for first-time users, but everything the prompt asks also has a CLI flag. makes scripting and CI/CD so much simpler - you can test the exact same code path without mock stdin.
- I really like the idea but my gut says it would be hard to trust. In the last example... "cleanest result" is not a great definition of done (that's only sort of a nitpick).
In general, I feel that removing the decision process (or relegating it to a language model) is not a good idea.
- How does this handle when Claude needs user input? To choose an option, grant tool permission, clarify questions…
- claude> "We want to add a title section that shows what page we are currently on, use cook to manage the development process"
* coolers whirring, gpus on fire, tokens flying, investors happy, developer goes for 6th break of the day
- Very nice. However, I do like to read every agent summary before letting them move on. I'm not sure I'd be able to apply this level of automation to many tasks.
- Semi-on-topic: Anyone know a way to get a good alternative UI on top of Cursor?
My company’s tracking how much we use the damn thing (its autocomplete is literally less-useful than standard VSCode, only time it’s consistently good is when it sees me do one thing to a line, sees repeated similar lines after that, and suggests I do it on the next one too, one at a time, and that’s only useful to me because I’ve never actually bothered to learn how to properly use a text editor) so I can’t avoid it, but even on codebases in the hundreds of lines it’s OOM killing things on my 16GB laptop (it, plus goddamn Teams, were eating half the memory by themselves the other day… with Cursor sitting at almost 6GB alone. JFC. On the plus side if this is what software from a company that should be full of experts at using these things looks like, guess our jobs are safe from them… though not from recession and ZIRP unwinding)
- Is the form factor what makes this different amongst the 500 that exist? Like CLI vs UIs?
by NetOpWibby
2 subcomments
- Dull colors and a display font used for copy makes this website incredibly unpleasant to read.
- Surprised that there's no discussion about the prevalence of using TypeScript for developing these CLI agent harnesses. To me it seems concerning that such CLI programs are so chunky and commonly have to use over 1GB of RAM. Might be boomer-talk, but I am used to CLI apps being extremely lightweight and fast (think Total Commander, which has a UI, but is still very lightweight and responsive)
I understand that part of the reason is because many of these harnesses are vibe-coded, so plenty is lost in terms of optimization. And, well, because LLMs code best in TypeScript
- can we integrate it with Devin too? seems like it doable
by studio-m-dev
0 subcomment
- [dead]
- [dead]
- [dead]
- [dead]
by eddie-wang
0 subcomment
- [flagged]
by BANRONFANTHE
0 subcomment
- [dead]
by derodero24
0 subcomment
- [dead]
by olivercoleai
0 subcomment
- [dead]
by shablulman
0 subcomment
- [dead]
by uxmaniik52
1 subcomments
- [dead]
by gethwhunter34
0 subcomment
- [dead]
by erdmozkn62
0 subcomment
- [dead]
- [dead]
by catlover76
0 subcomment
- [dead]
by NikitaCometa65
0 subcomment
- [dead]
by panditaditya21
0 subcomment
- [dead]
by pissedoffadmin
0 subcomment
- [dead]
- [flagged]
by rafaamaral
2 subcomments
- [flagged]