We ran into this when evaluating browser automation frameworks at AgDex. The ones that wrap CDP in deterministic helpers are slower to add features but much easier to debug in production. The "agent wrote its own helper" moment is magical in demos, but in prod you want a diff you can review.
Probably the right answer is what you're implicitly building: a minimal harness with good logging, so you can replay the CDP calls post-mortem. Is that something you're planning to add?
Is a bit like saying I'll never watch a movie again because LLMs can summarise it for me. For many tasks and activities the UI or experience in the browser is actually the end goal of what I am doing.
It's called "agentic coding" for all I know, and isn't a new paradigm, the whole purpose with agentic coding is that it uses tools to do their thing, then those tools could be structured as the good old JSON schema tools next to the implemented runtime, or as MCP, or HTTP API or whatever, the "paradigm" is the same: Have a harness, have a LLM, let the harness define tools that the LLM can use those.
Anyway, of course this will be superseded by a harness that provides freedom to complete any task within the OS.
I call it Terms of Service Violation. :)
> Set up https://github.com/browser-use/browser-harness for me.
> Read `install.md` first to install and connect this repo to my real browser. Then read `SKILL.md` for normal usage. Always read `helpers.py` because that is where the functions are. When you open a setup or verification tab, activate it so I can see the active browser tab. After it is installed, open this repository in my browser and, if I am logged in to GitHub, ask me whether you should star it for me as a quick demo that the interaction works — only click the star if I say yes. If I am not logged in, just go to browser-use.com.
Is the the new "curl {url} | sh"?
2. Can you publish a tabular comparison on your README?
3. What information gets sent to your API server?
I'm struggling to see why I should use this over agent-browser; I have not yet run into the "cross origin iframes" problem. Is this more for the 'claw crowd?
One issue I have is the pricing. The API is straightforward and easy to deploy, but it seems the API is restricted to a paid tier. Using the inline agent sessions seems possible via the free plan.
Happy to accept corrections if I'm wrong.