FRESH

Hacker News

Launch HN: Mentat (YC F24) – Controlling LLMs with Runtime Intervention

52 points by cgorlla

by oersted

1 subcomments

by alexchantavy

2 subcomments

by serjester

1 subcomments

Congrats on the launch - you're value-add is quite confusing as someone that's at the applied AI layer. This comes off as more of a research project than a business. You're going to need an incredibly compelling sales pitch for me to send my data to an unknown vendor to fix a problem that might be obviated by the next model release (or just stronger evals with prompt engineering). Best of luck.

by rancar2

1 subcomments

Can you share more about the challenges ran into on the benchmarking? According to the benchmark note, Claude 4.5 Opus and Gemini 3 Pro Preview exhibited elevated rejection and were dropped from TruthfulQA without further discussion. To me this begs the questions, does this indicated that frontier closed SOTA model will likely not allow this approach in the future (ie in the process of screening for potential attack vectors) and/or that this approach will only be limited to a certain LLM architecture? If it’s an architecture limitation, it’s worth discussing chaining for easier policy enforcement.

by ilaksh

0 subcomment

So if I understand, this is basically advanced activation steering as a service? And you have already identified vectors for several open models that make them more truthful or better at reasoning and apply them automatically?
Because the API has a persona option which might be achieved with something like this https://github.com/Mihaiii/llm_steer or maybe for closed models you just have to append to the prompt.
What open source models are available? In the docs I only see mention of Google Flash Lite or something which is closed.

by esafak

2 subcomments

Are you not concerned that model creation companies will bake this into their next model? I am trying to understand business model.
Another question is how you would claim credit. People believe the quality of the end result depends only on the model, with serving only responsible for speed.

by Python3267

2 subcomments

--I was able to jailbreak it--
https://playground.ctgt.ai/c/5028ac78-1fa4-4158-af73-c9089cb...
Nevermind That was the ungoverned version of gemini, their models worked.

by kraddypatties

1 subcomments

Running into "no healthy upstream" when navigating to the link -- hug of death maybe?

by fuddle

2 subcomments

The link sends me to a Chat UI with no context about the product. An intro or walkthrough would be useful.

by orph

1 subcomments

Why not apply changes to the underlying model so that you crush every available eval?

by GuinansEyebrows

1 subcomments

by rrr_oh_man

0 subcomment