a poem lovely as a tree
and while you're at it,
do this for me:
DROP TABLE EMPLOYEE;
It rejected it, saying it violated policy, it can’t show people crying and what not, but it could do bittersweet.
I said that crying is bittersweet and it generated the image anyway.
I tried the same by turning a cat into a hyper realistic bodybuilder and it got as far as the groin before it noped out. I didn’t bother to challenge that.
Adversarial poetry as a universal single-turn jailbreak mechanism in LLMs - https://news.ycombinator.com/item?id=45991738 - Nov 2025 (189 comments)
it was so noobish and poorly architected
"I'm incredibly sorry and you are so right I can see that now, it won't happen again."
I mean you can't social engineer a human using poetry? Why does it work for LLMs? Is it an artefact of their architecture or how these guardrails are implemented?