FRESH Hacker News
Home
Is One Layer Enough? A Single Transformer Layer Matches Full-Parameter RL Train
95 points by tcp_handshaker