Reinforcement Learning Strikes Back
·28 words·1 min
Back in circa 2016 we all were RL people, then we turned into LLM pre trainers and soon we might be back to RL again :). https://x.com/nrehiew_/status/1836761662231400948