The Reversal Curse: LLMs trained on "A is B" fail to learn "B is A" Paper • 2309.12288 • Published Sep 21, 2023 • 3
Technical Report: Large Language Models can Strategically Deceive their Users when Put Under Pressure Paper • 2311.07590 • Published Nov 9, 2023 • 16