John Schulman - Reinforcement Learning from Human Feedback: Progress and Challenges
- Video
- Apr 20, 2023
- #Naturallanguageprocessing #ArtificialIntelligence
John discusses the issue of hallucination and factual accuracy with large language models. He argues that behavior cloning or supervised learning is not enough to avoid the hallucin...
Show More