←
Hacker News
Reinforcement Learning from Human Feedback
41 points
2 comments
3 hours ago
klelatti
Web version with links, etc:
https://rlhfbook.com/
show comments
Web version with links, etc:
https://rlhfbook.com/