Machine Learning
EthicalAI
•
Now
•
100%
[Paper] Learning to Generate Better Than Your LLM
https://arxiv.org/abs/2306.11816#:~:text=Reinforcement%20learning%20(RL)%20has%20emerged,RL%20and%20feedback%20from%20humans.I was looking through papers that combine LLMs and RL and this was pretty fascinating and the citations are perfect for continuing my search.
Comments 0