How Sequence likelihood works part1(Machine Learning) | by Monodeep Mukherjee

Retrieved on: 2024-01-13 19:16:29

Tags for this article:

Artificial intelligence

Language modeling

Machine learning

Reinforcement learning

Reinforcement learning from human feedback

Cybernetics

Click the tags to see associated articles and topics

How Sequence likelihood works part1(Machine Learning) | by Monodeep Mukherjee. View article details on HISWAI: https://medium.com/%40monocosmo77/how-sequence-likelihood-works-part1-machine-learning-53f7ea8d7281

Excerpt

Past work has often relied on Reinforcement Learning from Human Feedback (RLHF), which optimizes the language model using reward scores assigned from ...

Article found on: medium.com

View Original Article