Featured resource
2026 Tech Forecast
2026 Tech Forecast

Stay ahead of what’s next in tech with predictions from 1,500+ business leaders, insiders, and Pluralsight Authors.

Get these insights
  • Course

Reinforcement Learning from Human Feedback (RLHF)

In this course we explore one corner of the expanding AI universe, and review some of the basic principles found in reinforcement learning from human feedback (RLHF), the technology underlying great AI tools such as ChatGPT, Bard, and more.

Beginner
40m
(12)

Created by Jerry Kurata

Last Updated Oct 31, 2023

Course Thumbnail
  • Course

Reinforcement Learning from Human Feedback (RLHF)

In this course we explore one corner of the expanding AI universe, and review some of the basic principles found in reinforcement learning from human feedback (RLHF), the technology underlying great AI tools such as ChatGPT, Bard, and more.

Beginner
40m
(12)

Created by Jerry Kurata

Last Updated Oct 31, 2023

Get started today

Access this course and other top-rated tech content with one of our business plans.

Try this course for free

Access this course and other top-rated tech content with one of our individual plans.

This course is included in the libraries shown below:

  • AI
  • Data
What you'll learn

Have you ever wondered how tools like ChatGPT and Bard are able to generate great responses to the questions we pose? How they can respond to a prompt like “Plan a trip to Italy this fall and suggest great things to see,” and produce a response containing a full itinerary with places to see, the best time to visit, and the sites you shouldn't miss?

In this course, Reinforcement Learning from Human Feedback (RLHF), you’ll gain the ability to understand what is going on behind the scenes to create responses to your prompts.

First, you’ll explore why having all the information available is not enough to create a great response.

Next, you’ll discover how we teach a machine learning model to handle all that data and craft a response that people like.

Finally, you’ll learn how none of it is magic, just some really great engineering by some bright people.

When you’re finished with this course, you’ll have the skills and knowledge of reinforcement learning with human feedback needed to understand how this great engineering works and produces its amazing results.

Reinforcement Learning from Human Feedback (RLHF)
Beginner
40m
(12)
Table of contents

About the author
Jerry Kurata - Pluralsight course - Reinforcement Learning from Human Feedback (RLHF)
Jerry Kurata
9 courses 4.5 author rating 1650 ratings

Jerry Kurata is a Solutions Architect at InStep Technologies.

Get started with Pluralsight