Course

Context Evaluation and Safety

This course will give you the skills and knowledge to conduct the context evaluation and safety validations needed to build your own LLM-as-a-judge.

Intermediate

41m

Created by Darryl Brown

Last Updated Jan 29, 2026

Get started today

Access this course and other top-rated tech content with one of our business plans.

Start a free team trial

Buy now

Try this course for free

Access this course and other top-rated tech content with one of our individual plans.

Start a free trial

Buy now

This course is included in the libraries shown below:

Course

Context Evaluation and Safety

This course will give you the skills and knowledge to conduct the context evaluation and safety validations needed to build your own LLM-as-a-judge.

Intermediate

41m

Created by Darryl Brown

Last Updated Jan 29, 2026

Get started today

Access this course and other top-rated tech content with one of our business plans.

Start a free team trial

Buy now

Try this course for free

Access this course and other top-rated tech content with one of our individual plans.

Start a free trial

Buy now

This course is included in the libraries shown below:

What you'll learn

Working with large language models (LLMs) does not always yield the best results. In this course, Context Evaluation and Safety, you’ll learn to build metrics-based tests and interpret the differences between hallucination and faithfulness. First, you’ll explore using Python and evaluation libraries like DeepSeek to test your model by evaluating output against ground truths. Next, you’ll discover how to set up a scoring system to evaluate the ratio of the supported claims as a test of fairness. Finally, you’ll learn how to build automated processes to periodically test your model. When you’re finished with this course, you’ll have the skills and knowledge of context evaluation and safety validations needed to build your own LLM-as-a-judge.

Context Evaluation and Safety

Intermediate

41m

Table of contents

About the author

Darryl Brown

4 courses

0.0 author rating

0 ratings

With over 20 years of experience in software design and development, I am a cloud engineering expert and leader at my company, where I manage complex and innovative solutions for the global insurance provider. As an Associate Vice President and Senior Principal Cloud Engineer, I oversee the architecture, design, infrastructure as code, DevSecOps, leadership, and innovation of cloud-based applications and services, using Microsoft Azure as the main platform. I love to share my knowledge, and I am a lifelong learner.

More Courses by Darryl

Context Evaluation and Safety

Context Evaluation and Safety

Get started today

Try this course for free

Context Evaluation and Safety

What you'll learn

Context Evaluation and Safety

Model Output Testing 14m

Evaluation Automation 17m

Context Safety and Governance 9m

2025 Forrester Wave™ names Pluralsight as a Leader among tech skills dev platforms