Featured resource
2026 Tech Forecast
2026 Tech Forecast

Stay ahead of what’s next in tech with predictions from 1,500+ business leaders, insiders, and Pluralsight Authors.

Get these insights
  • Course

Understanding Probabilistic Data Structures with 112,092 UFO Sightings

Like most developers, you probably have no idea what probabilistic data structures are. In fact, I did a super-scientific poll on Twitter and found that of 119 participants, 58% had never heard of them and 22% had heard the term but nothing more.

Beginner
54m
(5)

Created by JavaScript & Friends

Last Updated Jan 26, 2024

Course Thumbnail
  • Course

Understanding Probabilistic Data Structures with 112,092 UFO Sightings

Like most developers, you probably have no idea what probabilistic data structures are. In fact, I did a super-scientific poll on Twitter and found that of 119 participants, 58% had never heard of them and 22% had heard the term but nothing more.

Beginner
54m
(5)

Created by JavaScript & Friends

Last Updated Jan 26, 2024

Get started today

Access this course and other top-rated tech content with one of our business plans.

Try this course for free

Access this course and other top-rated tech content with one of our individual plans.

This course is included in the libraries shown below:

  • Core Tech
What you'll learn

There are three reactions to the title of this talk: 1. What the heck’s a probabilistic data structure? 2. UFO Sightings… wha? 3. 112,092 is an oddly specific number.

This is a talk about the first reaction with the second thrown in just for fun. I like weird stuff—UFOs, Bigfoot, peanut butter and bologna on toast—maybe you do too? As far as the third bullet point, well, that’s how many sightings I have.

Now, if you’re like most developers, you probably have no idea what probabilistic data structures are. In fact, I did a super-scientific poll on Twitter and found that out of 119 participants, 58% had never heard of them and 22% had heard the term but nothing more. I wonder what percentage of that 22% heard the term for the first time in the poll. We’re a literal-minded lot at times.

Anyhow. That’s 4 out of 5 developers or, as I like to call it, the Trident dentist ratio. (It’s actually a manifestation of the Pareto principle but I’m a 70s kid). That’s a lot of folks that need to be educated. So, let’s do that.

A probabilistic data structure is, well, they’re sort of like the TARDIS—bigger on the inside—and JPEG compression—a bit lossy. And, like both, they are fast, accurate enough, and can take you to interesting places of adventure. That last one might not be something a JPEG does.

More technically speaking, most probabilistic data structures use hashes to give you faster and smaller data structures in exchange for precision. If you’ve got a mountain of data to process, this is super useful. In this talk, we’ll briefly go over some common probabilistic data structures; dive deep into a couple (Bloom Filter, MinHash, and Top-K); and show a running application that makes use of Top-K to analyze the most commonly used words in all 112,092 of my UFO sightings.

When we’re done, you’ll be ready to start using some of these structures in your own applications. And, if you use the UFO data, maybe you’ll discover that the truth really is out there.

Understanding Probabilistic Data Structures with 112,092 UFO Sightings
Beginner
54m
(5)
Table of contents

About the author
JavaScript & Friends - Pluralsight course - Understanding Probabilistic Data Structures with 112,092 UFO Sightings
JavaScript & Friends
20 courses 3.3 author rating 115 ratings

The JavaScript and Friends Conference (JS&Friends) is a not-for-profit conference, organized by a team of volunteers.

2025 Forrester Wave™ names Pluralsight as a Leader among tech skills dev platforms

See how our offering and strategy stack up.

forrester wave report