Featured resource
Forrester Wave Report 2025
Pluralsight named a Leader in the Forrester Wave™

Our tech skill development platform earned the highest scores possible across 11 criteria.

Learn more
  • Labs icon Lab
  • Cloud
Labs

Generating Text Using TensorFlow

One fun thing we can do with machine learning is generating text! We can build a model that will predict the next word in a sequence. This combines interesting aspects of both sequence processing and natural language processing. This lab will give you more practice in both of these important areas of machine learning. ### Prerequisites This lab is designed to be completed in PyCharm running on your machine. You should have PyCharm and TensorFlow installed before attempting this lab. We will not be covering this setup in the lab.

Labs

Path Info

Level
Clock icon Advanced
Duration
Clock icon 1h 0m
Published
Clock icon Aug 13, 2021

Contact sales

By filling out this form and clicking submit, you acknowledge our privacy policy.

Table of Contents

  1. Challenge

    Retrieve and Load the Tiny Frankenstein Data

    1. Download the Tiny Frankenstein dataset: https://storage.googleapis.com/acg-datasets/tiny_frankenstein.tgz.
    2. Load the raw text into the program.
    3. Convert the text to lowercase.
  2. Challenge

    Turn the Text Data into Token Sequences

    1. Train a tokenizer on the Frankenstein text using every available word.
    2. Find the total number of learned words.
    3. Convert the Frankenstein text to tokens.
    4. Convert the tokens to sequences.
  3. Challenge

    Build a Model to Predict Tokens

    1. Create a recurrent neural network to learn the token sequences.
      • Hint: Bidirectional LSTMs work really well for this task!
    2. Compile the model with an appropriate loss function and optimizer.
      • Hint: You might want to increase the learning rate of the optimizer. This model can take a very long time to converge with the default learning rate.
    3. Train your model for 10-20 epochs.
  4. Challenge

    Generate Text!

    1. Create a reverse token lookup dictionary from the tokenizer's word_index.
    2. Create tokenized text to start the predictions.
    3. Use the model to predict the next token.
    4. Append the predicted token to the text, truncating earlier values.
    5. Continue using the model to predict as many tokens as you'd like!

Pluralsight Skills gives leaders confidence they have the skills needed to execute technology strategy. Technology teams can benchmark expertise across roles, speed up release cycles and build reliable, secure products. By leveraging our expert content, skill assessments and one-of-a-kind analytics, keep up with the pace of change, put the right people on the right projects and boost productivity. It's the most effective path to developing tech skills at scale.

What's a lab?

Hands-on Labs are real environments created by industry experts to help you learn. These environments help you gain knowledge and experience, practice without compromising your system, test without risk, destroy without fear, and let you learn from your mistakes. Hands-on Labs: practice your skills before delivering in the real world.

Provided environment for hands-on practice

We will provide the credentials and environment necessary for you to practice right within your browser.

Guided walkthrough

Follow along with the author’s guided walkthrough and build something new in your provided environment!

Did you know?

On average, you retain 75% more of your learning if you get time for practice.