Model Confusion: Insights from the Confusion Matrix

Analyze model accuracy and errors using a confusion matrix, highlighting where predictions succeeded and failed. Understand the limitations that caused significant inaccuracies, especially within specific categories of predictions.

Key Insights

The confusion matrix reveals that the model accurately predicted 8,538 individuals who stayed and 734 individuals who left, indicating strong accuracy in these categories.
Despite an overall accuracy of about 77%, the model struggled significantly when predicting individuals who actually left, accurately identifying only about 25% of them correctly (734 out of approximately 2,800).
Utilizing sklearn's confusion matrix and pandas DataFrames provided a clear visualization of prediction errors, showing that 2,100 individuals who actually left were incorrectly predicted to stay, highlighting substantial room for model improvement.

This lesson is a preview from our Data Science & AI Certificate Online (includes software) and Python Certification Online (includes software & exam). Enroll in a course for detailed lessons, live instructor support, and project-based training.

Let's take a look at what kind of categories we got right or wrong. We're going to make a confusion matrix, which is one of my favorite names. What a good name.

So a confusion matrix shows where we got confused, essentially. Where our model got confused, rather. We're not confused—we've got this.

But our model got pretty confused. Despite this 77%, we're going to see that there was a specific category that really confused it. So let's make that confusion matrix.

We're going to say CM—that's a standard name for confusion matrix. Confusion matrix—that's a function provided by sklearn.metrics. We pass it our actual correct answers from the test data and our predictions. Now we make a DataFrame from that matrix.

We can take a look at it. CMDF is a pandas DataFrame—a new DataFrame. We'll pass it the confusion matrix as the data.

Data Analytics Certificate: Live & Hands-on, In NYC or Online, 0% Financing, 1-on-1 Mentoring, Free Retake, Job Prep. Named a Top Bootcamp by Forbes, Fortune, & Time Out. Noble Desktop. Learn More.

We'll name the columns "Predicted Stayed" and "Predicted Left." And then we'll also pass it the row names, the index: "Actually Stayed" and "Actually Left."

Now let's take a look at that DataFrame. All right. We got quite a lot right, but also quite a lot wrong.

Let's take a look. Now, we predicted that 8,538 stayed and they did. That's great.

We predicted that 734 left and they actually left. These upper-left and lower-right entries are where we got things right. Predicted stayed—they stayed.

Predicted left—they left. That's pretty good. If we look at our "Predicted Stayed, " we got about 80% of them right, maybe a little more.

Out of the 10,000 or so that we predicted stayed, about 80% of them were right. But when we predicted left, we correctly identified slightly more who actually did leave compared to those we got wrong, but not by much.

For the people we predicted left, we were about 55% correct. 55% to 60% correct—I'm not doing the exact math here.

Because guess what? The computer is going to do the math for us. When we look at it differently, considering how many people actually left, here's the "Actually Left" group.

That's around 2,800. We only correctly identified 25% of those people, right?

Actually, no—that's very incorrect. Of the people who actually left, we only got 25% of them right. We predicted they left, and they actually left, great.

We predicted they left, but they actually stayed. Actually, we predicted they stayed, but they actually left. That's a huge number.

Of the people who actually left, we got 2,100 wrong and only 700 right. So that's pretty bad. We definitely have some errors here, and there's a way to measure that, which we'll look at next.

Colin Jaffe

Colin Jaffe is a programmer, writer, and teacher with a passion for creative code, customizable computing environments, and simple puns. He loves teaching code, from the fundamentals of algorithmic thinking to the business logic and user flow of application building—he particularly enjoys teaching JavaScript, Python, API design, and front-end frameworks.

Colin has taught code to a diverse group of students since learning to code himself, including young men of color at All-Star Code, elementary school kids at The Coding Space, and marginalized groups at Pursuit. He also works as an instructor for Noble Desktop, where he teaches classes in the Full-Stack Web Development Certificate and the Data Science & AI Certificate.

Colin lives in Brooklyn with his wife, two kids, and many intricate board games.

Key Insights

Colin Jaffe

How to Learn Machine Learning