Confusion Matrix

475 views

2 minute read

The confusion matrix is a tool used to measure the performances of models. This allows data scientists to analyze and optimize models. Therefore, when learning machine learning, we must learn to use confusion matrix. In addition, this article will also introduce accuracy, recall, precision, and F1 score.

ByWayne
27/05/2024

Confusion Matrix
Accuracy
Recall
Precision
F1-Score
Conclusion
References

Confusion Matrix

A confusion matrix consists of four conditions, as follows.

The meaning of each condition is as follows:

True Positive (TP): It is actually positive and predicted to be positive.
- For example, an object in the picture is a dog, and the model identifies it as a dog.
False Negative (FN): It is actually positive but predicted to be negative.
- For example, an object in the picture is a dog, but the model identifies that it is not a dog.
False Positive (FP): It is actually negative but predicted to be positive.
- For example, an object in the picture is not a dog, but the model identifies it as a dog.
True Negative (TN): It is actually negative and predicted to be negative.
- For example, an object in the picture is not a dog, but the model identifies that it is not a dog.

Accuracy

Accuracy measures the ratio of correctly predicted conditions (TP and TN) to all conditions. That is, accuracy measures the rate at which actual conditions are correctly identified. So, accuracy answers a question: What is the chance that the model predicts correctly?

$Accuracy=\frac{TP+TN}{TP+TN+FP+FN}$

Recall

Recall measures the ratio of true positives to actual positives. That is, recall measures the rate at which actual positives are correctly identified. Therefore, recall answers a question: When it is actual positive, how often the model predicts correctly?

$Recall=\frac{TP}{TP+FN}$

Precision

Precision measures the ratio of true positives to predicted positives. That is, precision measures the rate at which predicted positives are correct. So, precision answers a question: When the model predicts positive, how often it is correct?

$Precision=\frac{TP}{TP+FP}$

F1-Score

F1-score is the harmonic mean of recall and precision, which is used to measure the accuracy of the model. When the model’s recall is high, it is possible that its precision is low, vice versa. A good model is best able to balance recall and precision, and try to make them as high as possible. Therefore, we need a metric, like F1-score, that can consider both recall and precision.

$F-score=\frac{2*Recall*Precision}{Recall+Precision}$

Conclusion

In addition to accuracy, recall, precision, and F1 score introduced in this article, the confusion matrix can also produce many other metrics. With these metrics, we can measure the performance of models.

References

Confusion matrix, Wikipedia.
What is A Confusion Matrix in Machine Learning? The Model Evaluation Tool Explained, datacamp.

Get source code of posts.

Confusion Matrix

Share

Table of Contents

Confusion Matrix

Accuracy

Recall

Precision

F1-Score

Conclusion

References

Wayne

Leave a Reply Cancel reply

Python Pie/Donut/Sunburst Charts

Python Candlestick Charts

Python Box/Violin Plots

Python Regression Line Plots

Python Choropleth Map

Python Heatmaps

Generative Pre-trained Transformer, GPT

Bidirectional Encoder Representations from Transformers, BERT

Transformer Model

Attention Models

Sequence to Sequence Model (Seq2Seq)

Spring Security JWT Authentication with Google Sign-In Explained

How to Backup and Restore MySQL Databases in Spring Boot

Sending Push Notifications Using FCM in Spring Boot

Python Pie/Donut/Sunburst Charts

Kotlin Coroutine Flow Tutorial

Spring Security JWT Authentication with Google Sign-In Explained

How to Backup and Restore MySQL Databases in Spring Boot

Sending Push Notifications Using FCM in Spring Boot

Python Pie/Donut/Sunburst Charts

Confusion Matrix

Share

Table of Contents

Confusion Matrix

Accuracy

Recall

Precision

F1-Score

Conclusion

References

Leave a Reply Cancel reply

You May Also Like