Feature Scaling

In machine learning, before we take the data to train a model, we usually need to perform feature scaling on the data. This article will introduce some feature scaling methods.

Feature Scaling and Gradient Descent
Rescaling (Min-Max Normalization)
Mean Normalization
Standardization (Z-score Normalization)
Conclusion
References

Feature Scaling and Gradient Descent

When the range of values of each feature of the data is too different, it may cause gradient descent run slowly. If we perform feature scaling on each feature so that they are within a comparable range, it will speed up the execution speed of gradient descent.

Rescaling (Min-Max Normalization)

Rescaling is also called min-max normalization. It takes the value minus the minimum and divides it by the maximum minus the minimum. This scales the values to be between [0, 1].

$x^{\prime}=\frac{x-min(x)}{max(x)-min(x)}$

Mean Normalization

Mean Normalization subtracts the mean value from the value and divides it by the maximum value minus the minimum value. This scales values to between [-1, 1].

$x^{\prime}=\frac{x-mean(x)}{max(x)-min(x)}$

Standardization (Z-score Normalization)

Standardization is also called z-score normalization. It subtracts the mean value from the value and divides it by the standard deviation. It causes the values in each feature to have an average of 0 and a standard deviation of 1. This method is widely used in many machine learning algorithms.

$x^{\prime}=\frac{x-mean(x)}{\sigma} \\\\ \sigma=\sqrt{\frac{1}{n} \sum(x-mean(x))^2}$

Conclusion

Although feature scaling is not difficult, it can significantly speed up gradient descent. Therefore, after we collect the data, before starting to train the model, we can first analyze whether the value range of each feature is too different. If so, remember to do feature scaling first before training the model.

References

Andrew Ng, Machine Learning Specialization, Coursera.

Get source code of posts.

Share

Table of Contents

Feature Scaling and Gradient Descent

Rescaling (Min-Max Normalization)

Mean Normalization

Standardization (Z-score Normalization)

Conclusion

References

Wayne

Leave a Reply Cancel reply

Python Pie/Donut/Sunburst Charts

Python Candlestick Charts

Python Box/Violin Plots

Python Regression Line Plots

Python Choropleth Map

Python Heatmaps

CLIP Model

Generative Pre-trained Transformer, GPT

Bidirectional Encoder Representations from Transformers, BERT

Transformer Model

Attention Models

Spring Security JWT Authentication with Google Sign-In Explained

How to Backup and Restore MySQL Databases in Spring Boot

Sending Push Notifications Using FCM in Spring Boot

Python Pie/Donut/Sunburst Charts

Kotlin Coroutine Flow Tutorial

Spring Security JWT Authentication with Google Sign-In Explained

How to Backup and Restore MySQL Databases in Spring Boot

Sending Push Notifications Using FCM in Spring Boot

Python Pie/Donut/Sunburst Charts

Feature Scaling

Share

Table of Contents

Feature Scaling and Gradient Descent

Rescaling (Min-Max Normalization)

Mean Normalization

Standardization (Z-score Normalization)

Conclusion

References

Leave a Reply Cancel reply

You May Also Like