Try This If Your Linear Regression Model is Underperforming
Change the model or alter the data.
At times, skewness in data can negatively influence the predictive power of your linear regression model.
If the model is underperforming, it may be due to skewness.
There are many ways to handle this:
Try a different model
Transform the skewed variable
The most commonly used transformations are:
Log transform: Apply the logarithmic function.
Sqrt transform: Apply the square root function.
Box-cox transform: You can use it using Scipy’s implementation: Scipy docs.
The effectiveness of the three transformations is clear from the image above.
Applying them transforms the skewed data into a (somewhat) normally distributed variable.
Thanks for reading Daily Dose of Data Science! Subscribe for free to learn something new and insightful about Python and Data Science every day. Also, get a Free Data Science PDF (350+ pages) with 250+ tips.
👉 Over to you: What are some other methods to handle skewness that you are aware of?
👉 Tell the world what makes this newsletter special for you by leaving a review here :)
👉 If you liked this post, don’t forget to leave a like ❤️. It helps more people discover this newsletter on Substack and tells me that you appreciate reading these daily insights. The button is located towards the bottom of this email.
👉 If you love reading this newsletter, feel free to share it with friends!
Find the code for my tips here: GitHub.