This project aims to build an effective machine learning solution for Walmart, one of the leading retail stores in the US, to accurately predict sales and demand.
I applied 5(five) Regression Modelsβ> Multiple Linear Regression (MLR), Ridge Regression, Lasso Regression, Elastic-Net Regression, and Polynomial Regression for predicting Sales and evaluated the model scores for comparison (R2, RMSE, RSS and MSE).
Scope: Historical sales data from 45 Walmart stores across various regions in the US.
Key Factors Influencing Sales:
Holidays & Events: Prominent holidays like Super Bowl, Labor Day, Thanksgiving, and Christmas.
Economic Indicators: CPI (Consumer Price Index), Unemployment Index.
Markdown Events: Seasonal promotional markdowns preceding major holidays.
Holiday Weighting: Weeks including major holidays are weighted five times higher than non-holiday weeks.
The dataset is taken from Kaggle (https://www.kaggle.com/datasets/yasserh/walmart-dataset).