Skip to content

Predicting Late Delivery Risk in Supply Chains using Machine Learning with EDA, feature engineering, and model explainability

License

Notifications You must be signed in to change notification settings

Pegah-Ardehkhani/SupplyChain-LateDeliveryRisk

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

11 Commits
ย 
ย 
ย 
ย 
ย 
ย 

Repository files navigation

Supply Chain Late Delivery Risk ๐Ÿ”ดโš ๏ธ license Open In Colab nbviewer

Dataset ๐Ÿ“”

DataCo SMART SUPPLY CHAIN FOR BIG DATA ANALYSIS

About

This project analyzes and predicts late deliveries in a supply chain dataset using machine learning models. The goal is to identify the risk of late delivery for orders based on features like customer, product, shipping, and sales data.

Key components of the project:

  • Exploratory Data Analysis (EDA): Understand the dataset, feature distributions, and relationships.
  • Data Preprocessing: Handle missing values, encode categorical variables, and extract features from dates.
  • Predictive Modeling: Implemented multiple classifiers including Random Forest, Logistic Regression, XGBoost, Decision Tree, and K-Nearest Neighbors to predict Late_delivery_risk.
  • Evaluation: Used metrics such as accuracy, recall, F1-score, confusion matrix, and ROC-AUC for performance comparison.
  • Feature Importance & Explainability: Analyzed feature contributions using Random Forest importance, LIME, SHAP, and counterfactual explanations with DiCE.

This project provides a complete pipeline for risk prediction in supply chain management, helping stakeholders identify potential late deliveries and optimize operations.