Diabetes Prediction using Machine Learning

This project uses machine learning algorithms to predict the likelihood of a patient having diabetes based on certain diagnostic criteria. The dataset used in this project contains information about various diagnostic measurements for patients, such as glucose, blood pressure, insulin level, etc.

The aim of this project is to build and compare the performance of different machine learning models - K-Nearest Neighbors, Logistic Regression, Decision Tree, Random Forest, Support Vector Machine - to accurately predict the presence of diabetes in patients.

View Demo · Documentation · Report Bug · Request Feature

📶 Dataset

The dataset used in this project is the Pima Indians Diabetes Dataset, which can be found here. The dataset consists of 768 samples with 8 features, including the target variable, indicating the presence of diabetes. The dataset can be found on Kaggle. It includes following health criteria:

Pregnancies: Number of times pregnant
Glucose: Plasma glucose concentration a 2 hours in an oral glucose tolerance test
BloodPressure: Diastolic blood pressure (mm Hg)
SkinThickness: Triceps skin fold thickness (mm)
Insulin: 2-Hour serum insulin (mu U/ml)
BMI: Body mass index (weight in kg/(height in m)^2)
DiabetesPedigreeFunction: Diabetes pedigree function
Age: Age (years)
Outcome: Class variable (0 or 1)

Details

Number of Instances: 768
Number of Attributes: 8 plus class
Missing Attribute Values: Yes
Class Distribution: (class value 1 is interpreted as "tested positive for diabetes")

🧰 Dependecies

python 3.8.3

pandas

numpy

sklearn

matplotlib

seaborn

pickle

⚙️ Installation

Clone the repository and install the required dependencies using the following commands:

git clone https://github.com/Priyanshu88/Diabetes-Prediction.git

cd Diabetes-Prediction

⏯️ Usage

The Jupyter notebook Diabetes Prediction.ipynb contains the code for loading and preprocessing the dataset, as well as implementing and evaluating the KNN, Logistic Regression, Random Forest, Support Vector Machine and Decision Tree models. To run the notebook, simply open it in Jupyter and run each cell in order.

🚀 Results

Model	Accuracy
K-Nearest Neighbour	79.22%
Logistic Regression	81.82%
Random Forest	79.22%
Support Vector Machine	83.12%
Decision Tree	81.82%
Hypertuning - GridSearchCV on Logistic Regression	83.12%

🚧 Conclusion

In this project, we compared the accuracy of five different machine learning models as well as hypertuning parameters for predicting diabetes based on various health criteria. We found that Support Vector Machine was the most accurate model, with an accuracy of 83.12%. The above results also tells that Logistic Regression and Decision Tree are also performing good and hypertuing on Logistic Regression increases its accuracy around 2%. This project could be further improved by testing additional models and/or including additional health criteria in the dataset.

🚩 Deployment

Checkout the deployment repository here.

⚖️ License

This project is licensed under the MIT License - see the LICENSE file for details.

🤝 Contact

Your Name - @twitter_handle - 2040020@sliet.ac.in

Project Link: https://github.com/Priyanshu88/Diabetes-Prediction.git

Don't forget to leave a star ⭐️

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
Diabetes Prediction.ipynb		Diabetes Prediction.ipynb
Diabetes data.csv		Diabetes data.csv
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Diabetes Prediction using Machine Learning

View Demo · Documentation · Report Bug · Request Feature

📔 Table of Contents

📶 Dataset

Details

🧰 Dependecies

⚙️ Installation

⏯️ Usage

🚀 Results

🚧 Conclusion

🚩 Deployment

⚖️ License

🤝 Contact

About

Releases

Packages

Languages

License

Priyanshu88/Diabetes-Prediction

Folders and files

Latest commit

History

Repository files navigation

Diabetes Prediction using Machine Learning

View Demo · Documentation · Report Bug · Request Feature

📔 Table of Contents

📶 Dataset

Details

🧰 Dependecies

⚙️ Installation

⏯️ Usage

🚀 Results

🚧 Conclusion

🚩 Deployment

⚖️ License

🤝 Contact

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages