Added HIV prediction using Machine Learning #334
Merged
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Resolves #320
This project aims to develop a web application for predicting HIV disease status using a machine learning model. The application leverages the powerful XGBoost algorithm, which has been pre-trained to analyze a variety of health-related and behavioral inputs to determine the likelihood of an individual being HIV positive. The inputs include factors such as age, marital status, history of sexually transmitted diseases (STDs), educational background, HIV testing history, AIDS education, places of seeking sex partners, sexual orientation, drug-taking habits, and awareness before marriage.
The application is built using Streamlit, a popular framework for creating interactive web applications in Python. The user interface is designed to be both functional and visually appealing, featuring stylish fonts, custom CSS for enhanced aesthetics, and relevant images to provide an engaging experience. Users can input their data through sliders and select boxes, ensuring an intuitive and user-friendly interaction.
One of the standout features of the application is the dynamic and responsive design. The app includes a background image related to HIV/AIDS awareness, adding context and visual depth. Additionally, a moving HIV ribbon image is positioned in the corner, symbolizing the cause and drawing attention to the importance of awareness and education. The input and output sections are enclosed in stylish containers with shadows and rounded borders, enhancing readability and focus.
The model's prediction process involves label encoding categorical variables to convert them into numerical format suitable for the model. Upon clicking the 'Predict' button, the app processes the input data, applies the XGBoost model, and displays the prediction result. If the model predicts a positive HIV status, a highlighted error message is shown; otherwise, a success message indicates a negative result.
This project not only demonstrates the practical application of machine learning in healthcare but also highlights the importance of accessible tools for early detection and awareness. By providing a seamless and informative user experience, the application aims to contribute to the ongoing efforts in HIV prevention and education. The integration of relevant imagery and user-friendly design principles ensures that the tool is not only functional but also engaging and supportive of the overall mission to combat HIV/AIDS.
Algorithms used :
XGBOOST , Random Forest , Logistic Regression , Gradient Boosting Machine
(also performed entire data analysis ,EDA ,feature engineering , model comparison and evaluation)
deployed with the help of Streamlit application as follows :
webapp.mp4