Team_AI1_8

Titanic Survival Prediction using Logistic Regression

πŸ‘₯ Team Members


🎯 Problem Statement

The objective of this project is to predict whether a passenger survived the Titanic disaster or not using machine learning techniques. This is a binary classification problem where the output is either:


πŸ“Š Dataset Description

We used the Titanic dataset from Kaggle, which contains information about passengers such as:

The dataset is divided into:


🧹 Data Preprocessing

Before training the model, we cleaned and prepared the data:


βš™οΈ Feature Engineering

We created new features to improve model performance:


πŸ”’ Data Encoding

Categorical variables like gender and embarked location were converted into numerical values using:

This is required because machine learning models only understand numerical data.


βœ‚οΈ Train-Test Split

The dataset was divided into:

This helps evaluate how well the model performs on unseen data.


πŸ“ Feature Scaling

We applied StandardScaler to normalize the data so that all features have similar scale, which improves model performance.


πŸ€– Model Used

We used Logistic Regression, which is suitable for classification problems.

Reasons:


πŸ“Š Model Training & Validation


πŸ“ˆ Model Evaluation

We evaluated the model using:


πŸ“‰ Results

πŸ‘‰ This indicates the model performs well and generalizes properly.


πŸ”— GitHub Collaboration


🧠 Conclusion

This project demonstrates how machine learning can be used to solve real-world classification problems. By applying preprocessing, feature engineering, and Logistic Regression, we successfully built a model that predicts Titanic passenger survival with good accuracy.


πŸš€ Future Improvements