Hello, I'm Mazen Elsafty

Data Scientist | Data Analyst | Supervised ML Engineer and this is My Portfolio Kaggle

My Projects

  • All
  • Data Science And ML
  • Data Analysis
  • Web Scraping
  • SQL

Ames House Price Prediction

Data Science And ML

Charity Donor Predictor

Data Science And ML

My Resume

Education

September 2021 - October 2025

Bachelor's Degree in Information Technology

Pursuing a Bachelor's Degree in Information Technology with a specialization in Information Technology and Data Management.

GPA : 3.69

Egyptian e-Learning University (EELU)

Experience

April 2024 - December 2024

IBM Data Scientist , Soft Skills , English

The Digital Egypt Pioneers Initiative (DEPI), launched by the Egyptian Ministry of Communications and Information Technology, aims to cultivate a skilled workforce in digital technology and data science through advanced training programs and certifications.

  • IBM Data Science Grant Program:
    • Completed a comprehensive grant-supported program by IBM under DEPI.
    • Focused on mastering tools and techniques for data analysis, machine learning, data visualization, data cleaning, exploratory data analysis (EDA), statistics, web scraping, Python programming, and SQL.
  • Hands-on Tools and Libraries:
    • Gained proficiency in Anaconda and Jupyter Notebook environments.
    • Worked extensively with libraries such as Pandas, NumPy, Scikit-learn, Beautiful Soup, Requests, and SQL Magic.
    • Applied machine learning techniques, including supervised and unsupervised models, cross-validation, grid search, and model tuning.
  • Real-World Project Experience:
    • Ames House Price Prediction: Built predictive models to estimate house prices based on key features.
    • Finding Donors Predictor: Developed models to identify potential donors for charity organizations.
    • Chicago Web Scraping Project: Scraped real estate data from Chicago listings and created a structured dataset.
    • Medical Representative Analysis: Analyzed performance metrics and trends for medical representatives.
    • SMS Spam Classifier: Created a machine learning model to classify SMS messages as spam or legitimate.
  • Technologies and Techniques:
    • Employed advanced data preprocessing, cleaning, and feature engineering techniques.
    • Utilized machine learning pipelines to streamline workflows.
    • Worked on model evaluation and tuning to optimize performance.
  • Professional Development:
    • Received specialized training in freelancing, soft skills development, and English proficiency to support career growth and communication skills.
  • Through this initiative, I have built a robust foundation in data science and honed my ability to apply theoretical concepts to solve practical problems effectively. My work exemplifies a commitment to leveraging data-driven insights to address real-world challenges.

DEPI
August 2024 - September 2024

Data Scientist

CodeAlpha is a leading technology and software development company specializing in innovative solutions in data science, machine learning, and software engineering.

  • Machine Learning Models:
    • Developed and implemented a linear regression model to predict housing prices, working on feature engineering, data scaling, and model evaluation to enhance prediction accuracy.
  • A/B Testing Analysis:
    • Analyzed and compared product performance in an A/B testing scenario using Bayesian analysis and Chi-Square tests.
    • Delivered actionable insights that guided the marketing team in optimizing strategies for better customer engagement.
  • Stock Price Prediction:
    • Built and trained a Long Short-Term Memory (LSTM) model to predict Microsoft stock prices.
    • Utilized temporal dependencies in stock market data to improve prediction accuracy.
    • Employed time-series data processing techniques, such as sequence generation, lag features, and rolling statistics.
  • Data Preparation and EDA:
    • Collaborated in data collection and preprocessing, including handling missing values, outlier treatment, and feature selection.
    • Conducted exploratory data analysis (EDA) to identify patterns and trends, ensuring data quality and insight generation.
  • Technologies and Tools:
    • Programming Languages: Python.
    • Libraries: Pandas, NumPy, Matplotlib, Seaborn, Scikit-learn, TensorFlow, and Keras.
    • Data Analysis Techniques: Bayesian analysis, Chi-Square tests, time-series analysis, and data preprocessing.
    • Development Environments: Jupyter Notebook and Anaconda.
    • Machine Learning Techniques: Regression models, deep learning (LSTM), cross-validation, and hyperparameter tuning.
  • At CodeAlpha, I worked on diverse projects, leveraging cutting-edge tools and techniques to deliver impactful solutions. This experience significantly enhanced my technical expertise and my ability to contribute effectively to collaborative, data-driven environments.

CodeAlpha
August 2024 - September 2024

Data Scientist

Bharat Intern is a premier internship platform that bridges the gap between education and industry. It provides students and young professionals with meaningful opportunities in diverse fields such as technology, data science, marketing, and more.

  • Titanic Survival Prediction Project:
    • Developed machine learning models, including Logistic Regression, Decision Trees, and Random Forests, to predict passenger survival on the Titanic.
    • Conducted data preprocessing, feature engineering, and model evaluation to improve predictive accuracy.
  • CNN-Based Image Classification Project:
    • Built a Convolutional Neural Network (CNN) to classify images of cats and dogs, gaining hands-on experience with deep learning.
    • Utilized frameworks such as TensorFlow and Keras, focusing on model architecture, optimization, and validation.
  • Skill Development:
    • Strengthened expertise in data preprocessing, including handling missing values, normalization, and data augmentation for image data.
    • Gained experience in model training, fine-tuning, and leveraging collaborative teamwork to deliver high-quality outcomes.
  • Technologies and Tools:
    • Programming Languages: Python.
    • Libraries: Pandas, NumPy, Matplotlib, Scikit-learn, TensorFlow, and Keras.
    • Machine Learning Techniques: Supervised learning, deep learning, and neural network optimization.
    • Development Environments: Jupyter Notebook and Anaconda.
  • At Bharat Intern, I had the opportunity to work on impactful projects, enhancing my technical expertise and teamwork abilities in a dynamic, learning-oriented environment.

BharatIntern
August 2024 - September 2024

Data Analyst

NeuronetiX is an innovative technology company specializing in data analytics and artificial intelligence solutions. The company leverages cutting-edge technology to transform data into actionable insights, driving decision-making across diverse industries.

  • Data Analysis and Insights:
    • Performed comprehensive data cleaning, preprocessing, and exploratory analysis to identify patterns and trends in large datasets.
    • Applied statistical techniques to derive actionable insights and informed strategic decision-making.
  • Data Visualization and Reporting:
    • Designed interactive dashboards and reports using Power BI and Excel, enabling stakeholders to explore key metrics effectively.
    • Created compelling visualizations to communicate complex data trends and findings clearly.
  • Technical Expertise:
    • Utilized Python libraries such as Pandas, NumPy, and Matplotlib for data manipulation and visualization.
    • Automated repetitive data tasks to improve efficiency and accuracy in data workflows.
  • Technologies and Tools:
    • Programming Languages: Python.
    • Libraries: Pandas, NumPy, Matplotlib, and Seaborn.
    • Data Visualization Tools: Power BI and Excel.
    • Techniques: Data cleaning, statistical analysis, and exploratory data visualization.
  • At NeuronetiX, I honed my analytical and technical skills, contributing to data-driven projects that empowered businesses to make informed decisions. This experience deepened my understanding of real-world data challenges and solutions.

NeuronetiX
Image placeholder

About Me

Data Scientist, Data Analyst and Supervised Machine Learning Engineer.

My name is Mazen Elsafty, and I am a data enthusiast with a Bachelor’s degree in Computer Science, specializing in Information Technology. My academic background has provided me with a robust technical foundation, but my journey into the world of data ignited a deeper passion for its transformative potential. I am fascinated by how data can drive innovation, optimize business operations, and influence strategic decision-making on a global scale. To cultivate this passion, I have dedicated myself to mastering data science, data analysis, and machine learning, supported by a strong understanding of statistics, probability, and mathematics. Over the course of my learning, I have developed a data-driven mindset, enabling me to extract valuable insights and translate them into actionable solutions. My experience includes working on diverse projects, completing internships with reputable companies, and applying my knowledge to real-world challenges. These opportunities have honed my technical expertise and deepened my understanding of the practical applications of data in solving complex problems. I am committed to leveraging data to create meaningful impact, continuously expanding my skills, and contributing to innovative solutions that drive progress in businesses and beyond.

Hire Me Download CV

My Services

Data Science and Machine Learning Services

Unlock the potential of your data with our expertise in data science and machine learning. We offer:

  • Exploratory Data Analysis: Identify trends and insights through advanced visualization and statistical analysis.
  • Data Cleaning and Preprocessing: Professionally handle missing values, outliers, feature engineering, and scaling to prepare your data for analysis.
  • Custom Model Development: Build tailored machine learning models, including Decision Trees, Random Forest, SVM, and KNN, optimized using GridSearchCV and Cross-validation.
  • Data Analysis: Dive deep into your data to uncover patterns and insights, aiding in strategic decision-making.
  • Model Optimization: Fine-tune models for accuracy and reliability, utilizing ensemble methods and clustering techniques.
  • Deployment Support: Assist with implementation and maintenance in production environments using MLflow, Flask, and cloud services.

Learn More

Web Scraping

Enhance your data collection with our Web Scraping Service. We specialize in:

  • Custom Web Scraping: Tailored solutions to extract data from any website according to your requirements.
  • Data Cleaning: Ensure scraped data is organized and ready for analysis.
  • Data Storage: Options for storing scraped data in various formats such as CSV or databases.
  • Scalability: Handle large volumes of data across multiple websites efficiently.
  • Support and Maintenance: Ongoing support to ensure scrapers are running smoothly.

Learn More

Posts on Linkedin

Image placeholder

Ames Insights

I used the Decision Tree model to explore the Ames Housing dataset and accurately predict house prices based on key features.

Image placeholder

Charity Donors Insights

I built a machine learning model to help charities identify potential donors by predicting whether individuals earn more than $50K annually.

Image placeholder

California Insights

Oceanfront houses are highly desirable and command higher prices, often owned by wealthier individuals seeking prime locations.

Get In Touch

Get In Touch

Send Message

Freelance Platforms

Upwork

Upwork

Upwork

Upwork

My Contact Details

  • Email mazensafty2003@gmail.com
  • Phone +201016203122
  • Address Alexandria, Egypt
    Elmalik Street, Montazah