Hesham Hatem
Data Engineer & AI Enthusiast
Computer Science & AI student passionate about data engineering, machine learning, and building scalable data pipelines with modern big data technologies.
About Me
I'm a passionate Computer Science and AI student at Menoufia National University with strong skills in programming, data structures, and problem-solving. Currently maintaining a GPA of (Very Good), I'm dedicated to applying my technical knowledge in real-world scenarios and expanding my expertise in emerging technologies.
My journey in tech has led me through valuable internships at the National Telecommunication Institute (NTI) as a Data Analysis Intern and the Information Technology Institute (ITI) as an AI & Machine Learning Trainee. These experiences have strengthened my skills in data analysis, machine learning, and big data technologies.
I'm particularly interested in data engineering, building scalable ETL pipelines, and leveraging technologies like Apache Spark, Kafka, and Hadoop to process and analyze large datasets. I'm always eager to learn new technologies and take on challenging projects that push the boundaries of what's possible with data.
Experience
Data Analysis Intern
National Telecommunication Institute (NTI)
- •Cleaned and analyzed 10,000+ marketing records using Excel/Power Query, improving data accuracy and preparation efficiency
- •Developed interactive Power BI dashboards to visualize key insights, enhancing decision-making
AI & Machine Learning Trainee
Information Technology Institute (ITI)
- •Learned and applied supervised learning (classification) and unsupervised learning (clustering) techniques to solve real-world problems
- •Studied deep learning fundamentals (CNN, RNN, ANN) with hands-on experience in TensorFlow
Projects
Flink, Kafka & HDFS Data Pipeline
Built a real-time data pipeline that reads from Kafka topics, processes data with Apache Flink, and writes output to HDFS. Orchestrated using Docker Compose with simplified HDFS connectivity configuration.
Airflow COVID-19 Data Pipeline
Automated ETL pipeline for COVID-19 case data cleansing and modeling using Apache Airflow. Ingests raw data, cleanses and preprocesses datasets, and transforms data into structured format for analysis.
Bank Marketing Analysis
Analyzed 50,000+ records of bank marketing data, improving data quality and built predictive model for customer response prediction using machine learning techniques.
ETL for Nested JSON
Designed and implemented ETL pipeline processing 1M+ nested JSON records using PySpark, reducing processing time significantly and simplifying analysis workflows.
Google Play Store Analysis
Analyzed 1M+ app records from Google Play Store, identifying that 70% of downloads came from free apps, providing actionable insights for marketing strategies.
Technical Skills
Programming Languages
Databases
Tools & Platforms
Operating Systems
Education & Activities
Bachelor of Computer Science & Artificial Intelligence
Menoufia National University
Certifications
- •Google Data Analytics: Foundations: Data, Data, Everywhere
- •Google Data Analytics: Ask Questions to Make Data-Driven Decisions
Activities
Junior Student Activities
Organized sessions and welcomed new students, helping them transition into their first year.
Enactus Menoufia University (HR Member)
Conducted evaluations and supported team development initiatives to enhance performance and collaboration.
Get In Touch
I'm currently seeking internship opportunities to apply my technical knowledge in real-world scenarios. Feel free to reach out if you'd like to connect!
© 2025 Hesham Hatem. Built with Next.js and Tailwind CSS.