🎓 EDUCATION
EPITA - École d'Ingénieurs en Informatique (Le Kremlin-Bicêtre, France)
Feb 2023 - Feb 2024
MSc in Computer Science: Data Science & Analytics
- Machine Learning & Statistical Analysis, Neural Networks & Deep Learning, Natural Language Processing, Computer Vision
- Data Science in Production, Relational Databases, NoSQL Databases, Data Visualization, Predictive Analytics & Data Mining
- Advanced Algorithms, OOA-UML-Java, Big Data & Cloud Computing, Network Protocols & Architecture, UNIX OS
University of Technology - Vietnam National University (Ho Chi Minh City, Vietnam)
Sep 2015 - May 2019
Bachelor of Aerospace Engineering - Classification: Good
Le Hong Phong High School for the Gifted (Ho Chi Minh City, Vietnam)
Sep 2012 - May 2015
Major in Mathematics - Classification: Excellent
🏢 WORK EXPERIENCE
AREKA CONSULTING
Feb 2024 - presentData Engineer
- Data Engineering
- Built and maintained pipelines in Azure cloud to consolidate traveling data for multiple clients.
- Improved processes in cleaning hotel, flight routes, data validation using Machine Learning techniques.
- Developed ETL pipeline for a new client in GCP BigQuery and Looker Studio by integrating with reference databases in Azure.
- Business Intelligence
- Integrated CSS and JavaScript into Spotfire dashboards to show User guides in video format.
- Managed KPIs in dashboards, analyzed ad-hoc BI problems raised by consultants and clients.
- R&D Data Projects
- AREKA Carbon Track: Conducted research to calculate aircraft CO2 emission by AREKA method, collected flight data from the industry using APIs and scraping, and developed a complete pipeline to perform emission prediction for each flight.
- Air Fare Collection: Built a pipeline to collect all flight prices by Web scraping and stored in Blob Storage.
FPT SOFTWARE - Skywise Data Capture Team, AIRBUS
Jun 2022 - Dec 2023Data Engineer | Software Developer
- Data Engineering
- Built and maintained data pipelines for airlines, improved various data products.
- Analyzed data problems to solve ad-hoc issues, optimized code performance.
- Deployed new architecture for airline time series pipelines.
- Web Software Development
- Developed front-end Airbus Skywise Store for the New Offer project to customers.
- Built new Foundry application for back-end sensor time series processing.
- Fixed and improved other web apps according to internal and airlines requirements
- Scaled Agile Framework
- Presented solutions to business stakeholders in Europe.
- Collaborated with other outsourcing teams from France, India, and Vietnam.
- Worked and organized Agile meetings with the technical team.
HUU TOAN GROUP
Sep 2021 - May 2022Data & IT Engineer
- Data Processing & Analysis
- Built integrating tools in different systems to help users access data fast and easily.
- Cleaned customers & products raw data to gain insights in industrial market.
- Visualized dashboards and built dashboard pipelines in Power BI.
- ERP system (Dynamics 365): master-data management & module implementation
- Created automation tools for importing and editing master-data information.
- Maintained & standardized name rules for all types of company products used by all departments.
- Implemented new module by ERP & Power Automate to help company reduce workflows and paper tasks.
- IT & network administration
- Built and found solution for new office IT system including network server, domain cotroller, Teams & SharePoint, UPS, IP cameras & phones.
- Migrated company data from on-premises file server to Microsoft Office 365 ecosystem.
BAA TRAINING VIETNAM
Oct 2019 - Dec 2021Flight Simulator Engineer
- Operated and maintained flight simulator systems including electrical, mechanical, and computer systems (Windows Server, Linux, simulation software).
- Designed and improved the systems and process according to customers and manufacturers requirements.
- Worked with partners and flight instructors, managed company's data systems and infrastructures.
⚙️ TECHNICAL SKILLS
- Programming: Python, SQL, JavaScript, HTML, CSS, Java
- Python: PySpark, Scikit-learn, TensorFlow, Streamlit, SciPy, NumPy, Pandas, Seaborn, Matplotlib, Selenium, BeautifulSoup
- Cloud: Foundry (Palantir), Azure, AWS, GCP
- Tools: Git, Docker, Conda, Airflow, FastAPI, PostgreSQL, MongoDB, Grafana, Orange, Dynamics 365, Power Automate
- Business Intelligence: TIBCO Spotfire, Power BI, Looker Studio
- Machine Learning: Linear Regression, Logistic Regression, SVM, XGBoost, Random Forest, Decision Tree, K-means, PCA, SVD
- Deep Learning: Convolution Neural Networks, Recurrent Neural Networks, Attention, Transformers, Transfer Learning
📂 CERTIFICATES
- Data Scientist Nanodegree - Udacity
- Data Engineering Nanodegree - Udacity
- Palantir Foundry Data Engineer Associate - Palantir
- Microsoft Certified: Azure Data Engineer Associate - Microsoft
- Microsoft Certified: Azure AI Fundamentals - Microsoft
- Microsoft Certified: Azure Data Fundamentals - Microsoft
- Python Developer for AI Course - VTC Academy, Vietnam
- Databases and SQL for Data Science with Python - IBM, Coursera
- Python for Data Science, AI & Development - IBM, Coursera
- Simulator Operation & Maintenance Training Certificate - SIM International, Netherlands
- Simulator Training Certificate - CAE, Canada
🌏 LANGUAGES
- English: Professional working proficiency
- French: Limited working proficiency (niveau B1)
- Vietnamese: Native language