About Me

Education

  • Master in Master's Programme in Machine Learning, Data Science and Artificial Intelligence at Aalto University
    GPA: 4.5
    Year of graduation: 2027
  • Bachelor in Data Science at Aalto University
    GPA: 4.72
    Year of graduation: 2025
  • Standard SAT
    English: 790
    Mathematics: 790

Experience

  • Freelance Game Developer at HypeHype Inc.
    Duration: Sep 2023 - Present
  • Intern at HypeHype Inc.
    Duration: Jun 2023 - Sep 2023
  • Game Developer at Dyscordion Entertainment
    Duration: Sep 2022 - Dec 2023
  • Study Coordinator and Treasurer for DataGuild
    Duration: Jan 2023 - Dec 2024

Skills

  • Programming
    C#, C, Python, R, JavaScript, Scala, SQL, HTML
    Version control with Github and Gitlab
  • Languages
    English (Full working proficientcy)
    Vietnamese (Native)
  • Soft skills
    Communication, Teamwork, Creativity and Taking initiative and responsibility

Projects

Live Data Lab: Finnish Energy Grid

Event-driven AWS pipeline ingesting real-time energy and weather data from Fingrid and Meteo APIs into partitioned Parquet storage on S3. A Dockerised Lambda runs ARIMA forecasting on each ingestion, predicting 2 hours ahead for 5 energy sources. Fully automated with a GitHub Actions CI/CD pipeline covering unit tests, Docker builds and Lambda deployments. Visualised on a live JavaScript/Chart.js dashboard via a REST API backed by Amazon Athena. Download the PowerBI Full Grid Analytics Report (PDF).

Economic Forecasting With Machine Learning Models

Finnish regional econimic indicators were predicted using a variety of statistical models. The research project was done in collaboration with the OP Group as part of the Aalto University's Data Science Project course.

Cookie Cats A/B Test For Player Retention Analysis

A full A/B test analysis on 90,000+ mobile game players evaluating whether moving a progression gate improved retention using chi-squared testing, bootstrap resampling, and player segmentation.

Multilingual Toxicity Detection

Benchmarked five machine learning and deep learning approaches from Logistic Regression and LinearSVC to CNN and transformer models (mBERT, XLM-RoBERTa) for cross-lingual toxicity classification across English, German and Finnish text. Evaluated the tradeoff between computational efficiency and classification performance in a low-resource multilingual setting.

Student Stress Factors

Multivariate analysis was done in order to determine the relationship between Engineering student's level of stress and various life style and social factors using Multiple Correspondence Analysis.

Dried Beans Classification Using Machine Learning

Machine learning classification methods have been trained on a dataset of dried bean data in order to determine which species a dried bean is given its physical characteristics.

Predicting Car Insurance Premiums Using Bayesian Modelling

Baysian data analysis was done on a set of real world insurance data in order to model a vehicles insurance premium with several relevant factors such as vehicle type, number of drivers, number of previous complaints etc.

PDFStudio: In-Browser PDF Editor

A lightweight PDF editor that runs entirely in the browser. Supports deleting, cropping, merging, and rearranging pages, all client-side using PDF.js and Canvas APIs.

Contact Me


Download CV