Marco Lanfranchi

4th year data science student at SFU with experience in databases, software development, and data science.

📍 Burnaby, BC GitHub LinkedIn

Experience

Work

Database Engineer Intern, Samsung R&D Canada

📅 Jan 2025 - Aug 2025

Cloud Engineering team.

Data Analyst Intern, Nettwerk Music Group

📅 Sept 2023 - May 2024

Developed dashboards, data pipelines, and applied data science techniques analyzing global streaming data for hundreds of artists as part of the Analytics team at an independent music label.

Education

BSc in Data Science, Simon Fraser University

📅 Sept 2022 - Apr 2026

Open Source Contributions

Streamlit

📅 Oct 2025

Contributed to Streamlit, a popular open-source framework for building data and ML apps, by implementing tooltip support for st.badge().

Other Experience

Volunteer Jr. Data Scientist, Industrio AI

📅 Jan 2023 - Apr 2023

Contributed to the development of data applications for business clients, developing interactive visualizations with Python and JavaScript.

Research Associate, Dr. Matt Lowe, UBC School of Economics

📅 Jan 2021 - Aug 2021

Collaborated as an undergraduate research associate collecting data for one of Dr. Matt Lowe’s research studies in behavioral economics.

Projects

lisa (Labeled Identification of Speech Audio)

ML model for speaker identification from audio clips. Pipeline includes data processing, audio cleaning, feature extraction, and model training. Also developed a demo interface that takes live audio input and identifies speakers.

lisa (Labeled Identification of Speech Audio) demo

spotify-history

Automated background service that archives your Spotify listening history in an SQLite db and emails daily summaries.

spotify-history demo

iammusic-template

Web app that lets users create custom versions of the 'I AM MUSIC' album cover. Reached over 250k visitors in one month and processed 500k+ submissions via a custom API and NoSQL cloud database.

iammusic-template demo

written-digit-recognition

Interactive Streamlit app for handwritten digit classification, backed by a K-nearest neighbors model implemented from scratch in Python.

written-digit-recognition demo

aita-predictor

A machine learning model that classifies r/AmItheA-hole Reddit posts using an ensemble of classifiers built on vector embeddings and large-scale PySpark text processing.

aita-predictor demo