Marco Lanfranchi

4th year data science student at SFU with experience in databases, software development, and data science.

📍 Burnaby, BC GitHub LinkedIn

Experience

Work

Data Scientist I, Mastercard

Aug 2026, Vancouver, BC

Upcoming post-grad role on the ML Tooling/Operations team.

Database Engineer Intern, Samsung R&D Canada

Jan 2025 - Aug 2025, Vancouver, BC

Cloud Engineering team.

Data Analyst Intern, Nettwerk Music Group

Sept 2023 - May 2024, Vancouver, BC

Developed dashboards, data pipelines, and applied data science techniques analyzing global streaming data for hundreds of artists as part of the Analytics team at an independent record label.

Education

BSc in Data Science, Simon Fraser University

Sept 2022 - Apr 2026, Burnaby, BC

BA (transferred before completion), University of British Columbia

Sept 2020 - Jul 2022, Vancouver, BC

Open Source Contributions

Streamlit

Oct 2025

Contributed to Streamlit, a popular open-source framework for building data and ML apps, by implementing tooltip support for st.badge().

Other Experience

Volunteer Jr. Data Scientist, Industrio AI

Jan 2023 - Apr 2023, Vancouver, BC

Contributed to the development of data applications for business clients, developing interactive visualizations with Python and JavaScript.

Research Associate, Dr. Matt Lowe, UBC School of Economics

Jan 2021 - Aug 2021, Vancouver, BC

Collaborated as an undergraduate research associate collecting data for one of Dr. Matt Lowe's research studies in behavioral economics.

Projects

lisa (Labeled Identification of Speech Audio)

ML model for speaker identification from audio clips. Pipeline includes data processing, audio cleaning, feature extraction, and model training. Also developed a demo interface that takes live audio input and identifies speakers.

lisa (Labeled Identification of Speech Audio) demo

spotify-history

Automated background service that archives your Spotify listening history in an SQLite db and emails daily summaries.

spotify-history demo

iammusic-template

Web app that lets users create custom versions of the 'I AM MUSIC' album cover. Reached over 250k visitors in one month and processed 500k+ submissions via a custom API and NoSQL cloud database.

iammusic-template demo

written-digit-recognition

Interactive Streamlit app for handwritten digit classification, backed by a K-nearest neighbors model implemented from scratch in Python.

written-digit-recognition demo

aita-predictor

A machine learning model that classifies r/AmItheA-hole Reddit posts using an ensemble of classifiers built on vector embeddings and large-scale PySpark text processing.

aita-predictor demo