Marco Lanfranchi

4th year data science student at SFU with database, software development, and data science experience. Looking for opportunities in machine learning engineering.

📍 Burnaby, BC GitHub LinkedIn

Experience

Work

Database Engineer Intern, Samsung R&D Canada

📅 Jan 2025 - Aug 2025

Developed and deployed a platform that automated database account management across PostgreSQL, MySQL, Redshift, and MongoDB databases. Implemented all app functionality and introduced automated account lifecycles with password rotations and account expirations which eliminated over a quarter of DBA tickets.

🛠️ MongoDB, PostgreSQL, MySQL, Redshift, Python, Bash, AWS, Boto3, Terraform, Docker, GitHub Actions

Data Analyst Intern, Nettwerk Music Group

📅 Sept 2023 - May 2024

Applied statistical analysis and machine learning techniques to streaming and social media data for 100s of artists under an independent label. Developed dashboards for geospatial audience streaming analytics, fraudulent stream detection, and pipelines that transformed raw streaming data into reports and visualizations.

🛠️ Snowflake, SQL, Tableau, Python, Scikit-learn, Data Visualization, Geospatial Analysis, Regression Analysis

Education

BSc in Data Science, Simon Fraser University

📅 Sept 2022 - Apr 2026

Relevant courses include: Data Structures & Algorithms, Computational Data Science, Database Systems, Computer Systems, Intro to AI, Linear Algebra, Statistical Learning & Prediction, Linear Optimization, and Discrete Mathematics.

Other Experience

Volunteer Jr. Data Scientist, Industrio AI

📅 Jan 2023 - Apr 2023

Worked with a small team of data scientists and developers to build full-stack applications for fuel cell engineering clients, contributing front-end features and interactive visualizations using Python, Streamlit, Plotly, TypeScript, and Vue.js.

🛠️ Azure, PostgreSQL, Python, Plotly, Streamlit, TypeScript, Vue.js, Figma

Research Associate, Dr. Matt Lowe, UBC School of Economics

📅 Jan 2021 - Aug 2021

Collaborated as an undergraduate research associate collecting data for Dr. Matt Lowe’s research project: 'Do Virtue Signals Signal Virtue?'.

Projects

LISA (Labeled Identification of Speaker's Audio Model)

End-to-end machine learning project that identifies who's speaking from audio clips. Built a data pipeline with speaker diarization, audio preprocessing, and feature extraction. Working on model training, evaluation, and a real-time speaker identification demo interface.

🛠️ Python, Librosa, FFmpeg, pyannote.audio, Scikit-learn, Streamlit
LISA (Labeled Identification of Speaker's Audio Model) demo

spotify-history

Background service that archives your Spotify listening history into a local SQLite database and sends you daily listening summaries by email. Designed for easy set-up and to run indefinitely (I'm running it from an old Raspberry Pi).

🛠️ Python, SQLite3, Spotify API, Cron, Shell Scripting
spotify-history demo

iammusic-template

Web app that lets users generate custom versions of a popular album cover. At its peak, it reached over 200k visitors in a single month and has processed over 500k submissions through a custom API and NoSQL cloud database.

🛠️ React.js, Next.js, Firebase, Vercel, GCP
iammusic-template demo

written-digit-recognition

Interactive app for handwritten digit classification, built with a custom K-nearest neighbors implementation from scratch in Python.

🛠️ Python, NumPy, Plotly, Streamlit
written-digit-recognition demo

aita-predictor

A machine learning model that classifies r/AmItheA-hole Reddit posts using an ensemble of classifiers built on vector embeddings and large-scale PySpark text processing. Includes a Streamlit UI for interactive exploration and testing.

🛠️ Python, PyTorch, PySpark, Scikit-learn, Streamlit, OpenAI API
aita-predictor demo