Marco Lanfranchi
4th year data science student at SFU with database, software development, and data science experience. Looking for opportunities in machine learning engineering.
Experience
Work
Database Engineer Intern, Samsung R&D Canada
Developed and deployed a platform that automated database account management across PostgreSQL, MySQL, Redshift, and MongoDB databases. Implemented all app functionality and introduced automated account lifecycles with password rotations and account expirations which eliminated over a quarter of DBA tickets.
Data Analyst Intern, Nettwerk Music Group
Applied statistical analysis and machine learning techniques to streaming and social media data for 100s of artists under an independent label. Developed dashboards for geospatial audience streaming analytics, fraudulent stream detection, and pipelines that transformed raw streaming data into reports and visualizations.
Education
BSc in Data Science, Simon Fraser University
Relevant courses include: Data Structures & Algorithms, Computational Data Science, Database Systems, Computer Systems, Intro to AI, Linear Algebra, Statistical Learning & Prediction, Linear Optimization, and Discrete Mathematics.
Other Experience
Volunteer Jr. Data Scientist, Industrio AI
Worked with a small team of data scientists and developers to build full-stack applications for fuel cell engineering clients, contributing front-end features and interactive visualizations using Python, Streamlit, Plotly, TypeScript, and Vue.js.
Research Associate, Dr. Matt Lowe, UBC School of Economics
Collaborated as an undergraduate research associate collecting data for Dr. Matt Lowe’s research project: 'Do Virtue Signals Signal Virtue?'.
Projects
LISA (Labeled Identification of Speaker's Audio Model)
End-to-end machine learning project that identifies who's speaking from audio clips. Built a data pipeline with speaker diarization, audio preprocessing, and feature extraction. Working on model training, evaluation, and a real-time speaker identification demo interface.
spotify-history
Background service that archives your Spotify listening history into a local SQLite database and sends you daily listening summaries by email. Designed for easy set-up and to run indefinitely (I'm running it from an old Raspberry Pi).
iammusic-template
Web app that lets users generate custom versions of a popular album cover. At its peak, it reached over 200k visitors in a single month and has processed over 500k submissions through a custom API and NoSQL cloud database.
written-digit-recognition
Interactive app for handwritten digit classification, built with a custom K-nearest neighbors implementation from scratch in Python.
aita-predictor
A machine learning model that classifies r/AmItheA-hole Reddit posts using an ensemble of classifiers built on vector embeddings and large-scale PySpark text processing. Includes a Streamlit UI for interactive exploration and testing.