You are on page 1of 2

Carlo Mazzaferro

carlo.mazzaferro@gmail.com ● +1 (858) 232-4926 ● Blog: mazzafi.sh ● GitHub: github.com/carlomazzaferro

EDUCATION University of California, San Diego, La Jolla, California, USA Upper Level GPA: 3.46 / 4.0
■ B.S in Bioengineering, minor in Mathematics Aug 2012 – Jun 2017
● Biosystems Engineering specialization
● Focus: statistics, probability, and machine learning

WORK AND ByteCubed - Machine Learning Engineer Sep 2018 – Present


RESEARCH ● Consulting for a large car dealearship, I built and deployed machine learning models and the corresponding server side
code and API for RESTful access to internal users resulting in 12% reduction in lot time of newly purchased cars
● Built and scaled our internal platform for machine learning model management, including integration with CI pipelines,
continuous training and deployment
ByteCubed - Jr Data Scientist Aug 2017 – Sep 2018
● Recommendations: company recommendation systems written in python for proprietary BI tool based on textual
descriptions and financial information deployed with docker to over 5k users
● Analytics and Machine Learning: ad-hoc analysis using a variety of statistical methodologies for customer
segmentation using Jupyter Notebooks, pandas, scikit-learn
ByteCubed - Data Science Intern Jun 2017 – Aug 2017
● Relevance: Elasticsearch set up, configuration, and data ingestion pipeline in Scala
● Operations: deployment of machine learning solutions with Docker, Flask, and TensorFlow
UCSD CCBB - Bioinformatics Developer Intern Jan 2016 – Jun 2017
● Developed a novel high-performance cloud-based genomic data analysis pipeline with Apache Spark
● Developed predictive pipelines for potential new anti-cancer drug discovery implementing state-of-the-art machine
learning algorithms
● Refactor, optimize, and document legacy code under the supervision and review of Sr. Bioinformatics Engineers
The Scripps Research Institute - Software Developer Intern Jun 2015 – Feb 2016
● Queried data from internal databases for custom-made analysis
● Built web application for data visualization of data using Python, SQL, D3

PROJECTS Kryptoflow - Machine Learning deployment that scales Dec 2017- Present
● Framework for real-time analysis and prediction live time-series data focused on continuous training and deployment
● Kafka, TensorFlowServing, ReactJS and Flask compose part of the tech stack enabling real-time analysis and scalabilty
● Fully open source, actively developed at https://github.com/carlomazzaferro/kryptoflow
Effortless Genomic Data Annotation, Storage and Filtering in Python Jun 2016 – Apr 2018
● Developed a novel method to store, query, and filter genomic data currently being used internally by our team
(https://github.com/ucsd-ccbb/VAPr/). Code reviewed by Sr. Engineer and publication under review
● Implemented using MongoDB, resulting in significant speedup as compared to usual SQL-based approaches for data
retrieval
● Allows the storage of widely sparse data with orders of magnitude savings in disk utilization (10-15x)

CONFERENCES [1] Carlo Mazzaferro “RESTful Machine Learning with Flask and TensorFlow Serving”, in. PyData,
Washington DC Oct 2018.
● The presentation was paired with the development of an open source library called racket (see more at
github.com/carlomazzaferro/racket) aimed at enabling users to quickly get from model building to a dockerized,
production-grade deployment in minutes

PUBLICATIONS [2] Carlo Mazzaferro, “Predicting Protein Binding Affinity With Word Embeddings And Recurrent Neural
Networks”, Biorxiv. Pre-print. Submitted Mar 2017.
[3] Carlo Mazzaferro, Amanda Birmingham and Kathleen M. Fisch, “Effortless variant analysis in Python with
VAPr”, Bioinformatics. Apr 2018.
[4] Ana M. Moreno*, Sara Brin Rosenthal*, Carlo Mazzaferro*, Dhruva Katrekar, Amanda Birmingham,
Kathleen M. Fisch, Prashant Mali, “Defining Cas9 orthogonality in Immune Space”, Nature. Under Review.
Submitted Apr 2017.

Page 1 of 2
AWARDS & ■ Dean’s List, Fall 2015, Spring, Winter, Fall 2016. Jacobs School of Engineering 2014 – 2016
SCHOLARSHIPS For attaining a semester GPA of at least 3.75.

LANGUAGES ■ Fluent: English, Portuguese, Italian


■ Advanced: Spanish

PROGRAMMING ■ Advanced (4+ years): Python, SQL, JavaScript (React + Redux, D3)
LANGUAGES ■ Intermediate (2+ years): Scala, C++/C, R
■ Other: CSS, HTML (2 years)
■ Special Interest for functional programming, especially strongly typed ones

Page 2 of 2

You might also like