Jeremy Stanley

Jeremy Stanley headshot

I am the co-founder and Chief Scientist of Anomalo, an AI platform that helps enterprises monitor and improve data quality. Before founding Anomalo, I was the VP of Data Science at Instacart, where I led machine learning efforts and drove key initiatives on the path to profitability and sustainable growth.

Stepping Back

Sep 10, 2024 • Independent • LinkedIn

Announcing my decision to step back from an operating role at Anomalo due to health changes. Reflects on my career through the lens of proving a well known VC wrong - both about making Instacart profitable and about creating Anomalo.

Automating Data Quality (O’Reilly)

Feb 13, 2024 • Anomalo • O’Reilly • with Paige Schwartz

A practical guide to building and maintaining automated data quality monitoring systems that scale across cloud data platforms. It explains how to design and test unsupervised learning models for issue detection, implement effective alerting and resolution workflows, and manage these systems enterprise wide.

An All-In Founder Forum

May 17, 2023 • Independent • Medium

Recounts the origin and structure of a peer forum of venture-backed founders focused on vulnerability, accountability, and personal growth. Shares operating norms, meeting cadence, and an open invite for a new member.

Build Data Factories, Not Data Warehouses

Apr 12, 2022 • Anomalo • The New Stack

Modern organizations operate data factories (not warehouses) that transform raw inputs into dynamic, customized data products. To ensure these factories produce trustworthy outputs, data teams must invest in scalable, automated quality control systems that empower data consumers, minimize false alerts, and validate data at every stage, especially before delivery.

Detecting Extreme Data Events

Nov 16, 2021 • Anomalo • Anomalo (Medium)

Introduces Anomalo’s entity outlier check to catch rare, high-impact events by pairing time-series anomaly detection with automated root-cause analysis. Demonstrates the approach with NYC 311 data and contrasts it with generic outlier methods that fail in real-world settings.

Effective Data Monitoring: Steps to Minimize False Alerts

Mar 17, 2021 • Anomalo • Anomalo Blog

Offers ten actionable steps to reduce false positives and negatives in data monitoring systems and to calibrate alerting thresholds. It guides teams in balancing sensitivity and signal-to-noise to maintain trust in monitoring systems.

Trust Your Data with Unsupervised Data Monitoring

Jan 26, 2021 • Anomalo • Anomalo Blog

Presents how unsupervised learning can detect unexpected anomalies in data without requiring labeled incidents. It shows how combining forecasting with unsupervised signals helps improve coverage over simple rules-based monitoring.

Airbnb Quality Data For All

Dec 02, 2020 • Anomalo • Anomalo Blog

Airbnb’s growth demanded strong automated systems to keep its data complete, timely, and reliable. The article shows how companies can achieve similar data quality using tools like Anomalo without massive budgets.

Dynamic Data Testing

Nov 18, 2020 • Anomalo • Anomalo (Medium)

Defines a framework from static rules to dynamic, model-based tests and unsupervised detection for higher coverage with less maintenance. Uses EU COVID-19 data to show how predicted ranges outperform hand-tuned thresholds and avoid missed anomalies.

When Data Disappears

Nov 10, 2020 • Anomalo • Anomalo (Medium)

Explains why missing or partially missing data is the most common—and dangerous—data quality failure, often invisible in aggregate metrics. Outlines practical tests to detect staleness, shortfalls, and segment drop-offs, with alerting workflows for rapid triage.

700 Women Founders

Jun 24, 2017 • Independent • Medium

Analyzes 2009–2013 Crunchbase data to estimate the share of venture-backed companies with women founders and ranks VC portfolios by representation. Argues for transparency and accountability by using data to surface diversity gaps in venture funding.

Space, Time and Groceries

Jun 13, 2017 • Instacart • Instacart Tech Blog

Frames Instacart grocery delivery as a spatiotemporal logistics problem and describes the architecture used to optimize it. Uses Datashader to visualize massive Instacart GPS datasets that reveal how shoppers move through cities, stores, and delivery routes.

How Instacart Uses Data to Craft A Bespoke Comp Strategy

Jun 01, 2017 • Instacart • First Round Review • with Udi Nir, Guissu Baier

Details Instacart’s compensation methodology combining survey data and regression modeling, mapped to precise leveling to ensure fairness and competitiveness. Shares outcomes and tactics for market positioning, equity education, and calibration across roles and seniority.

3 Million Instacart Orders, Open Sourced

May 03, 2017 • Instacart • Instacart Tech Blog

Instacart released an anonymized public dataset of over 3 million grocery orders from more than 200,000 users to support machine learning research on consumer purchasing behavior. The article introduces the dataset, highlights privacy protections, and shares some initial insights from the data.

Deep Learning with Emojis (not Math)

Mar 29, 2017 • Instacart • Instacart Tech Blog

Offers an intuition-first explanation of deep learning concepts (using emojis) aimed at non-technical readers, connected to Instacart’s efforts in ranking and route optimization for shopping in store. It frames how deep learning ideas can be communicated without math-heavy exposition.

Doing Data Science Right — Your Most Common Questions Answered

Apr 07, 2016 • Independent • First Round Review • with Daniel Tunkelang

Provides concise, high-leverage answers to recurring challenges in building and scaling data science teams, from scope setting to stakeholder alignment. It distills practical wisdom from leaders across many companies into a Q&A format.

Data Science at Instacart

Feb 17, 2016 • Instacart • Instacart Tech Blog

How the data science organization at Instacart works in partnership with product and engineering to drive key decisions and outcomes. Outlines the data opportunities in forecasting, ads, recommendations, and search optimization.

How to Consistently Hire Remarkable Data Scientists

Mar 26, 2015 • Sailthru • First Round Review

Describes a structured approach to recruiting, evaluating, calibrating, and retaining exceptional data science talent, with emphasis on projects that reflect real work. It highlights how to design take-homes and interviewing loops that predict long-term success.

Cover: Pieces of the Action

Pieces of the Action

2022 • Vannevar Bush

Vannevar was FDR's science advisor during WWII, and shepherded tech from penicillin to proximity fuses to the atomic bomb. His advice on aligning scientists with the hierarchical war effort applies to any tech org. The challenges he saw the US facing in the 1970s, compared to the progress we have made and what we face today, is heartening. His vision for the future of personal computing (the Memex) was prophetic if incomplete. Written in 1970 and republished in 2022, it could have used a firmer edit, but Bush's voice and anecdotes shine through. I gained an entirely new perspective on innovation and its messy deployment in practice.

Causal Inference in Python: Applying Causal Inference in the Tech Industry

2023 • Matheus Facure

Teasing causality from data is one of the most subjective and challenging tasks that data scientists face. This book provides a thoughtful, fun to read, and practical introduction to causal inference. It covers a lot of ground, from causal graphs to synthetic controls to ML estimates for user level treatment effects. By weaving together theory and code, Facure turns what’s often an abstract topic into something directly usable in industry settings. The math notation is not always clear, but the code examples and reasoning are fantastic. Much of what's here I had to figure out over the course of my career. Other ideas were entirely new to me - and I can't wait to apply them.

Abundance: What It Takes to Build

2025 • Ezra Klein, Derek Thompson

I found this book to be timely and exciting, as it is written by respected journalists from the left and clearly diagnoses the failures to support the supply side required for achieving Abundance as a society. They convincingly argue that homelessness stems from a simple fact: we don’t build enough housing. They identify many of the factors crippling our ability to build in liberal cities. They illustrate how well-meaning special interests on the left can cripple legislation like the CHIPS act. They explain the history of legal tools created to fight abuse of the commons in the 60s and then explain how those same tools are now used by all to obstruct progress. They critique our federal science apparatus for being too risk averse and bureaucratic. I didn't leave the book with a clear blueprint for what should come next, but I hope this book inspires the start of one.

Cover: The Alchemy of Air

The Alchemy of Air

2008 • Thomas Hager

Cover: The Lessons of History

The Lessons of History

1968 • Will Durant, Ariel Durant

The Beginning of Infinity

2011 • David Deutsch

Thinking, Fast and Slow

2011 • Daniel Kahneman

Cover: Benjamin Franklin: An American Life

Benjamin Franklin: An American Life

2004 • Walter Isaacson

Deep Learning

2016 • Ian Goodfellow, Yoshua Bengio, Aaron Courville

An Introduction to Statistical Learning (2nd ed.)

2021 • Gareth James, Daniela Witten, Trevor Hastie, Robert Tibshirani

The Elements of Statistical Learning

2009 • Trevor Hastie, Robert Tibshirani, Jerome Friedman