ML Platforms & Data Quality
Centralized platforms that turn ad-hoc detection into a reliable, governed capability across an org.
- Anomaly detection at warehouse scale
- Continuous data-quality monitoring
- InnerSource governance models (PMC)
Available for ML strategy & advisory
Machine Learning & Engineering at Apple. I design and lead ML platforms — anomaly detection, data quality, search ranking — that quietly run in production at the scale of hundreds of millions of users.
San Francisco Bay Area · 13+ years · Senior Member IEEE
Shipped at

01 / About
For 13+ years I've sat on the seam between software engineering and applied machine learning — from intrusion-detection research, to recommendation systems at Yahoo Finance, to learning-to-rank at Walmart, to platform-scale anomaly detection at Apple.
Today I lead the technical roadmap for a centralized anomaly-detection and data-quality ML platform now used by 250+ engineers across half a dozen Apple divisions. The work I'm proudest of is the quiet kind — the platform that prevents a launch from blowing up, the privacy scanner that keeps compliance continuous, the ranker whose effects only show up in revenue.
I mentor early-career engineers, speak at industry events (most recently the Databricks Data+AI Summit 2025), and am a Senior Member of IEEE.
250+
Engineers on platforms I've built
100M+
Emails / day classified in production
1M+
Daily users served by recommenders
9-figure
Revenue impact at Walmart Search
Stanford University
Artificial Intelligence Professional Program
2025 — 2026
Arizona State University
M.S., Computer Science
GPA 4.0 / 4.0 · First in class
2015 — 2017
Indian Institute of Technology, Delhi
B.Tech + M.Tech (5 yr), Mathematics & Computer Science
2008 — 2013
02 / Expertise
Centralized platforms that turn ad-hoc detection into a reliable, governed capability across an org.
Learning-to-rank, semantic matching, and recommender systems that move revenue and engagement.
Detection systems that make compliance and security continuous instead of audit-driven.
Bridging research and production — explainability, reasoning-based detection, and peer-reviewed work.
03 / Selected Work
Apple
Case studyCentralized ML platform for detecting anomalies and data quality issues across Apple's data warehouse — the kind of platform that prevents a launch from blowing up.
Apple
Case studyTwo-year cross-functional initiative to make privacy compliance continuous instead of audit-driven.
Walmart Global Tech
Case studyFounding ML engineer on Walmart.com's reranking team. Built the systems and the science that made search feel relevant.
Yahoo Finance
Case studyImplicit-feedback recommendation system based on page visits — turning behavioral signal into personalized stock picks.
Yahoo Mail
Case studyProductionized a hierarchical classifier mapping coupon and order emails to the Google Product Taxonomy, plus a CNN event classifier and a real-time model-serving service.
Arizona State University
Case studyResearch platform correlating heterogeneous security feeds for intrusion detection — produced multiple peer-reviewed publications.
04 / Experience
Lead the technical roadmap for a centralized anomaly-detection & data-quality ML platform; founded an InnerSource PMC; led PII detection across 100% of warehouse columns; designed an enterprise anti-fraud system; spoke at Databricks Data+AI Summit 2025.
Founding ML engineer & research lead for Walmart.com's reranking team. Invented the Rerank Micro Service (RMS), Walmart's first microservice for reranking. Shipped Learning-to-Rank and embedding-based semantic matching tied to nine-figure revenue impact.
Built a stock recommendation system for 1M+ daily users (Spark MLlib), an analyst credibility scoring system, and a CNN-based event classifier for Yahoo Mail. Productionized a hierarchical classifier handling 100M+ emails/day, and led Resource-as-a-Service for real-time model fetching.
Designed Threat Intelligence Analytics — a platform correlating security data across heterogeneous feeds using data-mining and ML. Yielded multiple peer-reviewed publications on intuitionistic-fuzzy clustering for intrusion detection.
Summer internship building internal engineering tooling.
Big-data security analyst — captured incident vectors from IDS / endpoint products, stored them in HBase, and classified malicious activity with data-mining and ML on Hadoop, Hive, and IBM InfoSphere Streams.
Built tooling and diagnostics for EMC Documentum / xCP. Owned the Documentum Mobile 1.2.4 iOS release end-to-end. Bronze Award, Value Creation, Q1 2014.
05 / Research & Speaking
Speaking
Databricks Data+AI Summit 2025
Selected from a record number of submissions
Recognition
Senior Member, IEEE
Plus EMC Bronze Award · Hack Arizona winner
Mentorship
Career mentor
Onboarded 7+ engineers; multiple to feature ownership
Anomaly detection · LLM reasoning · 2025
Threat intelligence · multi-source correlation
Network forensics · intrusion detection
Network forensics · empirical evaluation
Medical imaging · segmentation
Fuzzy systems · neural networks
06 / Contact
I'm always happy to hear from folks working on hard ML or data-platform problems — whether it's a role, a collaboration, or just a conversation about anomaly detection, search ranking, or applied AI. I also mentor early-career engineers; if that's you, please don't hesitate to reach out.
anupam.p.iitd@gmail.comUsually reply within a day or two