The Billion-Request Content Recommendation System Challenge
11-02, 10:55–11:35 (America/New_York), Radio City (Room 6604)

For online publications and media sites, recommendation systems play an important role in engaging readers. At Chartbeat, we are actively developing a recommendation system that caters to billions of daily pageviews across thousands of global websites. While conventional discussions frequently highlight the data science and machine learning facets of the system, the cornerstone of a successful application is its system architecture. In this presentation, we will dissect our architectural decisions designed to meet high-performance requirements and share insights gleaned from our journey in scaling up the recommendation system.


This talk is intended for data scientists and engineers interested in building a large-scale recommendation system, with a specific focus on real-time delivery. Building a real-time recommendation system for publishers poses distinct challenges and requirements, including managing a constantly changing item space, balancing contextual relevance with article performance, and accommodating custom curation of recommendations while handling tens of thousands of requests per second.

We will introduce you to our approach for addressing these architectural challenges. Beginning with a comprehensive overview of our architecture, which incorporates a batch-processing component for generating candidates and a real-time component for serving recommendations, we will then zero in on the architectural decisions and lessons learned from our experience scaling up the system.

We will delve into a detailed exploration of how we utilize the Web-Queue-Worker architecture to generate recommendations for each request while simultaneously maintaining a service level objective of sub-one-second request latency. Additionally, we will guide you through our iterative development process, which relies on regular load testing to meet the demands of ever-increasing traffic. We will share the metrics employed for tracking during load testing and demonstrate how these metrics aided us in pinpointing performance bottlenecks and selecting the appropriate technology and approach to overcome design challenges.


Prior Knowledge Expected

No previous knowledge expected

Machine Learning Engineer @ Chartbeat