Part II. Retrieval

How do we get all the data in the right place to train a recommendation system? How do we build and deploy systems for real-time inference?

Reading research papers about recommendation systems will often give the impression that they’re built via a bunch of math equations, and all the really hard work of recommendation systems is connecting these equations to the features of your problem. More realistically, the first several steps of building a production recommendation system all fall under systems-engineering. Understanding how your data will make it into your system, be manipulated into the correct structure, then available in each of the relevant steps of the training flow often constitutes the bulk of the initial recommendation systems work. But even beyond this initial phase, ensuring all of the necessary components are fast enough and robust enough for production environments, requires yet another significant investment in platform infrastructure.

Often times, you’ll build a component responsible for processing the various types of data and storing them in a convenient format. Next, you’ll construct a model that takes that data and encodes it in a latent-space or other representation model. Finally, you’ll need something to turn an input request into the representation form as a query in this space. These usually take the form of jobs in some workflow management platform, or services deployed as endpoints. The next few chapters will step you through the relevant technologies and concepts necessary to build and deploy these systems – and the awareness of important aspects of reliability, scability, and efficiency.

Just in case you’re thinking “I’m a data scientist! I don’t need to know all this!”; you should know that RecSys has an inconvenient duality. Model architecture changes, often effect the systems architecture. Interested in trying out those fancy transformers? Your deployment strategy is going to need a new design. Maybe your clever feature embeddings can solve the cold-start problem! Those feature embeddings will need to serve your encoding layers, and integrate with your new NoSql feature store. Don’t panic! This chapter is a walk through the Big Data Zoo.

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset
3.15.4.244