Part I. Warming Up

“How do we get all the data in the right place to train a recommendation system, and for real-time inference?”

Reading research papers in recommendation systems will often give the impression that they’re built via a bunch of math equations, and all the really hard work of recommendation systems is connecting these equations to the features of your problem. More realistically, the first several steps of building a production recommendation system all fall under systems-engineering. The bulk of the initial recommendation systems work understanding how your data will make it into your system, be manipulated into the correct structure, then become available in each of the relevant steps of the training flow. But even beyond this initial phase, ensuring all of the necessary components are fast enough and robust enough for production environments, requires yet another significant investment in platform infrastructure.

Often times, you’ll build a component responsible for processing the various types of data and storing them in a convenient format. Next, you’ll construct a model that takes that data and encodes it in a latent-space or other representation model. Finally, you’ll need something to turn an input request into the representation form as a query in this space. These usually take the form of jobs in some workflow management platform, or services deployed as endpoints. The next few chapters will step you through the relevant technologies and concepts.

Just in case you’re thinking “I’m a data scientist! I don’t need to know all this!”; RecSys has an inconvenient duality, when the model architecture changes, often so too does the systems architecture. Wanna try out those fancy transformers? Your deployment strategy is going to need a new design. Maybe your clever feature embeddings can solve the cold-start problem! Prepare to serve your encoding layers and integrate with your new NoSql feature store. Don’t panic! This chapter is a walk through the Big Data Zoo.

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset
18.118.26.90