For handling requests through the use of multiple application instances, we use the concept of horizontal scaling, where we launch more than one instance of the same application behind a load balancer. The load balancer is then responsible for distributing the incoming requests across this pool of application instances.