Figure 7-1 displays the API gateway in between the external and the internal network, which is a very typical setup. It can look differently, too. It is not uncommon to find at least two API gateways. Many systems do not allow access to certain components from within the DMZ. For example, if data needs to be retrieved from a database, this database can only be accessed from within the internal network.
In those cases, one API gateway within the DMZ validates incoming messages by static rules only. These rules are configured once and do not need any additional information at runtime. After successful validation, that API gateway forwards the message to a second API gateway within the internal network. This one has access to all other components including databases and is able to complete a set of necessary validations.
Another trend that I have noticed during discussions with customers of large enterprises indicates that the distinction between the external and internal network has become less of a differentiator. These customers assume an attacker could be located anywhere. They are starting to place an API gateway as an Ingress gateway where traffic is received at the edge of their network and less powerful API gateways throughout different network sections. Each of them have very specific tasks and are usually responsible for securing a group of dedicated backend APIs (more on this design pattern in Chapter 8 about APIs and microservices).
The next sections are similar to each other, but they focus on different topics. Each one can be read on its own if that is preferred.
Why Do API Gateways Exist?
The first API gateways I noticed appeared in 2006. At that time, SOAP web services were still very new, and those messages could get very complex. XML schema validation, XML encryption, XML signatures, SOAP envelopes, SOAP headers and SOAP body, XML namespaces, WSDL, WSS, XSL, XPath—all of these technologies appeared in a relatively short period of time. And all of them had to be mastered in order to support a web service infrastructure.
- Message structure
SOAP messages have an envelope and a body. The header is optional.
SOAP fault appears in error messages only.
- Two namespaces including aliases
gw: http://gateway.example.com/ws/policy/fault . This namespace is specific to this message
- Elements with no namespaces
faultcode, faultstring, faultactor, details
- Message encoding
UTF-8
Elements and attributes
To access the value status, XPath is required. The expression looks like this:
/soapenv:Envelope/soapenv:Body/soapenv:Fault/detail/gw:policyResult/@status
That is quite a selector for such a short message! It is not easy to build by anyone who is new to XML-related technologies. For this purpose, some API gateways provide easy-to-use graphical interfaces that allow users to create this XPath expression by simply selecting the element based on an example message.
Unfortunately, wherever it was required to expose business interfaces as SOAP web services, developers needed to support these technologies in all of their APIs. This required specific knowledge and had potential for mistakes, especially when it came to encryption, digital signatures, and SAML. From a business logic point of view, having to support these technologies was considered overhead since they were not really part of the API feature itself.
Similar to mobile applications today, there is always the effort of implementing the foundation of the app. Components that handle security, authentication, authorization, session management, all of that is not really the app, but still required. To help mobile developers, SDKs are available to take care of many of these technologies.
For SOAP web services, products have been made available to help API developers, similar to mobile SDKs to help app developers. The first products I worked with were the Layer 7 SecureSpan SOA Gateway (later the CA API Gateway)2 and IBM DataPower.3 The main goal was to take the burden off of developers by supporting all these XML-based technologies in an easily accessible way. Developers could then concentrate on their API business logic and let the API gateways handle all the complicated, non-business API logic separately.
A typical scenario, which I have worked on myself, was the following.
A medical report would be sent to the health insurance company. The report would be signed and partially encrypted. In addition, the report would be sent via TLS with client authentication.
The API gateway would validate the SSL session, the digital signature, would check for rate limits and messages size, and it would do XML schema validation. It would take care of validating the complete message. Decrypting the message would be an option, depending on the use case. Developers of backend business APIs could now expect to handle validated messages only! For example, the signature validation would not be necessary since it was already done!
Nowadays these SOAP web services may still be running but new SOAP web services rarely appear. Instead, RESTful API interfaces have taken over. Although message structures may not be as complex anymore, the need for introspection still exists. The complex message structure has been replaced by having to support multiple parameters including optional ones, explicit HTTP method validations, and different types of payloads. Requirements such as rate limits and message size validations have not changed. In addition, new protocols such as OAuth and OpenID Connect have been created and need to be handled, too.
What Are API Gateways Used For?
- 1.
Access control (i.e. who can access)
- 2.
Network-level security (i.e. use of TLS)
- 3.
Message security (i.e. message encryption)
- 4.
Message validation and transformation (i.e. from JSON to XML)
- 5.
Message routing (i.e. forwarding messages via HTTP)
- 6.
API availability (i.e. accessible during certain hours)
- 7.
Logging
- 8.
Threat protection (i.e. protecting against SQL injection)
- 9.
Support for messaging (i.e. HTTP to MQTT)
- 10.
Support for accessing data sources (i.e. accessing databases)
It is a long list and it’s not even complete. Other categories can be found, but they are more like “you can but you should not necessarily do this or that using an API gateway.” As mentioned, on a high level it is all about externalizing non-business, API-related features in the API gateway. What that means in detail is described best by an example.
Let’s say a business API has to validate and persist incoming paystubs. The requests must be sent by authorized clients who are identified by an OAuth access_token. The API must apply validations to assure reliability and security. Once the request has been validated, the API must associate the given paystub with a user and persist it. All actions must be logged for historical and compliance reasons.
- 1.
Error handling
- 2.
SSL/TLS
- 3.
OAuth with SCOPE paystub
- 4.
Authentication
- 5.
Authorization
- 6.
Rate limit
- 7.
Replay attack protection
- 8.
Message size validation
- 9.
SQL injection protection
- 10.
Validate and persist paystub
- 1.Preparation → API infrastructure
- a.
Effort spent to build some kind of foundation including requirements 1 - 9
- a.
- 2.Implementation → API business logic
- a.
Effort spent implementing the actual, required logic for requirement 10
- a.
The percentage may not match in all cases, but it is roughly what I have seen in real-life scenarios. The point is, everything that is not part of the core feature of an API is part of its infrastructure. If you multiply this by the number of APIs, it can be imagined that many resources are used only to get to a point where the business implementation can be started.
The goal of having an API gateway is to externalize these tasks. With an API gateway, those 9 requirements can be implemented in front of the business API, done by a different developer team. And, even if it is the same team, the API gateway provides features that support the developers to get ahead fast. The API gateway not only speeds up implementations, it also eases the operation of services! Internal standards can be applied, logging can be aligned, and API protection can be aligned—all of this due to having one single component that “sees” all traffic.
The API gateway takes care of most requirements and filters incoming requests to only forward valid messages to the backend. Steps 1 and 2 on the backend side are additional required steps. They are to assure that requests were received by the API proxy and that the user is the one associated with the paystub. After that, the backend can handle its paystub feature.
If you now ask why the user can’t be authorized by the API proxy, the answer is quite simple: API gateways should not implement business-relevant validations! Only the business API itself should do that. Otherwise, a lot of context needs to be made available to the API gateway and that raises privacy concerns and potential replication challenges.
In many cases, the requirements in front of different business APIs will be the same, or at least similar. Knowing this opens the door for tremendous simplifications. Ideally it becomes so simple that a developer could be placed in front of a website and simply fill out a dialog with check boxes and input fields. A click on a “Deploy service” button will create a new API on the API proxy and all these requirements will be implemented. Simple as that!
I would like to share a prototype that I have worked on. This is the scenario.
At the end a “Deploy service” button generates a deployment descriptor that goes into a version control system and is used to generate code for different API proxy target systems. Since the code for the API proxy is generated, no errors are possible. Each API follows best practices, include error handling and generating comments so that readers of the implementation will know what it does.
It is hopefully obvious that anyone could fill out these dialogs. No hardcore technology knowledge is needed. The best part about all of this is the fact that none of this influences the business API. It can be developed without knowing anything about the API proxy implementation. The contract between those two would be an API definition such as the Swagger document discussed previously.
Mocking APIs
A client passes in an HTTP header (line number 3) named x-testcase, which sets a variable named testcase. Further down the API takes the appropriate action. For a success message, it will return what is shown in the lower right dialog.
Many cases can be handled. It can also be done based on other incoming parameters or request payloads. Ideally the API gateway will require request messages and return responses that also match the Swagger API definition. With that, only the content would be fake, but message formats and structures could already be verified!
Another powerful feature is the capability of simulating failing backend systems, added latency, connection breakdowns—all the cases no one would like to see in a production system, but still need to handle!
Why Is It Important to Leverage API Gateways?
- 1.
External network from internal network
- 2.
External API interface from internal API interface
- 3.
External URL from internal URL
- 4.
External network topology from internal network topology
- 5.
External API version from internal API version
- 1.
The API gateway can be managed by a different team than the backend API team.
- 2.
Teams such as operations (DevOps) or security can specify their own requirements without influencing the business API.
- 3.
The separation of features per API. For example, the API gateway could expose a JSON-based API and translate that to an XML interface for the backend API. The API gateway can also translate between different protocols.
- 1.
API gateways can connect to other components to prepare the request to the backend API.
- 2.
API gateways can be scaled independently of backend APIs.
- 1.
Termination of TLS/ SSL connections
- 2.
Service orchestration. One exposed API could leverage multiple other APIs internally. A response of a simple /overview API could be the result of five or more API calls, managed and implemented by the gateway.
- 3.
Caching, to reduce the load of messages being send to the backend system
All these reasons enable different teams to work on features at the same time, each one on its own component. Let’s have a closer look into each one.
Decoupling
The API gateway is located within the DMZ and receives requests from external clients. The requests include an OAuth access_token and a JSON payload. The API gateway will validate the access_token and retrieve the associated username. In addition, it will inspect the payload. Depending on that, it is able to evaluate which backend API of which version should handle this message (this is also called content-based routing). Since the backend API requires an XML payload, the API gateway transforms the JSON payload into that.
The API gateway creates a request for the target backend API and includes the username as an HTTP header in addition to the XML payload. The load balancer in between the API gateway and the backend APIs serves one IP address, which resolves to the matching backend API by examining the requests URL path. Each target backend receives and processes the request that matches exactly their interface.
This setup provides the flexibility for the backend API developers to develop and test new versions whenever they feel like it. Once the backend API is ready for production, the load balancer can be updated to route to the new backend API if required. Lastly, the API gateways logic to inspect the payload gets updated. That enables it to include the new backend API as a target location. The API gateway can also be prepared earlier in the development process but either ignores requests that are received with a newer payload or simply forwards them to an older version. Many possibilities appear once this setup has been established.
Separation of Concerns
Figure 7-7 shows several components: the API gateway, a firewall, a load balancer, and multiple backend servers. What appears to be a complex setup at a first glance is very much required. The backend APIs may be maintained by different teams that are specialists in a particular area. At the same time, network administrators can update the load balancer with configurations as necessary. The API gateway can handle requirements that are not directly business API-relevant but reduce complexity in that area.
Often, components in the DMZ require special audits due to the nature of the DMZ. Everything in the DMZ is exposed to the Internet and is therefore a potential target for attacks. The security team can apply strong measures to reduce risks. This does not apply to the backend APIs since they are not located in the DMZ and message inspections are done in the DMZ. The risk of receiving an invalid message is very much reduced.
Although the separation is very useful, it also requires well documented interfaces. If one component goes down, a procedure to identify it quickly is needed. A useful trick to trace messages is to include a requestID that is supported practically everywhere. As soon as a request is received, a requestID should be created and included in all subsequent calls and back again. If this value is logged, it can be searched for it and the failing component can be identified, sometimes even by not finding the value for component xyz in the logs!
Integration and Scaling
The beauty is that API gateways already support the technology to integrate with each of these systems. They only require configurations (exceptions exist). The backend API developer does not need to include a single line of code to support these integrations. Having these possibilities enables new use cases also. A request could be received via an email, checked for viruses using ICAP, and afterwards forwarded to a backend API. If this had to be handled by the backend itself, it could require skill sets that may not exist in teams.
Note
Because of these integration capabilities, API gateways are sometimes also referred to as lightweight ESBs.4 ESBs receive a request and processes it through many different channels until the message is finally sent to the main recipient.
Scaling is another big and important topic. Scaling may be required to serve a higher load from the external network but sometimes also into the internal network. A combination of both is certainly also possible. However, scaling is nothing that can be handled by any component itself. The network topology must be prepared for it. Adding a new API gateway or backend API is not an easy task and requires the correct setup beforehand. Needless to say, this is something that is necessary, no matter what component has to be scaled.
If the backend needs scaling, it can be done independently of the API gateway. Session stickiness between the API gateway and a particular backend API may be required but that can be handled. Scaling in that direction can also include rate limiting per backend API. The exposed API on the gateway can be configured to prevent the backend API from failing or getting overloaded.
Caching is also an interesting capability. Often, APIs return the same result, for example, configurations. In those cases, an API gateway could cache these types of responses for a specified amount of time to reduce the overall load to backend services and the overall latency.
Figure 7-9 contains multiple API gateways and backend APIs. It should illustrate the idea that components are created based on demand. Since they are all interconnected with each other in one way or the other, a well-planned strategy for scaling is required. A client will always send requests to the same address, but this request may be handled by different servers. This scenario needs to be supported! Having stateless components with no sessions is helpful but not always possible.
Having unique feature sets and playing such a vital role, API gateways are also great for auditing and logging. If all requests are processed by an API gateway, the API gateway is able to log and audit all messages if desired! This comes at a cost of decreased performance, but it may be valid for some environments. Turning on and off auditing and logging for certain time frames or during escalations is also possible.
Let’s consider a case where employees use an enterprise app. However, this app sometimes fails for some users. In addition, due to their roles, the app may access an API on backend XXX and sometimes on backend ZZZ. If logging only exists on the backend APIs, it could be challenging to discover what is going wrong. Having the API gateway in-between allows logging of usernames, request times, target backend APIs, backend API response errors, latency between APIs—all in a central location no matter what the target API may be.
If you feel that logging and auditing causes concerns in regard to privacy or access control nightmares, there is good news. API gateways (at least some) include role-based access. It is possible to limit who can do what and who can see what based on user roles or other attributes. This not only includes access to log files or audits; this also includes access to deployed APIs. This can also be used to simplify the case where API gateways are shared between different teams. Depending on team membership, these developers may only have access to reading API implementations or audits or configurations. Fine granular controls are available and should be leveraged.
API Gateway Alternatives
As an architect who has worked with API gateways since 2006, I usually say, There is no alternative. You need to have it! Needless to say, this is too easy as an acceptable answer. Of course alternatives exist!
Home-grown: Often, customers work with a home-grown solution. These systems usually start off as a small project to address exactly one requirement. Over time, more features are added as the need comes up. However, a typical conversation starts like this: We have this home-grown system that we need to replace. We do not have anyone left who is able to maintain it anymore! The other conversation sounds like this: We would like to introduce an API gateway but many of our systems depend on our home-grown system. Your API gateway needs to integrate the home-grown token format until we have moved all systems off of it! Don’t get me wrong: home-grown systems are not generally bad ideas. But, as it happens, these systems are often not treated as a product and therefore the development does not follow best practices for product development lifecycles. Lack of documentation, lack of testing, lack of knowledge-sharing are some of the issues. The investment into existing products may be avoided. But if the home-grown system is not treated as a product right from the beginning, it will be more cost-intensive in the long run. This alternative usually grows organically. A developer has this cool idea and promises that he can implement this one single feature easily by himself. Since it worked out, he is asked to implement another feature. One after one, and suddenly it’s a complete server that runs well, at least, as long as this one developer is around.
WAF5 and security per REST API: This option is found where environments serve pure REST APIs (i.e. JSON or XML payloads) and web content (i.e. HTML). Generally, environments that have started with web content only (web applications) leverage a WAF. Over time, when systems began to introduce REST services, the limitations of WAFs were compensated by adding features directly into each new REST API. Identified redundancies of common code was externalized and put into libraries or SDKs. For example, within a PHP-based ecosystem, REST APIs would use the same includes to handle rate limiting, message size validation, and other tasks. At some point in time, developers will have the desire to completely externalize these tasks into something like an API gateway or a WAF extension. And this ends up in a home-grown system again!
Per API solution: Rare, but also seen, are systems where each API takes care of its own security in all aspects. Teams that maintain an API, or a group of APIs, implement the business logic and additionally anything else that is needed for securing them. Knowledge may be shared with other teams, but the main mindset follows the idea of it has to work for us. For example, a system that has multiple web sites or REST APIs that handle online payment. Or, at least, the collection of payment details in order to forward these to payment card processing providers. How this collection of data is implemented or secured or displayed to users may be different per web site with no alignment between teams! As long as security reviews do not identify potential vulnerabilities and as long as the specific process is successful, any attempt for refactoring code is neglected. For this approach to be successful, the same requirements apply as they do for the home-grown use case.
A drawback of not having a dedicated component to handle incoming and outgoing traffic is the distribution of data. It is very difficult to apply company-wide, or at least business unit-wide, guidelines for securing and auditing APIs. Different teams have to be convinced of the usefulness and encouraged to follow these guidelines. Even small things like auditing become challenging since most teams have their own rules for auditing what, when, and how. If teams feel that adhering to these guidelines requires extra effort, they will ignore them as long as possible!
Nevertheless, I have seen one example where customers built a well-working system themselves. It was built from the ground up. The reason for its success was the fact that a team was formed and treated as a product development team. Requirements were collected, the scope of the project was well defined, timelines were specified, and releases were made available often to collect feedback from other employees. In addition, the system was well documented.
Summary
API gateways (API proxies) play a vital role in any environment that exposes RESTful APIs. API gateways are located so that all incoming and outgoing traffic can be inspected by them. Applying security rules, implementing logging requirements, enabling reviewers—all of this is supported. The burden of implementing non-functional requirements in regard to a business API is taken of the developers’ plates. Each team can concentrate on its own strengths. Alternatives do exist, but they often end up being a maintenance nightmare and difficult to manage.