9 Scaling Applications Using Multiple Processors in Node.js

Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

9
Scaling Applications Using Multiple Processors in Node.js

In Chapter 4, “Using Events, Listeners, Timers, and Callbacks in Node.js,” you learned that Node.js applications run on a single thread rather than multiple threads. Using the single thread for application processing makes Node.js processes more efficient and faster. But most servers have multiple processors, and you can scale your Node.js applications by taking advantage of them. Node.js allows you to fork work from the main application to separate processes that can then be processed in parallel with each other and the main application.

To facilitate using multiple processes Node.js provides three specific modules. The process module provides access to the running processes. The child_process module provides the ability to create child processes and communicate with them. The cluster module implements clustered servers that share the same port, thus allowing multiple requests to be handled simultaneously.

Understanding the Process Module

The process module is a global object that can be accessed from your Node.js applications without the need to use a require(). This object gives you access to the running processes as well as information about the underlying hardware architecture.

Understanding Process I/O Pipes

The process module provides access to the standard I/O pipes for the process stdin, stdout, and stderr. stdin is the standard input pipe for the process, which is typically the console. You can read input from the console using the following code:

process.stdin.on('data', function(data){
  console.log("Console Input: " + data);
});

When you type in data to the console and press Enter, the data is written back out. For example:

some data
Console Input: some data

The stdout and stderr attributes of the process module are Writable streams that can be treated accordingly.

Understanding Process Signals

A great feature of the process module is that it allows you to register listeners to handle signals sent to the process from the OS. This is helpful when you need to perform certain actions, such as clean up before a process is stopped or terminated. Table 9.1 lists the process events that you can add listeners for.

To register for a process signal, simply use the on(event, callback) method. For example, to register an event handler for the SIGBREAK event, you would use the following code:

process.on('SIGBREAK', function(){
  console.log("Got a SIGBREAK");
});

Table 9.1 Events that can be sent to Node.js processes

Event	Description
`SIGUSR1`	Emitted when the Node.js debugger is started. You can add a listener; however, you cannot stop the debugger from starting.
`SIGPIPE`	Emitted when the process tries to write to a pipe without a process connected on the other end.
`SIGHUP`	Emitted on Windows when the console window is closed, and on other platforms under various similar conditions. Note: Windows terminates Node.js about 10 seconds after sending this event.
`SIGTERM`	Emitted when a request is made to terminate the process. This is not supported on Windows.
`SIGINT`	Emitted when a Break is sent to the process. For example, when Ctrl+C is pressed.
`SIGBREAK`	Emitted on Windows when Ctrl+Break is pressed.
`SIGWINCH`	Emitted when the console has been resized. On Windows, this is emitted only when you write to the console, when the cursor is being moved, or when a readable TTY is used in raw mode.
`SIGKILL`	Emitted on a process kill. Cannot have a listener installed.
`SIGSTOP`	Emitted on a process stop. Cannot have a listener installed.

Controlling Process Execution with the `process` Module

The process module also gives you some control over the execution of processes, specifically, the ability to stop the current process, kill another process, or schedule work to run on the event queue. These methods are attached directly to the process module. For example, to exit the current Node.js process, you would use:

process.exit(0)

Table 9.2 lists the available process control methods on the process module.

Table 9.2 Methods that can be called on the process module to affect process execution

Method	Description
`abort()`	Causes the current Node.js application to emit an `abort` event, exit, and generate a memory core.
`exit([code])`	Causes the current Node.js application to exit and return the specified `code`.
`kill(pid, [signal])`	Causes the OS to send a kill signal to the process with the specified `pid`. The default `signal` is SIGTERM, but you can specify another.
`nextTick(callback)`	Schedules the `callback` function on the Node.js application’s queue.

Getting Information from the `process` Module

The process module provides a wealth of information about the running process and the system architecture. This information can be useful when implementing your applications. For example, the process.pid property gives you the process ID that can then be used by your application.

Table 9.3 lists the properties and methods that you can access from the process module and describes what they return.

Table 9.3 Methods that can be called on the process module to gather information

Method	Description
`version`	Specifies the version of Node.js.
`versions`	Provides an object containing the required modules and version for this Node.js application.
`config`	Contains the configuration options used to compile the current node executable.
`argv`	Contains the command arguments used to start the Node.js application. The first element is the node, and the second element is the path to the main JavaScript file.
`execPath`	Specifies the absolute path where Node.js was started from.
`execArgv`	Specifies the node-specific command-line options used to start the application.
`chdir(directory)`	Changes the current working `directory` for the application. This can be useful if you provide a configuration file that is loaded after the application has started.
`cwd()`	Returns the current working directory for the process.
`env`	Contains the key/value pairs specified in the environment for the process.
`pid`	Specifies the current process’s ID.
`title`	Specifies the title of the currently running process.
`arch`	Specifies the processor architecture the process is running on (for example, `x64`, `ia32`, or `arm`).
`platform`	Specifies the OS platform (for example, `linux`, `win32`, or `freebsd`).
`memoryUsage()`	Describes the current memory usage of the Node.js process. You need to use the `util.inspect()` method to read in the object. For example: console.log(util.inspect(process.memoryUsage()));{ rss: 13946880, heapTotal: 4083456, heapUsed: 2190800 }
`maxTickDepth`	Specifies the maximum number of events schedule by `nextTick()` that will be run before allowing blocking I/O events from being processed. You should adjust this value as necessary to keep your I/O processes from being starved.
`uptime()`	Contains the number of seconds the Node.js processor has been running.
`hrtime()`	Returns a high-resolution time in a tuple `array [seconds, nanoseconds]`. This can be used if you need to implement a granular timing mechanism.
`getgid()`	On POSIX platforms, returns the numerical group ID for this process.
`setgid(id)`	On POSIX platforms, sets the numerical group ID for this process.
`getuid()`	On POSIX platforms, returns the numerical or string user ID for this process.
`setuid(id)`	On POSIX platforms, sets the numerical or string user ID for this process.
`getgroups()`	On POSIX platforms, returns an array of group IDs.
`setgroups(groups)`	On POSIX platforms, sets the supplementary group IDs. Your Node.js application needs root privileges to call this method.
`initgroups(user, extra_group)`	On POSIX platforms, initializes the group access list with the information from `/etc/group`. Your Node.js application needs root privileges to call this method.

To help you understand accessing information using the process module, Listing 9.1 makes a series of calls and outputs the results to the console, as shown in Listing 9.1 Output.

Listing 9.1 process_info.js: Accessing information about the process and system using the process module

Property	Description
`stdin`	An input `Writable` stream.
`stdout`	A standard output `Readable` stream.
`stderr`	A standard output `Readable` stream for errors.
`pid`	An ID of the process.
`connected`	A Boolean that is set to `false` after `disconnect()` is called. When this is `false`, you can no longer `send()` messages to the child.

Property	Description
`settings`	Contains the `exec`, `args`, and `silent` property values used to set up the cluster.
`isMaster`	Is `true` if the current process is the cluster master; otherwise, it is `false`.
`isWorker`	Is `true` if the current process is a worker; otherwise, it is `false`.
`setupMaster([settings])`	Accepts an optional settings object that contains `exec`, `args`, and `silent` properties. The `exec` property points to the worker JavaScript file. The `args` property is an array of parameters to pass, and `silent` disconnects the IPC mechanism from the worker thread.
`disconnect([callback])`	Disconnects the IPC mechanism from the workers and closes the handles. The `callback` function is executed when the disconnect finishes.
`worker`	References the current `Worker` object in worker processes. This is not defined in the master process.
`workers`	Contains the `Worker` object, which you can reference by ID from the master process. For example: cluster.workers[workerId]

Property	Description
`id`	Represents the unique ID of this worker.
`process`	Specifies the `ChildProcess` object this worker is running on.
`suicide`	Is set to `true` when `kill()` or `disconnect()` is called on this worker. You can use this flag to determine whether you should break out of loops to try and go down gracefully.
`send(message, [sendHandle])`	Sends a message to the master process.
`kill([signal])`	Kills the current worker process by disconnecting the IPC channel and then exiting. Sets the `suicide` flag to `true`.
`disconnect()`	When called in the worker, closes all servers, waits for the `close` event, and then disconnects the IPC channel. When called from the master, sends an internal message to the worker causing it to disconnect itself. Sets the `suicide` flag.

Table of Contents for 9 Scaling Applications Using Multiple Processors in Node.js

Create new playlist

Sign In

Sign Up

9Scaling Applications Using Multiple Processors in Node.js

Understanding the Process Module

Understanding Process I/O Pipes

Understanding Process Signals

Controlling Process Execution with the process Module

Getting Information from the process Module

Implementing Child Processes

Understanding the ChildProcess Object

Executing a System Command on Another Process Using exec()

Executing an Executable File on Another Process Using execFile()

Spawning a Process in Another Node.js Instance Using spawn()

Implementing Child Forks

Implementing Process Clusters

Using the Cluster Module

Understanding the Worker Object

Implementing an HTTP Cluster

Summary

Next

Table of Contents for
9 Scaling Applications Using Multiple Processors in Node.js

9
Scaling Applications Using Multiple Processors in Node.js

Controlling Process Execution with the `process` Module

Getting Information from the `process` Module

Understanding the `ChildProcess` Object

Executing a System Command on Another Process Using `exec()`

Executing an Executable File on Another Process Using `execFile()`

Spawning a Process in Another Node.js Instance Using `spawn()`

Understanding the `Worker` Object