Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

3. CNN and RNN Using PyTorch

Pradeepta Mishra¹

(1)

Bangalore, Karnataka, India

Probability and random variables are an integral part of computation in a graph-computing platform like PyTorch. Understanding probability and the associated concepts are essential. This chapter covers probability distributions and implementation using PyTorch, as well as how to interpret the results of a test. In probability and statistics, a random variable is also known as a stochastic variable , whose outcome is dependent on a purely stochastic phenomenon, or random phenomenon. There are different types of probability distribution, including normal distribution, binomial distribution, multinomial distribution, and the Bernoulli distribution. Each statistical distribution has its own properties.

Recipe 3-1. Setting Up a Loss Function

Problem

How do we set up a loss function and optimize it? Choosing the right loss function increases the chances of model convergence.

Solution

In this recipe, we use another tensor as the update variable, and introduce the tensors to the sample model and compute the error or loss. Then we compute the rate of change in the loss function to measure the choice of loss function in model convergence.

How It Works

In the following example, t_c and t_u are two tensors. This can be constructed from any NumPy array.

../images/474315_1_En_3_Chapter/474315_1_En_3_Figa_HTML.jpg

The sample model is just a linear equation to make the calculation happen and the loss function defined if the mean square error (MSE) shown next. Going forward in this chapter, we will increase the complexity of the model. For now, this is just a simple linear equation computation.

../images/474315_1_En_3_Chapter/474315_1_En_3_Figb_HTML.jpg

Let’s now define the model. The w parameter is the weight tensor, which is multiplied with the t_u tensor. The result is added with a constant tensor, b, and the loss function chosen is a custom-built one; it is also available in PyTorch. In the following example, t_u is the tensor used, t_p is the tensor predicted, and t_c is the precomputed tensor, with which the predicted tensor needs to be compared to calculate the loss function.

../images/474315_1_En_3_Chapter/474315_1_En_3_Figc_HTML.jpg

The formula w * t_u + b is the linear equation representation of a tensor-based computation.

../images/474315_1_En_3_Chapter/474315_1_En_3_Figd_HTML.jpg

The initial loss value is 1763.88, which is too high because of the initial round of weights chosen. The error in the first round of iteration is backpropagated to reduce the errors in the second round, for which the initial set of weights needs to be updated. Therefore, the rate of change in the loss function is essential in updating the weights in the estimation process.

../images/474315_1_En_3_Chapter/474315_1_En_3_Fige_HTML.jpg

There are two parameters to update the rate of loss function: the learning rate at the current iteration and the learning rate at the previous iteration. If the delta between the two iterations exceeds a certain threshold, then the weight tensor needs to be updated, else model convergence could happen. The preceding script shows the delta and learning rate values. Currently, these are static values that the user has the option to change.

../images/474315_1_En_3_Chapter/474315_1_En_3_Figf_HTML.jpg

../images/474315_1_En_3_Chapter/474315_1_En_3_Figg_HTML.jpg

This is how a simple mean square loss function works in a two-dimensional tensor example, with a tensor size of 10,5.

Let’s look at the following example. The MSELoss function is within the neural network module of PyTorch.

../images/474315_1_En_3_Chapter/474315_1_En_3_Figh_HTML.jpg

When we look at the gradient calculation that is used for backpropagation, it is shown as MSELoss.

../images/474315_1_En_3_Chapter/474315_1_En_3_Figi_HTML.jpg