Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

Teacher forcing

As seen in the illustration above, when predicting an output at some place in the sequence y_t(n), we use y_t(n-1) as the input to the LSTM. We then use the output from this time step to predict y_t(n+1).

The problem with doing this in training is that if y_t(n-1) is wrong, y_t(n) will be even more wrong. This chain of increasing wrongness can make things very very slow to train.

A somewhat obvious solution to this problem is to replace each sequence prediction at each time step with the actual correct sequence at that time step. So, rather than using the LSTM prediction for y_t(n-1), we would use the actual value from the training set.

We can give the model's training process a boost by using this concept, which happens to be called teacher forcing.

Teacher forcing can sometimes make it difficult for our model to robustly generate sequences outside of those seen in training, but in general the technique can be helpful.

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.

3.147.59.198

Table of Contents for Teacher forcing

Create new playlist

Sign In

Sign Up

Table of Contents for
Teacher forcing