Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

Shashank Mohan Jain

Introduction to Transformers for NLP

With the Hugging Face Library and Models to Solve Problems

The Apress logo.

Shashank Mohan Jain

Bangalore, India

ISBN 978-1-4842-8843-6e-ISBN 978-1-4842-8844-3

https://doi.org/10.1007/978-1-4842-8844-3

Apress Standard

The use of general descriptive names, registered names, trademarks, service marks, etc. in this publication does not imply, even in the absence of a specific statement, that such names are exempt from the relevant protective laws and regulations and therefore free for general use.

The publisher, the authors and the editors are safe to assume that the advice and information in this book are believed to be true and accurate at the date of publication. Neither the publisher nor the authors or the editors give a warranty, expressed or implied, with respect to the material contained herein or for any errors or omissions that may have been made. The publisher remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

This Apress imprint is published by the registered company APress Media, LLC, part of Springer Nature.

The registered company address is: 1 New York Plaza, New York, NY 10004, U.S.A.

Introduction

This book takes the user through the journey of natural language processing starting from n-gram models to neural network architectures like RNN before it moves to the state-of-the-art technology today, which is known as the transformers. The book details out the transformer architecture and mainly explains the self-attention mechanism, which is the foundation of the transformer concept.

The book deals with the topic of transformers in depth with examples from different NLP areas like text generation, sentiment analysis, zero-shot learning, text summarization, etc. The book takes a deep dive into huggingface APIs and their usage to create simple Gradio-based applications. We will delve into details of not only using pretrained models but also how to fine-tune the existing models with our own datasets.

We cover models like BERT, GPT2, T5, etc., and showcase how these models can be used directly to create a different range of applications in the area of natural language processing and understanding.

The book doesn’t just limit the knowledge and exploration of transformers to NLP but also covers at a high level how transformers are being used in areas like vision.

Source Code

All source code used in this book can be found at github.com/apress/intro-transformers-nlp.

Table of Contents

Chapter 1: Introduction to Language Models 1

History of NLP 2

Bag of Words 4

n-grams 6

Recurrent Neural Networks 8

Language Models 11

Summary 16

Chapter 2: Introduction to Transformers 19

What Is a Seq2Seq Neural Network? 20

The Transformer 21

Transformers 22

Summary 36

Chapter 3: BERT 37

Workings of BERT 38

Masked LM (MLM) 38

Next Sentence Prediction 41

Inference in NSP 43

BERT Pretrained Models 44

BERT Input Representations 45

Use Cases for BERT 46

Sentiment Analysis on Tweets 47

Performance of BERT on a Variety of Common Language Tasks 48

Summary 49

Chapter 4: Hugging Face 51

Features of the Hugging Face Platform 53

Components of Hugging Face 54

Summary 67

Chapter 5: Tasks Using the Hugging Face Library 69

Gradio: An Introduction 69

Creating a Space on Hugging Face 70

Hugging Face Tasks 72

Question and Answering 72

Translation 78

Summary 84

Zero-Shot Learning 90

Text Generation Task/Models 95

Text-to-Text Generation 106

Chatbot/Dialog Bot 123

Code and Code Comment Generation 126

Code Comment Generator 131

Summary 136

Chapter 6: Fine-Tuning Pretrained Models 137

Datasets 139

Fine-Tuning a Pretrained Model 142

Training for Fine-Tuning 142

Inference 150

Summary 151

Appendix A: Vision Transformers 153

Self-Attention and Vision Transformers 153

Summary 157

Index 159

About the Author

Shashank Mohan Jain

has been working in the IT industry for around 22 years mainly in the areas of cloud computing, machine learning, and distributed systems. He has keen interests in virtualization techniques, security, and complex systems. Shashank has many software patents to his name in the area of cloud computing, IoT, and machine learning. He is a speaker at multiple reputed cloud conferences. Shashank holds Sun, Microsoft, and Linux kernel certifications.

About the Technical Reviewer

Akshay Kulkarni

is a renowned AI and machine learning evangelist and thought leader. He has consulted several Fortune 500 and global enterprises on driving AI and data science–led strategic transformation. Akshay has rich experience in building and scaling AI and machine learning businesses and creating significant impact. He is currently a data science and AI manager at Publicis Sapient, where he is a part of strategy and transformation interventions through AI. He manages high-priority growth initiatives around data science and works on various artificial intelligence engagements by applying state-of-the-art techniques to this space. Akshay is also a Google Developers Expert in machine learning, a published author of books on NLP and deep learning, and a regular speaker at major AI and data science conferences. In 2019, Akshay was named one of the top “40 under 40 data scientists” in India. In his spare time, he enjoys reading, writing, coding, and mentoring aspiring data scientists. He lives in Bangalore, India, with his family.

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.

Table of Contents for Front Matter

Create new playlist

Sign In

Sign Up

Introduction to Transformers for NLP

With the Hugging Face Library and Models to Solve Problems

Source Code

Table of Contents for
Front Matter