Contents

About This Book

About The Authors

Acknowledgments

Chapter 1 Introduction to Text Analytics

Overview of Text Analytics

Text Mining Using SAS Text Miner

Information Retrieval

Document Classification

Ontology Management

Information Extraction

Clustering

Trend Analysis

Enhancing Predictive Models Using Exploratory Text Mining

Sentiment Analysis

Emerging Directions

Handling Big (Text) Data

Voice Mining

Real-Time Text Analytics

Summary

References

Chapter 2 Information Extraction Using SAS Crawler

Introduction to Information Extraction and Organization

SAS Crawler

SAS Search and Indexing

SAS Information Retrieval Studio Interface

Web Crawler

Breadth First

Depth First

Web Crawling: Real-World Applications and Examples

Understanding Core Component Servers

Proxy Server

Pipeline Server

Component Servers of SAS Search and Indexing

Indexing Server

Query Server

Query Web Server

Query Statistics Server

SAS Markup Matcher Server

Summary

References

Chapter 3 Importing Textual Data into SAS Text Miner

An Introduction to SAS Enterprise Miner and SAS Text Miner

Data Types, Roles, and Levels in SAS Text Miner

Creating a Data Source in SAS Enterprise Miner

Importing Textual Data into SAS

Importing Data into SAS Text Miner Using the Text Import Node

%TMFILTER Macro

Importing XLS and XML Files into SAS Text Miner

Managing Text Using SAS Character Functions

Summary

References

Chapter 4 Parsing and Extracting Features

Introduction

Tokens and Words

Lemmatization

POS Tags

Parsing Tree

Text Parsing Node in SAS Text Miner

Stemming and Synonyms

Identifying Parts of Speech

Using Start and Stop Lists

Spell Checking

Entities

Building Custom Entities Using SAS Contextual Extraction Studio

Summary

References

Chapter 5 Data Transformation

Introduction

Zipf’s Law

Term-By-Document Matrix

Text Filter Node

Frequency Weightings

Term Weightings

Filtering Documents

Concept Links

Summary

References

Chapter 6 Clustering and Topic Extraction

Introduction

What Is Clustering?

Singular Value Decomposition and Latent Semantic Indexing

Topic Extraction

Scoring

Summary

References

Chapter 7 Content Management

Introduction

Content Categorization

Types of Taxonomy

Statistical Categorizer

Rule-Based Categorizer

Comparison of Statistical versus Rule-Based Categorizers

Determining Category Membership

Concept Extraction

Contextual Extraction

CLASSIFIER Definition

SEQUENCE and PREDICATE_RULE Definitions

Automatic Generation of Categorization Rules Using SAS Text Miner

Differences between Text Clustering and Content Categorization

Summary

Appendix

References

Chapter 8 Sentiment Analysis

Introduction

Basics of Sentiment Analysis

Challenges in Conducting Sentiment Analysis

Unsupervised versus Supervised Sentiment Classification

SAS Sentiment Analysis Studio Overview

Statistical Models in SAS Sentiment Analysis Studio

Rule-Based Models in SAS Sentiment Analysis Studio

SAS Text Miner and SAS Sentiment Analysis Studio

Summary

References

Case Studies

Case Study 1 Text Mining SUGI/SAS Global Forum Paper Abstracts to Reveal Trends

Introduction

Data

Results

Trends

Summary

Instructions for Accessing the Case Study Project

Case Study 2 Automatic Detection of Section Membership for SAS Conference Paper Abstract Submissions

Introduction

Objective

Step-by-Step Instructions

Summary

Case Study 3 Features-based Sentiment Analysis of Customer Reviews

Introduction

Data

Text Mining for Negative App Reviews

Text Mining for Positive App Reviews

NLP Based Sentiment Analysis

Summary

Case Study 4 Exploring Injury Data for Root Causal and Association Analysis

Introduction

Objective

Data Description

Step-by-Step Instructions

Part 1: SAS Text Miner

Part 2: SAS Enterprise Content Categorization

Summary

Case Study 5 Enhancing Predictive Models Using Textual Data

Data Description

Step-by-Step Instructions

Summary

Case Study 6 Opinion Mining of Professional Drivers’ Feedback

Introduction

Data

Analysis Using SAS® Text Miner

Analysis Using the Text Rule-builder Node

Summary

Case Study 7 Information Organization and Access of Enron Emails to Help Investigation

Introduction

Objective

Step-by-Step Software Instruction with Settings/Properties

Summary

Case Study 8 Unleashing the Power of Unified Text Analytics to Categorize Call Center Data

Introduction

Data Description

Examining Topics

Merging or Splitting Topics

Categorizing Content

Concept Map Visualization

Using PROC DS2 for Deployment DEPLOYMENT

Integrating with SAS® Visual Analytics

Summary

Case Study 9 Evaluating Health Provider Service Performance Using Textual Responses

Introduction

Summary

Index

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset
3.143.5.15