Home Page Icon
Home Page
Table of Contents for
Cover
Close
Cover
by Marshall Presser
Data Warehousing with Greenplum, 2nd Edition
Foreword to the Second Edition
Foreword to the First Edition
Preface
Why Are We Rewriting This Book?
Why Did We Write This Book in the First Place?
Who Are the “We”?
Who Should Read This Book?
What the Book Covers
What It Doesn’t Cover
Where You Can Find More Information
How to Read This Book
Acknowledgments
1. Introducing the Greenplum Database
Problems with the Traditional Data Warehouse
Responses to the Challenge
A Brief Greenplum History
What Is Massively Parallel Processing?
The Greenplum Database Architecture
Master and Standby Master
Segments and Segment Hosts
Private Interconnect
Mirror Segments
Additional Resources
Greenplum Documentation
Greenplum Best Practices Guide
Greenplum Cluster Concepts Guide
PivotalGuru (Formerly Greenplum Guru)
Pivotal Greenplum Blogs
Greenplum YouTube Channel
Greenplum Knowledge Base
Greenplum.org
Other Sources
2. What’s New in Greenplum?
What’s New in Greenplum 5?
What’s New in Greenplum 6?
Additional Resources
3. Deploying Greenplum
Custom(er)-Built Clusters
Greenplum Building Blocks
Public Cloud
Private Cloud
Greenplum for Kubernetes
Choosing a Greenplum Deployment
Additional Resources
4. Organizing Data in Greenplum
Distributing Data
Polymorphic Storage
Partitioning Data
Orientation
Compression
Append-Optimized Tables
External Tables
Indexing
Additional Resources
5. Loading Data
INSERT Statements
COPY Command
The gpfdist Process
The gpload Tool
Additional Resources
6. Gaining Analytic Insight
Data Science on Greenplum with Apache MADlib
What Is Data Science and Why Is It Important?
Common Data Science Use Cases
Apache MADlib
Scale and Performance
Familiar SQL Interface
Algorithm Design
R Interface
Deep Learning
Text Analytics
Brief Overview of GPText Architecture
Configuring Solr/GPText
Defining Your Analysis and Performing Text Searches
Administering GPText
Additional Resources
7. Monitoring and Managing Greenplum
Greenplum Command Center
Workload Management
Resource Queues
Resource Groups
Greenplum Management Tools
Basic Command and Control
System Health
Disaster Recovery and Data Replication
Operations and System Management
Other Tools
Additional Resources
8. Accessing External Data
dblink
Foreign Data Wrappers
Platform Extension Framework
Greenplum Stream Server
Greenplum-Kafka Integration
Greenplum-Informatica Connector
GemFire-Greenplum Connector
Greenplum-Spark Connector
Amazon S3
External Web Tables
Additional Resources
9. Optimizing Query Response
Fast Query Response Explained
GPORCA Recent Accomplishments
Additional Resources
Search in book...
Toggle Font Controls
Playlists
Add To
Create new playlist
Name your new playlist
Playlist description (optional)
Cancel
Create playlist
Sign In
Email address
Password
Forgot Password?
Create account
Login
or
Continue with Facebook
Continue with Google
Sign Up
Full Name
Email address
Confirm Email Address
Password
Login
Create account
or
Continue with Facebook
Continue with Google
Next
Next Chapter
Data Warehousing with Greenplum
Add Highlight
No Comment
..................Content has been hidden....................
You can't read the all page of ebook, please click
here
login for view all page.
Day Mode
Cloud Mode
Night Mode
Reset