Index

A

Administration
clustered tables
database objects
databases
data share
parameters
warehouses
Advanced Encryption Standard (AES)
Agile data warehousing
ALTER WAREHOUSE command
Alteryx
Amazon Web Services (AWS)
Analytical ecosystem
Apache MLflow
Apache Spark
cloud providers
components
connector
data flow process
key features
stages
data scientists
machine learning
optimal strategy
vs . Snowflake
AVRO file
generation
JSON sample file
loading data
metadata
schema
working
AWS Snowball
Azure Data Box
Azure Databricks
connecting Snowflake
creation
data
delta caching
environment
notebook
Spark cluster

B

Batch method
Bulk data loading
compression methods
COPY statement
encoding
encryption
file formats
staging area
storage locations
user interface, loading
Business intelligence (BI)

C

Cloud computing
deployment models
key terms
modern bandwidth
role of hypervisor
service models
Shared responsibility model, AWS
virtualization
COPY command
COPY INTO table statement
CREATE SHARE command

D

Database management commands
Databricks
elements
components
Data clustering
Data Definition Language (DDL)
Data Manipulation Language (DML)
DataRobot
Data science tool
Data sharing
Datasource API
Data system
lifecycle
retention period
Data warehouse
Data warehouse as a service (DWaaS)
Data warehouse (DW)
Data warehouse migration
architecture
business analytics
cloud analytics
goal
on-premise analytics
organizational part
data
deadlines/budget
development/deployment process
documentation
migration approach
outcomes
repoint tools
run
security
Snowflake
test plan
overview
technical part
Delta Lake
Discretionary access control (DAC)

E

Extract-load-transform (ELT)
Extract-transform-load (ETL) processing

F

File preparation, bulk data
CSV files
file sizing
semistructured data
splitting files
File staging
ETL processing
logical paths
named stage
staged data

G, H, I

Google Cloud Platform (GCP)
GraphX

J, K

JSON format
extracting attributes
FLATTEN function
NASDAQ
Snowflake
SQL
table
tree structure

L

Loading data files

M, N

Machine learning
Managed Streaming for Kafka (MSK)
Massive parallel processing (MPP)
data management solution
data mining techniques
principles
Redshift
vs. SMP
Snowflake
Materialized views (MVs)
benefit
data manipulation language
similarities and differences
Matillion ETL
ML Libraries
Modern solution architecture
Multicluster virtual warehouses
Multifactor authentication (MFA)

O

Optimized Row Columnar (ORC)

P

Parquet file
creating metadata
CSV sample file
PyArrow
transforming data
uploading data and copying to target table
working
Partition pruning
Pattern matching
Penetration testing
Planning
cloud provider
limitations
regions
pricing model
cloud storage
virtual warehouse size
Snowflake editions
tools
JDBC
ODBC
SnowSQL
web interface
Pushdown optimization

Q

Qubola
Querying staged files

R

Real-world project
big data
challenges
DW architecture
ETL tool
streaming
Tableau
Regions
Resource consumption, administration
data storage, usage
data transfer, usage
usage permissions
VWs, usage
Role-based access control (RBAC)
Roles and users, administration
access control models
account menu
create user
hierarchy, role
marketing role
role, commands
Snowflake account
user, commands
R Studio

S

Schema-on-read approach
Security reference architecture
account and user authentication
AES
audit and logging
history audit functions
penetration tests
query history audit logs functions
Query Profiler
layers
network and site access
object security
physical security
validations
VPC
Semistructured data
data types
file formats
schema-on-read approach
Shared responsibility model (SRM)
Snowflake
account creation
architecture
aspects
cloud providers
connection
data sharing
benefits
process
DLL
ETL processing
internal/external stages
JDBC driver
key layers
planning
SeePlanning
scalability
table
consumer account
metadata
steps
stock data
VW
SeeVirtual warehouses (VWs)
web interface
SeeWeb interface, Snowflake
XML
Snowflake partner ecosystem
connect page
drivers
Matillion ETL
creation
definition
key elements
modern solution architecture
objects
tables
Tableau
best-of-breed technologies
connection window
data
desktop connection
sign in
SQL query
visualization
Snowflake Spark Connector
Snowpipe
benefits
options
Snowpipe Auto-Ingest
data pipeline
CloudWatch logging
Kinesis Firehose delivery stream
PUT statement
S3 bucket
SQS
stream events
testing
Snowpipe REST API
SnowSQL
commands
installation
curl commands
Downloads dialog
introduction screen
platform-specific versions
Summary tab
load data
multiple sessions
SnowSQL configuration
connection settings
variables
active session
command line
config file
Spark dataframe
SqlDBM model
SQLDbm tool
Stream method

T

Tableau
Time travel feature
data retention parameter
sample table
table clone
table creation
table state
Time travel SQL extension
process
statements
work

U

UNDROP DATABASE
USE ROLE command
USE WAREHOUSE command

V

Virtual private cloud (VPC)
Virtual Private Snowflake (VPS)
Virtual warehouses (VWs)
building
creating
load monitoring
query statuses
start/resume, suspend, and resize
TESTWAREHOUSE
USE WAREHOUSE command
caching impacts
multicluster
choosing minimum and maximum number
credits and usage
query design
scaling
sizes and features
choosing right size
concurrency
USE WAREHOUSE command

W

Warehouse commands
Web interface, Snowflake
databases page
help menu
history page
partner connect page
shares page
user preferences menu
warehouses page
worksheets page

X, Y

XML format
built-in functions
choosing warehouse
downloading source file
example
extracting values
file creation
LATERAL FLATTEN table function
loading data into table
selecting load options
structure
XMLGET function

Z

Zero-Copy cloning
..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset
3.142.42.176