Index
A
B
C
- Cassandra
- classifications, with Naïve Bayes
- closeness centrality algorithm
- Cloudera
- cluster design, Apache Spark / Cluster design
- clustering, with K-Means
- cluster management
- cluster management, Databricks
- connected components algorithm
D
- dashboards / Overview
- data
- databases / Overview
- Databricks
- Databricks file system (DBFS) / The table data
- Databricks tables
- DataFrames
- data sources, Apache Spark streaming
- DataStax Spark Cassandra connector
- data visualization
- DBFS
- dbutils.fs class
- dbutils package
- deep learning
- development environments, Databricks
- discrete stream (DStream) / Overview
- Docker
E
- end of file markers (EOF) / Using Cassandra
- environment, H2O
- environment configuration, MLlib
- Extract, Transform, Load (ETL)
F
G
H
J
- JavaScript Object Notation (JSON) files
- jobs
K
L
- LabeledPoint
- libraries
- local Hive Metastore server
M
- markdown
- Mazerunner, for Neo4j
- Mazerunner algorithms
- MLlib
- MNIST
N
O
- OOM (Out of Memory) messages / Memory
- Oryx system
P
- P (Spam|Buy) / Theory
- PageRank algorithm
- Parquet files
- performance
- PostgreSQL connector library
- PredictionIO
R
S
T
U
- user-defined functions (UDFs)
V
..................Content has been hidden....................
You can't read the all page of ebook, please click
here login for view all page.