Index

A

Active learning
Adamic-adar index
Algorithm
Alpha Vantage’s API
Anaconda Distribution
managing packages
Miniconda
reproducing environments
Spyder launch
Apache Arrow
Apache Parquet
Application programming interfaces (APIs)
CDCs
fibonacci numbers
generic sequencer function
law of diminishing returns
roles
AToughGame problem
attach_model function
Augmented ball descend
problem specification
version 1.1
boundaries and movement
path finding engine
retrospectives
satellite image dataset
version 1.2
enhancing input subsystem
enhancing output subsystem
retrospective
version 1.3
approaches
baseline
nonrecursive simple path finder
ParallelSimplePathFinder class
performance optimization
retrospective
reuse-based software engineering characteristics
test terrain function
Average clustering coefficient

B

Betweenness centrality
Ball Descend project
data model
code
matrix function
NumPy
print statement
problem specification
simulator
test automation
Bernoulli’s probability distribution
Bias-variance trade-off
Big Data
four Vs
LHC
MOOC
old vs. modern data science projects
Bipartite graphs
application actor
bipartite.py module
IoT platform
rated relation
Boiling frog attack
Boolean indexing technique
Brown Cow model
Brute-force program

C

Closeness centrality
Chaos domain
Click-through rate (CTR)
Closest pair algorithm
API-centric and object-oriented development
Euclidian distance
interactive information radiators
dashboards
DSL
tabular presentation data
version 1.0
Brute-Force implementation
SnakeViz
version 2.0, divide-and-conquer technique
version 3.0
FastClosestPair Class
sorting functions
visualize calling hierarchy
Collect_mse inner function
combine function
Community common neighbors
Community resource allocation
Complex domain
Complicated domain
Concepts/techniques, machine learning
collinearity
data_generator.py module
error terms
evaluate_model function
features vs. outputs
gaussian random variable
MSE
normal distribution
observer.py module
overfitting
real world process
regularization
residuals plot
demo_residuals function
evaluate_model function
fit method
set_params() method
runtime model
scikit-learn framework
session.py module
trivial training process
underfitting
warning module
%connect_info magic command
Consumer-driven contracts (CDCs)
Cross-validation (CV) score
Cyclomatic complexity
ascending order
built-in function
execution flows
global evaluation score
sorting routine
spyder
Cynefin domains, quadrants
Cynefin framework

D

Dask array
Dask dataframe
Dask delivers
Dask framework
Data engineers
DataLoader class
Data modification attack
Data preparation
Data preprocessing, financial model
features
AAPL price log
data_preprocessing.py Module
data_visualization.py module
driver.py module
histograms
log returns
normalization
outliers
returns
volatility
IPython console
Pandas data frame
style parameter
time series
AAPL closing levels
moving average
scaling
timestamp
Data processing
abstractions vs. latent features
augmented ball descend case study
SeeAugmented ball descend
compressing rating matrix
Data science project
Brown Cow model
case study (Cholera)
learning
domain knowledge
domain-related terms
programming experience
old vs. modern
phases
Data security
backup procedure
collection
disclosure
incident regarding e-mail
inference
Microsoft security development lifecycle
OWSP
phases
problems
suspicious e-mails
tool samples
unsecured connection
Data streaming
Data type
Data visualization
closest pair case study
SeeClosest pair algorithm
matplotlib architecture, case study
geographic map
higher-level components
plotting temperatures
Data wrangling
Deep learning
Degree centrality
demo_metrics_and_mse function
demo_regularization function
Digit, counting
Disorder
Divide-and-conquer technique
Divisible numbers, counting
Document structure
abstract
data science life cycle phases
dataset
drawbacks
future work
motivation
project
SeeWikipedia edits project
references
Domain-specific language (DSL)
driver.py script
program, restructure
reusability

E

E-Commerce customer segmentation
CSV format
EDA
marketing domain
project in Spyder
SeeSpyder project
structured data
Edge betweenness centrality
Euclidian distance
European economic area (EEA)
expectedGain function
Exploratory data analysis (EDA)
extract_config method

F

Feature engineering
column types
custom
Age_Group
bar plots
CTR
multilevel data frame
NaN
nonzero method
number of clicks
defined
describe method
dummy variables
e-commerce
logged-out category
pandas package
scatter plots
SciPy
summarize function
Filter bubble
Financial modeling
data preprocessing
SeeData preprocessing, financial model
data retrieval
feature engineering
correlation matrix
create features
feature_engineering.py Module
heat map plotting
log returns
mean reversion
scatter plot
TA-Lib
target feature
time series analysis
timestamping
find_path method
Fixing bug, software methods
agile methods
correct version, code
language of business
defect code
improved fix
unit test
vectorized version
business-associated arguments
classical arguments
NumPy framework
Full-batch learning
Funny elevator case study
divide and conquer paradigm
socio-economic/socio-technical aspects
testing
unoptimized variant

G

General data protection regulation (GDPR)
absolutistic approach
architecture
attribute-based access control model
behavioral patterns
controllers/processors, regulation
data breach
DevOps paradigm
domain-driven design
EU-based organizations
fundamental rights
health care system
lawful processing
microservices
remote access
requests
risk management
security measures
generate_points method
Geocoding API
GitHub repository
Global Historical Climatology Network-Daily (GHCN-Daily)
Graph analysis
bipartite graphs
quality attributes
social networks
usage matrix
built-in matplotlib engine
generic template
Graphviz
multigraph
NetworkX
optimization task
sample instance
UCs
UML use case diagram
usage_matrix.py Module, consent

H

High-dimensional datasets
Homoscedastic outputs

I

in_degree method
Intelligent machines
IPyWidgets
i-th shadow model

J

Jaccard coefficient
JupyterHub
JupyterLab
Anaconda Navigator
code execution
abrupt stoppage
cells
doctest tests
error message
Hanoi Tower Solver
HTML
notebook
output
project
SeeBall Descend project
screen
simulator notebook
Jupyter Notebook
Jupyter project
notebook
principal components
tools
Jupyter widgets and notebook extensions

K

Kafka
Kaggle
KDD Cup 1999 Data
Kernel
K-fold CV

L

Local clustering coefficient (LCC)
Label flipping
Label modification attack
Large Hadron Collider (LHC)
Large-scale software system
adaptive maintenance
corrective maintenance
definition
fixing bugs
holistic approach
knowledge areas
life cycle model
preparation/planning phase
preventive correction
preventive maintenance
scope
tournament chess game, analogy
types
Las Vegas model
Lehman’s laws of software evolution
LensKit for Python (LKPY)
extract_config method
Fallback class
knn wrapper package
lkpy_demo.py file
MultiEval facility
nDCG Top-N accuracy metric
package structure
PMML serialized models
README.txt file
SciPy style
UML class diagram
UserUser algorithm
Linear regression
nonparametric
parametric
semiparametric
%load lkpy_demo.py
Logistic regression
Log returns

M

Machine-based feature engineering
implementation from scratch
implementation with PyTorch
Machine learning
big data
concepts/techniques
SeeConcepts/techniques, machine learning
methods
styles
Machine learning as a service (MLaaS)
Manual feature engineering
aggregated state
combine function
Massive open online course (MOOC)
Mean squared error (MSE)
Membership inference attack
black-box
MLaaS
overfitted model
training dataset
white-box
Mini-batch learning
Miniconda distribution
Monstrous models

N

Naive algorithm
Naming abstractions
nbconvert
nbviewer
NetworkX
next_neighbor method
Not a Number (NaN)
NumPy package

O

Old vs. Modern data science projects
Online learning
Open-source software (OSS)
OpenStreetMap
Open triads
Open Web Application Security Project (OWASP)

P

pairplot function
Path finder
find_path function
terrain
top-down decomposition
wall function
Perl module
plot_mse function
Poisoning attack
categories
linear regression model
Poison insertion attack
Policy authoring point (PAP)
Policy decision point (PDP)
Policy enforcement point (PEP)
Policy information point (PIP)
Prediction accuracy metrics
Predictive Model Markup Language (PMML)
predict_proba methods
Preferential attachment
Principal component analysis (PCA) method
Privacy, meaning
Problem-solving ability
Public data sources
data analysis
Internet, accessible
%pwd magic command
scikit-learn

Q

Quandl
Quantitative data

R

read_csv function
Recommender systems
machine learning, usage
mashup movie
JSON response
OMDb service
simple_movie_recommender.py Module
tastedrive_service.py Module
MovieLens
categories
collaborative filtering
content-based
context-based systems
information retrieval system
interfaces
relational database system
types
untracked option
standard root mean squared error
Reinforcement learning
requests package
Resilient distributed dataset (RDD)
Ridge regression
runtime model

S

Scalable graph loading
CSV file
load graph
nodes.csv file, content
standard edge list format
Scaled percent returns
Scatter matrix plot
SciPy ecosystem
Semi-supervised learning
Shadow training
attack model
highest-scored label
overall structure
target system
test sets
Simple domain
Simple moving average (SMA)
sklearn.feature_selection module
Social network
centrality
eccentricity
karate club network
LCC
local/global graph metrics
machine learning systems
NetworkX’s documentation
prediction measures
single metric graph
six degrees of separation
Socio-∗ pieces of software production
socio-economic aspects
socio-technical aspects
Software engineering
agile principles
application
bug free legacy code
context awareness and communication
bug
SeeFixing bug, software engineering
cone of uncertainty
constant feedback loop
context and knowledge oblivious
cyclomatic complexity
knowledgeable
conventions
faculty code
large-scale software systems
legacy code
Python ecosystem
quality assurance tools
risk
rules and principles
Software engineering vs. data science
Source lines of code (SLOC)
Spyder project
chunk data
code restructure, CSV file
dataset
download
feature, association
SeeFeature engineering
file sizes
IPython console
shell-related magic commands
driver code
folder structure
nyt_data.py Script
results
Apache Parquet
driver.py
multilevel index
Parquet engine
summarize function
test pipeline
click-through rate
get_file_number function
number of clicks
traverse function
UI
Staged processing
Stochastic gradient descent
Streaming linear regression
Apache Spark’s MLlib framework
driver.py Module
RDD
setIntercept method
streaming_regression.py Module
training and test streams
StreamingLinearRegressionWithSGD class
Streaming system
Stream processing
Structured data
style parameter
Sum of squared errors (SSE)
Supervised learning
Systems/software development life cycle (SDLC)

T

TA-Lib
Total cost of ownership (TCO)
Transitivity
t-SNE method
features and components
power
variables
Turing test setup

U

UCI Machine Learning Repository
Unbiased minimal variance estimator
Unsupervised learning
Use cases (UCs)

V

VarianceThreshold class
Variety
Velocity
Veracity
Visualization
Volume

W, X, Y

Wikipedia edits project
abstract
data
drawbacks
JupyterLab notebook
motivation
specification fixing

Z

zipfile package
..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset
3.143.255.36