Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

M. Paluszek, S. ThomasMATLAB Recipeshttps://doi.org/10.1007/978-1-4842-6124-8_1

1. Coding Handbook

Michael Paluszek¹ and Stephanie Thomas²

(1)

Princeton, NJ, USA

(2)

Princeton Junction, NJ, USA

The purpose of this chapter is to provide an overview of MATLAB syntax and programming, highlighting features that may be underutilized by many users and noting important differences between MATLAB and other programming languages and IDEs. You should also become familiar with the very detailed documentation that is available from the MathWorks in the help browser. The Language Fundamentals section describes entering commands, operators, and data types.

MATLAB has matured a lot in the last two decades from its origins as a linear algebra package. Originally, all variables were double precision matrices. Today, MATLAB provides different variable types such as integers, data structures, object-oriented programming and classes, and integration with Java. The MATLAB application is a full IDE with an integrated editor, debugger, command history, and code analyzer and report capabilities. Engineers who have been working with MATLAB for many years may find that they are not taking advantage of the full range of capabilities now offered, and in this text we hope to highlight the more useful new features.

The first part of this chapter provides an overview of the most commonly used MATLAB types and constructs. We’ll then provide some recipes that make use of these constructs to show you some practical applications of modern MATLAB.

MATLAB Language Primer

Brief Introduction to MATLAB

MATLAB is both an application and a programming language. It was developed primarily for numerical computing and is widely used in academia and industry. MATLAB was originally developed by a college professor in the 1970s to provide easy access to linear algebra libraries, and the MathWorks was founded in 1984 to continue the development of the product. The name is derived from MATrix LABoratory. Today, MATLAB uses the LAPACK libraries for the underlying matrix manipulations. Many toolboxes are available for different engineering disciplines; in this book, we will focus on features available only in the base MATLAB application.

The MATLAB application is a rich development environment for the MATLAB language. It provides an editor, command terminal, debugger, plotting capabilities, creation of graphical user interfaces, and more recently the ability to install third-party apps. MATLAB can interface with other languages including FORTRAN, C, C++, Java, and Python. A code analyzer and profiler are built-in. Extensive online communities provide forums for sharing code and asking questions.

The main components of the MATLAB application are

Command Window: – Terminal for entering commands and operating on variables in the base workspace. The MATLAB prompt is >>.
Command History: – List of previously executed commands.
Workspace display: – List of the variables and their values in the current workspace (application memory). Variables remain in the memory once created until you explicitly clear them or close MATLAB.
Current Folder: – File browser displaying contents of the current folder and providing file system navigation. Recent versions of MATLAB can also display SVN status on configuration managed files.
File details: – Panel displaying information on the file selected in the Current Folder panel.
Editor: – Editor for m-files with syntax coloring and a built-in debugger. This can also display any type of text file and will recognize and appropriately color other languages including Java, C/C++, and XML/HTML.
Variables editor: – Spreadsheet-like graphical editor for variables in the workspace.
App Designer: – Application development window.
Help browser: – Searchable help documentation on all MATLAB products and third-party products you have installed.
Profiler: – Tool for timing code as it runs.

These components can be docked in various configurations. The default layout of the main application window or desktop contains the first five components listed earlier and is shown in Figure 1.1. The Command Window is in the center. The upper-left panel shows a file browser with the contents of the Current Folder. Under this is a file information display. On the right-hand side is the Workspace display and the Command History panel. The base workspace is all the variables currently in the application memory. Commands from the history can be dragged onto the command line to be executed, or double-clicked. The extensive toolbar includes buttons for running the code analyzer and opening the code profiler and the help window, as well as typical file and data operations. Note the PLOTS and APPS tabs above the toolbar. The PLOTS tab allows the graphical creation and management of plots from data selected in the workspace browser. The APPS tab allows you to access and manage third-party apps that you install.

../images/335353_2_En_1_Chapter/335353_2_En_1_Fig1_HTML.jpg — Figure 1.1
MATLAB desktop with the Command Window.

You can rearrange the components in the application window, moving, resizing, or hiding them, and save your own layouts. You can “undock” any component, moving it to its own window. You can also revert back to the default layout at any time or choose from several other available configurations. You can also hide the toolstrip to get more real estate for your windows. There are new capabilities to customize your interface with each version, so explore what’s new!

The editor with the default syntax coloring is shown in Figure 1.2, with a file from this chapter shown. The horizontal lines show the division of the code into “cells” using a double-percent sign, which can be used for sequential execution of code and for creating sections of text when publishing. The cell titles are bolded in the editor. MATLAB keywords are highlighted in blue, comments in green, and strings in pink. The toolbar includes buttons for commenting code, indenting, and running or debugging the code. The “Go To” pop-up menu gives access to subfunctions within a large file (see Section 1.10). Note the PUBLISH and VIEW tabs with additional features on publishing, covered in the next chapter, and options for the editor view.

../images/335353_2_En_1_Chapter/335353_2_En_1_Fig2_HTML.jpg — Figure 1.2
MATLAB file editor.

The last window we will show is the help browser in Figure 1.3. MATLAB has extensive help including examples and links to online videos and tutorials. Third-party toolboxes can also install help into this browser. Like any browser, you can have open multiple tabs, there is a search utility, and you can mark favorite topics. We will refer to topics available in the help browser throughout this book.

../images/335353_2_En_1_Chapter/335353_2_En_1_Fig3_HTML.jpg — Figure 1.3
MATLAB help window.

Everything Is a Matrix

By default, all variables in MATLAB are double precision matrices. You do not need to declare a type for these variables. Matrices can be multidimensional and are accessed using one-based indices via parentheses. You can address elements of a matrix using a single index, taken column-wise, or one index per dimension. Use square brackets to enclose the matrix data and semicolons to mark the end of rows. Use a final semicolon to end the line, or leave it off to print the result to the command line. To create a matrix variable, simply assign a value to it, like this 2x2 matrix a and 2x1 matrix b:

../images/335353_2_En_1_Chapter/335353_2_En_1_Figc_HTML.jpg

You can simply add, subtract, multiply, and divide matrices with no special syntax. The matrices must be the correct size for the linear algebra operation requested. A transpose is indicated using a single quote suffix, A’, and the matrix power uses the operator ̂.

../images/335353_2_En_1_Chapter/335353_2_En_1_Figd_HTML.jpg

By default, every variable is a numerical variable. You can initialize matrices to a given size using the zeros, ones, eye, or rand functions, which produce zeros, ones, identity matrices (ones on the diagonal), and random numbers, respectively. Use isnumeric to identify numeric variables. Table 1.1 shows key matrix functions.

Table 1.1

Key Functions for Matrices

Function	Purpose
zeros	Initialize a matrix to zeros
ones	Initialize a matrix to ones
eye	Initialize an identity matrix
rand, randn, randi	Initialize a matrix of random numbers
isnumeric	Identify a matrix or scalar numeric value
isscalar	Identify a scalar value (a 1 x 1 matrix)
size	Return the size of the matrix

Strings Are Simple

Character arrays are defined using single quotes. They can be concatenated using the same syntax as matrices, namely, square brackets. They are indexed the same way as matrices. Here is a short example of character array manipulation:

../images/335353_2_En_1_Chapter/335353_2_En_1_Fige_HTML.jpg

Use ischar to identify character variables. Also note that isempty returns TRUE for an empty array, that is, ''.

Since R2016b, MATLAB has also provided a string type defined using regular quotes. Some newer functions are designed to operate specifically on strings, but most work on both text types. If you concatenate strings using square brackets, they are maintained as separate elements in an array rather than combined as character arrays are. To append strings, use the “+” operator (see Recipe 1.5). isempty returns FALSE for an empty string, that is, ‘‘''; this creates a 1-by-1 string with no characters rather than an empty string.

../images/335353_2_En_1_Chapter/335353_2_En_1_Figf_HTML.jpg

For a description of string syntax, type help strings at the MATLAB command line, and for a comprehensive list of string and character functions, type help strfun. Table 1.2 shows a selection of key string functions.

Table 1.2

Key Functions for Strings

Function	Purpose
ischar	Identify a character array
isstring	Identify a string
char	Convert integer codes or cell array to character array
sprintf	Write formatted data to a string
strcmp, strncmp	Compare strings
strfind	Find one string within another
num2str, mat2str	Convert a number or matrix to a string
lower	Convert a string to lowercase
contains	Search for patterns in string arrays
split	Split strings at whitespace

Use Strict Data Structures

Data structures in MATLAB are highly flexible, leaving it up to the user to enforce consistency in fields and types. You are not required to initialize a data structure before assigning fields to it, but it is a good idea to do so, especially in scripts, to avoid variable conflicts.

Replace

../images/335353_2_En_1_Chapter/335353_2_En_1_Figg_HTML.jpg

with

../images/335353_2_En_1_Chapter/335353_2_En_1_Figh_HTML.jpg

In fact, we have found it is generally a good idea to create a special function to initialize larger structures that are used throughout a set of functions. This is similar to creating a class definition. Generating your data structure from a function, instead of typing out the fields in a script, means you always start with the correct fields. Having an initialization function also allows you to specify the types of variables and provide sample or default data. Remember, since MATLAB does not require you to declare variable types, doing so yourself with default data makes your code that much clearer.

TIP

Create an initialization function for data structures.

You make a data structure into an array simply by assigning an additional copy. The fields must be in the same order, which is yet another reason to use a function to initialize your structure. You can nest data structures with no limit on depth.

../images/335353_2_En_1_Chapter/335353_2_En_1_Figi_HTML.jpg

MATLAB now allows for dynamic field names using variables, that is, structName. (dynamicExpression). This provides improved performance over getfield, where the field name is passed as a string. This allows for all sorts of inventive structure programming. Take our data structure array in the previous code snippet, and let’s get the values of field a using a dynamic field name; the values are returned in a cell array.

../images/335353_2_En_1_Chapter/335353_2_En_1_Figj_HTML.gif

Use isstruct to identify structure variables and isfield to check for the existence of fields. Note that isempty will return false for a struct initialized with struct, even if it has no fields. Table 1.3 lists some key functions for interacting with structs.

Table 1.3

Key Functions for Structs

Function	Purpose
struct	Initialize a structure with or without fields
isstruct	Identify a structure
isfield	Determine if a field exists in a structure
fieldnames	Get the fields of a structure in a cell array
rmfield	Remove a field from a structure
deal	Set fields in a structure to a value

Cell Arrays Hold Anything and Everything

One variable type unique to MATLAB is the cell array. This is really a list container, and you can store variables of any type in elements of a cell array. Cell arrays can be multidimensional, just like matrices, and are useful in many contexts.

Cell arrays are indicated by curly braces, {}. They can be of any dimension and contain any data, including string, structures, and objects. You can initialize them using the cell function, recursively display the contents using celldisp, and access subsets using parentheses just like for a matrix. The following is a short example.

../images/335353_2_En_1_Chapter/335353_2_En_1_Figk_HTML.gif

Using curly braces for access gives you the element data as the underlying type. When you access elements of a cell array using parentheses, the contents are returned as another cell array, rather than the cell contents. MATLAB help has a special section called Comma-Separated Lists which highlights the use of cell arrays as lists. The code analyzer will also suggest more efficient ways to use cell arrays, for instance:

Replace

a = {b{:} c};

with

a = [b {c}];

Cell arrays are especially useful for sets of strings, with many of MATLAB’s string search functions optimized for cell arrays, such as strcmp.

Use iscell to identify cell array variables. Use deal to manipulate structure array and cell array contents. Table 1.4 shows a selection of key cell array functions.

Table 1.4

Key Functions for Cell Arrays

Function	Purpose
cell	Initialize a cell array
cellstr	Create cell array from a character array
iscell	Identify a cell array
iscellstr	Identify a cell array containing only strings
celldisp	Recursively display the contents of a cell array

Optimize Your Code with Logical Arrays

A logical array is composed of only ones and zeros. You can initialize logical matrices using the true and false functions, and there is an islogical function to test if a matrix is logical. Logical arrays are outputs of numerous built-in functions, like isnan, and are often recommended by the code analyzer as a faster alternative to manipulating array indices. For example, you may need to set any negative values in your array to zero.

Replace

with

where x<0 produces a logical array with 1 where the values of x are negative and 0 elsewhere.

MATLAB provides both traditional relational operators, that is, && for AND and || for OR, as well as unique element-wise operators. These element-wise operators, that is, single & and |, compare matrices of the same size and return logical arrays. Table 1.5 shows some key functions for logical operations.

Table 1.5

Key Functions for Logical Operations

Function	Purpose
logical	Convert numeric values to logical
islogical	Identify a logical array (composed of 1s and 0s)
true	Return a true value (1) or array (M,N)
false	Return a false value (0) or array (M,N)
any	Return true if any value in the array is a nonzero number
all	Return true if none of the values in the array is 0
and, or	Functional forms of element-wise operators & and \|
isnan, isinf, isfinite	Values testing functions returning logical arrays

Use Persistent and Global Scope to Minimize Data Passing

In general, variables defined in a function have a local scope and are only available within that function. Variables defined in a script are available in the workspace and, therefore, from the command line.

MATLAB has a global scope which is the same as any other language, applying to the base workspace and maintaining the variable’s value throughout the MATLAB session. Global variables are empty once declared, until initialized. The clear and clearvars functions each have flags for removing only the global variables. This is shown in the example below.

../images/335353_2_En_1_Chapter/335353_2_En_1_Fign_HTML.gif

MATLAB has a unique scope that pertains to a single function, persistent. This is useful for initializing a function that requires a lot of data or computation and then saving that data for use in later calls. The variable can be reset using the clear command on the function, that is, clear functionName. This can also be a source of bugs so it is important to note the use of persistent variables in a function’s help comments, so you don’t get unexpected results when you switch models.

TIP

Use a persistent variable to store initialization data for subsequent function calls.

Variables can also be in scope for multiple functions defined in a single file, if the end keyword is used appropriately. In general, you can omit a final end for functions, but if you use it to wrap the inner functions, the functions become nested and can access variables defined in the parent function. This allows subroutines to share data without passing large numbers of arguments. The editor will highlight the variables that are so defined.

In the following example, the constant variable is available to the nested function inside the parent function.

Nested Function

../images/335353_2_En_1_Chapter/335353_2_En_1_Figo_HTML.gif

Table 1.6 shows a selection of scope functions.

Table 1.6

Key Functions for Scope Operations

Function	Purpose
persistent	Specify persistent scope for a variable in a function
global	Specify global scope for a variable
clear	Clear a function or variable
who, whos	List variables in a workspace
mlock, munlock	Lock (and unlock) a function or MEX-file which prevents it from being cleared

Understanding Unique MATLAB Operators and Keywords

Some common operators have special features in MATLAB, which we call attention to here.

Colon

The colon operator for creating a list of indices in an array is unique to MATLAB. A single colon used by itself addresses all elements in that given dimension; a colon used between a pair of integers creates a list.

../images/335353_2_En_1_Chapter/335353_2_En_1_Figp_HTML.gif

The colon operator applies to all variable types when accessing elements of an array: cell arrays, strings, data structure arrays.

The colon operator can also be used to create an array using an interval, as a shorthand to linspace. The interval and the endpoints can be doubles. Using it for matrix indices is really an edge case using a default interval of 1. For example, 0.1:0.2:0.5 produces 0.1 0.3 0.5.

../images/335353_2_En_1_Chapter/335353_2_En_1_Figq_HTML.gif

Tilde

The tilde (∼) is the logical NOT operator in MATLAB. The output is a logical matrix of the same size as the input, with values of 1 if the input value is 0 and a value of 0 otherwise.

../images/335353_2_En_1_Chapter/335353_2_En_1_Figr_HTML.gif

In newer versions, it also can be used to ignore an input or output to a function, and this is suggested often in the code analyzer as preferable to the use of a dummy variable.

Dot

By dot, we mean using a period with a standard arithmetic operator, like .* or .∖ or .̂. This is a special syntax in MATLAB used to apply an operator on an element per element basis over the matrices, instead of performing the linear algebra operation otherwise implied. This is also termed an array operation as opposed to a matrix operation. Since the matrix and array operations are the same for addition and subtraction, the dot is not required.

MATLAB is optimized for array operations. Using this syntax is a key way to reduce for loops in your MATLAB code and make it run faster. Consider the traditional alternative code:

../images/335353_2_En_1_Chapter/335353_2_En_1_Figu_HTML.gif

Even this simple example takes two to three times as long to run as the vectorized version shown above.

end

The end keyword serves multiple purposes in MATLAB. It is used to terminate for, while, switch, try, and if statements, rather than using braces as in other languages. It is also used to serve as the last index of a variable in a given dimension. Using end appropriately can make your code more robust to future changes in the size of your data.

../images/335353_2_En_1_Chapter/335353_2_En_1_Figv_HTML.gif

Harnessing the Power of Multiple Inputs and Outputs

Uniquely, MATLAB functions can have multiple outputs. They are specified in a comma-separated list just like the inputs. Additionally, you do not need to specify the data types of the inputs or outputs, and you can silently override the output types by assigning any data you want to the variables. Thus, a function can have an infinite number of syntaxes defined within a single file. Outputs must be assigned the names given in the signature; you cannot pass a variable to the return keyword.

MATLAB provides helper functions for specifying a variable number of inputs or outputs, namely, varargin and varargout. These variables are cell arrays, and you access and assign elements using curly braces. Here is an example function definition:

../images/335353_2_En_1_Chapter/335353_2_En_1_Figw_HTML.gif

The following example demonstrates that the outputs were correctly assigned.

Using varargout and varargin

../images/335353_2_En_1_Chapter/335353_2_En_1_Figx_HTML.gif

This allows you to accept unlimited arguments or parameter pairs in your function. It is up to you to create consistent forms for your function and document them clearly in the help comments.

You can also count the input and output arguments for a given call to your function using nargin and nargout and use this with logical statements or a switch statement to handle multiple cases.

If you need very complex input handling, MATLAB now provides an inputParser class, which allows you to parse and validate an input scheme. You can define functions to validate the inputs, optional arguments, and predefine parameter pairs.

Use Function Handles for Efficiency

Function handles are pointers to functions. They are closely related to anonymous functions, which allow you to define a short function inline, and return the function handle. When you create a handle, you can change the input scheme and give values for certain inputs, that is, parameters. Using handles as inputs to integrators and similar routines is much faster than passing in a string variable of the function name.

In the following snippet, we create an anonymous function handle to myFunction with a different signature and a specific value for a. Note the use of the @, which designates a function handle. The handle can be evaluated with inputs just like a regular function.

../images/335353_2_En_1_Chapter/335353_2_En_1_Figy_HTML.gif

The handle h can be passed to a function such as an integrator that is expecting a signature with only two variables. You will also commonly use function handles to specify an events function for integrators or similar tools, as well as output functions that are called between major steps. Output functions can print information to the screen or a figure. See, for example, odeplot and odeprint.

In order to test if a variable is a function handle, you need to use the function handle class name with isa, that is:

as ishandle works only for graphics handles. For more information, see the help documentation for function_handle. Table 1.7 provides the few key functions for dealing with function handles.

Table 1.7

Key Functions for Handles

Function	Purpose
feval	Execute a function from a handle or string
func2str	Construct a string from a function handle
str2func	Construct a handle from a function name string
isa	Test for a function handle

Numerics

While MATLAB defaults to doubles for any data entered at the command line or in a script, you can specify a variety of other numeric types, including single, uint8, uint16, uint32, uint64, logical (i.e., an array of booleans). The use of the integer types is especially relevant to using large data sets such as images. Use the minimum data type you need, especially when your data sets are large.

Images

MATLAB supports a variety of formats including GIF, JPG, TIFF, PNG, HDF, FITS, and BMP. You can read in an image directly using imread, which can determine the type automatically from the extension, or fitsread. (FITS stands for Flexible Image Transport System, and the interface is provided by the CFITSIO library.) imread has special syntaxes for some image types, such as handling alpha channels for PNG, so you should review the options for your specific images. imformats manages the file format registry and allows you to specify handling of new user-defined types, if you can provide read and write functions.

You can display an image using either imshow, image, or imagesc, which scales the colormap for the range of data in the image.

For example, we use a set of images of cats in Chapter 7, Face Recognition. The following is the image information for a typical image:

../images/335353_2_En_1_Chapter/335353_2_En_1_Figaa_HTML.gif

This is the metadata that tells the camera software, and image databases, where and how the image was generated. This is useful when learning from images as it allows you to correct for resolution (width and height) bit depth and other factors.

If we view this image using imshow, it will publish a warning that the image is too big to fit on the screen and that it is displayed at 33%. If we view it using image, there will be a visible set of axes. image is useful for displaying other two-dimensional matrix data as individual elements per pixel. Both functions return a handle to an image object; only the axes properties are different. Figure 1.4 shows the use of imshow and image.

../images/335353_2_En_1_Chapter/335353_2_En_1_Fig4_HTML.jpg — Figure 1.4
Image display options.

../images/335353_2_En_1_Chapter/335353_2_En_1_Figab_HTML.gif

Table 1.8 shows key image functions.

Table 1.8

Key Functions for Images

Function	Purpose
imread	Read an image in a variety of formats
imfinfo	Gather information about an image file
imformats	Determine if a field exists in a structure
imwrite	Write data to an image file
image	Display image from an array
imagesc	Display image data scaled to the current colormap
imshow	Display an image, optimizing figure, axes, and image object properties and taking an array or a filename as an input
rgb2gray	Write data to an image file
ind2rgb	Convert index data to RGB
rgb2ind	Convert RGB data to indexed image data
fitsread	Read a FITS file
fitswrite	Write data to a FITS file
fitsinfo	Information about a FITS file returned in a data structure
fitsdisp	Display FITS file metadata for all HDUs in the file

Datastore

Datastores allow you to interact with files containing data that are too large to fit in memory. There are different types of datastores for tabular data, images, spreadsheets, databases, and custom files. Each datastore provides functions to extract smaller amounts of data that do fit in memory for analysis. For example, you can search a collection of images for those with the brightest pixels or maximum saturation values. We will use the directory of cat images included with the code as an example.

../images/335353_2_En_1_Chapter/335353_2_En_1_Figac_HTML.gif

Once the datastore is created, you use the applicable class functions to interact with it. Datastores have standard container-style functions like read, partition, and reset. Each type of datastore has different properties. The DatabaseDatastore requires the Database Toolbox and allows you to use SQL queries.

MATLAB provides the MapReduce framework for working with out-of-memory data in datastores. The input data can be any of the datastore types, and the output is a key-value datastore. The map function processes the datastore input in chunks, and the reduce function calculates the output values for each key. mapreduce can be sped up by using it with the MATLAB Parallel Computing Toolbox, Distributed Computing Server, or Compiler. Table 1.9 shows key datastore functions.

Table 1.9

Key Functions for Datastore

Function	Purpose
datastore	Create a datastore
read	Read a subset of data from the datastore
readall	Read all of the data in the datastore
hasdata	Check to see if there is more data in the datastore
reset	Initialize a datastore with the contents of a folder
partition	Excerpt a portion of the datastore
numpartitions	Estimate a reasonable number of partitions
ImageDatastore	Datastore of a list of image files
TabularTextDatastore	A collection of one or more tabular text files
SpreadsheetDatastore	Datastore of spreadsheets
FileDatastore	Datastore for files with a custom format, for which you provide a reader function
KeyValueDatastore	Datastore of key-value pairs
DatabaseDatastore	Database connection, requires the Database Toolbox

Tall Arrays

Tall arrays were introduced in R2016b. They are allowed to have more rows than will fit in memory. You can use them to work with datastores that might have millions of rows. Tall arrays can use almost any MATLAB type as a column variable, including numeric data, cell arrays, strings, datetimes, and categoricals. The MATLAB documentation provides a list of functions that support tall arrays. Results for operations on the array are only evaluated when they are explicitly requested using the gather function. The histogram function can be used with tall arrays and will execute immediately.

The MATLAB Statistics and Machine Learning Toolbox™, Database Toolbox, Parallel Computing Toolbox, Distributed Computing Server, and Compiler all provide additional extensions for working with tall arrays. For more information about this new feature, use the following topics in the documentation:

Tall Arrays
Analysis of Big Data with Tall Arrays
Functions That Support Tall Arrays
Index and View Tall Array Elements
Visualization of Tall Arrays
Extend Tall Arrays with Other Products
Tall Array Support, Usage Notes, and Limitations

Table 1.10 shows key tall array functions and Table 1.11 shows key sparse matrix functions.

Table 1.10

Key Functions for Tall Arrays

Function	Purpose
tall	Initialize a tall array
gather	Execute the requested operations
summary	Display summary information to the command line
head	Access first rows of a tall array
tail	Access last rows of a tall array
istall	Check the type of the array to determine if it is tall
write	Write the tall array to disk

Table 1.11

Key Functions for Sparse Matrices

Function	Purpose
sparse	Create a sparse matrix from a full matrix or from a list of indices and values
issparse	Determine if a matrix is sparse
nnz	Number of nonzero elements in a sparse matrix
spalloc	Allocate nonzero space for a sparse matrix
spy	Visualize a sparsity pattern
spfun	Selectively apply a function to the nonzero elements of a sparse matrix
full	Convert a sparse matrix to full form

Sparse Matrices

Sparse matrices are a special category of matrix in which most of the elements are zero. They appear commonly in large optimization problems and are used by many such packages. The zeros are “squeezed” out, and MATLAB stores only the nonzero elements along with index data such that the full matrix can be recreated. Many regular MATLAB functions, such as chol or diag, preserve the sparseness of an input matrix.

Tables and Categoricals

Tables were introduced in release R2013 of MATLAB and allow tabular data to be stored with metadata in one workspace variable. It is an effective way to store and interact with data that one might put in, or import from, a spreadsheet. The table columns can be named, assigned units and descriptions, and accessed as one would fields in a data structure, that is, T.DataName. See readtable on creating a table from a file, or try out the Import Data button from the Command Window. Table 1.12 shows key Tables functions.

Table 1.12

Key Functions for Tables and Categoricals

Function	Purpose
table	Create a table with data in the workspace
readtable	Create a table from a file
join	Merge tables by matching up variables
innerjoin	Join tables A and B retaining only the rows that match
outerjoin	Join tables including all rows
stack	Stack data from multiple table variables into one variable
unstack	Unstack data from a single variable into multiple variables
summary	Calculate and display summary data for the table
categorical	Arrays of discrete categorical data
iscategorical	Create a categorical array
categories	List of categories in the array
iscategory	Test for a particular category
addcats	Add categories to an array
removecats	Remove categories from an array
mergecats	Merge categories

Categorical arrays allow for storage of discrete nonnumeric data, and they are often used within a table to define groups of rows. For example, time data may have the day of the week, or geographic data may be organized by state or county. They can be leveraged to rearrange data in a table using unstack. This is more efficient searching than elements of a cell array. See categorical and categories.

You can also combine multiple data sets into single tables using join, innerjoin, and outerjoin, which will be familiar to you if you have worked with databases.

Large MAT-files

You can access parts of a large MAT-file without loading the entire file into memory by using the matfile function. This creates an object that is connected to the requested MAT-file without loading it. Data is only loaded when you request a particular variable or part of a variable. You can also dynamically add new data to the MAT-file.

For example, we can load a MAT-file of neural net weights.

../images/335353_2_En_1_Chapter/335353_2_En_1_Figad_HTML.gif

We can access a portion of the previously unloaded w variable or add a new variable name, all using this object m.

../images/335353_2_En_1_Chapter/335353_2_En_1_Figae_HTML.gif

There are some limits to the indexing into unloaded data, such as struct arrays and sparse arrays. Also, matfile requires MAT-files using version 7.3, which is not the default for a generic save operation as of R2016b. You must either create the MAT-file using matfile to take advantage of these features or use the -v7.3’ flag when saving the file.

Advanced Data Types

The data types discussed so far are all that are needed for most engineering programming. However, for specialized applications, there are additional options for data types, including:

Classes: – Classes, with properties and methods, can be defined using the classdef keyword in an m-file similar to writing a function. See also the properties, methods, and events keywords. See Chapter 6 for recipes using classes.
Time series: – The timeseries object and the related tscollection object provide methods for associating data samples with timestamps. Plotting a timeseries object will use the stored time vector automatically.
Map containers: – The map container allows you to store and look up data using a key which may be nonnumeric. This is an object instantiated via containers.Map.

Primer Recipes

The next part of this chapter provides recipes for some common tasks in modern MATLAB, like using different data types, adding help to your functions, loading binary data, writing to a text file, creating a MEX file, and parsing functions into “pcode.”

1.1 Initializing a Data Structure Using Parameters

It’s always a good idea to use a special function to define a data structure you are using as a type in your codebase, similar to writing a class but with less overhead. Users can then overload individual fields in their code, but there is an alternative way to set many fields at once: an initialization function which can handle a parameter pair input list. This allows you to do additional processing in your initialization function. Also, your parameter string names can be more descriptive than you would choose to make your field names.

Problem

We want to initialize a data structure so that the user clearly knows what they are entering.

Solution

The simplest way to implement the parameter pairs is using varargin and a switch statement. Alternatively, you could write an inputParser, which allows you to specify required and optional inputs as well as named parameters. In that case, you have to write separate or anonymous functions for validation that can be passed to the inputParser, rather than just write out the validation in your code.

How It Works

We will use the data structure developed for the automobile simulation as an example. The header lists the input parameters along with the input dimensions and units, if applicable.

AutomobileInitialize.m

../images/335353_2_En_1_Chapter/335353_2_En_1_Figaf_HTML.gif

The function first creates the data structure using a set of defaults and then handles the parameter pairs entered by a user. After the parameters have been processed, two areas are calculated using the dimensions and the height.

../images/335353_2_En_1_Chapter/335353_2_En_1_Figag_HTML.gif

../images/335353_2_En_1_Chapter/335353_2_En_1_Figah_HTML.gif

../images/335353_2_En_1_Chapter/335353_2_En_1_Figai_HTML.gif

To perform the same tasks with inputParser, you add either an addRequired, addOptional, or addParameter call for every item in the switch statement. The named parameters require default values. You can optionally specify a validation function; in the following example, we use isNumeric to limit the values to numeric data.

../images/335353_2_En_1_Chapter/335353_2_En_1_Figaj_HTML.gif

In this case, the results of the parsed parameters are stored in a Results substructure.