Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

Chapter 4
Arrays, Strings, and Pointers

WHAT YOU WILL LEARN IN THIS CHAPTER:

How to use arrays
How to define and initialize arrays of different types
How to use the range-based for loop with an array
How to define and use multidimensional arrays
How to use pointers
How to define and initialize pointers of different types
The relationship between arrays and pointers
How to define references and some initial ideas on their uses

WROX.COM CODE DOWNLOADS FOR THIS CHAPTER

You can find the wrox.com code downloads for this chapter on the Download Code tab at www.wrox.com/go/beginningvisualc. The code is in the Chapter 4 download and individually named according to the names throughout the chapter.

HANDLING MULTIPLE DATA VALUES OF THE SAME TYPE

You already know how to define and initialize variables of various types that each holds a single item of information; I’ll refer to single items of data as data elements. The most obvious extension to the idea of a variable is to be able to reference several data elements of a particular type with a single variable name. This would enable you to handle applications of a much broader scope.

Let’s consider an example. Suppose that you needed to write a payroll program. Using a separate variable for each individual’s pay, tax liability, and so on, would be an uphill task to say the least. A more convenient way to handle such a problem would be to reference an employee by some kind of generic name — employeeName to take an imaginative example — and to have other generic names for the kinds of data related to each employee, such as pay and tax. Of course, you would need some means of picking out a particular employee from the whole bunch, together with the data from the generic variables associated with them. This kind of requirement arises with any collection of like entities that you want to handle, whether they’re baseball players or battleships. Naturally, C++ provides you with a way to deal with this.

Arrays

One way to solve these problems is to use an array. An array is a number of memory locations called array elements or simply elements, each of which stores an item of data of the same given data type, and which are all referenced through the same variable name. The employee names in a payroll program could be stored in one array, the pay for each employee in another, and the tax due for each employee could be stored in a third array.

You select an element in an array using an index value. An index is an integer representing the sequence number of the element in the array. The first element has the index 0, the second 1, and so on. You can also envisage the index for an array element as being the offset from the first element. The first element has an offset of 0 and therefore an index of 0, and an index value of 3 will refer to the fourth element of an array.

The basic structure of an array is illustrated in Figure 4-1.

Figure 4-1 shows an array with the name height that has six elements. These might be the heights of the members of a family, for instance, recorded to the nearest inch. Because there are six elements, the index values run from 0 through 5. You refer to a particular element by writing the array name followed by the index value of the element between square brackets. The third element is height[2], for example. If you think of the index as the offset from the first element, it’s easy to see that the index for the fourth element will be 3.

The memory required to store each element is determined by its type, and all the elements of an array are stored in a contiguous block of memory.

Declaring Arrays

You define an array in essentially the same way as you defined the variables that you have seen up to now. The only difference is that you specify the number of array elements between square brackets following the array name. For example, you could define the integer array height, shown in the previous figure, with the following statement:

long height[6];

A long value occupies 4 bytes, so the whole array requires 24 bytes. Arrays can be of any size, subject to the constraints imposed by the amount of memory in the computer on which your program is running.

Arrays can be of any type. For example, to define arrays to store the capacity and power output of a series of engines, you could write:

double engine_size[10];      // Engine size in cubic inches
double horsepower[10];       // Engine power output

If auto mechanics is your thing, this would enable you to store the cubic capacity and power output of up to 10 engines, referenced by index values from 0 to 9. As you have seen with other variables, you can define several arrays of a given type in a single statement, but in practice it is almost always better to define them in separate statements.

TRY IT OUT: Using Arrays

Imagine that you have recorded the amount of gasoline you have bought for the car and the odometer reading each time. You can write a program to analyze this data to see how the gas consumption looks on each occasion you bought gas:

// Ex4_01.cpp
// Calculating gas mileage
#include <iostream>
#include <iomanip>
        
using std::cin;
using std::cout;
using std::endl;
using std::setw; 
        
int main()
{
   const int MAX {20};                     // Maximum number of values
   double gas[ MAX ];                      // Gas quantity in gallons
   long miles[ MAX ];                      // Odometer readings
   int count {};                           // Loop counter
   char indicator {'y'};                   // Input indicator
        
   while( ('y' == indicator || 'Y' == indicator) && count < MAX )
   {
      cout << endl << "Enter gas quantity: ";
      cin >> gas[count];                   // Read gas quantity
      cout << "Enter odometer reading: ";
      cin >> miles[count];                 // Read odometer value
        
      ++count;
      cout << "Do you want to enter another(y or n)? ";
      cin >> indicator;
   }
        
   if(count <= 1)                   // count = 1 after 1 entry completed
   {                                // ... we need at least 2
      cout << endl << "Sorry - at least two readings are necessary.";
      return 0;
   }
        
   // Output results from 2nd entry to last entry
   for(int i {1}; i < count; i++)
   {
     cout << endl
          << setw(2) << i << "."             // Output sequence number
          << "Gas purchased = " << gas[i] << " gallons" // Output gas
          << " resulted in "                 // Output miles per gallon
          << (miles[i] - miles[i - 1])/gas[i] << " miles per gallon.";
   }
   cout << endl;
   return 0;
}

The program assumes that you fill the tank each time, so the gas bought was the amount used by driving the distance recorded. Here’s an example of the output:

Enter gas quantity: 12.8
Enter odometer reading: 25832
Do you want to enter another(y or n)? y
        
Enter gas quantity: 14.9
Enter odometer reading: 26337
Do you want to enter another(y or n)? y
        
Enter gas quantity: 11.8
Enter odometer reading: 26598
Do you want to enter another(y or n)? n
        
 1.Gas purchased = 14.9 gallons resulted in 33.8926 miles per gallon.
 2.Gas purchased = 11.8 gallons resulted in 22.1186 miles per gallon.

How It Works

Because you need to take the difference between two odometer readings to calculate the miles covered for the gas used, you use only the odometer reading from the first pair of input values — you ignore the gas bought in the first instance as that would have been used earlier. During the second period in the output, the traffic must have been really bad — or maybe the parking brake was left on.

The dimensions of the arrays gas and miles that store the input data are determined by the value of the constant, MAX. By changing the value of MAX, you can change the program to accommodate a different maximum number of input values. This technique makes a program flexible in the amount of information that it can handle. Of course, all the program code must be written taking account of the array dimensions, or of any other parameters specified by const variables. This presents little difficulty in practice, so there’s no reason not to adopt this approach. You’ll see later how to allocate memory as the program executes, so that you don’t need to fix the memory for data storage in advance.

Entering the Data

The data values are read in the while loop. Because the loop variable count can run from 0 to MAX - 1, the user cannot enter more values than the array can handle. You initialize count and indicator to 0 and 'y' respectively, so the while loop is entered at least once. There’s a prompt for each input value and the value is read into the appropriate array element. The element used to store a particular value is determined by count, which is 0 for the first input. The array element is specified in the cin statement by using count as an index, and count is then incremented, ready for the next value.

After you enter each value, the program prompts for confirmation that another value is to be entered. The character entered is read into indicator and tested in the loop condition. The loop will terminate unless 'y' or 'Y' is entered and count is less than the specified maximum value, MAX.

After the input loop ends (by whatever means), count contains one more than the index of the last element entered in each array. (Remember, you increment it after you enter each value). This is checked to verify that at least two pairs of values were entered. If this wasn’t the case, the program ends with a suitable message because two odometer values are needed to calculate a mileage value.

Producing the Results

The output is generated in the for loop. The control variable i runs from 1 to count-1, allowing mileage to be calculated as the difference between the current element, miles[i], and the previous element, miles[i - 1]. An index value can be any expression evaluating to an integer that represents a legal index for the array in question, which is a value from 0 to one less than the number of elements in the array.

If the value of an index is outside the range of the array elements, you will reference a spurious location that may contain other data, garbage, or even program code. If the reference to such an element is in an expression, you will use some arbitrary value in the calculation, which certainly produces a result that you did not intend. If you are storing a result in an array element using an illegal index, you will overwrite whatever happens to be in that location. When this is part of your program code, the results are catastrophic. If you use illegal index values, there are no warnings produced, either by the compiler or at run time. The only way to guard against this is to code your program to prevent it from happening.

Initializing Arrays

To initialize an array in its definition, you put the initializing values in an initializer list. Here’s an example:

int engine_size[5] { 200, 250, 300, 350, 400 };

The array has the name engine_size and has five elements that each store a value of type int. The values in the initializing list correspond to successive index values, so in this case engine_size[0] has the value 200, engine_size[1] the value 250, engine_size[2] the value 300, and so on.

You must not specify more initializing values than there are elements in the array, but you can include fewer. If there are fewer, the values are assigned to successive elements, starting with the first — which is the one corresponding to index 0. Array elements for which you don’t provide a value are initialized with zero. This isn’t the same as supplying no initializing list. Without an initializing list, the array elements contain junk values. You can initialize all array elements to zero with an empty initializer list. For example:

long data[100] {};          // Initialize all elements to zero

You can also omit the dimension of an array, provided you supply initializing values. The number of array elements will be the number of initializing values. For example:

int value[] { 2, 3, 4 };

This defines an array with three elements that have initial values 2, 3, and 4.

TRY IT OUT: Initializing an Array

This example demonstrates that you’ll have junk values in arrays that you don’t initialize:

// Ex4_02.cpp
// Demonstrating array initialization
#include <iostream>
#include <iomanip>
        
using std::cout;
using std::endl;
using std::setw; 
        
int main()
{
   int value[5] { 1, 2, 3 };
   int junk [5];
        
   cout << endl;
   for(int i {}; i < 5; i++)
      cout << setw(12) << value[i];
        
   cout << endl;
   for(int i {}; i < 5; i++)
      cout << setw(12) << junk[i];
        
   cout << endl;
   return 0;
}

You define two arrays, value and junk. You initialize value in part, and you don’t initialize junk at all. The program generates two lines of output, which on my computer look like this:

           1           2           3           0         0
  -858993460  -858993460  -858993460  -858993460  -858993460

The second line (corresponding to values of junk[0] to junk[4]) may be different on your PC.

How It Works

The first three values of the value array are the initializing values, and the last two have the default value of 0. In the case of junk, all the values are meaningless in the context of your program because you didn’t provide any initial values.

Using the Range-based for Loop

You have seen that you can use a for loop to iterate over all the elements in an array. The range-based for loop makes this even easier. The loop is easy to understand through an example:

double temperatures[] {65.5, 68.0, 75.0, 77.5, 76.4, 73.8,80.1};
double sum {};
int count {};
for(double t : temperatures)
{
  sum += t;
  ++count;
}
double average = sum/count;

This calculates the average of the values in the temperatures array. The parentheses following for contain two things separated by a colon; the first specifies the variable that will access each of the values from the collection specified by the second. The t variable will be assigned the value of each element in the temperatures array in turn before executing the loop body. This accumulates the sum of the array elements . The loop also accumulates the total number of elements in count so the average can be calculated after the loop.

You could also write the loop using the auto keyword:

for(auto temperature : temperatures)
{
  sum += temperature;
  ++count;
}

The auto keyword tells the compiler to determine the type for the local variable that holds the current value from the array type. The compiler knows that the array elements are of type double, so t will be of type double.

You cannot modify the values of the array elements in the range-based for loop as it is written here. You can only access the element values for use elsewhere. With the loop written as it is, element values are copied to the loop variable. You could access the array elements directly specifying the loop variable as a reference. You learn about references later in this chapter.

NOTE The C++ library provides the _countof() function that returns the number of elements in an array. You just put the array name between the parentheses. The cstdlib header needs to be included to use this function. Many other standard library headers such as iostream include cstdlib. You could calculate the average temperature like this:

for(auto temperature : temperatures)
{
  sum += temperature;
}
sum /= _countof(temperatures);

_countof()is a Microsoft extension and not standard C++.

Multidimensional Arrays

Arrays with one index are referred to as one-dimensional arrays. You can define an array with more than one index, in which case it is a multidimensional array. Suppose you have a field in which you are growing bean plants in rows of 10, and the field contains 12 rows so there are 120 plants in all. You could define an array to record the weight of beans produced by each plant using the statement:

double beans[12][10];

This defines the two-dimensional array beans, the first index being the row number, and the second index the plant number within the row. To refer to an element requires two index values. For example, you could set the value of the element reflecting the fifth plant in the third row with the statement:

beans[2][4] = 10.7;

Remember that index values start from zero, so the row index is 2 and the index for the fifth plant within the row is 4.

Being a successful bean farmer, you might have several identical fields planted with beans in the same pattern. Assuming that you have eight fields, you could use a three-dimensional array to record data about these, defined thus:

double beans[8][12][10];

This records production for the 10 plants in each of the 12 rows in a field and the leftmost index references one of the 8 fields. If you ever get to bean farming on an international scale, you can use a four-dimensional array, with the extra dimension designating the country. Assuming that you’re as good a salesman as you are a farmer, growing this quantity of beans is likely to affect the ozone layer.

Arrays are stored in memory such that the rightmost index varies most rapidly. Thus, the array data[3][4] is three one-dimensional arrays of four elements each. The arrangement of this array is illustrated in Figure 4-2.

The elements of the array are stored in a contiguous block of memory, as indicated by the arrows in Figure 4-2. The first index selects a particular row within the array, and the second index selects an element within a row.

A two-dimensional array is really a one-dimensional array of one-dimensional arrays. An array with three dimensions is actually a one-dimensional array of elements where each element is a one-dimensional array of one-dimensional arrays. This is not something you need to worry about most of the time. However, it implies that for the array in Figure 4-2, the expressions data[0], data[1], and data[2] reference one-dimensional arrays.

Initializing Multidimensional Arrays

To initialize a multidimensional array, you use an extension of the method used for a one-dimensional array. For example, you can define and initialize a two-dimensional array, data, with the statement:

long data[2][4] {
                   { 1,  2,  3,  5 },
                   { 7, 11, 13, 17 }
                };

The initial values for each row are within their own pair of braces. Because there are four elements in each row, there are four initial values in each group, and because there are two rows, there are two groups between braces, each group of initial values being separated from the next by a comma.

You can omit initial values in any row, in which case the remaining elements in the row are zero. For example:

long data[2][4] {
                   { 1,  2,  3       },
                   { 7, 11           }
                };

I have spaced out the initial values to show where values have been omitted. The elements data[0][3], data[1][2], and data[1][3] have no initializing values and are therefore zero.

To initialize the entire array with zeros you can write:

long data[2][4] {};

If you are initializing arrays with even more dimensions, remember that you need as many nested braces for groups of initial values as there are dimensions in the array — unless you’re initializing the array with zeros.

You can let the compiler work out the first dimension in an array, but only the first, regardless of the number of dimensions.

TRY IT OUT: Using a Multidimensional Array

You can use a multidimensional array to figure out the average bean plant production in each of a number of rows in a field:

// Ex4_03.cpp
// Storing bean plant production in an array
#include <iostream>                       // For stream I/O
#include <iomanip>                        // For stream manipulators
using namespace std;                      // Any name in std namespace
        
int main()
{
  const int plant_row_count{ 6 };         // Count of plants in a row
  double beans[][plant_row_count] {       // Production for each plant
    { 12, 15 },
    { 0, 10, 13, 0, 11, 2 },
    { 8, 7, 10, 10, 13    },
    { 9, 8, 11, 13, 16    }
  };
 
  double averages[_countof(beans)] {};    // Stores average plant production
  for (int row{}; row < _countof(beans); ++row)
  {
    for (int plant{}; plant < plant_row_count; ++plant)
    {
      averages[row] += beans[row][plant];
    }
    averages[row] /= plant_row_count;
  }
 
  cout << "Average production per row is :" 
       << setiosflags(ios::fixed)                  // Fixed point output
       << setprecision(2)                          // 2 decimal places
       << endl;
 
  int n{};                                         // Row number
  for (double ave : averages)
    cout << "Row " << ++n << setw(10) << ave << endl;
 
   return 0;
}

How It Works

There’s a using directive so all names in the std namespace can be used without qualification. The first statement in main() defines plant_row_count as a const variable, and this stores the number of plants in a row. The next statement defines and initializes the two-dimensional beans array. The first dimension is deduced by the compiler from the initializer list and the second dimension is specified by plant_row_count. This would result in an error message if plant_row_count was not const. The initial values for elements in each row are between braces, and the number of pairs of inner braces determines the first row dimension. The number of elements in a row is defined explicitly, so if you inadvertently specify more initial values than there are elements in a row, you will get an error message. Where you specify fewer than plant_row_count values for a row, the remaining elements in the row will be 0.

The next statement defines and initializes an array to hold the average plant production for each row:

  double averages[_countof(beans)] {};   // Stores average plant production

The _countof() macro determines the number of rows in beans. Specifying just the array name references the array of rows. The array name with a single index would reference a particular row, so _countof(beans[0]) would return the number of elements in the first row. There are no values in the initializer list so all elements in averages will be initialized to 0.

The averages for the rows are calculated in nested loops:

  for (int row{}; row < _countof(beans); ++row)
  {
    for (int plant{}; plant < plant_row_count; ++plant)
    {
      averages[row] += beans[row][plant];
    }
    averages[row] /= plant_row_count;
  }

The outer for loop iterates over the rows. The inner loop iterates over the plants in a row, adding each production value to the averages element for the row. The average is calculated by dividing the sum accumulated in averages[row] by the number of plants in a row.

When the nested loops end, this statement executes:

  cout << "Average production per row is :" 
       << setiosflags(ios::fixed)                  // Fixed point output
       << setprecision(2)                          // 2 decimal places
       << endl;

This outputs a message to precede the output of the averages and writes two manipulators to the stream. The std::setiosflags() manipulator is used to set flags that affect how output is presented. In this case ios::fixed appears between the parentheses, which ensures a floating-point value is displayed with fixed-point notation and not in scientific notation. Sending std::setprecision() to the stream causes subsequent floating-point output to include the number of decimal places specified between the parentheses.

The averages are output using a range-based for loop:

  int n{};                                         // Row number
  for (double ave : averages)
    cout << "Row " << ++n << setw(10) << ave << endl;

The loop variable ave will be assigned each of the values in the averages array in turn. You could use auto instead of type double.

WORKING WITH C-STYLE STRINGS

An array of char elements is called a character array and is generally used to store a C-style string. A character string is a sequence of characters with a special character appended to indicate the end of the string. This character is defined by the escape sequence . It’s referred to as the null or NUL character because it’s a byte with all bits zero. A string terminated by null is referred to as a C-style string because it originated in the C language.

This is not the only representation of a string. You’ll meet much safer representations in Chapter 8. You should avoid using C-style strings in new code, but they often occur in existing programs so you need to know about them.

Each character in a non-Unicode string occupies one byte, so with the terminating null, the number of bytes a string occupies is one more than the number of characters in the string. You can define a character array and initialize it with a string literal like this:

char movie_star[15] {"Marilyn Monroe"};     // 14 characters plus null

The terminating '' is supplied automatically. If you include one explicitly in the string literal, you’ll end up with two. You must allow for the terminating null when you specify the array dimension.

You can omit the dimension and let the compiler work it out:

char president[] {"Ulysses Grant"};

The compiler allocates enough elements to hold the characters in the string plus the terminating null, so this array will have 14 elements. Of course, if you use the array later to store a different string, the new string must not exceed 14 bytes including its terminating null character. In general, it is your responsibility to ensure that an array is large enough for any string you store in it.

You can create strings of Unicode characters, the characters in the string being of type wchar_t:

wchar_t president[] {L"Ulysses Grant"};

The L prefix indicates that the literal is a wide character string, so each character, including the terminating null, will occupy two bytes. Of course, indexing the string references characters, not bytes, so president[2] corresponds to the character L'y'.

The Unicode encoding for type wchar_t is UTF-16. There are other encodings such as UTF-8 and UTF-32. Whenever I refer to just Unicode in the book I mean UTF-16.

String Input

The iostream header contains definitions of functions for reading characters from the keyboard. The one that you’ll look at here is the getline() function that reads a sequence of characters from the keyboard and stores it in an array as a string terminated by ''. You typically use getline()like this:

const int MAX {80};               // Maximum string length including 
char name[MAX];                   // Array to store a string
cin.getline(name, MAX, '
'),     // Read input line as a string

These statements define the char array name with MAX elements and then read characters from cin using getline(). The source of the data, cin, is written as shown, with a period separating it from the function name. The period indicates that the getline() function is the one belonging to the cin object. You will learn more about this syntax when you learn about classes. Meanwhile, just take it for granted. The significance of each argument to the getline() function is shown in Figure 4-3.

Because the last argument is ' '(newline or end line character) and the second argument is MAX, characters are read from cin until the ‘' character is read, or when MAX-1 characters have been read, whichever occurs first. The maximum number of characters read is MAX-1 rather than MAX to allow for the '' character to be appended to the characters stored in the array. The ' ' character is generated when you press the Return key and is therefore usually the most convenient character to end input. You can specify something else by changing the last argument. The ' ' isn’t stored in the array name, but as I said, '' is stored at the end of the input string in the array.

TRY IT OUT: Programming with Strings

This program reads a string from the keyboard and counts its characters.

// Ex4_04.cpp
// Counting string characters
#include <iostream>
using std::cin;
using std::cout;
using std::endl;
        
int main()
{
   const int MAX {80};                // Maximum array dimension
   char buffer[MAX];                  // Input buffer
   int count {};                      // Character count
        
   cout << "Enter a string of less than "
        << MAX << " characters:
";
   cin.getline(buffer, MAX, '
'),    // Read a string until 

        
   while(buffer[count] != '')       // Increment count as long as
      count++;                        // the current character is not null
        
   cout << endl
        << "The string "" << buffer
        << "" has " << count << " characters.";
   cout << endl;
   return 0;
}

Typical output from this program is as follows:

Enter a string of less than 80 characters:
Radiation fades your genes
The string "Radiation fades your genes" has 26 characters.

How It Works

This program defines a character array buffer and reads a string into it from the keyboard after prompting for the input. Input ends when the user presses Enter, or when MAX-1 characters have been read.

A while loop counts the number of characters in buffer. The loop continues as long as the character in buffer[count] is not ''. This sort of checking on the current element while stepping through an array is a common technique. The only action in the loop is to increment count for each non-null character. There is a library function that will do what this loop does; you learn about it later in this chapter.

Finally, the string and the character count are displayed by a single output statement. Note the use of the escape sequence '"' to output a double quote.

String Literals

You have seen that you can write a string literal between double quotes and you can add L as a prefix to specify a Unicode string. You can split a long string over more than one line with each segment between double quotes. For example:

"This is a very long string that "

"has been spread over two lines."

C++ supports the use of regular expressions through the regex header. I don’t have the space to cover these in this book, but regular expressions typically involve strings with lots of backslash characters. Having to use the escape sequence for each backslash character makes regular expressions hard to enter correctly and even harder to read. The raw string literal gets over the problem. A raw string literal can contain any character, without necessitating the use of escape sequences. Here’s an example:

R"(The " " escape sequence is a tab character.)"

As a normal string literal, this would be:

"The "\t" escape sequence is a tab character."

The R indicates the start of a raw string literal and the string is delimited by "( and )". All characters between the delimiters are “as is” — escape sequences are not recognized as such. This immediately raises the question of how you include )" as part of a raw string literal. This is not a problem. The delimiters for a raw string literal in general can be "char_sequence( at the beginning and )char_sequence" at the end. char_sequence is a sequence of characters that must be the same at both ends and can be up to 16 characters; it must not contain parentheses, spaces, control characters, or backslashes. Here’s an example:

R"*("a = b*(c-d)")*" is equivalent to ""a = b*(c-d)""

The raw string contains the characters between "*( and )*". You can define a raw string of wide characters by prefixing R with L.

Using the Range-based for Loop with Strings

You can use a range-based for loop to access the characters in a string:

char text[] {"Exit signs are on the way out."};
int count {};
cout << "The string contains the following characters:" << endl;
for (auto ch : text)
{
  ++count;
  cout << ch <<  "  ";
}  
cout << endl << "The string contains " 
<< (count-1) << " characters." << endl;

The loop outputs each string character, including the null at the end, and accumulates a count of the total number of characters. The count includes the null that terminates the string so its value is reduced by 1 before output.

TRY IT OUT: Storing Multiple Strings

You can use a two-dimensional array to store several C-style strings. You can see how this works with an example:

// Ex4_05.cpp
// Storing strings in an array
#include <iostream>
using std::cout;
using std::cin;
using std::endl;
        
int main()
{
   char stars[6][80] { "Robert Redford",
                       "Hopalong Cassidy",
                       "Lassie",
                       "Slim Pickens",
                       "Boris Karloff",
                       "Oliver Hardy"
                     };
   int dice {};
        
   cout << endl
        << "Pick a lucky star!"
        << "Enter a number between 1 and 6: ";
   cin >> dice;
        
   if(dice >= 1 && dice <= 6)          // Check input validity
      cout << endl                     // Output star name
           << "Your lucky star is " << stars[dice - 1];
   else
      cout << endl                     // Invalid input
           << "Sorry, you haven't got a lucky star.";
        
   cout << endl;
   return 0;
}

How It Works

Apart from its incredible inherent entertainment value, the main point of interest in this example is the definition of the stars array. It is a two-dimensional array of elements of type char that can hold up to six strings, each of which can be up to 80 characters, including the terminating null. The initializing strings for the array are enclosed between braces and separated by commas.

A disadvantage of using arrays in this way is the memory that is almost invariably left unused. All of the strings are fewer than 80 characters, and the surplus elements in each row of the array are wasted. You’ll see later in this chapter how you could avoid this.

You can let the compiler work out how many strings you have by omitting the first array dimension:

   char stars[][80] { "Robert Redford",
                      "Hopalong Cassidy",
                      "Lassie",
                      "Slim Pickens",
                      "Boris Karloff",
                      "Oliver Hardy"
                    };

This causes the compiler to define the first dimension to accommodate the initializing strings. Because you have six, the result is exactly the same, but it avoids the possibility of an error. You can’t omit both array dimensions. You can only omit the first dimension in an array.

Of course, if you do omit the first array dimension, you would need to update the rest of the code to figure out the dimension instead of hard-coding 6. The _countof() function helps. The statement affected would then look like this:

  cout << endl
    << "Pick a lucky star!"
    << "Enter a number between 1 and " << _countof(stars) << ": ";
  cin >> dice;
 
  if (dice >= 1 && dice <= _countof(stars))      // Check input validity
    cout << endl                                 // Output star name
    << "Your lucky star is " << stars[dice - 1];
  else
    cout << endl                                 // Invalid input
    << "Sorry, you haven't got a lucky star.";

Where you reference a string for output in Ex4_05.cpp, you only specify the first index value:

      cout << endl                               // Output star name
           << "Your lucky star is " << stars[dice - 1];

A single index selects a particular 80-element subarray, and the output operation displays the contents up to the terminating null. The index is dice-1 because dice varies from 1 to 6 and the index values need to be from 0 to 5.

INDIRECT DATA ACCESS

Variables you have dealt with so far provided you with the ability to name a memory location in which you can store data of a particular type. The contents of a variable are either entered from an external source, such as the keyboard, or calculated from other values. There is another kind of variable that does not store data that you normally enter or calculate, but greatly extends the power and flexibility of your programs. This kind of variable is called a pointer.

What Is a Pointer?

Each memory location in your PC has an address. The address provides the means for the hardware to reference that location. A pointer is a variable that stores the address of another variable of a given type. A pointer has a variable name just like any other variable and also has a type that designates what kind of variables its contents refer to. Note that the type of a pointer variable includes the fact that it’s a pointer. A variable that is a pointer, that can hold an address of a location containing a value of type int, is of type ‘pointer to int'.

Declaring Pointers

A definition for a pointer is similar to that of an ordinary variable, except that the pointer name has an asterisk in front of it to indicate that it’s a pointer. For example, to define a pointer pnumber of type pointer to long, you could use the following statement:

long* pnumber;

This definition has been written with the asterisk close to the type name. If you want, you can also write it as:

long *pnumber;

The compiler won’t mind; however, the type of pnumber is ‘pointer to long', which is often indicated by placing the asterisk close to the type name. Whichever way you choose to write a pointer type, be consistent.

You can mix definitions of ordinary variables and pointers in the same statement. For example:

long* pnumber, number {99};

This defines pnumber of type ‘pointer to long' as before, and also defines the variable number, of type long. On balance, it’s probably better to define pointers separately from other variables; otherwise, the statement can appear misleading as to the type of variables defined, particularly if you prefer to place the * adjacent to the type name. The following statements certainly look clearer, and putting definitions on separate lines enables you to add comments for them individually, making for a program that is easier to read:

long number {99};    // Declaration and initialization of long variable
long* pnumber;       // Declaration of variable of type pointer to long

It’s a common convention to use variable names beginning with p to denote pointers. This makes it easier to see which variables are pointers, which in turn can make a program easier to follow.

Let’s take an example to see how this works, without worrying about what it’s for. I will get to how you use pointers very shortly. Suppose you have the long variable number containing the value 99 because you defined it in the preceding code. You could use the pointer pnumber of type pointer to long to store the address of number. But how do you obtain the address of a variable?

The Address-of Operator

What you need is the address-of operator, &. This is a unary operator that obtains the address of a variable. It’s also called the reference operator, for reasons I discuss later in this chapter. To set up the pointer, you could write this assignment statement:

pnumber = &number;            // Store address of number in pnumber

The result of this operation is illustrated in Figure 4-4.

The & operator obtains the address of any variable, but you need a pointer of the appropriate type to store it. To store the address of a double variable for example, the pointer must be of type double*, which is ‘pointer to double'.

Using Pointers

Taking the address of a variable and storing it in a pointer is all very well, but the really interesting aspect is how you can use it. Fundamental to using a pointer is accessing the data in the variable to which it points. You do this using the indirection operator, *.

The Indirection Operator

You use the indirection operator, *, to access the contents of the variable to which a pointer points. The name “indirection operator” stems from the fact that the data is accessed indirectly. It is also called the dereference operator, and the process of accessing the data in the variable pointed to by a pointer is termed de-referencing the pointer.

One aspect of this operator that can seem confusing is the fact that you now have several different uses for the same symbol, *. It is the multiply operator, it is the indirection operator, and it is used in the definition of a pointer. Each time you use *, the compiler can distinguish its meaning by the context. When you multiply two variables, A*B for instance, there’s no meaningful interpretation of this expression for anything other than a multiply operation.

Why Use Pointers?

A question that usually springs to mind at this point is, “Why use pointers at all?” After all, taking the address of a variable you already know and sticking it in a pointer so that you can dereference it seems like overhead you can do without. There are several reasons why pointers are important.

As you will see, you can use pointer notation to operate on data stored in an array. Also, when you get to define your own functions, you will see that pointers are used extensively for enabling access within a function to large blocks of data, such as arrays, that are defined outside the function. Most importantly, you will see that you can allocate space for variables dynamically — that is, during program execution. This capability allows your program to adjust its use of memory depending on the input. Because you don’t know in advance how many variables you are going to create dynamically, the way for doing this is using pointers — so make sure you get the hang of this bit.

Initializing Pointers

Using pointers that aren’t initialized is extremely hazardous. You can easily overwrite random areas of memory through an uninitialized pointer. The resulting damage depends on how unlucky you are, so it’s more than just a good idea to initialize your pointers. It’s very easy to initialize a pointer to the address of a variable that has already been defined. Here you can see that I have initialized the pointer pnumber with the address of the variable number just by using the operator & with the variable name:

int number {};                       // Initialized integer variable
int* pnumber {&number};              // Initialized pointer

When initializing a pointer with the address of another variable, remember that the variable must already have been defined prior to the pointer definition.

Of course, you may not want to initialize a pointer with the address of a specific variable when you define it. In this case, you can initialize it with the pointer equivalent of zero, nullptr, which is a pointer that doesn’t point to anything. You can define and initialize a pointer using the following statement:

int* pnumber {nullptr};              // Pointer not pointing to anything

Because nullptr is the equivalent of zero for pointers, an empty initializer list would work just as well. Setting a pointer to nullptr ensures that it doesn’t contain an address that will be accepted as valid, and provides the pointer with a value that you can check in an if statement, such as:

if(pnumber == nullptr)
   cout << endl << "pnumber does not point to anything.";

Before nullptr was added to C++, 0 or NULL (which is a macro for which the compiler will substitute 0) was used to initialize a pointer, and of course, these still work. However, it is much better to use nullptr.

Because the literal nullptr can be implicitly converted to type bool, you can check the status of the pointer pnumber like this:

if(!pnumber)
   cout << endl << "pnumber does not point to anything.";

nullptr converts the bool value to false, and any other pointer value converts to true. Thus, if pnumber contains nullptr, the if expression will be true and will cause the message to be written to the output stream.

TRY IT OUT: Using Pointers

You can try out various aspects of pointer operations with an example:

// Ex4_06.cpp
// Exercising pointers
#include <iostream>
using std::cout;
using std::endl;
using std::hex;
using std::dec;
        
int main()
{
   long* pnumber {};             // Pointer definition & initialization
   long number1 {55}, number2 {99};
        
   pnumber = &number1;           // Store address in pointer
   *pnumber += 11;               // Increment number1 by 11
   cout << endl
        << "number1 = " << number1 
        << "   &number1 = " << hex << pnumber;
        
   pnumber = &number2;           // Change pointer to address of number2
   number1 = *pnumber*10;        // 10 times number2
        
   cout << endl
        << "number1 = " << dec << number1
        << "   pnumber = " << hex << pnumber
        << "   *pnumber = " << dec << *pnumber;
        
   cout << endl;
   return 0;
}

You should compile and execute the release version of this example. The debug version will add extra bytes that are used for debugging purposes; these will cause the variables to be separated by 12 bytes instead of 4. On my computer, this example generates the following output:

number1 = 66   &number1 = 003CF7F0
number1 = 990   pnumber = 003CF7F4   *pnumber = 99

How It Works

There is no input. All operations are carried out with the initial values for the variables. After storing the address of number1 in the pointer pnumber, the value of number1 is incremented indirectly through the pointer in this statement:

*pnumber += 11;                       // Increment number1 by 11

The indirection operator determines that you are adding 11 to the contents of the variable pointed to by pnumber, which is number1. If you forgot the * in this statement, you would be attempting to add 11 to the address in the pointer.

The values of number1, and the address of number1 that is stored in pnumber, are displayed. You use the hex manipulator to generate the address output in hexadecimal notation. You can output the values of ordinary integer variables as hexadecimal using the hex manipulator. You send it to the output stream in the same way as endl, with the effect that all following output is in hexadecimal notation. To restore decimal output, you use the dec manipulator in the next output statement, which switches output back to decimal mode.

After the first line of output, pnumber is set to the address of number2. number1 is then changed to the value of 10 times number2:

   number1 = *pnumber*10;                // 10 times number2

This is calculated by accessing the contents of number2 indirectly through the pointer. The second line of output shows the results.

The address values you see in your output may well be different from those shown here because they reflect where the program is loaded in memory, which depends on the state of your operating system environment.

Note that the addresses &number1 and &number2 differ by four bytes. This shows that number1 and number2 occupy adjacent memory locations, because each long variable occupies four bytes. The output demonstrates that everything is working as you would expect.

Pointers to char

A pointer of type const char* has the interesting property that it can be initialized with a string literal. For example, you can define and initialize such a pointer with the statement:

const char* proverb {"A miss is as good as a mile."};

This looks similar to initializing a char array, but it’s quite different. This creates a string literal (actually an array of type const char[]) with the character string appearing between the quotes and terminating with '', and stores the address of the literal in the pointer proverb. The address of the literal will be the address of its first character. This is shown in Figure 4-5.

TRY IT OUT: Lucky Stars with Pointers

You could rewrite the lucky stars example using pointers instead of an array:

// Ex4_07.cpp
// Initializing pointers with strings
#include <iostream>
using std::cin;
using std::cout;
using std::endl;
        
int main()
{
   const char* pstr1 {"Robert Redford"};
   const char* pstr2 {"Hopalong Cassidy"};
   const char* pstr3 {"Lassie"};
   const char* pstr4 {"Slim Pickens"};
   const char* pstr5 {"Boris Karloff"};
   const char* pstr6 {"Oliver Hardy"};
   const char* pstr {"Your lucky star is "};
        
   int dice {};
        
   cout << endl
        << "Pick a lucky star!"
        << "Enter a number between 1 and 6: ";
   cin >> dice;
        
   cout << endl;
   switch(dice)
   {
      case 1: cout << pstr << pstr1;
              break;
      case 2: cout << pstr << pstr2;
              break;
      case 3: cout << pstr << pstr3;
              break;
      case 4: cout << pstr << pstr4;
              break;
      case 5: cout << pstr << pstr5;
              break;
      case 6: cout << pstr << pstr6;
              break;
        
      default: cout << "Sorry, you haven't got a lucky star.";
   }
        
   cout << endl;
   return 0;
}

How It Works

The array in Ex4_05.cpp has been replaced by the six pointers, pstr1 to pstr6, each initialized with the name of a star. You have also defined pstr, initialized with the phrase that you’ll use at the start of a normal output line. Because you have discrete pointers, it is easier to use a switch statement to select the appropriate output message rather than an if, as you did in the original version. Incorrect values entered are taken care of by the default option of the switch.

Outputting the string pointed to couldn’t be easier. As you can see, you simply write the pointer name. It may cross your mind at this point that in Ex4_06.cpp you wrote a pointer name in the output statement, and the address that it contained was displayed. Why is it different here? The answer is in the way the stream output operation views a pointer of type ‘pointer to char.’ It treats a pointer of this type in a special way, in that it regards it as a string (which is an array of char), and so outputs the string, rather than its address.

Using pointers has eliminated the waste of memory that occurred with the array version of this program, but the program seems a little long-winded now. There must be a better way. Indeed there is — using an array of pointers.

TRY IT OUT: Arrays of Pointers

With an array of pointers of type char, each element can point to an independent string, and the lengths of each of the strings can be different. You can define an array of pointers in the same way that you define a normal array. Let’s go straight to rewriting the previous example using a pointer array:

// Ex4_08.cpp
// Initializing pointers with strings
#include <iostream>
using std::cin;
using std::cout;
using std::endl;
        
int main()
{
   const char* pstr[] { "Robert Redford",  // Initializing a pointer array
                        "Hopalong Cassidy",
                        "Lassie",
                        "Slim Pickens",
                        "Boris Karloff",
                        "Oliver Hardy"
                      };
   const char* pstart {"Your lucky star is "};
        
   int dice {};
        
   cout << endl
        << "Pick a lucky star!"
        << "Enter a number between 1 and "<< _countof(pstr) << ": ";
   cin >> dice;
        
   cout << endl;
   if(dice >= 1 && dice <= _countof(pstr))     // Check input validity
      cout << pstart << pstr[dice - 1];        // Output star name
        
   else
      cout << "Sorry, you haven't got a lucky star."; // Invalid input
        
   cout << endl;
   return 0;
}

How It Works

In this case, you are nearly getting the best of all possible worlds. You have a one-dimensional array of pointers to type char defined, such that the compiler works out what the dimension should be from the number of initializing strings. The memory usage that results from this is illustrated in Figure 4-6.

FIGURE 4-6

Compared to using a “normal” array, a pointer array generally carries less overhead in terms of space. With an array, you would need to make each row the length of the longest string, and six rows of seventeen bytes each is 102 bytes, so by using a pointer array you have saved a whole -1 bytes! What’s gone wrong? The simple truth is that for this small number of relatively short strings, the size of the extra array of pointers is significant. You would make savings if you were dealing with more strings that were longer and more variable in length.

Space saving isn’t the only advantage of using pointers. In many circumstances you save time, too. Think of what happens if you want to move "Oliver Hardy" to the first position and "Robert Redford" to the end. With the pointer array in Ex4_08.cpp, you just swap the pointers — the strings themselves stay where they are. If you had stored these simply as strings, a great deal of copying would be necessary — you’d need to copy the string "Robert Redford" to a temporary location while you copied "Oliver Hardy" in its place. Then you’d need to copy "Robert Redford" to the end position. This requires significantly more computer time.

Because you use pstr as the array name, the variable holding the start of the output message needs to be different; it is called pstart. You select the string to output by means of an if statement, similar to that in the original version of the example. You either display a star selection, or a suitable message if the user enters an invalid value.

The sizeof Operator

The sizeof operator produces an integer value of type size_t that gives the number of bytes occupied by its operand, where size_t is a type defined by the standard library. Many standard library functions return a value of type size_t, and size_t is defined using a typedef statement to be equivalent to one of the fundamental types, usually unsigned int. The reason for using size_t rather than a fundamental type directly is that it allows flexibility in what the actual type is in different C++ implementations. The C++ standard permits the range of values accommodated by a fundamental type to vary, to make the best of a given hardware architecture, and size_t can be defined to be the equivalent of the most suitable fundamental type in the current machine environment.

Look at this statement that refers to dice in the previous example:

cout << sizeof dice;

The value of the expression sizeof dice is 4 because dice is type int and therefore occupies 4 bytes. Thus, this statement outputs the value 4.

The sizeof operator can be applied to an element in an array or to the whole array. When you apply the operator to an array name by itself, it produces the number of bytes occupied by the whole array, whereas when you apply it to a single element, it results in the number of bytes occupied by that element. In the last example, you could output the number of elements in pstr with the expression:

cout << (sizeof pstr)/(sizeof pstr[0]);

The expression (sizeof pstr)/(sizeof pstr[0]) divides the number of bytes occupied by the whole array, by the number of bytes occupied by the first element. Because each array element occupies the same amount of memory, the result is the number of elements in the array. The code fragment you saw earlier that computed the average for an array of temperatures could be written like this:

double temperatures[] {65.5, 68.0, 75.0, 77.5, 76.4, 73.8, 80.1};
double sum {};
for(auto t : temperatures)
  sum += t;
double average = sum/((sizeof temperatures)/(sizeof temperatures[0]));

Of course, as I noted earlier, you can use _countof() to obtain the number of array elements and this is much clearer and will result in a compile-time error message if you pass a pointer to it instead of an array name.

You can apply the sizeof operator to a type name, in which case the result is the number of bytes occupied by a variable of that type. In this case, the type name should be enclosed in parentheses. For example:

size_t long_size {sizeof(long)};

The variable long_size will be initialized with the value 4. The variable long_size is of type size_t to match the type of value produced by the sizeof operator. Using a different integer type for long_size may result in a warning message from the compiler.

Constant Pointers and Pointers to Constants

You defined pstr in Ex4_08.cpp like this:

   const char* pstr[]  { "Robert Redford",  // Initializing a pointer array
                         "Hopalong Cassidy",
                         "Lassie",
                         "Slim Pickens",
                         "Boris Karloff",
                         "Oliver Hardy"
                       };

Each pointer in the array is initialized with the address of a string literal, "Robert Redford", "Hopalong Cassidy", and so on. The type of a string literal is ‘array of const char,’ so you are storing the address of a const array in a const pointer. This prevents modification of the literal used as the initializer, which is quite a good idea. There is no ambiguity about the const-ness of the strings pointed to by the elements of the pstr pointer array. If you now attempt to change these strings, the compiler flags this as an error at compile time.

However, you could still legally write this:

pstr[0] = pstr[1];

Those lucky individuals due to be awarded Mr. Redford would get Mr. Cassidy instead since both pointers now point to the same name. Note that this isn’t changing the strings pointed to — it is changing the address stored in pstr[0]. You probably want to inhibit this kind of change as well; some people may reckon that good old Hoppy may not have the same sex appeal as Robert. You can do this with the following statement:

   // Array of constant pointers to constants
   const char* const pstr[] = { "Robert Redford",
                                "Hopalong Cassidy",
                                "Lassie",
                                "Slim Pickens",
                                "Boris Karloff",
                                "Oliver Hardy"
                              };

Now the characters in the strings cannot be modified and neither can any of the addresses in the array.

You can distinguish three situations relating to const, pointers, and the objects to which they point:

A pointer to a constant object
A constant pointer to an object
A constant pointer to a constant object

In the first situation, the object pointed to cannot be modified, but you can set the pointer to point to something else:

int value {5};
const int* pvalue {&value};
*pvalue = 6;                                // Will not compile!
pvalue = nullptr;                           // OK

In the second situation, the address stored in the pointer can’t be changed, but the object pointed to can be:

int value {5};
int* const pvalue {&value};
*pvalue = 6;                                // OK
pvalue = nullptr;                           // Will not compile!

Finally, in the third situation, both the pointer and the object pointed to have been defined as constant and, therefore, neither can be changed:

int value {5};
const int* const pvalue {&value};
*pvalue = 6;                                // Will not compile!
pvalue = nullptr;                           // Will not compile!

Pointers and Arrays

Array names can behave like pointers under some circumstances. In most situations, if you use the name of a one-dimensional array by itself, it is automatically interpreted as a pointer to the first array element . Note that this is not the case when the array name is used as the operand of the sizeof operator.

If you have these definitions,

double* pdata {};
double data[5];

you can write this assignment:

pdata = data;       // Initialize pointer with the array address

This assigns the address of the first element of the data to the pointer pdata. Using the array name by itself refers to the address of the array. If you use the array name data with an index value, it refers to the contents of the element corresponding to that index value. So, to store the address of an element in the pointer you use the address-of operator like this:

pdata = &data[1];

Here, pdata contains the address of the second array element.

Pointer Arithmetic

You can perform arithmetic operations with pointers. You are limited to addition and subtraction, but you can also compare pointer values to produce a logical result. Arithmetic with a pointer implicitly assumes that the pointer points to an array, and that the arithmetic operation is on the address contained in the pointer. For the pointer pdata, for example, you could assign the address of the third element of the data array to it with this statement:

pdata = &data[2];

In this case, the expression pdata+1 would refer to the address of data[3], the fourth element of the data array, so you could make pdata point to this element by writing this statement:

pdata += 1;          // Increment pdata to the next element

This increments the address in pdata by the number of bytes occupied by one element of the data array. In general, pdata+n, where n can be any expression resulting in an integer, adds n*sizeof(double) to the address in pdata, because it is of type pointer to double. This is illustrated in Figure 4-7.

In other words, incrementing or decrementing a pointer works in terms of the type of the object pointed to. Increasing a pointer to long by one changes its contents to the next long address, and so increments the address by four. Similarly, incrementing a pointer to short by one increments the address by two. The more common notation for incrementing a pointer by one is using the increment operator. For example:

pdata++;            // Increment pdata to the next element

This is equivalent to (and more common than) the += operator. However, I used += earlier to make it clear that although the increment value is specified as one, the effect is always an address increment greater than one except for the case of a pointer to type char.

You can, of course, dereference a pointer on which you have performed arithmetic (there wouldn’t be much point to it otherwise). For example, if pdata is still pointing to data[2], this statement,

*(pdata + 1) = *(pdata + 2);

is equivalent to this:

data[3] = data[4];

The parentheses are necessary when you want to dereference a pointer after incrementing the address it contains because the precedence of the indirection operator is higher than that of the arithmetic operators, + and -. If you write *pdata+1, instead of *(pdata+1), this adds one to the value stored at the address in pdata, which is equivalent to executing data[2]+1. Because this isn’t an lvalue, its use in the previous assignment statement would cause the compiler to generate an error message.

You can use an array name as though it were a pointer for addressing elements of an array. Suppose you have the array defined as:

long data[5];

Using pointer notation, you can refer to the element data[3], for example, as *(data+3). This kind of notation can be applied generally so that, corresponding to the elements data[0], data[1], data[2], you can write *data, *(data+1), *(data+2), and so on.

TRY IT OUT: Array Names as Pointers

You can practice this aspect of array addressing with a program to calculate prime numbers (a prime number is divisible only by itself and one):

// Ex4_09.cpp
// Calculating primes
#include <iostream>
#include <iomanip>
using std::cout;
using std::endl;
using std::setw;
        
int main()
{
   const int MAX {100};           // Number of primes required
   long primes[MAX] { 2,3,5 };    // First three primes defined
   long trial {5};                // Candidate prime
   int count {3};                 // Count of primes found
   bool found {false};            // Indicates when a prime is found
        
   do
   {
      trial += 2;                     // Next value for checking
      found = false;                  // Set found indicator
        
      for(int i {}; i < count; i++)   // Try division by existing primes
      {
         found = (trial % *(primes + i)) == 0;// True for exact division
           if(found)                          // If division is exact
              break;                          // it's not a prime
      }
        
      if (!found)                      // We got one...
         *(primes + count++) = trial;  // ...so save it in primes array
   }while(count < MAX);
        
   // Output primes 5 to a line
   for(int i {}; i < MAX; i++)
   {
      if(i % 5 == 0)               // New line on 1st, and every 5th line
         cout << endl;
      cout << setw(10) << *(primes + i);
   }
   cout << endl;
        
   return 0;
}

If you compile and execute this example, you should get the following output:

         2         3         5         7        11
        13        17        19        23        29
        31        37        41        43        47
        53        59        61        67        71
        73        79        83        89        97
       101       103       107       109       113
       127       131       137       139       149
       151       157       163       167       173
       179       181       191       193       197
       199       211       223       227       229
       233       239       241       251       257
       263       269       271       277       281
       283       293       307       311       313
       317       331       337       347       349
       353       359       367       373       379
       383       389       397       401       409
       419       421       431       433       439
       443       449       457       461       463
       467       479       487       491       499
       503       509       521       523       541

How It Works

You have the usual #include statements for the iostream header for input and output, and for iomanip, because you use a stream manipulator to set the field width for output.

You use the constant MAX to define the number of primes that you want the program to produce. The primes array, which stores the results, is initialized with the first three primes to start the process off. All the work is done in two loops. The outer do-while loop picks the next value to be checked and adds the value to the primes array if it is prime, and the inner for loop that checks the value to see whether it’s prime or not.

The algorithm in the for loop is very simple and is based on the fact that if a number is not a prime, it must be divisible by one of the primes found so far — all of which are less than the number in question because all numbers are either prime or a product of primes. In fact, it is only necessary to divide by primes less than or equal to the square root of the number in question, so this example isn’t as efficient as it might be:

found = (trial % *(primes + i)) == 0;  // True for exact division

This statement sets found to true if there’s no remainder from dividing trial by the current prime *(primes + i) (remember that this is equivalent to primes[i]), and to false otherwise. The if statement causes the for loop to be terminated if found is true because the candidate in trial can’t be a prime in that case.

After the for loop ends (for whatever reason), it’s necessary to decide whether or not the value in trial was prime. This is indicated by the value in found:

*(primes + count++) = trial;   // ...so save it in primes array

If trial does contain a prime, this statement stores the value in primes[count] and then increments count through the postfix increment operator.

After MAX primes have been found, they are output with a field width of 10 characters, 5 to a line, as a result of this statement:

if(i % 5 == 0)              // New line on 1st, and every 5th line
   cout << endl;

This starts a new line when i has the values 0, 5, 10, and so on.

TRY IT OUT: Counting Characters Revisited

To see how to handle strings in pointer notation, you could produce a version of the program you looked at earlier for counting the characters in a string:

// Ex4_10.cpp
// Counting string characters using a pointer
#include <iostream>
using std::cin;
using std::cout;
using std::endl;
        
int main()
{
   const int MAX {80};                 // Maximum array dimension
   char buffer[MAX];                   // Input buffer
   char* pbuffer {buffer};             // Pointer to array buffer
        
   cout << endl                        // Prompt for input
        << "Enter a string of less than "
        << MAX << " characters:"
        << endl;
        
   cin.getline(pbuffer, MAX, '
'),    // Read a string until 

        
   while(*pbuffer)                     // Continue until 
      pbuffer++;
        
   cout << endl
        << "The string "" << buffer
        << "" has " << pbuffer - buffer << " characters.";
   cout << endl;
        
   return 0;
}

Here’s an example of output from this example:

Enter a string of less than 80 characters:
The tigers of wrath are wiser than the horses of instruction.
The string "The tigers of wrath are wiser than the horses of
instruction." has 61 characters.

How It Works

The program uses the pointer pbuffer rather than the array name buffer. You don’t need the count variable because the pointer is incremented in the while loop until '' is found. When '' is found, pbuffer will contain the address of that position in the string. The count of the number of characters in the string is therefore the difference between the address in pbuffer, and the address of the beginning of the array denoted by buffer.

You could have incremented the pointer in the loop by writing the loop like this:

while(*pbuffer++);                   // Continue until

Now the loop contains no statements, only the test condition. This would work adequately, except that the pointer would be incremented after '' was encountered, so the address would be one more than the last position in the string. You would therefore need to express the count of the number of characters in the string as pbuffer–buffer-1.

Note that you can’t use the array name here in the same way that you have used the pointer. The expression buffer++ is strictly illegal because you can’t modify the address value that an array name represents. Even though you can use an array name in an expression as though it is a pointer, it isn’t a pointer, because the address value that it represents is fixed.

Using Pointers with Multidimensional Arrays

Using a pointer to store the address of a one-dimensional array is relatively straightforward, but with multidimensional arrays, things can get a little complicated. If you don’t intend to use pointers with multidimensional arrays, you can skip this section, as it’s a little obscure; however, if you have previous experience with C++, this section is worth a glance.

If you have to use a pointer with multidimensional arrays, you need to keep clear in your mind what is happening. By way of illustration, you can use an array beans, defined as follows:

double beans[3][4];

You can define and assign a value to the pointer pbeans, as follows:

double* pbeans;
pbeans = &beans[0][0];

Here, you are setting the pointer to the address of the first element of the array, which is of type double. You could also set the pointer to the address of the first row in the array with the statement:

pbeans = beans[0];

This is equivalent to using the name of a one-dimensional array, which is replaced by its address. You used this in the earlier discussion; however, because beans is a two-dimensional array, you cannot set an address in the pointer with the following statement:

pbeans = beans;           // Will cause an error!!

The problem is one of type. The type of the pointer is double*, but the array is of type double[3][4]. A pointer to store the address of this array must be of type double*[4]. C++ associates the dimensions of the array with its type, and the preceding statement is only legal if the pointer has been defined with the dimension required. This can be done with a slightly more complicated notation than you have seen so far:

double (*pbeans)[4];

The parentheses here are essential; otherwise, you would be declaring an array of pointers. Now the previous statement is legal, but this pointer can only be used to store addresses of an array with the dimensions shown. The auto keyword can help out here. You can write the statement as:

auto pbeans = beans;

Now the compiler will deduce the correct type for you.

Pointer Notation with Multidimensional Arrays

You can use pointer notation with an array name to reference elements of the array. You can reference each element of the array beans that you defined earlier, which had three rows of four elements, in two ways:

Using the array name with two index values
Using the array name in pointer notation

Therefore, the following two expressions are equivalent:

beans[i][j]
*(*(beans + i) + j)

Let’s look at how these work. The first expression uses normal array indexing to refer to the element with offset j in row i of the array.

You can determine the meaning of the second expression by working from the inside outwards. beans refers to the address of the first row of the array, so beans+i refers to row i. The expression *(beans+i) is the address of the first element of row i, so *(beans+i)+j is the address of the element in row i with offset j. The whole expression therefore refers to the value of that element.

If you really want to be obscure — and it isn’t recommended that you should be — the following two statements, where you have mixed array and pointer notation, are also legal references to the same element of the array:

*(beans[i] + j)
(*(beans + i))[j]

There is yet another aspect to using pointers that is the most important of all: the ability to allocate memory dynamically. You’ll look into that next.

DYNAMIC MEMORY ALLOCATION

Working with a fixed set of variables in a program can be very restrictive. You’ll often want to allocate space for variables at execution time, depending on the input data. Any program that processes a number of data items that is not known in advance can take advantage of the ability to allocate memory at run time. For example, in a program that stores information about the students in a class, the number of students is not fixed and their names will vary in length, so to deal with the data most efficiently, you’ll want to allocate space at execution time.

Obviously, because dynamically allocated variables can’t have been defined at compile time, they can’t be named in your code. When they are created, they are identified by their address, which you store in a pointer. With the power of pointers, and the dynamic memory management tools in Visual C++, writing your programs to have this kind of flexibility is quick and easy.

The Free Store, Alias the Heap

In most instances, when your program is executed, there is unused memory in your computer. This unused memory is called the heap, or the free store. You can allocate space within the free store for a new variable of a given type using a special operator that returns the address of the space allocated. This operator is new, and it’s complemented by the operator delete, which releases memory previously allocated by new.

You can allocate space in the free store for variables in one part of a program, and then release the space and return it to the free store after you have finished with it. This makes the memory available for reuse by other dynamically allocated variables. This is a powerful technique; it enables you to use memory very efficiently and in many cases results in programs that can handle much larger problems, involving considerably more data than otherwise might be possible.

The new and delete Operators

Suppose that you need space for a double variable. You can define a pointer to type double and then request that the memory be allocated at execution time. You can do this using the new operator:

double* pvalue {};
pvalue = new double;      // Request memory for a double variable

This is a good moment to recall that all pointers should be initialized. Using memory dynamically typically involves a number of pointers floating around, so it’s important that they should not contain spurious values. You always set a pointer that doesn’t contain a legal address value to nullptr.

The new operator in the second line of code should return the address of the memory in the free store allocated to a double variable, and this address is stored in the pointer pvalue. You can then use this pointer to reference the variable using the indirection operator, as you have seen. For example:

*pvalue = 9999.0;

Of course, the memory may not have been allocated because the free store had been used up, or because the free store is fragmented by previous usage — meaning that there aren’t sufficient contiguous bytes to accommodate the variable for which you want to obtain space. You don’t have to worry too much about this. The new operator will throw an exception if the memory cannot be allocated for any reason, which terminates your program. Exceptions are a mechanism for signaling errors in C++; you learn about these in Chapter 6.

You can initialize a variable created by new. Taking the example of the double variable that was allocated by new and the address stored in pvalue, you could have set the value to 999.0, as it was created with this statement:

pvalue = new double {999.0};   // Allocate a double and initialize it

Of course, you could create the pointer and initialize it in a single statement, like this:

double* pvalue { new double{999.0} };

When you no longer need a variable that has been dynamically allocated, you can free the memory that it occupies with the delete operator:

delete pvalue;                 // Release memory pointed to by pvalue

This ensures that the memory can be used for something else. If you don’t use delete, and you store a different address in pvalue, it will be impossible to free the memory or to use the data that it contains, because access to the address is lost. In this situation, you have what is referred to as a memory leak, especially when it recurs in your program.

You should set a pointer to nullptr when you release the memory to which it points. If you don’t, you have what is called a dangling pointer, through which you might attempt to access memory that has been freed.

Allocating Memory Dynamically for Arrays

Allocating memory for an array dynamically is very straightforward. To allocate an array of type char, you could write this statement:

char* pstr {new char[20]};     // Allocate a string of twenty characters

This allocates space for a char array of 20 characters and stores its address in pstr. To remove the array that you have just created, you use the delete operator. The statement would look like this:

delete [] pstr;                // Delete array pointed to by pstr

Note the use of square brackets to indicate that you are deleting an array. When removing arrays from the free store, you should always include the square brackets, or the results will be unpredictable. Note that you do not specify any dimensions here, simply use [].

Of course, pstr now contains the address of memory that may already have been allocated for some other purpose, so it certainly should not be used. When you use delete to discard memory you previously allocated, you should always reset the pointer, like this:

pstr = nullptr;

This ensures that you cannot access the memory that has been released.

You can initialize an array allocated in the free store:

int *data {new int[10] {2,3,4}};

This statement creates an array of 10 integer elements and initializes the first three with 2, 3, and 4. The remaining elements will be initialized to 0.

TRY IT OUT: Using Free Store

You can see how dynamic memory allocation works by rewriting the program that calculates an arbitrary number of primes, this time using memory in the free store to store the primes:

// Ex4_11.cpp
// Calculating primes using dynamic memory allocation
#include <iostream>
#include <iomanip>
using std::cin;
using std::cout;
using std::endl;
using std::setw;
        
int main()
{
   int max {};                               // Number of primes required
   cout << endl
        << "Enter the number of primes you would like (at least 4): ";
   cin >> max;                    
        
   if(max < 4)                               // Test the user input, if less than 4
      max = 4;                               // ensure it is at least 4
        
   // Allocate prime array and initialize with seed primes   
   long* pprime {new long[max] {2L, 3L, 5L} }; 
 
   long trial {5L};                          // Candidate prime
   int count {3};                            // Count of primes found
   bool found {false};                       // Indicates when a prime is found
 
 
 
        
   do
   {
      trial += 2L;                           // Next value for checking
      found = false;                         // Set found indicator
        
      for(int i {}; i < count; i++)          // Division by existing primes
      {
         found =(trial % *(pprime + i)) == 0;// True for exact division
         if(found)                           // If division is exact
            break;                           // it's not a prime
      }
        
      if (!found)                            // We got one...
         *(pprime + count++) = trial;        // ...so save it in primes array
   } while(count < max);
        
   // Output primes 5 to a line
   for(int i {}; i < max; i++)
   {
      if(i % 5 == 0)                         // New line on 1st, and every 5th line
         cout << endl;
      cout << setw(10) << *(pprime + i);
   }
        
   delete [] pprime;                         // Free up memory...
   pprime = nullptr;                         // ...and reset the pointer
   cout << endl;
   return 0;
}

Here’s an example of the output from this program:

Enter the number of primes you would like (at least 4): 20
         2         3         5         7        11
        13        17        19        23        29
        31        37        41        43        47
        53        59        61        67        71

How It Works

After receiving the number of primes required in the int variable max, you make sure that max can be no less than 4. This is because the program requires space to be allocated for at least the three seed primes, plus one new one. You specify the size of the array by putting the variable max between the square brackets following the array type specification:

   long* pprime {new long[max] {2L, 3L, 5L} };

The program would terminate at this point if the memory could not be allocated for pprime. The statement also initializes the first three elements of the array to the first three prime values. The remaining elements will be 0.

The calculation of the primes is exactly as before; the only change is that the name of the pointer, pprime, replaces the array name, primes, that you used in the previous version. Equally, the output process is the same. Acquiring space dynamically is really not a problem. After it has been allocated, it in no way affects how the computation is written.

After you finish with the array, you remove it from the free store using delete, remembering to include the square brackets to indicate that it is an array:

   delete [] pprime;             // Free up memory

Although it’s not essential here, you also reset the pointer:

   pprime = nullptr;            // and reset the pointer

All memory allocated in the free store is released when your program ends, but it is good to get into the habit of resetting pointers to nullptr when they no longer point to valid memory areas.

Dynamic Allocation of Multidimensional Arrays

Allocating memory in the free store for a multidimensional array involves using the new operator in a slightly more complicated form than is used for a one-dimensional array. Suppose that you define the pointer pbeans like this:

double (*pbeans)[4] {};

To obtain the space for the array beans[3][4] that you used earlier in this chapter, you could write this:

pbeans = new double [3][4];              // Allocate memory for a 3x4 array

You just specify both array dimensions between square brackets after the type name for the elements. Of course, you could do it all in one go:

double (*pbeans)[4] {new double [3][4]};

Allocating space for a three-dimensional array simply requires that you specify the extra dimension, as in this example:

auto pBigArray (new double [5][10][10]); // Allocate memory for a 5x10x10 array

This uses auto to have the pointer type determined automatically. Don’t forget — you can’t use an initializer list with auto. You could write it as:

auto pBigArray = new double [5][10][10]; // Allocate memory for a 5x10x10 array

However many dimensions there are in the array that has been created, to destroy it and release the memory back to the free store, you write the following:

delete [] pBigArray;                     // Release memory for array
pBigArray = nullptr;

You always use just one pair of square brackets following the delete operator, regardless of the dimensionality of the array.

You have already seen that you can use a variable as the specification of the dimension of a one-dimensional array to be allocated by new. This extends to two or more dimensions, but with the restriction that only the leftmost dimension may be specified by a variable. All the other dimensions must be constants or constant expressions. So, you could write this,

pBigArray = new double[max][10][10];

where max is a variable; however, specifying a variable for any dimension other than the leftmost causes an error message to be generated by the compiler.

USING REFERENCES

A reference appears to be similar to a pointer in many respects, which is why I’m introducing it here, but it really isn’t. The importance of references becomes apparent only when you get to explore their use with functions, particularly in the context of object-oriented programming. Don’t be misled by their simplicity and what might seem to be a trivial concept here. As you’ll see later, references provide some extraordinarily powerful facilities, and in some contexts enable you to achieve results that would be impossible without them.

What Is a Reference?

Essentially, a reference is a name that can be used as an alias for something else. There are two kinds of references: lvalue references and rvalue references.

An lvalue reference is an alias for another variable; it is called an lvalue reference because it refers to a persistent storage location that can appear on the left of an assignment operation. Because an lvalue reference is an alias, the variable for which it is an alias has to exist when the reference is defined. Unlike a pointer, a reference cannot be altered to represent something else.

An rvalue reference can be used as an alias for a variable, just like an lvalue reference, but it differs from an lvalue reference in that it can also reference an rvalue, which is a temporary value that is essentially transient.

Declaring and Initializing Lvalue References

Suppose that you have defined a variable as:

long number {};

You can define an lvalue reference for this variable using this statement:

long& rnumber {number};        // Declare a reference to variable number

The ampersand following the type name long and preceding the variable name rnumber, indicates that an lvalue reference is being defined, and that the variable name it represents, number, is specified as the initializing value between the parentheses; therefore rnumber is of type ‘reference to long'. You can use the reference in place of the original variable name. For example:

rnumber += 10L;

This will increment number by 10.

Note that you cannot write:

int& rfive {5};                // Will not compile!

The literal 5 is constant and cannot be changed. To protect the integrity of constant values, you must use a const reference:

const int& rfive {5};          // OK

Now you can access the literal 5 through the rfive reference. Because you define rfive as const, it cannot be used to change the value it references.

Let’s contrast the lvalue reference rnumber in the previous code with the pointer pnumber, defined in this statement:

long* pnumber {&number};       // Initialize a pointer with an address

This defines the pointer pnumber, and initializes it with the address of number. This allows number to be incremented with a statement such as:

*pnumber += 10L;               // Increment number through a pointer

There is a significant distinction between using a pointer and using a reference. You must dereference the pointer to access the variable to which it points in the expression. With a reference, there is no need for dereferencing. In some ways, a reference is like a pointer that has already been dereferenced, although it can’t be changed to reference something else. An lvalue reference is the complete equivalent of the variable for which it is a reference.

Using References in a Range-based for Loop

Earlier in this chapter you saw a code snippet using a range-based for loop to iterate over an array of temperatures:

for(auto t : temperatures)
{
  sum += t;
  ++count;
}

The t variable does not reference an array element, only its value, so you cannot use it to modify the element. However, you can by using a reference:

const double FtoC {5.0/9.0};           // Convert Fahrenheit to Centigrade
for(auto& t : temperatures)
  t = (t - 32)*FtoC;
for(auto& t : temperatures)
  cout << "  " << t;
cout << endl;

The variable t will now be of type double& and will reference each array element directly. This loop changes the values in the array from Fahrenheit to Centigrade.

Using a reference in a range-based for loop is particularly valuable when you are working with collections of objects. Copying objects can be expensive on time, so avoiding copying by using a reference type makes your code more efficient. You will learn about collections of objects in Chapter 10 when the range-based for loop comes into its own.

If you want to use references with the range-based for loop for performance reasons, but you don’t want to be able to modify the values, you can use const auto&, as in:

for (const auto& t : temperatures)
  cout << "  " << t;

Creating Rvalue References

I am explaining rvalue references here because the concept is related to that of lvalue references, but I cannot go into the significance of rvalue references at this point. Rvalue references are particularly important in the context of functions, which you’ll learn about in Chapter 5. You’ll also learn more about rvalue references in subsequent chapters.

As you know, every expression is either an rvalue or an lvalue. A variable is an lvalue because it represents a location in memory. An rvalue is different. It represents the result of evaluating an expression. Thus, an lvalue reference is a reference to a variable that has a name, and allows the contents of the memory that the variable represents to be accessed through the lvalue reference. An rvalue reference is a reference to memory containing the result of evaluating an expression.

You specify an rvalue reference type using two ampersands following the type name. Here’s an example:

int x {5};
int&& rExpr {2*x + 3};                   // rvalue reference
cout << rExpr << endl;
int& rx {x};                              // lvalue reference
cout << rx << endl;

Here, the rvalue reference is initialized to reference the result of evaluating the expression 2*x+3, which is a temporary value — an rvalue. The output will be 13. You cannot do this with an lvalue reference. Is this useful? In this case, no, indeed it is not recommended at all; but in a different context, it is very useful.

LIBRARY FUNCTIONS FOR STRINGS

The cstring standard header defines functions that operate on null-terminated strings. These are functions that are specified in the C++ standard and are defined in the std namespace. There are alternatives to some of these that are not standard and therefore not in the std namespace, but which provide a more secure implementation of the function than the original versions. In general, the secure functions have names ending with _s and I’ll use the more secure versions in examples. Let’s explore some of the most useful functions provided by the cstring header.

Finding the Length of a Null-terminated String

The strlen() function returns the length of the argument string of type char* as a value of type size_t. The wcslen() function does the same thing for strings of type wchar_t*.

Here’s how you use the strlen() function:

const char* str {"A miss is as good as a mile."};
std::cout << "The string contains " <<  std::strlen(str) << " characters.";

The output produced when this fragment executes is:

The string contains 28 characters.

As you can see from the output, the length that is returned does not include the terminating null. It is important to keep this in mind, especially when you are using the length of one string to create another.

Both strlen() and wcslen() find the length by looking for the null at the end. If there isn’t one, the functions will happily continue beyond the end of the string, checking throughout memory in the hope of finding a null. For this reason, these functions represent a security risk when you are working with data from an untrusted external source. It is generally better to use the strnlen() and wcsnlen() functions, both of which require a second argument that specifies the length of the buffer in which the string specified by the first argument is stored. For example:

char str[30] {"A miss is as good as a mile."};
std::cout << "The string contains " <<  strnlen(str, _countof(str)) 
          << " characters.";

The second argument to strnlen() is provided by the _countof() macro.

Joining Null-terminated Strings

The strcat() function that concatenates two null-terminated strings is deprecated because it is unsafe. The strcat_s() function is the safe alternative. The string specified by the second argument to strcat_s() is appended to the string specified by the first argument. Here’s an example of how you might use it:

const size_t count {30};
char str1[count] {"Many hands"};
const char* str2 {" make light work."};
        
errno_t error {strcat_s(str1, str2)};
        
if(error == 0)
    std::cout << "Strings joined successfully.
"
              << str1 << std::endl;
        
else if(error == EINVAL)
  std::cout <<"Error! Source or destination string address is a null pointer." 
            << std::endl;
        
else if(error == ERANGE)
  std::cout << "Error! Destination string too small." << std::endl;

For convenience, I defined the array size as the constant count. The first argument to strcat_s() is the destination string to which the source string specified by the second argument is to be appended. The function returns an integer value of type errno_t to indicate how things went. The return value will be zero if the operation is successful, EINVAL if the source or destination is nullptr, or ERANGE if the destination length is too small. In the event of an error, the destination will be left unchanged. The error code values EINVAL and ERANGE are defined in the cerrno header, which is included indirectly in the iostream header. Of course, you are not obliged to test for the error codes that the function might return but it is good practice.

As Figure 4-8 shows, the first character of the string specified by the second argument overwrites the terminating null of the first argument, and all the remaining characters of the second string are copied across, including the terminating null. Thus, the output from the fragment will be:

Strings joined successfully.
Many hands make light work.

The wcscat_s() function is the safe alternative to wcscat() that concatenates wide-character strings, and works in the same way as the strcat_s() function.

With the strncat_s() function you can append part of one null-terminated string to another. The first two arguments are the destination and source strings respectively, and the third argument is a count of the number of characters from the source string that are to be appended. With the strings as defined in Figure 4-8, here’s an example of using strncat_s():

  errno_t error{ strncat_s(str1, str2, 11) };

After executing this statement, str1 contains the string "Many hands make light". The operation appends 11 characters from str2 to str1, overwriting the terminating '' in str1, and then appends a final '' character. The wcsncat_s() provides the same capability as strncat_s() but for wide-character strings.

Copying Null-terminated Strings

The standard library function strcpy() copies a string from a source location to a destination. The strcpy_s() function is a more secure version of strcpy(). The first argument is a pointer to the destination, and the second is a pointer to the source string; the first argument is of type char* and the second is type const char*. strcpy_s()verifies that the source and destination are not nullptr and that the destination has sufficient space to accommodate the source string. If either argument is nullptr or the destination is too small, the program will crash and offer you the option to close the program or start debugging it, thus preventing an uncontrolled copy operation. wcscpy_s() provides analogous wide-character versions of this copy function.

Comparing Null-terminated Strings

The strcmp() function compares two null-terminated strings that you specify by arguments of type const char*. The function returns a value of type int that is less than zero, zero, or greater than zero, depending on whether the string pointed to by the first argument is less than, equal to, or greater than the string pointed to by the second argument. Here’s an example:

const char* str1 {"Jill"};
const char* str2 {"Jacko"};
int result {std::strcmp(str1, str2)};
if(result < 0)
  std::cout << str1 << " is less than " << str2 << '.' << std::endl;
else if(0 == result)
  std::cout << str1 << " is equal to " << str2 << '.' << std::endl;
else
  std::cout << str1 << " is greater than " << str2 << '.' << std::endl;

This fragment compares the strings str1 and str2, and uses the value returned by strcmp() to execute one of three possible output statements.

Comparing strings works by comparing the character codes of successive pairs of corresponding characters. The first pair of characters that are different determines whether the first string is less than or greater than the second string. Two strings are equal if they contain the same number of characters, and the corresponding characters are identical. Of course, the output is:

Jill is greater than Jacko.

The wcscmp() function is the wide-character string equivalent of strcmp().

Searching Null-terminated Strings

The strspn() function searches a string for the first character that is not in a given set and returns the index of the character found. The first argument is a pointer to the string to be searched, and the second is a pointer to a string containing the set of characters. You could search for the first character that is not a vowel like this:

char str[] {"I agree with everything."};
const char* vowels {"aeiouAEIOU "};
size_t index {std::strspn(str, vowels)};
std::cout << "The first character that is not a vowel is '" << str[index]
          << "' at position " << index << std::endl;

This searches str for the first character that is not contained in vowels. Note that I included a space in the vowels set, so a space will be ignored so far as the search is concerned. The output from this fragment is:

The first character that is not a vowel is 'g' at position 3

Another way of looking at the return value from strspn()is that it represents the length of the substring, starting from the first character in the first argument string that consists entirely of characters in the second argument string. In the example it is the first three characters "I a". The wcsspn() function is the wide-character string equivalent of strspn().

The strstr() function returns a pointer to the position in the first argument of a substring specified by the second argument. Here’s a fragment that shows this in action:

char str[] {"I agree with everything."};
const char* substring {"ever"}; 
char* psubstr {std::strstr(str, substring)};
        
if(!psubstr)
  std::cout << """ << substring << "" not found in "" << str << """ << 
std::endl;
else
  std::cout << "The first occurrence of "" << substring
            << "" in "" << str << "" is at position "
            << psubstr-str << std::endl;

The third statement calls strstr()to search str for the first occurrence of substring. The function returns a pointer to the position of the substring if it is found, or nullptr when it is not found. The if statement outputs a message, depending on whether or not substring was found in str. The expression psubstr-str gives the index position of the first character in the substring. The output produced by this fragment is:

The first occurrence of "ever" in "I agree with everything." is at position 13

TRY IT OUT: Searching Null-terminated Strings

This example searches a given string to determine the number of occurrences of a given substring:

// Ex4_12.cpp
// Searching a string
#include <iostream>
#include <cstring>
using std::cout;
using std::endl;
 
int main()
{
  char str[] { "Smith, where Jones had had "had had" had had "had"."
    "
"Had had" had had the examiners' approval." };
  const char* word { "had" };
  cout << "The string to be searched is: " << endl << str << endl;
 
  int count {};                              // Number of occurrences of word in str
  char* pstr { str };                        // Pointer to search start position
  char* found {};                            // Pointer to occurrence of word in str
  const size_t wordLength { std::strlen(word) };
  while (true)
  {
    found = std::strstr(pstr, word);
    if (!found)
      break;
    ++count;
    pstr = found + wordLength;                
// Set next search start as 1 past the word found
  }
  cout << """ << word << "" was found "
       << count << " times in the string." << endl;
  return 0;
}

The output from this example is:

The string to be searched is: Smith, where Jones had had "had had" had had "had".
"Had had" had had the examiners' approval.
"had" was found 10 times in the string.

How It Works

All the action takes place in the indefinite while loop:

  while(true)
  {
    found = std::strstr(pstr, word);
    if (!found)
      break;
    ++count;
    pstr = found + wordLength;                
// Set next search start as 1 past the word found
  }

You first search the string for word starting at position pstr, which initially is the beginning of the string. You store the address strstr() returns in found, which will be nullptr if word was not found so the if statement ends the loop in that case.

If found is not nullptr, you increment the number of occurrences of word, and update pstr so that it points to one character past the word instance that was found. This will be the starting point for the search on the next loop iteration. From the output, you can see that word was found ten times in str. Of course, "Had" doesn’t count because it starts with an uppercase letter.

SUMMARY

You are now familiar with all of the basic types of values in C++, how to create and use arrays of those types, and how to create and use pointers. You have also been introduced to the idea of a reference. However, we have not exhausted all of these topics. I’ll come back to arrays, pointers, and references later in the book.

The pointer mechanism is sometimes a bit confusing because it can operate at different levels within the same program. Sometimes it is operating as an address, and at other times it can be operating with the value stored at an address. It’s very important that you feel at ease with the way pointers are used, so if you find that they are in any way unclear, try them out with a few examples of your own until you feel confident about applying them.

EXERCISES

Write a program that allows an unlimited number of values to be entered and stored in an array allocated in the free store. The program should then output the values, five to a line, followed by the average of the values entered. The initial array size should be five elements. The program should create a new array with five additional elements, when necessary, and copy values from the old array to the new.
Repeat the previous exercise but use pointer notation throughout instead of arrays.
Declare a character array, and initialize it to a suitable string. Use a loop to change every other character to uppercase.
1. Hint: In the ASCII character set, values for uppercase characters are 32 less than their lowercase counterparts.
Define an array of elements of type double that contains twelve arbitrary values that represent monthly average temperatures in Fahrenheit. Use a range-based for loop to convert the values to Centigrade and find and output the maximum, minimum, and average Centigrade temperatures.

WHAT YOU LEARNED IN THIS CHAPTER

TOPIC	CONCEPT
Arrays	An array allows you to manage a number of variables of the same type using a single name. Each dimension of an array is defined between square brackets, following the array name in the definition of the array.
Array dimensions	Each dimension of an array is indexed starting from zero. Thus, the fifth element of a one-dimensional array has the index value 4.
Initializing arrays	Arrays can be initialized by placing the initializing values between curly braces in the definition — in other words, in an initializer list.
Range `for` loop	You can use the range-based `for` loop to iterate over each of the elements in an array.
Pointers	A pointer is a variable that contains the address of another variable. A pointer is defined as a ‘pointer to type’ and may only be assigned addresses of variables of the given type.
Pointers to `const` and `const` pointers	A pointer can point to a constant object. Such a pointer can be reassigned to another object. A pointer may also be defined as `const`, in which case it can’t be reassigned.
References	A reference is an alias for something else. An lvalue reference can be used in place of the variable it references. An rvalue reference can refer to a value stored in a temporary location. A reference must be initialized when it is defined. A reference can’t be reassigned to another variable.
The `sizeof` operator	The `sizeof` operator returns the number of bytes occupied by the object specified as its argument. Its argument may be a variable, or a type name between parentheses.
The `new` operator	The `new` operator allocates memory in the free store. When memory has been assigned, it returns a pointer to the beginning of the memory area. If memory cannot be allocated for any reason, an exception is thrown that by default causes the program to terminate.
The `delete` operator	You use the `delete` operator to release memory that you previously allocated using the `new` operator.

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.

Table of Contents for Chapter 4: Arrays, Strings, and Pointers

Create new playlist

Sign In

Sign Up

HANDLING MULTIPLE DATA VALUES OF THE SAME TYPE

Arrays

Declaring Arrays

Initializing Arrays

Using the Range-based for Loop

Multidimensional Arrays

Initializing Multidimensional Arrays

WORKING WITH C-STYLE STRINGS

String Input

String Literals

Using the Range-based for Loop with Strings

INDIRECT DATA ACCESS

What Is a Pointer?

Declaring Pointers

The Address-of Operator

Using Pointers

The Indirection Operator

Why Use Pointers?

Initializing Pointers

Pointers to char

The sizeof Operator

Constant Pointers and Pointers to Constants

Pointers and Arrays

Pointer Arithmetic

Using Pointers with Multidimensional Arrays

Pointer Notation with Multidimensional Arrays

DYNAMIC MEMORY ALLOCATION

The Free Store, Alias the Heap

The new and delete Operators

Allocating Memory Dynamically for Arrays

Dynamic Allocation of Multidimensional Arrays

USING REFERENCES

What Is a Reference?

Declaring and Initializing Lvalue References

Using References in a Range-based for Loop

Creating Rvalue References

LIBRARY FUNCTIONS FOR STRINGS

Finding the Length of a Null-terminated String

Joining Null-terminated Strings

Copying Null-terminated Strings

Comparing Null-terminated Strings

Searching Null-terminated Strings

SUMMARY

EXERCISES

WHAT YOU LEARNED IN THIS CHAPTER

Table of Contents for
Chapter 4: Arrays, Strings, and Pointers