Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

3. More with Data Types

In Chapter 2, we covered all the C# predefined types and briefly touched on the topic of reference types versus value types. In this chapter, we continue the discussion of data types with further explanation of the categories of types.

A mind map shows the contents of chapter 3, which is related to data types.

In addition, we delve into the details of combining data elements together into tuples—a feature introduced in C# 7.0—followed by grouping data into sets called arrays. To begin, let’s delve further into understanding value types and reference types.

Categories of Types

All types fall into one of two categories: value types and reference types. The differences between the types in each category stem from how they are copied: Value type data is always copied by value, whereas reference type data is always copied by reference.

Value Types

Except for string, all the predefined types in the book so far have been value types. Variables of value types contain the value directly. In other words, the variable refers to the same location in memory where the value is stored. Because of this, when a different variable is assigned the same value, a copy of the original variable’s value is made to the location of the new variable. A second variable of the same value type cannot refer to the same location in memory as the first variable. Consequently, changing the value of the first variable will not affect the value in the second variable, as Figure 3.1 demonstrates. In the figure, number1 refers to a location in memory that contains the value 42. After assigning number1 to number2, both variables will contain the value 42. However, modifying either variable’s value will not affect the other.

Similarly, passing a value type to a method such as Console.WriteLine() will result in a memory copy, and any changes to the parameter inside the method will not affect the original value within the calling function. Since value types require a memory copy, they generally should be defined to consume a small amount of memory; value types should almost always be less than 16 bytes in size.

Reference Types

By contrast, the value of a variable of reference type is a reference to a storage location that contains data. Reference types store the reference where the data is located instead of storing the data directly, as value types do. Therefore, to access the data, the runtime reads the memory location out of the variable and then “jumps” to the location in memory that contains the data, an operation known as dereferencing. The memory area of the data a reference type points to is called the heap (see Figure 3.2).

The typical memory representation of the program is shown. — **Figure 3.2: Reference types point to the heap**

A figure illustrates the memory allocation of value type variables in a stack. The source code is as follows: int number1 equals 42; character letter equals 'A;' float pi equals 3.14 F; int number2 equals number 1, using system.IO, string text equals "A cacophony of ramblings from my potpourri of notes"; StringReader reader equals new StringReader(text). It stores 42 in the memory space allocated for the variable number 1; stores 'A' in the memory allocated for variable letter; stores 3.14F in the memory space allocated for the variable pi; stores 42 in the memory space allocated for the variable number 2; stores 0x00A61234 in the memory space allocated for the variable text; stores 0x00A612C0 in the memory space allocated for the variable reader. The memory area of string text and stringReader reader points to the random location in the segment of a heap where it stores the reference type of that variables allocated. The data for string text is as follows: 9C 66 00 20 00 and so on. The data for string Reader reader is as follows: 41 00 20 00 63 and so on.

A reference type does not require the same memory copy of the data that a value type does, which makes copying reference types far more efficient than copying large value types. When assigning the value of one reference type variable to another reference type variable, only the reference is copied, not the data referred to. In practice, a reference is always the same size as the “native size” of the processor—a 32-bit processor will copy a 32-bit reference, a 64-bit processor will copy a 64-bit reference, and so on. Obviously, copying the small reference to a large block of data is faster than copying the entire block, as a value type would.

Since reference types copy a reference to data, two different variables can refer to the same data. If two variables refer to the same object, changing data in the object via one variable causes the effect to be seen when accessing the same data via another variable. This happens both for assignment and for method calls. Therefore, a method can affect the data of a reference type, and that change can be observed when control returns to the caller. For this reason, a key factor when choosing between defining a reference type or a value type is whether the object is logically like an immutable value of fixed size (and therefore possibly a value type), or logically a mutable thing that can be referred to (and therefore likely to be a reference type).

Besides string and any custom classes such as Program, all types discussed so far are value types. However, most types are reference types. Although it is possible to define custom value types, it is relatively rare to do so in comparison to the number of custom reference types.

Begin 8.0

Begin 2.0

Declaring Types That Allow `null`

Often it is desirable to represent values that are “missing.” When specifying a count, for example, what do you store if the count is unknown or unassigned? One possible solution is to designate a “magic” value, such as -1 or int.MaxValue. However, these are valid integers, so it can be ambiguous as to when the magic value is a normal int or when it implies a missing value. A preferable approach is to assign null to indicate that the value is invalid or that the value has not been assigned. Assigning null is especially useful in database programming. Frequently, columns in database tables allow null values. Retrieving such columns and assigning them to corresponding variables within C# code is problematic, unless the data type in C# can contain null as well.

You can declare a type as either nullable or not nullable, meaning you can declare a type to allow a null value or not, with the nullable modifier. (Technically, C# only includes support for the nullable modifier with value types in C# 2.0 and reference types in C# 8.0.) To enable nullability, simply follow the type declaration with a nullable modifier—a question mark immediately following the type name. For example, int? number = null will declare a variable of type int that is nullable and assign it the value null. Unfortunately, nullability includes some pitfalls, requiring the use of special handling when nullability is enabled.

8.0

Dereferencing a `null` Reference

While support for assigning null to a variable is invaluable (pun intended), it is not without its drawbacks. While copying or passing a null value to other variables and methods is inconsequential, dereferencing (invoking a member on) an instance of null will throw a System.NullReferenceException—for example, invoking text.GetType() when text has the value null. Anytime production code throws a System.NullReferenceException, it is always a bug. This exception indicates that the developer who wrote the code did not remember to check for null before the invocation. Further exacerbating the problem, checking for null requires an awareness on the developer’s part that a null value is possible and, therefore, an explicit action is necessary. It is for this reason that declaring of a nullable variable requires explicit use of the nullable modifier—rather than the opposite approach where null is allowed by default (see “Nullability of Reference Types before C# 8.0” later in the section). In other words, when the programmer opts in to allow a variable to be null, he or she takes on the additional responsibility of being sure to avoid dereferencing a variable whose value is null.

Since checking for null requires the use of statements and/or operators that we haven’t discussed yet, the details on how to check for null appear in Advanced Topic: Checking for null. Full explanations, however, appear in Chapter 4.

2.0

Advanced Topic: Checking for `null`

There are numerous statements and operators that developers can use to check for null. Listing 3.1 provides a few examples. The clearest way to check for null is with an if statement and the is operator, as demonstrated in Listing 3.1.

8.0

Listing 3.1: Checking for null

Example	Description	Example Code
1.	Assign a tuple to individually declared variables.	(string country, string capital, double gdpPerCapita) = ("South Sudan", "Juba", 275.18); System.Console.WriteLine( $@"The poorest country in the world in 2017 was { country}, {capital}: {gdpPerCapita}");
2.	Assign a tuple to individually declared variables that are pre-declared.	string country; string capital; double gdpPerCapita; (country, capital, gdpPerCapita) = ("South Sudan", "Juba", 275.18); System.Console.WriteLine( $@"The poorest country in the world in 2017 was { country}, {capital}: {gdpPerCapita}");
3.	Assign a tuple to individually declared and implicitly typed variables.	(var country, var capital, var gdpPerCapita) = ("South Sudan", "Juba", 275.18); System.Console.WriteLine( $@"The poorest country in the world in 2017 was { country}, {capital}: {gdpPerCapita}");
4.	Assign a tuple to individually declared variables that are implicitly typed with a distributive syntax.	var (country, capital, gdpPerCapita) = ("South Sudan", "Juba", 275.18); System.Console.WriteLine( $@"The poorest country in the world in 2017 was { country}, {capital}: {gdpPerCapita}");
5.	Declare a named item tuple and assign it tuple values, and then access the tuple items by name.	(string Name, string Capital, double GdpPerCapita) countryInfo = ("South Sudan", "Juba", 275.18); System.Console.WriteLine( $@"The poorest country in the world in 2017 was { countryInfo.Name}, {countryInfo.Capital}: { countryInfo.GdpPerCapita}");
6.	Assign a named item tuple to a single implicitly typed variable that is implicitly typed, and then access the tuple items by name.	var countryInfo = (Name: "South Sudan", Capital: "Juba", GdpPerCapita: 275.18); System.Console.WriteLine( $@"The poorest country in the world in 2017 was { countryInfo.Name}, {countryInfo.Capital}: { countryInfo.GdpPerCapita}");
7.	Assign an unnamed tuple to a single implicitly typed variable, and then access the tuple elements by their item-number property.	var countryInfo = ("South Sudan", "Juba", 275.18); System.Console.WriteLine( $@"The poorest country in the world in 2017 was { countryInfo.Item1}, {countryInfo.Item2}: { countryInfo.Item3}");
8.	Assign a named item tuple to a single implicitly typed variable, and then access the tuple items by their item-number property.	var countryInfo = (Name: "South Sudan", Capital: "Juba", GdpPerCapita: 275.18); System.Console.WriteLine( $@"The poorest country in the world in 2017 was { countryInfo.Item1}, {countryInfo.Item2}: { countryInfo.Item3}");
9.	Discard portions of the tuple with underscores.	(string name, _, double gdpPerCapita) = ("South Sudan", "Juba", 275.18);
10.	Tuple element names can be inferred from variable and property names (starting in C# 7.1).	string country = "South Sudan"; string capital = "Juba"; double gdpPerCapita = 275.18; var countryInfo = (country, capital, gdpPerCapita); System.Console.WriteLine( $@"The poorest country in the world in 2017 was { countryInfo.country}, {countryInfo.capital}: { countryInfo.gdpPerCapita}");

Description	Example
*Declaration* Note that the brackets appear with the data type. Multidimensional arrays are declared using commas, where the comma+1 specifies the number of dimensions.	string[] languages; // one-dimensional int[,] cells; // two-dimensional
*Assignment* The `new` keyword and the corresponding data type are optional at declaration time. If not assigned during declarations, the `new` keyword is required when instantiating an array. Arrays can be assigned without literal values. As a result, the value of each item in the array is initialized to its default. If no literal values are provided, the size of the array must be specified. (The size does not have to be a constant; it can be a variable calculated at runtime.) Starting with C# 3.0, specifying the data type is optional.	string[] languages = { "C#", "COBOL", "Java", "C++", "TypeScript", "Pascal", "Python", "Lisp", "JavaScript"}; languages = new string[9]; languages = new string[]{"C#", "COBOL", "Java", "C++", "TypeScript", "Pascal", "Python", "Lisp", "JavaScript" }; // Multidimensional array assignment // and initialization int[,] cells = new int[3,3] { {1, 0, 2}, {1, 2, 0}, {1, 2, 1} };
*Forward Accessing an Array* Arrays are zero based, so the first element in an array is at index 0. The square brackets are used to store and retrieve data from an array.	string[] languages = new string[]{ "C#", "COBOL", "Java", "C++", "TypeScript", "Visual Basic", "Python", "Lisp", "JavaScript"}; // Retrieve fifth item in languages array // (TypeScript) string language = languages[4]; // Write "TypeScript" System.Console.WriteLine(language); // Retrieve second item from the end (Python) language = languages[^3]; // Write "Python" System.Console.WriteLine(language);
*Reverse Accessing an Array* Starting in C# 8.0, you can also index an array from the end. For example, item `^1` corresponds to indexing the last element of the array and `^3` corresponds to indexing the third-from-the-last element.
*Ranges* C# 8.0 allows you to identify and extract an array of elements using the range operator, which identifies the starting item up to but excluding the end item.	System.Console.WriteLine($@"^3..^0: { // Python, Lisp, JavaScript string.Join(", ", languages[^3..^0]) }"); System.Console.WriteLine($@"^3..: { // Python, Lisp, JavaScript string.Join(", ", languages[^3..]) }"); System.Console.WriteLine($@" 3..^3: { // C++, TypeScript, Visual Basic string.Join(", ", languages[3..^3]) }"); System.Console.WriteLine($@" ..^6: { // C#, COBOL, Java string.Join(", ", languages[..^6]) }");

Common Mistake	Error Description	Corrected Code
int numbers[];	The square brackets for declaring an array appear after the data type, not after the variable identifier.	int[] numbers;
int[] numbers; numbers = {42, 84, 168 };	When assigning an array after declaration, it is necessary to use the `new` keyword and then specify the data type.	int[] numbers; numbers = new int[]{ 42, 84, 168 }
int[3] numbers = { 42, 84, 168 };	It is not possible to specify the array size as part of the variable declaration.	int[] numbers = { 42, 84, 168 };
int[] numbers = new int[];	The array size is required at initialization time unless an array literal is provided.	int[] numbers = new int[3];
int[] numbers = new int[3]{}	The array size is specified as `3`, but there are no elements in the array literal. The array size must match the number of elements in the array literal.	int[] numbers = new int[3] { 42, 84, 168 };
int[] numbers = new int[3]; Console.WriteLine( numbers[3]);	Array indices start at zero. Therefore, the last item is one less than the array size. (Note that this is a runtime error, not a compile-time error.)	int[] numbers = new int[3]; Console.WriteLine( numbers[2]);
int[] numbers = new int[3]; numbers[^0] = 42;	Same as previous error. The index from end operator uses `^1` to identify the last item in the array. `^0` is one item past the end, which doesn’t exist. (Note that this is a runtime error, not a compile-time error.)	int[] numbers = new int[3]; numbers[^1] = 42;
int[] numbers = new int[3]; numbers[numbers.Length] = 42;	Same as previous error: 1 needs to be subtracted from the `Length` to access the last element. (Note that this is a runtime error, not a compile-time error.)	int[] numbers = new int[3]; numbers[numbers. Length-1] = 42;
int[] numbers; Console.WriteLine( numbers[0]);	numbers has not yet been assigned an instantiated array, so it cannot be accessed.	int[] numbers = {42, 84}; Console.WriteLine( numbers[0]);
int[,] numbers = { {42}, {84, 42} };	Multidimensional arrays must be structured consistently.	int[,] numbers = { {42, 168}, {84, 42} };
int[][] numbers = { {42, 84}, {84, 42} };	Jagged arrays require instantiated arrays to be specified for the arrays within the array.	int[][] numbers = { new int[]{42, 84}, new int[]{84, 42} };

Table of Contents for 3. More with Data Types

Create new playlist

Sign In

Sign Up

3. More with Data Types

Categories of Types

Value Types

Reference Types

Declaring Types That Allow null

Dereferencing a null Reference

Nullable Value Types

Nullable Reference Types

Implicitly Typed Local Variables

Tuples

Arrays

Declaring an Array

Instantiating and Assigning Arrays

Using an Array

Length

Ranges

More Array Methods

Array Instance Members

Strings as Arrays

Common Array Errors

Summary

Table of Contents for
3. More with Data Types

Declaring Types That Allow `null`

Dereferencing a `null` Reference