10 Strings and Regular Expressions

Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

10 Strings and Regular Expressions

Prefer the standard to the offbeat.

– Strunk & White

Introduction
Strings

string Implementation
String Views
Regular Expressions

Searching; Regular Expression Notation; Iterators
Advice

10.1 Introduction

Text manipulation is a major part of most programs. The C++ standard library offers a string type to save most users from C-style manipulation of arrays of characters through pointers. A string_view type allows us to manipulate sequences of characters however they may be stored (e.g., in a std::string or a char[]). In addition, regular expression matching is offered to help find patterns in text. The regular expressions are provided in a form similar to what is common in most modern languages. Both strings and regex objects can use a variety of character types (e.g., Unicode).

10.2 Strings

The standard library provides a string type to complement the string literals (§1.2.1); string is a regular type (§8.2, §14.5) for owning and manipulating a sequence of characters of various character types. The string type provides a variety of useful string operations, such as concatenation. For example:

Regular Expression Special Characters
.	Any single character (a “wildcard”)		Next character has a special meaning
[	Begin character class	*	Zero or more (suffix operation)
]	End character class	+	One or more (suffix operation)
{	Begin count	?	Optional (zero or one) (suffix operation)
}	End count	\|	Alternative (or)
(	Begin grouping	^	Start of line; negation
)	End grouping	$	End of line

Repetition
{ n }	Exactly n times
{ n, } n	or more times
{n,m}	At least n and at most m times
*	Zero or more, that is, {0,}
+	One or more, that is, {1,}
?	Optional (zero or one), that is {0,1}

Character Classes
alnum	Any alphanumeric character
alpha	Any alphabetic character
blank	Any whitespace character that is not a line separator
cntrl	Any control character
d	Any decimal digit
digit	Any decimal digit
graph	Any graphical character
lower	Any lowercase character
print	Any printable character
punct	Any punctuation character
s	Any whitespace character
space	Any whitespace character
upper	Any uppercase character
w	Any word character (alphanumeric characters plus the underscore)
xdigit	Any hexadecimal digit character

Character Class Abbreviations
d	A decimal digit	[[:digit:]]
s	A space (space, tab, etc.)	[[:space:]]
w	A letter (a-z) or digit (0-9) or underscore (_)	[_[:alnum:]]
D	Not d	[^[:digit:]]
S	Not s	[^[:space:]]
W	Not w	[^_[:alnum:]]

Table of Contents for 10 Strings and Regular Expressions

Create new playlist

Sign In

Sign Up

10

Strings and Regular Expressions

10.1 Introduction

10.2 Strings

10.2.1 string Implementation

10.3 String Views

10.4 Regular Expressions

10.4.1 Searching

10.4.2 Regular Expression Notation

10.4.3 Iterators

10.5 Advice

Table of Contents for
10 Strings and Regular Expressions