Exploring C++: The Adventure Begins

The first thing to know is that this book is written in a conversational — even whimsical style at times. I’ll not be formal unless the topic really calls for it.

Reader Background

In this book I assume you’ve had a course in basic programming. This course would have to entail data types, input/output with the console, branching and looping, functions, separate compilation (your own libraries), basic ‘class‘ design, and the use of at least ‘vector‘s for storing lots of data. If you’ve had some basic file input and output, so much the better! Basically everything you’d find in our first volume. *smile*

I also expect you’ve had a decent amount of math. Some schools require having finished Calculus I, but I think any kind of calculus would be good enough to get you those advanced abstract thinking skills prized by programmers.

Styles

There are some color and style conventions used in the book that might be helpful to know up front as well. For instance, different parts of the C++ language are colored differently. You can see a little sample in this chart:

Typography

‘short‘		‘12’456‘
‘int rand()‘	‘return‘	‘"Welcome"`
‘<iostream>‘	‘cout‘	’\n’

Definitions are given as they are needed but not highlighted in any special way. This highlights that all knowledge is precious — not just that found in a little rounded box! Also, watch for them everywhere — even in footnotes!¹

Also, there are links to online sites/documents. These links look like this one to the awesome website cppreference.com. It is a great place to look up things you’ve forgotten the details of.²

Exercises

I’ve provided no exercises in this text. There are many example codes that are complete and run just fine, but no explorations. This is because my teaching website alongside this one (craie-programming.org) is filled with programming exercises. Each is separated into semesters and kind and difficulty rating. The semesters at my school are CSC121 and CSC122. The kinds are labs for focusing 1-3 topics at a time and projects for synthesizing 3-10 topics into a cohesive whole. The difficulty is given as a Level where 1 is relatively easy at the time the material is learned and 7 is pretty darn challenging at the time the material is learned. They are great for practice even if you aren’t taking my courses so feel free to try them out!

One further note about the assignment ’prompts’ as assigned: they are not perfectly clear and tediously laid out on purpose. Part of learning to program is interacting with the prospective ’customer’ of the program/application. Their initial product description is likely to be imperfect and require some amount of clarification with them — perhaps even a few rounds of it in some cases. Students of programming need to get practice with this process as well. And who better to gather requirements clarification from than their instructor!

Code Availability

As mentioned above, there are numerous code samples gone over in the text. Many are present in the text in full. A few were much longer and are linked to on the companion site (craie-programming.org/OER). The cutoff is about two pages.

I keep this split on purpose even though the codes in the text cannot generally be copy/pasted out — something about fonts for special characters like underscores and quotes — because I feel it is important to the student’s memory to actually type much code for themselves at least in the first two semesters of a typical program of study. This is part of the classical ’muscle memory’ tradition found in numerous studies like math, martial arts, etc.

Self-Study

In any significant study of material, there comes a time for self-study. And programming is no exception! Here that takes the form of realizing you have an unanswered question or concern with a topic and making a test program to clear that up or deepen your knowledge. Such programs are typically 10 or so lines long, but can be pages in later topics.³

It is a good idea to make such programs regularly and document them well with comments, good variable names and the like, etc. Always make the effort to make your code readable and understandable to whomever may come by it later — even if that someone is just you. Don’t underestimate the worth of a good test program in reminding oneself the ways of a feature!

Viewing

I recommend a continuous scroll to keep the flow going from page to page. But it shouldn’t look bad in one-page or 2-up modes, either.

Also, in that vein, since this book was produced to be a PDF and not in print, there is no provided index. You’ve got built-in search, so hit that / and !

Finally, make sure you check regularly for updates as this is an online document and therefore subject to anytime fixes, additions, or clarifications.

Coming Soon!

There are many sections currently marked "Coming Soon!". Most of these will be filled in by the end of the Summer 2024 term. Appendices will come along as time permits after the regular chapter sections are done, however.

This work is copyright (©) Jason James but is hereby released under the Creative Commons Open License of Attribution for non-commercial uses only and a share-alike option. That is, you can use this material freely for any purpose that doesn’t bring anyone profit, but you must give me credit.

I’d also like to plead with you that if you make changes to the work that you share them back to me so that I may have the chance to consider and possibly incorporate them in my own release. Please email me at ’OER at craie-programming dot org’ with any suggestions. Thank you!

Acknowledgements

Here are a very few of the folks to whom I owe a great debt and whose wisdom and service and support helped me make this book you see on your screen:

Storage

C-Style Arrays and Strings

Before there were ‘vector‘s and the ‘array‘ ‘class‘, there were C-style arrays.⁵ These are a statically-sized⁶ storage mechanism like the ‘array‘ ‘class‘. In fact, the ‘array‘ ‘class‘ is a thin veil over the top of a C-style array.

A special subset of the C-style arrays is known as null-terminated strings or C-strings. We’ll also be talking about these in this chapter.

What we’ll eventually find is that the ‘vector‘ and ‘string‘ ‘class‘es are built from these static structures using the techniques of the next chapter ([dynmem] on dynamic memory). In fact, by the end of this book, we would be able to rebuild these ‘class‘es from scratch if we wanted.⁷

Basics of Arrays

Versus vectors

As mentioned in brief above, arrays are static memory objects. That is, they are sized at compile-time by the programmer and never get to change in size throughout the entire program run. The only way to change the size, in fact, is to change it in the source code and rebuild the program all over again.

This is drastically different from the ‘vector‘ ‘class‘ which was able to grow as needed by the program. Sometimes this difference can be a hindrance. But if we keep it in mind as we code, we can keep on top of the difficulties and keep it working.

Do we work with arrays in new designs? NO! Not unless the system we are working on (our target platform) has memory limitations that preclude the use of ‘vector‘s. These embedded or subset compiler environments just don’t have the hardware or OS support necessary for us to use the ‘vector‘ ‘class‘ in our program.

We will, however, use arrays a lot when dealing with legacy libraries that only provide an array interface to their functionality. This alone necessitates that we learn to use arrays and use them well!

Declaration

Again, this constant size can impact us throughout the program, so we make it an actual constant rather than a literal to make it easier to update when we change our mind and decide the current value is too small or too large.⁸

^{\textrm{},}

⁹

Also note that you’ll need a separate variable to track the number of used positions within your array, say:

This tracking variable should be the same type as you chose for your maximum size/capacity constant above. The type you chose can be any of the integer types: ‘short‘, ‘long‘, ‘unsigned short‘, ‘unsigned long‘, or ‘size_t‘. (If you don’t care about platform independence — aka portability — you can even use ‘int‘ or ‘unsigned int‘.)

What’s ‘size_t‘? Well, that’s one of those ‘typedef‘s (or perhaps a ‘using‘ alias) that the library set as a particular ‘unsigned‘ integer type that was ideal on the target platform for any array size information or array positions.¹⁰ That probably makes it the best choice for our purposes here, then.

Which library was that again? It is available from almost any standard library, actually. But if you don’t have anything else included already, just bring in ‘cstddef‘ as a minimal library.

(If you really don’t like the name, you can even make a ‘typedef‘ or ‘using‘ alias for this type if you want (like the ‘vector‘ did for you when it chose whatever ‘unsigned‘ integer type and called it ‘size_type‘).

Initialization

Initialization of array elements is at the programmer’s discretion — sort of… If you begin to initialize an array:

and stop short of the actual number of elements, the remaining positions will be default constructed (0 bit patterns for the built-in types¹¹). Also note that the use tracking variable has been set to the number of initializers!

Although you can, you should not leave the declared size ‘const‘ out even when providing a list of initializers. This leaves you without a known upper bound for loops or other error-checking scenarios. The infamous ‘sizeof(arr)/sizeof(double)‘ — dividing the bytes for the array by the bytes for the base type — will not work outside the original declaration context/scope of the program.¹²

A Brief Example

Here, the ‘enum‘eration ‘const‘ants have their names stored in a ‘const‘ant array so that later we can communicate to the user in words instead of numeric values:

The first ‘cout‘ prints the somewhat cryptic message ‘The light is 0.‘ But with the helper ‘string‘ array, the second ‘cout‘ shows the much more palatable ‘The light is RED.‘ instead.

We’ve simply used the ‘STOP_LIGHT‘ ‘const‘ant as a subscript into the ‘const‘ant array of ‘string‘s to select the proper name for that ‘STOP_LIGHT‘’s value. But what a difference!

While this might be an actual place to use an empty set of square brackets to make the compiler count the elements for us, I decided to be careful and put the extra ‘enum‘eration value in there. This has the one disadvantage of making ‘switch‘es on the ‘STOP_LIGHT‘ type ask for a ‘case‘ involving the extra ‘const‘ant.

Lots of Loops

Since the array is a built-in type, there are no methods to support your programming. There are a few scattered library functions, but we’ll not need them here. (Later we’ll talk about the C-string library, but that’s a very special subset of arrays…)

General

If Full

…if we had a full array, anyway. If it isn’t completely filled, we shouldn’t do the range-based version because there aren’t mechanisms like with the ‘vector‘ to end at the position of the used tracker variable.

Input Is Special

Initial input loops would be ‘do‘ or ‘while‘ depending on your interface/druthers, of course. In addition to making sure your user isn’t done with their data, they should also check to make sure that you do not overrun the maximum capacity of the array during input:

This version may make the user enter an extra piece of data when they totally fill the array, but at least the loop head keeps things safe — we are just being a little rude. We’ll fix that up in an upcoming example.

Passing Arrays to Functions

When you go to pass an array to a function, take note that they are automatically passed as a form of reference — the function gets to change the contents of the array by default. To avoid this, simply add the keyword ‘const‘ to the formal argument’s base type.

Also, note that the sizing brackets of a formal argument are typically left empty. This is because the compiler will not pay attention to their contents, anyway. This allows a function accepting an array to be most generic and accept actual arrays of any length — given the correct base type.

But, to keep the function within the sane bounds of the array — not going past the currently filled in elements, that is — we’ll [almost] always pass to the function an extra parameter to signify the number of used elements within the array.

For using the elements of the array in the function, the function head might look like this:

Or, for changing the elements of the array, the function head might look like this:

Note how this input function might also change the number of elements and so takes a reference to a ‘size_t‘ to track that number of elements for the caller.

Note on sizeof

The note above about not using ‘sizeof‘ to determine the number of elements in an array comes into play once an array has been passed outside its declaring context — to another function, for instance. Here we note that only one element is printed on a modern 64-bit system:

(Remember that red-framed examples are anti-examples and NOT to be followed but avoided!!!)

Since the size of a ‘double‘ here is typically 8 bytes and so is the size of what the function receives for an array, we just get 1 for ‘len‘. What do I mean "what the function receives for an array"? You’ll have to wait until the next chapter (section [defn:arr-ptr]) for that!

Array Subrange Processing

If your function processes a sub-range or contiguous sub-sequence within an array, you’ll need two boundaries:

Most array programmers take the ‘to‘ parameter to be inclusive, but the style with C++ ‘vector‘s is to have it be exclusive. I’ll let you decide what’s best for you (and your function’s callers). Just make it clear in your function’s documentation!!!

A Comparative Example

I have an application handy that uses one-dimensional storage to collect and average the user’s height measurements. I’ve got one version that uses a ‘vector‘ and another version that uses a C-style array. I strongly encourage you to download these and load them side-by-side in your favorite difference checker.

If you don’t have a difference checker, you can find many and their relative qualities on this page at Wikipedia. Such tools allow one to see the commonalities and differences between two pieces of code with ease and can be both a learning tool and a design tool of great use!¹⁴

Basics of C-Strings

A C-string is a specially-treated array of characters. Instead of just holding data characters, it also holds a terminator character which is guaranteed to not be a part of the user’s data. (This can be guaranteed because this character is not able to be typed at a standard keyboard.)

The terminator is placed immediately following the user’s actual data characters in the array. Hence us calling it the terminator — it terminates or ends the user’s data.

Because of this special terminator character, we won’t have to pass a size/length variable to functions which process a C-string! The downside is we’ll be able to store one less character than would otherwise be indicated by the array’s declared size. *shrug* Small price to pay!

What is the terminator character? ASCII 0, of course: ’\0’. (Also called the null character. Don’t confuse this with the ‘nullptr‘ parameter to ‘time()‘, however! We’ll talk more about that constant in the next chapter with respect to pointers.)

C-String Initialization

The array initialization syntax of a comma-separated, curly-brace enclosed list of values can be shortened to a C-string literal. That is, we can do this:

But this is assuming ‘MAX_TITLE‘ is at least 17. If it were 16, the first declaration would fail saying that the initializer was too long — even a string literal has the invisible null terminator implicitly present¹⁵ — whereas the second declaration would ’work’, but not produce a C-string constant! It would instead just be a ‘const‘ array of ‘char‘ ending in ‘’r’‘ — not ’\0’ — and therefore not a C-string.

C-String Library Functions

To help process C-strings in typical ways, the ‘cstring‘ library provides the following functions:¹⁶

So, ‘strcpy‘ is like assigning the source C-string to the destination C-string (‘d = s‘), but we cannot assign arrays!

And ‘strcat‘ is like concatenating the source C-string onto the destination C-string (‘d += s‘), but that would be assigning arrays, too!

You can even do multi-combinations, of course, like if you wanted to know that the first C-string was less than or equal to the second (‘s1 <= s2‘), you could code to check ‘strcmp‘’s result against 0 with ‘<=‘ like this:

Function	Notes
‘strcpy(d, s)‘	copies ‘s‘ into ‘d‘ — destroying ‘d‘’s prior value
‘strcat(d, s)‘	copies ‘s‘ onto the end of ‘d‘
‘strcmp(s1, s2)‘	compares ‘s1‘ to ‘s2‘ lexicographically, returning an integer analogous to that returned by ‘string::compare‘

I want to know if…	So I code…
‘s1 < s2‘	‘strcmp(s1, s2) < 0‘
‘s1 == s2‘	‘strcmp(s1, s2) == 0‘
‘s1 > s2‘	‘strcmp(s1, s2) > 0‘

And, just like with ‘string::compare‘, these comparisons are not truly alphabetical when the data are, they are ASCII-betical! That is, not only do ‘strcmp("hello", "helix")‘ and ‘strcmp("playing", "play")‘ return a positive value, so does: ‘strcmp("apple", "Apple")‘!¹⁷

Protecting from Overrun

Because the ‘strcpy‘ and ‘strcat‘ functions don’t know the declared size of the destination array, they cannot effectively protect that array from overrun by the source array! To do so, the ‘cstring‘ library also provides functions that protect these actions from overrunning the destination array. You might suspect overloading to be involved here, but remember that these are old C routines and they didn’t and still don’t have overloading in C.

So, we had to have different names for these protection functions. Since they were taking a numeric bound on the number of characters that could be copied, the letter n was added to the names. Not at the end, though — in the middle:

Whenever the declared size of the destination array is known, these functions should be used preferentially over the others. They are inherently safer by far!

Function	Notes
‘strncpy(d, s, n)‘	copies at most ‘n‘ ‘char‘s from ‘s‘ into ‘d‘
‘strncat(d, s, n)‘	copies at most ‘n‘ ‘char‘s from ‘s‘ onto the end of ‘d‘
‘strncmp(s1, s2, n)‘	compares at most ‘n‘ ‘char‘s of ‘s1‘ to those in ‘s2‘ to determine their lexicographic order

Unfortunately, there is a caveat. When the standard was first interpreted, some folks said it meant to copy the ‘n‘

^{\textrm{}th}

character and stop immediately — sometimes leaving the destination not a C-string at all! Others demanded that a null character be stored in the ‘n‘

^{\textrm{}th}

spot to make sure the destination was a C-string after all. Eventually the POSIX organization, which proposes standards to fix holes or shortcomings in other standards agreed with the latter folk and so, on a POSIX-compliant library implementation, you’ll always end up with a C-string.

But finding out if your current library is POSIX-compliant can be no small task! Perhaps a safety net is in order? Toward this end, many programmers will simply follow up any ‘strncpy‘ or ‘strncat‘ call with this assignment:

Here we cap off the destination array with a null character at its last physical position. This makes the result a C-string no matter what came before it!

With this in mind, what parameter do we pass for ‘n‘ for these two functions? Well, for ‘strncpy‘, we usually pass ‘MAX_D-1‘ as the upper bound. This avoids copying that last character since we are going to overwrite it anyway.

But for ‘strncat‘ we need to know how many characters are already in the destination in order to find out how much room is left there! Since we designed against having to track the logical length of the C-strings with null termination, we are kinda out of luck here.

We’ll have to use a helper function: ‘strlen‘. It takes a single C-string and counts and returns the number of data characters — those preceding the null terminator — therein. This is an expensive operation and not to be called upon lightly! Beware!

Now, in the third parameter to ‘strncat‘ we can place: ‘MAX_D-strlen(d)-1‘. This accounts for the maximum size and the current logical size and leaves room for the null terminator we’ll add at the end of the array upon return.

But what is ‘strncmp‘ for? There are no buffer overrun issues there unless one of the arrays isn’t really a C-string, right? True, but it is also useful to build a power-user friendly, command-driven interface.

What’s a power user? That’s someone who memorizes all the keyboard shortcuts for their apps so that they can keep their hands on the keyboard 95% of the time to maximize efficiency. Many programmers naturally turn into power users during their years of using clunky GUI interfaces so closely during development processes.

What’s a command-driven interface? It’s like you took the menu from a console program and replaced it with typed commands instead. Just like implementing your own shell — like the terminal many of you compile and run in on your Unix/Linux box.

Why? Well, it is easier for many console applications to avoid the menu and its typical 10-item-or-less rule and just implement a ’help’ command that the user can type at any time to remind them of the various commands available. Then they can type ’help command’ — listing a particular command, of course — to get more detailed help on that command.

Anyway, I spent many glorious hours in undergrad transferring files. This was a common time-wasting activity before the Web and when you didn’t have enough money to buy real games. We’d — yes, I was not alone — download music, art, freeware games, and the like from anonymous FTP sites. And with the command-line app, we’d type commands like ’get’ and ’quit’ to retrieve files from various FTP servers and to end the program when we were done.

I saved many minutes of those hours — commonly put toward visiting another site, of course — by typing ’qui’ for the quit command. This was possible because ’qui’ was a significant prefix for the quit command. Why wasn’t it just ’q’? Or ’qu’? Well, those would have been potentially confused with the ’quote’ command for sending special instructions to the remote site. But ’qui’ was just enough! *chuckle* Ah… Memories…

Case-Insensitive Comparison?

While certain systems have their own pet C-string compare functions that are not ASCII-betical, these are not part of the C++ standard. On those systems you might find ‘stricmp‘ (on Windows) or ‘strcasecmp‘ (on Unix/Linux). But I won’t recommend them here as they are not portable. I’d recommend you use the boost library’s ‘isequals‘ function or roll your own!

Output of C-Strings

‘cout‘ tries to interpret any ‘char‘ array that is inserted (‘<<‘) into it as a C-string — whether it has a null-terminator or not! Beware!!!

Oh, not convinced? Well, try it. You’ll find that a plain ‘char‘ array with no null terminator displays not only its contents but also an arrangement of garbage values that might include smiley faces, musical notes, line drops, tabs, and other such things. This is because without the null terminator, ‘cout‘ doesn’t know when to stop. Luckily there are many places in memory where there are 8 zero bits in a row, so it is bound to hit one ...eventually!

Input of C-Strings

‘cin‘’s extraction operator (‘>>‘) knows to store a null-terminator at the end of a ‘char‘ array so that it becomes a C-string after extraction. However, we don’t just want to do ‘cin >> str‘, since this leaves the maximum size of the C-string’s array in question and us vulnerable to overrun!

This let’s ‘cin‘ know how long the array for the proposed C-string ‘s‘ has been allocated to be and it will stop one ‘char‘ short of this maximum to store the null-character. It is still space-separated, though, so you may not reach ‘MAX_S-1‘ ‘char‘s of data. (Recall that the ‘setw‘ manipulator is in the ‘iomanip‘ library.)

C-String Input with Embedded Spacing

There is another variant with a third parameter to specify the input stop ‘char‘ so you aren’t stuck with ’\n’-style lines.

In addition to not playing well with prior extractions — which leave behind newlines with abandon, there is an extra caveat with ‘cin‘’s C-string ‘getline‘. When it fails to reach the ’\n’ — or whatever stop ‘char‘ you specified — it will tell ‘cin‘ that he has ‘fail‘ed! To fix this, you may want to encapsulate your use of ‘cin‘’s ‘getline‘ in a function like so:

Here I’ve parameterized the C-string, its size, the desired stopping character, and a ‘bool‘ to indicate that the buffer should be emptied upon ‘fail‘ure. Either way, the ‘fail‘ure is returned from the function. (The ‘bool‘ also has two constants for the caller to use to avoid the magic of ‘bool‘ literals. Always a good idea!)

The function takes care of the common ‘getline‘ — C-string and ‘string‘ ‘class‘ alike — problem of a left-behind stop character in the buffer by ‘peek‘ing for it and ‘ignore‘’ing it if present. This requires a ‘flush‘ of ‘cout‘ to ensure any waiting prompt is displayed since ‘peek‘ doesn’t actually read and so many libraries won’t force the display of prompts for its call.

Then the function takes care of a ‘fail‘ during the ‘getline‘ by recording it and possibly ‘clear‘ing it as well.

This handy function can be put in an input library and just d to help out in any program!

Standard C-String Processing Loop

For any other processing that you need to do, write your own loop! Yea! C-string processing loops by-and-large look like this:

This will re-process your entire C-string once for every character within it. That is, if your C-string were

n

characters long, then

(n+1)^2

‘char‘s will be processed! A simple 99-character C-string, then, would end up processing ‘10’000‘ characters!

Arrays as class Members

Let’s explore putting arrays into a ‘class‘ as member data by using example code. Here is a new take on the ‘Student‘ ‘class‘ from the ‘vector‘ chapter of the last volume:

Here we have a plain array of ‘double‘ for storing the student’s grades and a C-string array to store their name. Of course, only the plain array needs a used positions tracker.

Let’s now look at the constructor patterns for these array members. Here is the default constructor:

In much legacy code, you’ll find that array members are left out of the member initialization list. This is because in C++98 and C++03 such usage was illegal. In a plain array, it is only necessary to initialize the used tracker to 0. This is because later code won’t use the elements beyond that point, anyway. For the C-string member, we need to make sure the first character of the array (position 0) is a null character to keep it a C-string and not just a plain ‘char‘ array.

The copy constructor just copies all data from the other object without error checks since the other object is of our ‘class‘ type and has been error-checked its whole life. (We can trust this guy!) For C++03 and before, this would look like so:

For a modern implementation, it would sadly look exactly the same. Only if we use the ‘vector‘, ‘array‘, or ‘string‘ ‘class‘es can we copy the elements from an old version of ourselves. Arrays cannot be copied or assigned.

Another constructor would typically not take values for the ‘grades‘ member and just get a value for the ‘name‘:

Here we delegate to the default constructor for cleanup and then mutate the ‘name‘ member to avoid overrun concerns. (This new data came from outside the ‘class‘, after all! We can’t just trust it to be safe.)

Accessors for the ‘grades‘ array are much like those for a ‘vector‘ member of a ‘class‘. You ask which one the caller is interested in and return that. You also have a meta-accessor to return the count of used elements in the array member.

I’ve used -42 as the error indicator because I like 42 and teachers might want to use -1 thru -5 — or whatever — as flag values for special circumstances.

The mutator for the ‘grades‘ member takes a parameter telling it which grade to change and another telling it the new value:

Although we should check the domain of the new value, I’ve eschewed it here to focus on the array management.

But we also need a mutator that adds new elements to the ‘grades‘ array like so:

As to the C-string member (‘name‘), it also needs care to be taken due to its underlying array-based storage.

Let’s start with an accessor for C-string member. They give us an array in which to store a copy of our member. If they give us the declared length, we can protect the copy from overrun. If not, we just copy blithely away…

Why they wouldn’t just use our above constant to declare their array may at first elude you, but there are situations where they might want their array for this data to be smaller or larger than ours. They may be retrieving the student’s name to print in a table, for instance. Then they’d need only enough to fill the column rather than the entire 94 possible characters of the name! Or they might be storing it into a larger area and wanting to later concatenate into that area more text. You never can tell what the other programmer is going to want to do with the data you give them!

I left you with one question: Should 1 be considered a degenerate array length or not? A one-length array isn’t a useful C-string as it only holds the null character and no data. I left it with the 0 branch and used ‘strcpy‘. You can move it to the ‘strncpy‘ branch if you like, but it’ll do nothing useful there!

Some people consider that they should we return ‘false‘ when the caller’s C-string was too long. But that would require a call to ‘strlen‘ — doubling our processing time! I say, "No way!"

Here is the code all in one go and with a fairly complete driver on the book’s website. The driver only omits testing for the copy and name-based constructors.

2D Arrays

Declaration

We still use the ‘size_t‘ for declared size info, of course. The base type of the array is up to you and your application. I’ve chosen ‘double‘ here rather arbitrarily. The rows maximum comes first followed by the columns maximum — both in square brackets. The dimensions themselves are up to you and your application. I’m using 50 and 100 to show that it need not be square.

We have used trackers for both dimensions because the user won’t necessarily use the full extent of either dimension we allot for them. These start out at 0, of course.

Initialization

You can also use the initialization syntax we used before for two-dimensional arrays. Just nest the lists for each row inside an outer list for the number of rows.

Each row can be ‘MAX_ROWS‘ elements long. If it is shorter, the remaining elements are default constructed as with one-dimensional arrays. There can be at most ‘MAX_COLS‘ rows. If there are fewer, all left over rows are default constructed in their entirety. (Here, too, I’d’ve initialized the used trackers to 2 in each direction instead of 0. *smile*)

Arrays of C-Strings

But the use of the ‘typedef‘ makes the declaration and sub-array extraction more convenient.

Sub-Arrays

As you may well guess from this example, you may subscript a two-dimensional array by fewer than its full indices. Doing so gives you a sub-array of the remaining dimensions. That is:

Passing to Functions

Passing a multidimensional array to a function is a major pain (unless you are packaging the dimensions in ‘typedef‘s). The first dimension (left-most in declaration) can be left empty just as with a 1D array (the compiler will ignore any value provided there for the formal argument). The second dimension *MUST** be filled in with the proper declared constant!!!

But Why?

The reason for this is that a two-dimensional array is actually stored as a long linearized space with address jumps to the start of each row based on the number of columns. (Did I mention that the length of a row is the same as the number of columns? And vice-versa... *shiver*)

That is, we like to think of 2D memory as looking like this somewhere in the RAM:

We’ll talk more about exactly how this happens when we discuss dynamic multi-dimensional arrays, but for now, suffice it to say that the mapping from our natural 2D coordinates of ‘array[r][c]‘ would be ‘array[r*MAX_COLS+c]‘. That is, for each row desired, we pass through ‘MAX_COLS‘ entries in the array and then we move ‘c‘ more entries past that position.

An Example

On the website you can find an example of a 2D program using an array of C-strings to store the user’s name — one ’word’ per row.¹⁸ The goal of the program is quite simple: read in the user’s name, shuffle the words, repeat it back.

On lines 29-37 we input the user’s name. We start with a ‘flush‘ of ‘cout‘ because many implementations don’t do this when ‘peek‘ing ‘cin‘. We ‘peek‘ to find the newline that ends the input to stop the loop, of course. The ‘while‘ head also protects us from running out of space by checking for the maximum number of rows.

Line 34 does the core reading. It uses ‘setw‘ to protect the input name component from overrun. It also updates the number of components that have been read. This merging of the ‘++‘ into the subscript is frowned upon in modern development as too ugly and confusing. But past generations sought this kind of thing as elegant coding. So even though the current trend is away from this style, you’ll find it all over the place in the wild. Don’t be afraid! It works just fine and merely needs you to be versed in the differences between pre- and post- increment. We’ll go over those in a later chapter on operator overloading.

The comment on lines 35 and 36 mentions that we don’t really check that the input component was complete. If the input was followed by something that wasn’t a space, then an incomplete word was read and more of it is still in the buffer. We might should warn the user about this, but right now we are just looping around and gathering that trailing bit as its own component separate from its beginning.

Lines 38-42 let the user know if they had more name components than we could handle. Then line 43 cleans up the buffer either way.

The ‘if‘ that follows makes sure at least one name was entered. If so we display their name, shuffle it about, and reprint it with a quirky message about the shuffled form being ’cooler’. (The else just prints a smarmy message about them having no name at all.)

Looking below, we find a ‘swap‘ function for use by the ‘shuffle‘ function. It is well documented and laments not being the everything we’d ever want in a C-string swapper. As it says, we will get around to making an improved one when we discuss ‘template‘s later in the book.

Next is the ‘shuffle‘ function itself. As mentioned, it ‘swap‘s C-strings with any that precede them in the array — possibly including themselves! This method is perfectly sound and gives statistically good shufflings of the data. This despite your long years of shuffling cards and shaking dice until they are frayed, scarred, and ruined.

The ‘display‘ function is nothing amazing. We’ve been there and done that before. It just points out that the variable name lowercase l is horrible and should never be used by anyone — ever!

Arrays Beyond 2D

All of this extrapolates directly to 3D and beyond. You add each new dimension before the previous ones like so:

Here we’ve added a number of planes (like a 2D sheet) in front of the dimensions for each of these 2D structures. Thus, we’ll end up with ‘MAX_PLANES‘ sheets of size ‘MAX_ROWS‘ by ‘MAX_COLS‘ each.

The worst of it is the linearization of the subscripting. Here, accessing position ‘ThreeD[p][r][c]‘ would be done with ‘ThreeD[p*MAX_ROWS*MAX_COLS+r*MAX_COLS+c]‘. This can be simplified with Horner’s method to have fewer multiplies: ‘ThreeD[(p*MAX_ROWS+r)*MAX_COLS+c]‘.¹⁹

As can be inferred, all dimensions after the first must be filled in for passing 3D and higher dimensional structures to functions.

Wrap Up

In summation, we’ve covered a LOT of information in this chapter! We learned how to store data in a statically-sized array. We used this technique and the special nature of the ASCII 0 value to make handling strings of text feasible. We learned how to treat these storage types as members of a ‘class‘. And we learned how to chain them together to make multidimensional storage structures.

I hope this chapter end finds you well and not struggling. If you have any troubles, please see your instructor or a qualified tutor for help! Don’t just search the Internet. People are helpful there, but often too helpful. They’ll teach you things you aren’t prepared for and even give bad advice at times. If you must search, make sure you corroborate any advice with several sources and don’t just trust the first blog or other posting you find on a subject.

Memory Management

Memory is managed by the operating system (as are all system resources). Normally, a program is given a fixed amount of memory for its function execution ’stack’. Each time a function is called, its local variables as well as its arguments are carved from a new ’activation record’ on this stack. When the function returns, its record is reclaimed so that the space can be used by the next function to be called.²⁰

However, your machine generally has quite a bit of RAM left over after it, system drivers, and user applications are loaded. This extra RAM is known (historically) as the ’heap’.²¹ If you detect that you need extra memory as your program is running, you can request some of this heap memory from the OS. If it can spare it, you will be given a ’pointer’ into that memory. If not, the OS will generally return to you the ‘nullptr‘ address.²²

But before we get into that, let’s learn more about these things called pointers.

Pointers

A pointer is a variable that holds the address of some location in memory. That is, the contents of a pointer variable is an indication of where in the memory of the program another piece of information is located. This is typically an offset within the program’s memory area in a modern system. This allows indirect access to other memory locations by ’following’ the pointer to its destination or target.²³ Diagrammatically, we show pointers like this:

In the past, this allowed programmers to pass such an address to a function for processing without having to make a copy of the original memory area. In C++, we have references to do that sort of thing. So now we typically use pointers only for dynamic memory handling, but on rare occasions we’ll have to interface to older/legacy routines that use pointers for ’referring’ to original memory.

Versus References

We’ll be discussing the differences between pointers and references throughout the chapter, but let’s just start off that aspect of the discussion by saying that references are a layer of syntactic sugar on top of pointers. But since the compiler manages all the underlying details, we have to — get to — think high-level about them. That makes it much easier to work with concepts like changing another block of memory than using pointers directly as you’ll soon find out.

Declaration

When declaring a pointer, you must tell the compiler the type of data to which you intend to point. To distinguish a pointer type from a normal type (as used for a variable or constant declaration), our C ancestors used the star/asterisk (‘*‘) symbol as a modifier for the [base] type. So, to declare a pointer (we’ll cleverly call it ‘ptr‘) to a ‘short‘ integer, for instance, we would declare this in our code:

But isn’t the Asterisk..?

The spacing around the ‘*‘ symbol here is optional, as usual. This leads to many different styles of spacing and warring camps of programmers…again, as usual. The problem with the spacing is that some programmers are lead to believe that the ‘*‘ symbol is completely independent of the type and the identifier. However, given its role as a type modifier, it seems more logical (and, indeed is true) that it is actually associated with the data type …which, in turn, defines the very nature of the identifier.

The other point (sorry…couldn’t avoid it…*snicker*) at work here is the fact that the ‘*‘ symbol used to ’modify the data type’, doesn’t truly stick to the type it modifies. Instead, it correlates to the identifier! This leads to programmers creating declarations such as:

And then they think that both ‘p‘ and ‘q‘ are going to be pointers — since the ‘*‘ was modifying the type. Instead, they’ve declared ‘p‘ as a pointer to the correct type and ‘q‘ as a plain old variable of that type! To make both identifiers be pointers, you would have to declare them like this:

So, the nature of the thing makes it seem that the ‘*‘ should reside with the type, but the syntax of the thing requires that the ‘*‘ reside with the identifier! Most perplexing! Our champion of choice here is the trusty ‘typedef‘inition:

Now both ‘p‘ and ‘q‘ are correctly declared as pointers to the desired type! The ‘typedef‘inition makes the ‘*‘ stick to the data type it is modifying as nature intended — language syntax rules be damned! All hail the ‘typedef‘inition! ‘typedef‘inition! ‘typedef‘inition!²⁴

Terminology

As mentioned above, when dealing with a pointer, you must also be careful of your vocabulary! Many a fine algorithm has been delayed or outright doomed by one programmer loosely discussing the ’value of the pointer’ and another misinterpreting the meaning.

With a pointer, the value is technically the address of some other memory location. Note its basic definition: a pointer is a variable which holds the address of another memory location. Therefore, when we discuss matters involving pointers, we should always clearly define terms amongst ourselves before-hand. A reasonable convention is to use terms like ’target’ or ’destination’ for the memory location to which the pointer points and then leave the terminology about the pointer itself alone. Then a discussion of the ’value of the pointer’ is understood by all involved to mean the pointer’s contained address. And the phrases ’value of the target’ or ’target’s value’ would as simply be clear in meaning.

Of course, we could always then take the address of a pointer — and store that in another pointer …perhaps that’s a topic best left for another section, ’eh?

To Point, but to where?

What value should we place in a pointer variable to indicate that it does not yet point to a valid address? We long ago created the symbolic constant ‘NULL‘ to represent this idea.

In C, the ‘NULL‘ identifier is, oddly enough, a C-style constant (aka a macro with no parameters) which is [typically] equal to the value 0. For years it was a matter of fervid debate in the C++ world as to which is the more proper to use — the named constant or the literal.

Well, technically, the debate is truly over whether the literal or a C++-style constant would be better. But the way the standard, all technical reports from the standards committee, and the stern opinions of Bjarn himself are worded, there seemed to be no real way to define such a beast…sadly.

But it appears all those nay-sayers were wrong! Continued efforts produced such a constant. It is named ‘nullptr‘ — note there is no underscore! In compilers supporting this (those supporting C++11), it would be highly preferred to ‘NULL‘.

So, then, when creating a pointer that you aren’t otherwise immediately initializing, always set it to ‘nullptr‘:

Basic Pointer Operations

There are two basic operators to support pointer actions: ‘&‘ and ‘*‘. The former is used in legacy coding situations to take the address of a variable. This address can then be stored in a pointer:

This at first glance appears to be the same ‘&‘ used for references, but it is an actual unary²⁵ operator. The ‘&‘ used for references is simple syntax to show a variable refers to another memory location in a special way. Context should make it clear which is being used.

The ’latter’, then, is the unary ‘*‘ operator. It is used to follow a pointer to its destination and use the value stored there:

This would print the value of the ‘object‘ from above, for instance. Again, there is much confusion amongst new programmers between this operation and the syntax used to declare a pointer in the first place. I’d apologize, but it is more the fault of Kernighan and Ritchie — the designers of the C language.²⁶

I call this operator the ’follow’ operator as it follows the pointer to the destination. But its real name is ’dereference’ because of its C heritage of being used to indirectly access other memory — referenced memory. You can use either term, but don’t get pointers and references confused, please! They are syntactically and semantically very different ways to access other memory!

For instance, just right off the bat, a reference must be initialized to a valid memory location when created but a pointer can defer this proper initialization until later with either none or a ‘nullptr‘ initialization.

In terms of semantics and syntax, the reference is just another name for its original memory location and requires nothing more than the ‘&‘ in its declaration for use. For pointers, the initial ‘*‘ is important to note what it is, but this is followed up on by loads of dereferencing ‘*‘ operations to use the value at the target or no ‘*‘ to use the address of the target itself.

Passing to Functions

Passing a pointer to a function is fairly simple, but certain aspects of it can be confusing to novice programmers.

Here the ‘ret‘ stands in place of the return type and the ‘....‘ hold the place of any other arguments this function might take.

The argument ‘parg‘ will be a pointer to the specified ‘type‘ within the function.

So, as you can see, the call syntax isn’t terribly difficult, either. (Note there is no ‘*‘ on the actual argument!)

But what’s happening in memory is a bit disconcerting to programmers new to pointers:

Here we can see that the argument pointer (‘parg‘) is a copy of the original caller’s pointer (‘pcaller‘) and points to the same memory location. We normally frown upon having two pointers point to the same block of memory, but in this situation, it is almost impossible to avoid.

Because ‘parg‘ is a value argument — a copy of its actual argument, it cannot be used to change the actual argument itself. But since it points to the same memory block as the original pointer, it can be used to affect a change to the data there:

To avoid this kind of thing — protecting the data pointed to as well as the original pointer — we can apply ‘const‘ to the ‘type‘ of the formal argument:

This can be handy in a situation where you don’t want to change the caller’s data but just look at it like the earlier ‘print_arr‘ or ‘sum_arr‘ functions from section [use:arr-arg].

In a little while (section [def:dynmem]), we’ll face the need to change where an original pointer points within a function to which it was passed. Since we can’t do it with the above syntax, is there a way? Of course! All we need is to refer to the pointer like so:

This kind of thing needs to be read from right to left: ‘parg_ref‘ is a reference to a pointer to a ‘type‘ memory location.²⁷

Now ‘parg_ref‘ doesn’t point to the same place as the caller’s pointer as a separate pointer itself. Now it refers directly to the caller’s pointer like so:

Here the single-line arrow is for a pointer as usual and the double-line arrow is for a reference.

Now, if we could come up with a new address to store in the pointer, we could do something like this:

C-Style Referencing

To be honest, there is another way to effect this task. It was the basic purpose of pointers in our ancestor language C, in fact. It’s why the unary ‘*‘ operator is called dereference officially.

A C programmer — not us, so don’t emulate this! — with a need to change the caller’s memory would take its address like so:

And then the caller would supply a variable’s address using the unary ‘&‘ operator — for taking an address — like so:

They didn’t have references, you see, and had to do all caller memory changes by pointer/address. Even their ‘swap‘ functions would look like this:

They were sad times…and still are for C programmers! Be thankful Bjarne added the reference syntax to hide all this pointer nonsense!

Pointers to class Objects

If a pointer is pointing to a ‘class‘ base type, we can get to the object’s member data (if in a ‘class‘ scope) and functions (if ‘public‘ or in a ‘class‘ scope) via the pointer. This can be done in one of two ways. The first is a bit tricky due to a precedence battle between ‘*‘ for following a pointer and ‘.‘ for member access. It looks like this:

The seemingly extra parentheses are necessary because without them, the ‘.‘ operator would try to access a member of the pointer itself!

Due to the tedious typing of this, our C ancestors had the forethought to add a new operator to access members of things like ‘class‘es and ‘struct‘s via a pointer: the ‘->‘ operator. That’s a dash and greater than right up next to one another. It works like this:

This not only avoids the typing nightmare of using ‘.‘ with pointers, but also reminds us of the diagrams of pointers with their arrows from pointer to destination!

Arrays Revisited

If a pointer points to a C-style array, you are allowed to use the subscript operator on it like so:

As for function arguments, such a pointer to an array may be passed by the same syntax as above or by the alternative syntax:

The watchful reader will note that this is the same syntax we’ve been using to pass actual arrays to functions as opposed to passing a pointer to an array to a function. This is no coincidence!

This allocates enough space for ‘CONST_SIZE‘ items of the desired ‘type‘ on the stack as well as a pointer to the first element of this space:

Is there any difference between this pointer and one we declare with the ‘*‘ syntax? A slight bit, actually. This pointer is a ‘const‘ant pointer. That is, it will never be able to point to anywhere but its initial destination. This is as opposed to a pointer to a ‘const‘ant value which cannot change the contents of the destination memory but can point to another block of memory if it needed to.

Four Types of Constant

What would it look like if we wanted to declare a ‘const‘ant pointer? Something like this:

(Here I’ve used ‘object‘ in a more general sense of any memory location rather than a variable of a ‘class‘ type.)

This must be read, as usual, from right to left for full comprehension: "‘ptr‘ is a ‘const‘ant pointer to a ‘type‘ object".

But since we have two ends to any pointer and each can be either ‘const‘ or not, we actually have four types of ‘const‘-ness! (See the grid? It’s a 2x2…)

Some people balk at the first row there, but it really is a type of ‘const‘-ness — nothing is ‘const‘ or a lack of ‘const‘-ness.

I’m guessing the first row’s utility is understood by now, but what of the others’? The ability to mark a base-type ‘const‘ant has been discussed earlier, but it basically protects the values inside the array from being changed on an array pointer. For simple pointers to single memory locations, it does the same: protects the target from change.

The marking of the pointer itself as ‘const‘ant is a bit awkward in usage. The compiler does a lot of things to pointers for us automatically like making an array pointer ‘const‘ and keeping it that way. Or like degrading an array pointer to a normal pointer that is a copy of the original target’s address when passing an array to a function. This protects the original array from being reset to a new target address within the function. But we can do some of this on our own whenever we want to keep a pointer from changing to a new target, just put a ‘const‘ after its ‘*‘ in the declaration.

Pointer Math

So what happens internally when we say ‘arr[p]‘ in our program? Well, the compiler takes that and makes it into this interesting bit of code: ‘*(arr + p)‘. Here we are adding a number of element positions to a pointer and then following the result.

What is meant by this addition? Technically, it adds ‘p*sizeof(type)‘ — where ‘type‘ is the base type of the array — to the address the array is located.

But this need for ‘sizeof‘ is entirely internal to the compiler. We only need to add a number of elements and the rest is automatic. We normally just use the subscript operator, but we can use the add/follow syntax if we so choose:

This is, of course, unnatural and lots more typing. So most of us don’t do that exactly. However, this addition of positions to pointers to get new pointers is quite a minefield of new ideas! Let’s explore.

For instance, since we can use ‘+‘ here, we might equally expect to be able to use the ‘+=‘ and ‘++‘ shorthand operators, as well. This is entirely true! ‘++‘ can even be used on either side.

Here we have a local pointer to ‘double‘s that starts pointed to the same location as ‘arr‘ and then advances one position at a time as the loop progresses. We’ve used the comma operator to put two initializations and two updates into this ‘for‘ loop head. Some would argue this was bad style. Others see it as efficient use of programmer time and code space.

Either way, we’ve removed the clunkier parentheses and addition and replaced it with a nice simple follow and separate increment.

We can even take it a step further. Since adding a position to a pointer gives a new pointer, what must subtracting two pointers yield? That’s right! A number of positions! So this version:

removes our need to track the index any longer. We still need the ‘count‘ to know when to stop, but the ‘i‘ variable is no more!

Of course, why didn’t we just say ‘parr < arr+count‘ in the test? Well, that gets into memory issues with some systems — what is a lesser address, for instance? We could use ‘!=‘, however, and keep things above-board.

Further, we can use this pointer arithmetic to good ends in situations like this:

Here we are using a C-style ‘swap‘ function which takes pointers instead of references — use what you have handy! Note that we just add the offsets to the array pointer instead of addressing an indexed element like so: ‘&(arr[max])‘ That would be quite the mess!

Pop quiz: What type of sorting routine would swap a maximum element with a current element, anyway?

Declaration(s)	Meaning
‘type * ptr1;‘	both the pointer’s destination and the value at that address can be altered

‘type const * ptr2b;‘	the pointer’s destination can be altered, but the value at that address can NOT be altered
	the pointer’s destination can NOT be altered, but the value at that address can be altered

‘type const * const ptr4bptr2a;‘	neither the pointer’s destination nor the value at that address can be altered

More From C-Strings

Now that we know the special relationship between pointers and arrays, we can revisit the ‘cstring‘ library and find more helpful functions!

Let’s start with ‘strchr‘ and ‘strstr‘. These take a C-string to search through and either a ‘char‘ or another C-string to try to locate within the first argument. The result is either a pointer to the location found or ‘nullptr‘ if not found.

Also note that ‘strcpy‘, ‘strcat‘, and so on take pointers as their arguments and so we can do things like this:

Here we are copying from a certain offset into the source C-string instead of its beginning and storing the data not over the entire destination, but after a certain offset into it. So above, ‘dest‘ should now equal ‘"Welcome Jason!"`.

Finally, there is the infamous ‘strtok‘ function. Some people fear it and some people revere it. Just be careful and read its documentation carefully as well. You can find out more about it on any Unix-style box by typing ‘man strtok‘ at the terminal prompt. Or you can look this up on the Web to find similar information.

In addition to these ‘cstring‘ functions, we can also update our end-of-string loop like so:

Weirdness!

Since you can add a number of element positions to a pointer, it stands to reason that you can also subtract them as well. So it does make sense at times to do something like: ‘*(p - 4)‘ or even ‘p[-4]‘. But you must be careful to make sure ‘p‘ here is not the original address of the array! Make sure it is at least 4 positions away from that address, in fact. If not, you are accessing memory that isn’t yours and this is dangerous!

In other disturbing subscripting news, addition is commutative. This means that the following transformation sequence is viable:

Not all compilers allow it, but when it works, it is terrible fun as a prank on the new hire.

Finally, make sure you document carefully whether your function’s pointer argument should be a single object or an array. There is no way to know what the caller has actually sent you, after all, so making sure they know what you wanted is ever-so-important!

Iterators

‘iterator‘s are to ‘vector‘s and ‘string‘ ‘class‘es what pointers are to arrays and C-strings …sorta:

Essential Usage

An ‘iterator‘ holds a reference to the data at a certain position within a ‘string‘ or ‘vector‘. To declare one, you use this syntax:

Here we have an ‘iterator‘ that can hold a reference to a ‘double‘ within a ‘vector‘ of ‘double‘s. To make it actually hold such a reference, we must assign it:

‘class‘es	Built-Ins
‘vector‘/‘string‘	array/C-string
‘at‘/‘[]‘/‘size_type‘	‘[]‘/‘size_t‘
‘iterator‘	pointer

Here we’ve made it ’iterate’ the first (0th) element of the ‘vector‘ ‘vec‘.²⁸ Now, how do we access that ‘double‘?! Use the unary ‘*‘ operator (dereference...remember?):

The question remains, does ‘itr‘ iterate anything? Hunh? We assigned it — and even printed it, but does that position exist within the ‘vector‘? Oh… Let’s check:

Much better! Now we know it exists before we store into it or use it — not doing so would be a definite no-no. What’s that? What’s ‘.end()‘ there? That returns an ‘iterator‘ to one past the last element in the container. So you can use it to detect invalid positions within an ‘iterator‘. This is used a lot as a flag from functions to indicate they couldn’t do something. It is also used as the end of an iterated range of positions — as usual, a range of values in C++ has the last position excluded and the first included.

Intermediate Usage

Just make sure that the ‘size_type‘ you are adding/subtracting is legitimate for the current ‘vector‘/‘string‘’s ‘size()‘!

If you pass an ‘iterator‘ to a function, you do NOT need to also pass the ‘vector‘ into which it iterates in order to affect that ‘vector‘’s elements!!!

That’s right. Remember that an ‘iterator‘ holds a REFERENCE to the ‘vector‘’s element and so you can alter a ‘vector‘ element without even having the ‘vector‘ at hand!

Pop quiz: What type of sorting routine would swap two neighboring elements, anyway?

If a ‘vector‘ is ‘const‘ant, you cannot use an ‘iterator‘ into it. Instead you need to use a ‘const_iterator‘ — which holds a ‘const‘ant reference to the ‘vector‘’s element. (You can also use a ‘const_iterator‘ on a regular vector when you just don’t want to change the contents of any ‘vector‘ or ‘string‘ accidentally.)

The usage of ‘const_iterator‘ is identical to ‘iterator‘ except that you cannot store into the dereferenced value.

This all works because the ‘const‘ keyword is useful to overload ‘class‘ functions in addition to their argument lists. What? Yeah. A ‘vector‘ or ‘string‘ which is a ‘const‘ant will always call the ‘const‘ version of the ‘begin‘ function and we’ll receive a ‘const_iterator‘ as a result. That way you cannot get a plain ‘iterator‘ into a ‘const‘ant ‘vector‘ or ‘string‘ — no accidental changes!

But what about if I just want to look at the elements in a non-‘const‘ ‘vector‘ or ‘string‘? Well, through the magic of inheritance (which we’ll discuss later in the book), the ‘iterator‘ type is upwardly compatible with the ‘const_iterator‘ ‘class‘. (This is somewhat like the way you can store a ‘double‘ value into a ‘long double‘ but not the other way around.)

Odds and Ends

Where ‘ip‘ is an ‘iterator‘ position, an ’‘s‘’ indicates the ’source’ ‘vector‘ and ’‘b‘’ and ’‘e‘’ represent ’begin’ and ’end’, respectively. These pairs are treated as ‘begin‘ and ‘end‘ ‘iterator‘s on a range and so the end ‘iterator‘ is not placed into the sequence — ends are ALWAYS exclusive.

(There are iterator versions of these functions for the ‘string‘ ‘class‘ as well, but we already had such functionality with ‘size_type‘s, so it isn’t that big of a deal. *smile* Oh, but ‘string‘ also has ‘iterator‘ versions of its ‘replace‘ functions!)

So instead of coding a loop for insertion or removal like we did in the first volume, we can simply code:

This would insert the value 4.2 in front of position 4 in the ‘vector‘. The ‘iterator‘ in question doesn’t have to be directly related to ‘.begin()‘, of course. It could have come from some other source. In fact, most of these functions give you a new ‘iterator‘! They return an ‘iterator‘ to the first element in the affected region: ‘insert‘ returns an ‘iterator‘ to the just inserted element or first thereof and ‘erase‘ returns an ‘iterator‘ to the first element following those erased or ‘.end()‘ if nothing follows. Only ‘assign‘ returns nothing.

There are also constructors for both ‘string‘ and ‘vector‘ ‘class‘es which accept pairs of ‘iterator‘s delimiting a range from which to initialize the new sequence.

In fact, the mechanism by which these ‘iterator‘s are passed to the constructors is so powerful that it can accept anything that acts like an ‘iterator‘ — even an old pointer! This finally gave us a way to initialize a ‘vector‘ without a horrid sequence of ‘push_back‘s before we had initialization syntax for ‘vector‘s added in C++11:

Invalidation

When is an ‘iterator‘ not really an ‘iterator‘? Sounds like a trick question, but I’m being quite serious!

Note that ‘iterator‘s into a ‘vector‘/‘string‘ which are at or after an ‘erase‘d position are considered ’invalidated’ — they should not be dereferenced …upon pain of death!

Even though the item ‘gg‘ is still in the ‘vector‘, I am not allowed to dereference ‘itr2‘ because it and ‘itr‘ and any other ‘iterator‘s which iterated at/beyond ‘itr‘ are considered invalidated by the erasure!

Furthermore, if I were to accidentally dereference ‘itr2‘, I would be erroneously using memory I no longer have legitimate access to. (It may still contain a ‘gg‘, but it won’t be the one from inside the ‘vector‘!)

Notice that ‘dd‘ and ‘ee‘ are gone and ‘vec‘ is now only ‘6‘ elements long — but seems to be physically ‘8‘ positions in length! That’s because there was no need to physically remove the memory locations after shifting the data down to cover up the ‘erase‘d elements. The ‘vector‘ merely remembers that those last two positions are not part of the data anymore.

Unfortunately, no one told the ‘iterator‘s that certain positions were no longer valid! Here we’ve only created two ‘iterator‘s into the ‘vector‘, but a typical application may have tens or more ‘iterator‘s into a ‘vector‘ when an ‘erase‘-ure occurs! It is somewhere between impractical to impossible for the ‘vector‘ to inform all ‘iterator‘s of position invalidation.

Still, why are the ‘iterator‘s invalid? They both still iterate positions in the physical ‘vector‘, right? Well, yes, but they are no longer iterating the data the programmer expected them to iterate. ‘itr‘ no longer iterates ‘dd‘ because it is no longer part of the ‘vector‘. And ‘itr2‘ no longer iterates ‘gg‘ because the ‘vector‘’s actual data ‘gg‘ is two positions left of ‘itr2‘’s position. Notice what happens to ‘itr2‘ when I perform the following assignment:

What? You still don’t see it? Look where ‘itr2‘ iterates: ‘gg‘. But there is no longer any ‘gg‘ in the ‘vector‘! Now it is more obvious why ‘itr2‘ is considered invalidated, right?

Still not sure about ‘itr‘, eh? Well, all I can say is that it was expected to have been iterating ‘dd‘ and we no longer have such a beast! It currently iterates ‘ff‘ — whomever that is!

Essentially, ‘itr2‘ is lost to us, but we can return ‘itr‘ to a valid state by using the result from ‘erase‘:

Most programmers, though simply re-establish the position for an invalidated ‘iterator‘ the way they did in the first place. This might be by some search technique or relative positioning or …

For the Adventurous

But I had a nice algorithm that used ‘size_type‘s to process my ‘string‘ (or ‘vector‘) in reverse. How can I do that with ‘iterator‘s?

These are similar to forward ‘iterator‘s in behavior — only in reverse! For instance, you can initialize them with the ‘rbegin‘ function and check them for validity with the ‘rend‘ function.

The most confounding thing about them to most beginners is that when you increment them — ‘++ritr‘ — they move to a previous element. Remember, though, they are iterating in reverse!

This loop will go through all the elements of the ‘vector‘ ‘vec‘ from the last to the first.

Dynamic Memory

As mentioned above, sometimes you realize you need more memory than you’ve planned on having. This is usually done on purpose. Because traditional array declarations can waste significant amounts of memory for many runs and not have enough memory for others we will opt to have either a small amount of memory in the heap or even no memory with a ‘nullptr‘ initialization and then to ask the OS for more memory later. The asking is called allocation. Let’s delve deeper…

Allocation and Deallocation

The way that you would request such extra memory is with the operators ‘new‘ and ‘new[]‘. ‘new‘ is used to request (aka allocate) memory for a single object whereas ‘new[]‘ is used to allocate memory for an array of objects.

When calling ‘new‘, you can supply constructor arguments so that the newly created object will be initialized appropriately. If you don’t, it will be default constructed.

‘new[]‘, however, always default constructs ALL of your array’s element objects. (There just isn’t the syntax for placing a custom constructor call…sorry.)

Note that the ‘10‘ passed to the ‘[]‘ of ‘new[]‘ here can be any integer expression — not just literals or constants like with static²⁹ array allocation!

try/catch vs nothrow/nullptr

I said above that the OS traditionally returned a ‘nullptr‘ to indicate it couldn’t find the memory requested in a ‘new‘ or ‘new[]‘ operation. But that isn’t the case in C++ any longer. Now the default behavior of these operators is to ‘throw‘ an ‘exception‘ at you when they can’t get the memory from the OS.

Thus, we can let this ‘exception‘ go and let it crash the program, but that seems extreme given that it won’t be hard to protect the rest of the program from missing memory. So, what I recommend to avoid this potential crash is one of two approaches.

The first is a bit bulky but works fine. This is to use a ‘try‘/‘catch‘ around any ‘new‘ or ‘new[]‘ operations you do and set the pointer to ‘nullptr‘ in the ‘catch‘:

Here we try to allocate an array to pointer ‘p‘ that is ‘len‘ long. When we ‘catch‘ ‘bad_alloc‘, we know not enough memory was available and we reset the pointer to ‘nullptr‘. ‘catch‘ing the ‘bad_alloc‘ ‘exception‘ will require ’ing the library ‘new‘. It is in the ‘std‘ ‘namespace‘ as well.

Other options at this juncture are, of course, to ‘catch‘ the ‘exception‘ with a variable — preferably by ‘const&‘ to avoid copies. Then we can print a message for the user about the issue using the ‘what‘ method all ‘exception‘s have.

We can also choose to re-‘throw‘ the ‘exception‘. To do this without ‘catch‘ing by name, just use the statement ‘throw;‘ in the ‘catch‘ block.³⁰

The second method is to use overloaded versions of ‘new‘ and ‘new[]‘ that won’t ‘throw‘ an ‘exception‘ in the first place. There is one extra argument to this overload and it is always the object ‘nothrow‘:

This object is found in the ‘new‘ library if you have trouble using it. It is in the ‘std‘ ‘namespace‘ as well.

This overload automatically puts the pointer to ‘nullptr‘ when the OS can’t get the requested amount of memory for us.

Use of Dynamic Memory

Whichever way you get back to the original ‘nullptr‘-for-failure behavior, you now have a way to protect the rest of your code from trying to use a pointer that couldn’t get the needed memory: an ‘if‘ check.

Below I’ve used the ‘nothrow‘ overloads of ‘new‘ and ‘new[]‘, but a ‘try‘/‘catch‘ would work out just fine as well.

We just surround any usage of the dynamically allocated pointer by an ‘if‘ that checks for ‘nullptr‘. Here we are checking for the purpose of storing new data in the dynamic memory. But next we’ll check again for access/reading of the stored data:

Note that usage of memory in a program is typically done in helper functions and those functions aren’t naturally synchronized with the allocation step. Some other programmer on the team may call them out of order and really mess up a program run if we don’t put these ‘nullptr‘ checks around all usages of the dynamic memory.

Cleaning Up with delete

To release the memory back to the OS when you are done, use either the ‘delete‘ or ‘delete[]‘ operator as appropriate for the original allocation. That is, ‘delete‘ when you allocated a single object with ‘new‘ or ‘delete[]‘ when you allocated an array of objects with ‘new[]‘. Doing so will give the OS back control of that heap space and it is no longer allowed for you to use it in any way.

No special care has to be taken if the pointer in question holds the value ‘nullptr‘. This was the case long ago in very early C++ compilers, but that has long since been taken out as a ’feature’. We can now simply ‘delete‘ a ‘nullptr‘ with abandon!

nullptr Assignment After delete

Even though the OS will get back control of the released memory, NEITHER the OS NOR the ‘delete‘ operators will alter your pointer’s content. That is, you’d still have the dynamic address and could technically, if not legally, access it!!!

In addition, the data on the heap you are pointing to won’t be altered in any way. If you are working in a highly secure arena, you might consider overwriting it with some masking values as well.

So, if the program will continue to run for a bit — or you are just extra careful — you should really assign the ‘nullptr‘ address to your deallocated pointer. This will avoid using that released memory after the fact by accident.

What, some of you might be thinking, about const pointers? Can they be dynamically allocated?

So far so good. We got the space, we checked it and used it. What’s left? Oh, yeah, deallocation! Let’s try that:

It looks okay at first glance — the pointer has been released to the OS, after all. But then we realize that we can’t do ‘nullptr‘ assignment because ‘z‘ is a constant pointer! That means we can’t protect ourselves from reusing this memory during the rest of the program. So unless this happens inside a particular block scope — like a function or loop body — we probably don’t want to do this — ever!

Reallocation of a Dynamic Array

When you have a dynamically allocated array of objects and realize that you are going to need more space, you’ll need to re-allocate the array.

The only step in this process that can move is the last one that updates any length or size information related to the dynamic memory. Typically we carry a ‘size_t‘ alongside any dynamically allocated array to remember the upper bound on the usable space. We also often keep another ‘size_t‘ to track how many spaces within the array we’ve filled in.

This process is analogous to what we did with static arrays back in the day (section [decl:arr]), but the upper bound is now a variable instead of a constant.

Here I’ve moved the size/counter updates to before the copying loop to provide the new upper bound for that loop in a possibly changed number of actual elements. The reason we’ve used a minimum finding function to change the logical size (actual number of elements) is that we might have shrunk the array instead of making it bigger!

How can this be?! Well, for the ‘more_space‘ parameter, I used not a ‘size_t‘ but an ‘ssize_t‘. This is the POSIX data type that is the ‘signed‘ counterpart to the ‘unsigned‘ ‘size_t‘. Although not a part of C++ itself, this type is often available and fills a much-needed gap in the type system.

If you are wary to use a non-standard type, however, you can make your own like so:

Chunks vs Multipliers

To decide how many more elements to allocate, one of two schemes is typically chosen:

With a chunk sizing you’ll typically determine the actual size of the chunks by statistical analysis of a program’s expected data. This is easily realized with the above implementation by using the chunk size as the amount of ‘more_space‘ requested.

But do you have to rewrite the above code to use the common multiplier technique? Of course not! Just use ‘(multiplier-1)*physical_size‘ as the amount of ‘more_space‘ requested.

The most common multiplier in use is 2. There was much research in the last decade of the 20

^{\textrm{}th}

century indicating this as the most efficient multiplier to use in programs due to memory patterns and typical OS allocation schemes.

If using the multiplier scheme and also allowing your caller to set an initial size for the array, provide some mechanism to adjust their requested size into a power of your multiplier. This will make things go more smoothly should you decide to allow both shrinkage and growth in your system. In fact, it is so important, I’m just going to tell it straight out! It is a mildly clever bit of math involving logarithms, so I know you could come up with it, but why not share, right? The next higher power of two above

x

— or

x

if it already is one — comes out to be:

Special care should be taken, of course, when

x=0

, since logarithms don’t really work there…

Only the multiplier scheme will amortize the cost of reallocation. This cost comes from both the actual allocation of more memory from the OS — a small cost — and the copying of the elements from the old space to the new space — a HUGE cost!

But what is amortization again? That’s a term that comes from accounting circles and means basically to average out the cost of something over time. For instance, a company wanting to buy a computer might be hesitant to outlay thousands of dollars for a top-of-the-line model. But if they see evidence that these models can last for many years without much in the way of maintenance costs, they might be more accepting of the idea.

Here we are averaging out the cost of the element copies over the time they are not needed by doubling the space we have on hand. If we had 8 elements on hand, for instance, and grew to 16 slots, we would incur a copy time of ’8’. But now we won’t have to grow for 8 more additions of data so those 8 copies average away.

Growth vs Shrinkage

Most new programmers are okay with the idea of getting more memory from the OS for dynamic arrays that have grown beyond their original bounds. That doesn’t sound too far-fetched. But shrinking space? What’s that about?!

Well, some programs don’t run for just a few minutes or hours. Some programs run for days or weeks at a time on large servers. Such applications may see a cyclic need for more memory during peak times. Then, during off hours, their memory needs will wane.

If they keep the extra memory they aren’t needing any more, they are not playing well with all the other programs on the system! Other programs might be using more memory during the same time that this one doesn’t need as much. In an extreme case, other programs might be failing to do their work because they can’t get enough memory to run!

To avoid these problems and just play nicely with other apps, we set our sights on shrinking memory when it is no longer necessary. This would involve sending in a ‘more_space‘ parameter to the above function that was negative, of course.

But if we just shrink a chunk or half each time we can, we are in for a possible race condition. This could happen, for instance, when the program just dropped an item, but is about to add an item right afterward. If this happened over and over at some juncture, we’d be stuck shrinking and growing back and forth. Not very efficient after all!

To avoid this race of allocations and deallocations, we typically use pessimistic shrinkage. In this scheme we don’t drop back our memory usage a single step until we’ve got two steps worth of memory unused.

For instance, in a doubling scheme, we’d wait until we were at a quarter of our allocated size used before shrinking by half.

Dynamic class Members

Often it is important to have members of a ‘class‘ placed on the heap. (Perhaps it is an array we want to grow/shrink!) This is possible but takes some care. Let’s explore…

Where the Parts Go

To place dynamic memory management in a ‘class‘, we merely need to identify the three portions of the process and place them correctly into the ‘class‘ structure.

The first portion of the dynamic memory process is allocation. This should fairly obviously be placed in the constructor — or some helper function called from there. It might also happen in a mutator. This makes the helper idea a real win-win.

Second comes use of the dynamically allocated memory. This can be placed in any and all ‘class‘ functions which need to use the memory — but don’t forget to put ‘nullptr‘ checks in EVERY SINGLE ONE OF THEM!!!

Last would be deallocation. Where should that go? There seems to be no answer. *snap* Guess we can’t do this, eh?

Of course we can! But we’ll need a new kind of method! Let’s call it a deconstructor! No…that would make sense! Okay, okay. Then the destructor. Much better. Now it’s an homage to a freaky — albeit really cool — 80s movie.³¹

Destructors

The name of the destructor shall be formed similarly to that of the constructors: from the name of the ‘class‘. However, to distinguish it from the constructors, we’ll place a tilde (‘ ‘) in front of it. (Since it is going to be used for memory deallocation, we can maybe think of it as waving bye-bye to our old memory. *chuckle*)

(Just so you know, the tilde in the destructor’s name has ABSOLUTELY NOTHING to do with the tilde operator. We’ll talk about that operator when we discuss files, flags, and operator overloading later. But it is COMPLETELY UNRELATED to dynamic memory management!!!)

For the following examples, presume the following ‘class‘ and dynamic members:

This ‘class‘ — ‘DynC‘ because it is a ‘class‘ with a dynamic member — is just a demonstrator model and doesn’t contain all the features we’ll want in a typical situation like this in live code. But it will serve its purpose to show us what methods are needed beyond the destructor and why each is necessary.

Here we see that the dynamic array is defaulted to just the ‘nullptr‘ value — no allocation at all. Then in the constructor that takes a size, we try to allocate that amount of space and reset the ‘max‘ member if we are unsuccessful. Finally, the destructor frees the space for the dynamic member. It could reset the contents of all the members then, but it won’t be necessary as the destructor is being called only when an object is leaving scope.

In fact, they will be automatically called in a manner analogous to constructors. Destructors are called when objects fall out of scope — the ends of their lifetimes. Objects’ destructors are, in fact, called in the reverse order of their constructors.

For example, let’s say we had a scope with two objects of our ‘class‘ ‘DynC‘:

At the close curly brace there, both objects will be destructed automatically just as they were constructed automatically when we declared them. But, since we declared ‘A‘ before ‘B‘, it was constructed first. That means that ‘A‘ will be destructed last:

This at first seems like just odd trivia, but it comes into play in the discussions that follow.

Copy Constructor

Beware copying objects one unto another. During these times, your ‘class‘ is in particular danger of leaving dangling pointers laying about the system and/or causing future double-deallocations of dynamic memory!

And remember that this is no small matter, since copying happens so frequently in C++. Not only things like this:

Here, the ‘arg‘ will be copy constructed from the actual argument in the call and the ‘return‘ value will be copy constructed from the expression on the ‘return‘ statement!

The default copy constructor just takes the value of ‘A‘’s pointer member and copies it directly into ‘B‘’s pointer member and so we might end up with this kind of memory structure:

When the ‘B‘ object destructs and releases the ‘DATA‘ to the operating system, all is fine:

But when the ‘A‘ object later destructs itself and tries to release the SAME MEMORY to the operating system, the best case is confusion. The worst case might be actual damage within the memory subsystem of the OS. *shiver*

This simple problem — double-deletion of a resource — assumes that the older object exists in the same scope as the new object and they die relatively close together.

As you can see, if the new object is a function argument and the old object will go on living after the new object exits scope — when the function returns — you face various memory violation errors as the old object accesses memory your program no longer owns!

To fix the problems caused by the compiler-supplied copy constructor, we can do something like this:

Now the newly constructed object will have its own dynamic memory area and its own copy of the data from the old object. When the new object is destructed, it will only affect its own dynamic memory — not the older object’s.

This will alleviate the double-free and dangling pointer issues the compiler-supplied copy constructor gave us when combined with a proper destructor.

The above approach says to reset the counter members to ‘0‘ to reflect their pointer being ‘nullptr‘ for consistency of the object and so later code won’t be harmed when other member functions try to use any of them to safeguard pointer target usage. This sometimes is accompanied by a function to access the object’s state called a validator or such. The purpose of this would be to let the caller know that the object is currently in possession of valid memory and not a ‘nullptr‘ which would prove useless to them.

Some prefer not to have a validator at all and just know all objects are in a valid state in the program. Or at least have a more ’in-your-face’ way to know an object has ended up invalid. This would involve the use of exceptions. You would ‘throw‘ an exception in the ‘else‘ above and the code that tried to create the object would have to ‘try‘ to ‘catch‘ it. The main reason I don’t like this here is that copy constructors are called all over the place — value arguments to name one of the most difficult. We can’t really ‘catch‘ while passing an argument to a function. We’d have to ‘try‘ all function calls that pass their arguments by value and then ‘catch‘ the exception. Do you know how many programmers pass by value instead of ‘const&‘ around here? So many!!!

Anyway, if you want to use this technique, read more about it in section [defn:excpt].

operator=

The pictorial representation above also goes for a poorly defined assignment ‘operator‘ (aka the one provided by the compiler). That is like here:

Not when being constructed, but sometime later in the code when one is assigned a new value with the single equal ‘operator‘ (‘=‘).

When the objects later destruct, the same problems may occur. So we might expect our assignment ‘operator‘ to look something like this:

Such an ‘operator‘ function will be called whenever the compiler sees the ‘operator‘ itself in action. In more detail than above:

So the object on the left of the ‘operator‘ is used as the calling object for the member function call. (More on that when we overload other ‘operator‘s later.)

But, the threat is bigger with the assignment ‘operator‘ because it is replacing an object that currently exists with a copy of another object. If we aren’t careful, we can lose our old identity and leave a pointer allocated but lose its address — a memory leak! (Wasn’t that what the destructor was protecting us from in the first place?)

Notice how ‘OLD DATA‘ is still allocated but no-one is pointing to it! (That’s a memory leak, all right. Icky, hunh?) To fix this, we’d need to precede our new request for storage by a deallocation like so:

That’ll clean up the old crap before we allocate the new crap...er, data. (*grin*)

However, to top this all off, we have ancestral problems left from the C language. Did you know that you could do this:

Probably.³² It is a multi-assignment and quite convenient when you want to initialize several variables to the same value (all 0 or all ‘false‘, for instance). The problem is that all operators are treated equally and so you can change the precedence of them via parenthesization.

This sets ‘B‘ to be a copy of ‘A‘ and then immediately obliterates that copy to make ‘B‘ be a copy of ‘C‘. Stupid but time consuming. *shrug*

Here we make ‘B‘ be a copy of ‘A‘ and then update it by adding information from ‘C‘. This saves us several keystrokes and a line of code. Again: *shrug*

But we cannot play favorites! All ‘operator‘s just ‘return‘ a result which is then used in the next operation by standard precedence rules — including their override via parentheses!

And on a related note, it is important to protect the programmer who codes this:

Hunh? Not probably literally, but maybe via ‘?:‘ factoring. We did this above in the constructor from a given ‘MAX‘ to reset ‘max‘ when there was no memory allocated.

If we are deleting our old ‘CRAP‘ before copying the new guy’s stuff, this kind of thing would be nasty because the old guy and the new guy are the SAME guy!!! Once we ‘delete‘ the old ‘CRAP‘, we’d have no ‘CRAP‘ to copy!?!

To solve both of these issues, we’ll need a helper. We need to know the identity of the calling object — the one from the left side of the ‘=‘ operation. It’s name is left with the caller. But it still resides at a memory location. So, C++ decided to provide us with the address of the calling object.

In fact, it has been doing so since the very beginning. For all of our ‘class‘ member functions, C++ has given us an invisible, implicit parameter: the address of the calling object. We’ve never discussed it for two reasons: we didn’t know what addresses and pointers were and we didn’t care. Now we both know what pointers are and care about the identity of the calling object in more than a "the compiler will automatically use members of the calling object in a member function" kind-of way.

There is, of course, a keyword used to access the calling object’s address. That keyword is one of the most poorly chosen identifiers we’ve ever seen. And we’ve seen some doozies: ‘cstdlib‘, ‘log‘/‘log10‘, ‘npos‘, … I could go on, but I think you get the idea.

So what is the keyword for the calling object’s address? It is ‘this‘. What? No, that was it: ‘this‘. ‘this‘ is the keyword. The word ’‘this‘’ is the keyword. I swear it!

It is so hard to tell you about this keyword because it is a pronoun and people always think I’m talking about something else when I really mean ’‘this‘’!

Anyway… How do we use ’‘this‘’ to solve the problems of multi-assignment and self-assignment? All we have to do is check the address of the argument object against that of the calling object to make sure they are not the same object. (We couldn’t just check if they are equal objects because we want to know they are physically the same object — not that they have equivalent content.)

Place the entire rest of the function — except the ‘return‘, of course — inside this ‘if‘ structure to fix self-assignment.

And multi-assignment? That one is slightly trickier. There we need to ‘return‘ a reference to the calling object from the ‘operator=‘ — just like the built-in assignment operation does. But we have an address — a pointer — not a reference.

Well, having a pointer, we can follow it to the original object. Then we can just reference that! Like so:

And just like that we’ve fixed another double-‘delete‘, another dangling pointer, and another memory leak!

The Big Three

That completes what are known as the Big Three. The destructor was necessary to prevent memory leaks from dynamic members. And its addition necessitated the addition of a copy constructor and an ‘operator‘ assignment. Since every time you have a dynamic ‘class‘ member you need these three functions added to the ‘class‘, we have named them the Big Three.

Debugging with Hex Addresses

On the support website you can find this complete program with the ‘DynC‘ ‘class‘ and a short driver main. This program not only tests the ‘DynC‘ methods to make sure they are working, but those methods print the addresses of objects involved in their operations along the way.

If you sit the source code on one side of your screen and a terminal running the program on the other, you can follow along back and forth which addresses must coincide with which objects and this will help you build your intuition about construction and destruction order and the like.

The thing about addresses and printing is they come out in hexadecimal — base 16 numbers. This is disconcerting at first, but you can become accustomed to it. Don’t worry so much about their values, just be able to tell one of them from the others so you can pinpoint which object did what.

2D Dynamic Arrays

One thing that comes up from time to time is multidimensional storage. We’ve explored this in terms of nested ‘vector‘ and ‘array‘ ‘class‘es in the previous volume³³ and with static arrays earlier in this volume (section [decl:arr]).

But what about dynamic arrays? Do we ever take those multidimensional? Of course! Let’s explore this starting with two-dimensional structures.

There are two basic approaches to 2D arrays on the heap: the physically accurate and the mapped. Some applications call for one or the other, so we’ll explore both here.

Physically Accurate

This approach is the most ’obvious’. That is, it will give us the simple use of two subscript operations to access the data when we are done allocating it — just like a normal statically allocated 2D array. The allocation won’t be as clean and simple as other approaches, however, so we may want to give those some serious consideration later.

It uses an array of pointers where each element points to an array of data. Overall this effects a 2D structure as can be seen in the diagram below. See how I’ve arranged the rows of data to sort of stack on top of one another and you could line them up to form a nice grid if they were movable.³⁴

Here the commas are the pointers and the arrows show where they point as usual. The ‘arr1‘ is the initial pointer that we have to start our ’journey’ through the structure. If we subscript it, we pick out a single pointer from the column array to the left. This element is another pointer. Thus we can subscript it as well to get to one of the data items it points to.

So just one pointer gets us to all the data with two subscript operations as we would have done with static 2D arrays. That’s pretty nice and a cozy place to work.

But how do we describe this in code? Well, we’d have to set everything up, of course — nothing comes for free in the dynamic world!

Here we’ve done something quite new. ‘arr1‘ is a double pointer! This is because the things it points to aren’t real data but are themselves pointers. So ‘arr1‘ points to pointers that in turn point to data.

That also means that when we allocate space for the column array on the left that its base type isn’t the data type we are interested in but "pointer to data type". This is reflected in the ‘new[]‘ allocation for ‘arr1‘ itself.

Note how the allocation stops when a row fails to allocate or when we’ve allocated all the rows. But we keep track of how many rows were successfully allocated for later with the ‘num_rows‘ helper variable.

See how we use two subscripts just like we would a statically allocated 2D array? So nice!

But, care must be taken when we’re done to avoid leaving allocated but inaccessible fragments of memory:³⁵

Here we’ve made sure that we release the rows of data first before releasing the column array of row pointers. This order is very important! Remember that once deallocated, a pointer will retain its old address but we have lost the rights to that memory. The system can reallocate it at any moment. So we must free up the row spaces before freeing up the row pointers!

Row Mapped

The second allocation scheme follows the compiler’s own method of laying out the 2D grid in a long, linearized space row-by-row. That is, we take each row of the array and line it up right after the previous row in the memory so that the second abuts the first and so on. Doing so is going to be less 2D-friendly in the usage phase in that we won’t be able to use two subscript operations, but has advantages for both the setup and tear-down phases of the plan. It might be worth a look if it can simplify those processes.

Let’s look at this in more detail. As we discussed earlier (section [defn:2Darr]), we like to think of 2D memory as being — well, two-dimensional. We like to think of it looking something like this diagram below in the RAM:

And the compiler uses this mapping from our normal double subscripts to 1D positions:

Note that there is no chance to have lost desired rows — we get the whole array or nothing. There is also just a single pointer ‘*‘ so that’s nice.

What’s the problem here? It isn’t a tedious setup and take-down. That’s all been streamlined! It’s the mapping in the middle. It just isn’t natural. *pout*

There are two basic ways to hide it away. The first is an ‘inline‘ helper function like this one:

But this is just a bit icky — it still doesn’t hide enough details. Look at it in action, after all:

If you are going to use this allocation approach, the coolest way would have to be to embed the array in a ‘class‘ and overload ‘operator()‘ to do your 2D indexing.

Why ‘operator()‘? Why not overload ‘operator[]‘? Well, the subscript ‘operator‘ in C++ only takes one argument — the position along the single dimension to access, presumably. And we need to access along two dimensions.

So ‘operator()‘ to the rescue! It is the parentheses used to call a function. It doesn’t modify precedence or anything like that. It is normally placed after a function’s name to surround any parameters that function might need to run. Since functions can be designed to take any number of arguments, this ‘operator‘ can take an arbitrary number of operands — things to operate on. We need it to take two for our dimension positions. Perfect!

As you can see, we take in the two dimensional positions and do the mapping inside the ‘operator()‘ and ‘return‘ that array position by reference so the caller can either change or look at the data at their discretion. If we want to provide similar access for ‘Arr2D‘ objects that have been passed as ‘const&‘, we can even overload it a second time:

All we changed was to mark the function ‘const‘ so it would work for a constant ‘Arr2D‘ object and ‘return‘ed by value instead of by reference. That C++ allows these two function overloads to sit side-by-side in the same ‘class‘ is pretty neat and useful at times like these!

Still not completely normal, but you’ve hidden the allocation details, the deallocation details, and, of course, the mapping details inside the ‘class‘.

Another possibility would be to use a helper position ‘class‘. For this approach we do overload ‘operator[]‘ but instead of passing it a ‘size_t‘, we pass it some kind of ‘class‘ or ‘struct‘ that holds two values at once. These, then, are mapped to the row and column by our formula and the result ‘return‘ed.

For simplicity I’m going to use the ‘pair‘ construct provided by the standard library ‘utility‘ and explored in the previous volume.³⁶ This might look like so:

This doesn’t look too bad, either. It’s a little more typing than our ‘operator()‘ approach above — especially pre-C++17, but still not too bad.

A Step Back

Merging the Two Methods

But, for those of you out for the ultimate dynamic 2D experience — the best of both worlds as it were — you can try out this variation. Note well, though, you’ll have to be willing to pay the costs of both worlds, too…

We start by making the one-dimensional row mapping as in the latter technique. But in addition to that, we make an array of pointers as in the former approach. This time the pointers in the array will aim at the beginning of each row within the linearized space. Diagrammatically it looks something like this:

How will this help us? Well, it gets us the single allocation for all the data space as with the mapped method. It also gets us double subscripted access like the physically accurate method.

What? How? Look at the ’column’ array: it holds pointers to locations with data. And we’ll have a pointer to it as it is an array. So we’ll select a pointer with a first subscript and then select an element with a second subscript!

Although we have the potential of failing to allocate the row-start pointers array separately from the ‘data‘’s allocation, this method can give nicer access patterns for general use. You can even place the conditional based on ‘arr2D‘’s successful allocation in your new ‘class‘ access method to take advantage of the ‘[][]‘ ability vs a mapping calculation!

More than Two Dimensions

Both methods can be easily extended to multiple dimensions. The first way naturally extends as you’d expect (add a ‘*‘, add another loop for allocation, etc.).

Here ‘p‘ is the plane or cross-section of the 3D structure you want. I’ll let you continue the math to higher dimensions.

Note that you can factor out the multiply by ‘COLS‘ from the first two terms to speed things up a bit. This is known as Horner’s method.

It might also be worth noting that you can use a ’dimensional’ array to store supplementary information in several ways to make access faster. (This is fodder for a much later discussion, however.)

main Arguments

So far our main function has always taken no parameters. Some of us even put the keyword ‘void‘ in its parentheses to point this out more explicitly. But any command run from the command prompt (and even many of those run by double-clicking an icon on the desktop) can take information right then from the user without having to use ‘cout‘/‘cin‘ to discuss the matter.

Command-Line Arguments

Look at the compiler you might use at a typical *nix prompt,³⁹ for instance. You tell it which file(s) to compile and what options should be enforced during this compilation, right? Sometimes you even tell it what to name the final executable file.

In C++, all of this information is prepared by the part of the OS known as the shell and given to the main function in the form of a 2D dynamic array of C-strings.⁴⁰

How does this look from the program’s side of things? Well, the head of the main changes to look like this for starters:

The first is the count of arguments — sortof — and the second is the values of the arguments as C-strings — more or less.

‘argc‘ is actually 1 more than the number of arguments because, for some reason, the first ’argument’ is given as the name of the program.

Thus, ‘argv[0]‘ holds the name used on the command-line to invoke your program. Some programs use this to invoke different behaviors. For instance, and are often the same executable but the code looks at its ‘argv[0]‘ value to see if its goal should be compression or decompression during this execution.

The rest of the ‘argv‘ entries are the actual arguments typed by the user when they ran the program. They are typically taken apart at spacing — tabs or spaces.

But, on the other hand, if we put quotes around some of the arguments, we see something more like this:

Note how the shell groups things within matching quotes together to form a single C-string argument for the main function to process.

Also of importance is the way the user types parameters to your program. The standard for doing options is to use either a single dash (‘-‘) or slash (‘/‘) followed by one or more single-letter options — so-called short options. These are often case-sensitive so that ‘z‘ and ‘Z‘ indicate different options. A more verbose style is to use a doubled prefix character (like ‘–‘ for instance) before a whole-word option — so-called long options.

If a command-line option requires a filename or number, it will typically follow a short option immediately or sometimes in the next ‘argv‘ slot. But if a long option requires such information, it will immediately follow an immediate ‘=‘ — within the same ‘argv‘ slot.

These all give help information about running the command as well as a little idea of what the command does. With , the ‘-h‘ gives a tiny amount of help and the ‘–help‘ gives a lot of help.

Let’s look at a sample main looking for short options. It takes options to process a number, a Boolean value, and a filename.

As you can see, it looks at the first character of the C-string parameter for a dash and, if found, checks the next character for any of its known short option letters. If it doesn’t know the option, it reports as much. And if it doesn’t start with a dash, it is simply repeated back out as a plain argument. Here is a downloadable copy so you can play with it on your own.

The Environment

There is another way that programs sometimes get information from users: the environment. This memory area is managed by the shell and a copy of it is sent to any main asking for such. It contains a list of variables and their values separated by equal signs. The variables traditionally have no spaces, but the values can have such.

This parameter is represented as a 2D dynamic array of C-strings as well. But it isn’t counted by an integer argument like ‘argv‘. It is instead ended by a ‘nullptr‘ in the last slot.⁴¹ It looks like so:

A typical way to process the environment is to use a ‘char‘ double pointer and walk it from C-string to C-string looking for the variable we have interest in. When we find it, we can take the value into our program and make adjustments in configuring this run.

Here we look at each pointer with a single dereference to see if it is ‘nullptr‘ yet. If it isn’t we process that line using the earlier hinted at ‘strtok‘ from the ‘cstring‘ library.⁴²

Once the environment variable is processed, we ‘++‘ to the next slot and see if we are done yet.

So what kinds of things can come in from environment variables? All kinds! They can be numeric, string, Boolean, etc. Anything the user can send in a command-line argument, they can send in an environment variable.

So why bother? Well, environment variables can be set once and forgotten. Command-line options must be retyped on each run of the program.

So how does the user set environment variables? That’s a bit of a sticky wicket! It varies drastically from system to system. I’ll describe here how to do it in *nix and Windows how to set them temporarily. Sadly, setting them permanently in all systems is beyond the scope of this book.

3.25ex -1em **nix: In a *nix terminal, type the following command to set an environment variable for the current terminal session:

This will make that variable available to the system environment until this terminal is closed. Thus the variable will be available to all programs you run from this prompt.

3.25ex -1em *Windows: In a Windows prompt, type the following command to set an environment for the current command prompt duration:

This will make that variable available to the system environment until this window is closed. Thus the variable will be available to all programs you run from this prompt.

Wrap Up

In this chapter we’ve learned standard C++ tools to manage memory during the run of a program. This included a fairly detailed look at pointers and indirect access to data as well as a hefty examination of dynamic memory allocation, use, and deallocation. We ended with a foray into passing information right into a main function from the command line which, it turns out, uses memory managed by the OS on our behalf and in the same shape as we learned to use on our own.

File Streams In Depth

In the previous volume we covered file stream processing at a shallow but effective level. This time around, we’re pulling out all the stops and diving deep into the topic!

Some of this might feel a little repetitive to the coverage in the last volume. But we do want a complete coverage in one place, so.

Concepts

So what are files and what are they used for? Well, they store information for some program to process, typically. They are just bits stored on the disk/drive in their primal form, but at a high level, they represent information to us.

We’ve used files for years, of course. Word files, Excel files, video files, audio files, etc. Most recently, we’ve been working with C++ source files and their executable forms after compilation.

This should imply to us that there are at least two types of files out there: binary and text. This is quite true! We’ll be focusing on the text files in this text, but there will be a brief introduction to binary file processing in a later section ([file:bin]).

Since we are focused on text files, that means we can create sample data files for our programs to process in the same environment that we’ve been using for C++ source files! Any plain text editor will do, but if you are used to an environment, keep using it!

Let’s start our discussions with input since that is what we mostly do with files to start — read in immense amounts of data to process!

Input

Input from a text file is almost just like using ‘cin‘. It differs in just three respects, really:

But let’s start with a simple file format without comments to make things as simple as possible. Let’s say we wanted to read in a sequence of space-separated numbers from a file. Data still needs to be separated by spacing for even a file stream in text mode to read it properly.

Connecting to Files

The basics of connecting to a file are having the name of the file as the user sees it and then calling the proper method. To get the user’s file name, we need to have a ‘string‘ of some kind. I’ll actually use a ‘string‘ ‘class‘ variable for convenience so we don’t have to worry about overflow on input of the name:

Make sure to use ‘getline‘ to read file names since most users like to put spaces inside their files and/or folders these days!

Once we’ve got the name, we need to call the proper method. But of what ‘class‘ and which method?

Since we are trying to input from this file, we’ll use the type ‘ifstream‘ — short for Input File STREAM. This ‘class‘ is found in the ‘fstream‘ library with all File STREAM codes.

The proper method is called ‘open‘. You just need to pass it the file name ‘string‘ from earlier:

And that should do it — assuming the user typed the right file name and possibly folder path. The file name the user types must include not only the name, but also the extension and path of the file. If the file is in the current directory/folder, the path can be skipped. But if the file is located in another place, they can use a relative or absolute path to indicate this to us.

A relative path would start with the subfolder name if the file is under the current folder or with if the file is located above the current folder. An absolute path would start with either a drive letter and colon (on Windows) or a if on a *nix machine.

Once the file is ‘open‘ — connected to our program via the ‘ifstream‘ variable we made earlier, we can read from it with any techniques we used with ‘cin‘! We can use ‘>>‘ for basic reading or ‘getline‘ for ‘string‘ data or we could ‘peek‘ to make decisions about what to do next or we could ‘ignore‘ to skip past data, etc.

eof Loops

The tricky bit is that we don’t typically have just one piece of data in a file or even a known number of data in a file. It is usually an arbitrarily long sequence of data we are expected to process. With numeric data on the console we did this via calling ‘fail‘ to determine when the user was done entering real data. But in files, there is a special marker at the end of all files we can use to determine when the sequence is done!

This marker is known as ‘eof‘ — End Of File. We call the ‘eof‘ method to determine if we’ve reached it. But that terminology was a little sloppy. It doesn’t return ‘true‘ until the program has tried to use the marker as data. We can read the last piece of data next to it successfully and it still won’t go off! So we need to be careful when formulating an ‘eof‘ loop.

This format borrowed from our ‘fail‘ on console experience will work much of the time:

But it isn’t as good as we can do! We can tweak it a bit to make it work on any input file! It turns out there are two ways to make it work so generally: a simple tweak and a more object-oriented approach.

Tweaking eof

To tweak it, we just prime the ‘eof‘ condition with something other than reading data.⁴³ The problem with the file variants that don’t work with the above form is all about spacing.

I believe this stems from the POSIX demand that all text files now end in a newline character (before the ‘eof‘ marker). But whatever the case, the tweak to fix things is to extract a contiguous sequence of whitespace as priming:

This makes sure that stray whitespace after the last piece of data or in an otherwise empty file won’t cause us problems.

The one difference is that if you are reading with ‘getline‘ instead of extraction as above, you should not prime with ‘ws‘ as it would eat some of the whitespace you were trying to preserve with ‘getline‘! Instead, prime with a ‘peek‘ operation. One of the odd things about ‘peek‘ is that even though it doesn’t actually read anything it will trigger ‘eof‘ if we see the marker at the current location. This would, then, look like so:

Had to change the file variable and data variable names, of course, but you get the idea…

Object-Oriented eof

Another approach that works despite the rampage of spacing in modern files is also more object-oriented, so that’s nice. *smile*

And all is well! This not only makes the loop more object-oriented, but makes it smaller, too!

Again, both ‘>>‘ and ‘getline‘ return the stream used for input as their result. And this stream in the ‘bool‘ context of the ‘while‘ head are ‘true‘ when the file is happy and ‘false‘ when the file has had any sort of problem — like an ‘eof‘ encounter.

You’ll find other forms of ‘eof‘ loop all over the web. They do not work as well as these two forms! Please always use a tweaked or OO ‘eof‘ loop in C++ programming!

Disconnecting from Files

One thing that escapes many folk first learning about file processing is then need to disconnect their file from the program after it is done being used. This need comes in several flavors depending on the work the file has been doing, but it is always a need!

We use the ‘close‘ method to do this job and it is quite simple. But there is one thing about it that is a little odd: it either leaves the file in the original state before closing (‘eof‘, ‘fail‘, etc.) or in a new but still erroneous state.

The latter is because some library implementers want to signal something to themselves upon reuse of this file variable. Not good practice, but not forbidden by the standard, either. The former is just laziness. So, to be sure we can reuse this variable later in the program, we should also ‘clear‘ it after we ‘close‘ it:

Why close Files?

All files use system resources called file handles. These resources are given to any attempted file open for use in processing the file through the OS. But these resources tend to be finite — albeit very large — in number on a system. So if we don’t give them back by ‘close‘’ing a file variable, we can run the user’s system out of them! (This wouldn’t just affect us, either, but all programs running on the system!)

Another reason is worse for output files. They store their ’displayed’ information in a buffer just like ‘cout‘. The difference is that, while ‘cout‘’s buffer was around 2 kilobytes, that of a file stream is typically 16 kilobytes or more! That’s a lot of text that would get lost if the file didn’t ‘close‘ properly!

To compensate for these things, the destructor of the stream ‘class‘es has been made to call ‘close‘ for us. But it still doesn’t alleviate the problem when we want to reuse a file variable later in the program to access another file and the file variable hasn’t been destroyed and recreated in some inner scope like a function or loop body.

A Complete Example

Just some space-separated data along a line. But remember that any spacing will do. You could add blank lines or just newlines here and there and tabs are fine as well.

Output

The second most popular use of files in programming is outputting to them. We have to save the user’s data somewhere for future sessions, after all!

This is much simpler than input, too. You just ‘open‘ a connection to a disk space and output to it. Don’t forget to ‘close‘ the connection when you are done.

What type of variable do we use? Well, ‘ofstream‘, of course! That’s for Output File STREAM — just in case. *smile*

And, once ‘open‘, you can do anything to it that you’ve ever done to ‘cout‘. You can insert to it (‘<<‘), you can ‘flush‘ it, you can format it with various manipulators or methods, etc.⁴⁴

Errors on Output

Although you can check for output errors on file streams, this is almost as fruitless as checking for output errors on ‘cout‘. It is usually something catastrophic that has happened and cannot be recovered.

Might as well just go blissfully along writing to a dead file variable and being silently ignored until you are done as to try to stop early and report the problem to the user. Well, maybe if you were processing lots of data it might be worth the effort.

A Complete Example

Again, you don’t have to check for errors on an output file,⁴⁵ but if you did, there’s how to do it. You can use the ‘!‘ operator directly on the stream which causes the usual evaluation of it in a ‘bool‘ context. Or you can ask the stream if it has ‘fail‘ed or is ‘!‘ ‘good‘. Once you know it had problems, you would, of course, ‘clear‘ it and print some sort of message for the user to know.

Opening Issues

Thus far we’ve just ‘open‘ed a file connection and let fly with activity on that file. But what if the file can’t connect? Can this be a problem? Yes!

At its simplest, the user may misspell the file’s name or get the path to the file wrong. At the worst, files may exist on external drives and those might be detached during an ‘open‘ operation causing a ‘fail‘ure.

Files exist in folders/directories and these have permissions or allowed actions by different users on the system. If we try to ‘open‘ an output file in a folder that is unwritable, this will ‘fail‘. If we try to ‘open‘ an input file from an unreadable directory, it will ‘fail‘. Files also have permissions on them and ‘open‘ing an unreadable input file will cause a ‘fail‘ure as well.

Input Opening

To fix problems with ‘open‘ing an input file, we just need to write code like this:

Note the ‘close‘/‘clear‘ pair. The ‘close‘ ensures that the previously allocated resource from the OS is released before we request a new one. The ‘clear‘ gets the stream variable over its ’funk’ after the connection from its previous ’file’ ‘fail‘ed. (It also ‘clear‘s the state that some library implementers set after a ‘close‘ operation for bookkeeping. We’re about to reuse this stream so we need it happy!)

How, then, do we test if the file ‘fail‘ed to ‘open‘? We can use the ‘fail‘ method, of course:

Or we could be more general — expecting the worst — and check that the stream is not happy:

Or we could be more object-oriented and make the file variable act as a ‘bool‘ for a moment:

Output Opening

In addition to the problems we noted above, when an output file is connected, the physical disk file is effectively erased or emptied and you are placed at the beginning. In other words, opening an existing file for output destroys it.

If you open a file with gigabytes of important information — your only copy of it in the world! — as an output file, it will all be gone — forever!

Obviously, opening such files should be done with care. Luckily, input files simply won’t open when a file doesn’t exist! So we can open an input connection to the desired file and, if that fails, re-open it for output with a clear conscience. If the input connection succeeds, however, maybe we should try a different name…

So we get a filename, ‘open‘ the file as if we wanted to input from it (as an ‘ifstream‘), and see if that worked. If it did, we have an existing file!⁴⁷ We report the problem to the user and ask what they want to do.

A complete list of possibilities is: get a new name, overwrite the file after all, append to the file (see below), or just give up at this time. (The last is useful if they’ve tried this in a menu option that can just be ditched and go back to the menu, for instance.)

If they choose to give up, we set ‘abort‘ to ‘true‘ but if they choose to overwrite or append it’ll be ‘okay‘.

If the input connection didn’t open successfully, we are safe to ‘open‘ this file for output as it doesn’t yet exist. So it’s gonna be ‘okay‘.

This "‘open‘-as-input" loop continues while the user isn’t ‘abort‘ing and we still aren’t ‘okay‘ to proceed.

So how do we ‘open‘ for appending? Well, you can ‘open‘ a file in a few different modes. This tells the file to act slightly differently while it is connected to the program with this stream. Append mode is different from output mode in that it doesn’t erase the file but rather adds new output to the end of the current content.

To ‘open‘ in a different mode, use the [optional] second argument to ‘open‘ to change the mode:

This would ‘open‘ the ‘file_var‘ (an ‘ofstream‘) for output at the end of the file’s current content — not erasing but adding to the file.

The other two major modes are ‘ios_base::in‘ and ‘ios_base::out‘. These are the default for their specific file types and rarely need to be stated explicitly.

Passing to Functions

Just because we have a new tool doesn’t mean we need to use it all in the main function! We should also learn to pass streams about between functions.

There are two things to be aware of when passing a stream to a function. Both involve the type used in the transfer.

To Copy or Not?

The first is that streams must always be passed by reference. To see why, imagine the structure of the stream variable and how it communicates to its disk file. Internally, the stream has a buffer and a current position within that buffer where processing is active. This is the get position for an input stream or the put position for an output stream.

If a stream were passed to a function by value, a copy would be made of all of this internal structure. The function’s local copy would then process along in the stream and at the least the buffer position would be updated. But if it were an output stream, the buffer content could also be updated. And if the position reaches the end of the buffer it is either refilled for input or ‘flush‘ed for output! Lots of actions are happening here — some with more permanent effects than others!

But, when the function ‘return‘ed, the local copy would be destroyed and we’d be back to the original stream. The exact original stream. It would be in the same buffer state — position and all — that it was before the call! Now processing would pick up from there. If it was for input, we’d reprocess the same data the function did already. If it was for output, we’d start overwriting what the function created for us!

Passing a stream by value would be quite the mess! So, to avoid that and force us to use reference on stream parameters, one of two things is done in their ‘class‘es. In older implementations (pre-C++11), the copy constructors would be made ‘private‘ and implemented broken so that they didn’t really copy anything. This was effective to a point, but C++11 upped the ante a bit. It introduced a new use for the ‘delete‘ keyword. Now the stream ‘class‘es have their copy constructors set equal to ‘delete‘ like so:

This tells the compiler that there will be no copy constructor for this ‘class‘ — ever! Needless to say, this is terribly powerful and should be used with caution.

A Stream by Any Other Type...

The second thing to keep in mind when passing a stream to a function is how similar the streams are to one another. Not ‘ofstream‘s and ‘ifstream‘s. But ‘ifstream‘s and whatever ‘class‘ ‘cin‘ happens to be of. They share a plethora of operations, right? In fact, you could almost drop one in where the other was and the code would keep working just fine.

The makers of these ‘class‘es noted this and took advantage of it during their design. Using a technique we’ll learn later for ourselves called inheritance, they connected these ‘class‘es together in a sort of family. The input streams all come from a common ancestor ‘class‘ as do all the output streams. These two ancestral ‘class‘ types can be used to refer to either descendant due to these inheritance relationships! (Again, more on inheritance in chapter [inh:defn].)

So, we like to take advantage of this by making our code as general as possible and allowing for the possibility that we are talking to either a file or a console stream — not caring which it is! This is pretty easy to do for output situations. We like to label our output nicely in either a report file or an on-screen report, right? But we do need to watch ourselves a little for input. We don’t want to prompt for file input, after all!⁴⁸

Okay, okay, don’t keep us in suspense any longer! What’re these familial types?! Oh, right. They are ‘istream‘ and ‘ostream‘. So, if you are making a function to do output, you would pass the stream to use as an ‘ostream&‘ type or, for an input function, you’d use the ‘istream&‘ type.

Two Caveats

This new power comes with two little issues. Neither is truly bad or good, but has aspects of both.

When You Have to Be a File

Sometimes a file just has to be a file. That is, consoles don’t share all the same operations as the file streams, after all. The two things they don’t do — or rather do automatically and can’t be changed — is to ‘open‘ and ‘close‘. If you are doing either of these things to a file stream within a function, it must be passed as full-on ‘ifstream&‘ or ‘ofstream&‘. It can’t be helped.

Defaulting a Stream

Two things come into conflux here. We’ve long-since had the ability to make some arguments default to initial values when the caller failed or forgot to pass them to the function. We’ve also long-since had references which couldn’t default unless they were ‘const&‘. But now we have these nice pure reference streams and we’d probably like to default those to the console so we didn’t have to type ‘cout‘ and ‘cin‘ even more often!

As luck would have it, we can! The reason we couldn’t default a pure reference before was because we had no global objects to default them to. You’d have needed such a global to refer to so that the function could see it from anywhere, after all. But now we have the global console streams! You can default an ‘ostream&‘ to ‘cout‘ or an ‘istream&‘ to ‘cin‘ with ease.

An Old Example Revisited

Let’s take a practical example by looking at an old friend: ‘display_money‘. We haven’t seen this function in quite some time, so if you need a refresher, you can look back to the original discussion in the last volume.

Here we’ve got a function to display monetary values with lots of configuration options:

Moving About in a Stream

Sometimes it becomes evident that it is necessary to reprocess some or all of an input file’s data.⁴⁹ In these situations, never ‘close‘ the file and ‘open‘ it back up again! This is terribly wasteful of both your time and that of the user. It takes a good deal of communication with the OS, you see, which never goes quickly or smoothly.

Instead, just seek out a new place to get information from in the file. What? That’s an odd way to say it? Well, yes, but it matches the syntax chosen by the standards committee. You see, they’ve called the relevant function ‘seekg‘ which is short for ’seek where to move the get position’ — effectively.

As mentioned earlier, all input streams have a position from which they are currently getting information — the get position. The beginning of the file is, of course, position 0.⁵⁰ So to reprocess all of the file, you could just do something like this:

Where ‘infile‘ is the name of your ‘ifstream‘ variable.⁵¹ (The ‘seekg‘ function is available for ‘cin‘, but it doesn’t really work, so...)

To reprocess only part of the stream, we’ll need a little help. A method called ‘tellg‘ will report to you a representation of this position — tell you where the file is getting information — in terms of how many bytes it is past the beginning of the file. This ‘return‘ is of the data type ‘streampos‘. This is an odd data type that is restricted to ‘unsigned‘ integral values except for the occasion that the library itself needs to store -1 in it for book keeping. It is typically a ‘class‘ type to make this happen as smoothly as possible. But for many purposes it works as an ‘unsigned‘ integer type.

Once you get a ‘tellg‘ result, you can pass it to ‘seekg‘ later to return to that position in the file for reprocessing:

But that’s not all! We can also ‘seekg‘ to positions not just from ‘tellg‘ or 0, but also relative to places other than the beginning of the file. There is a second argument to ‘seekg‘ that allows this. This other overload of the function takes a ‘streamoff‘-typed offset and a relative position indicator. A ‘streamoff‘ value is like a ‘streampos‘ except it is ‘signed‘. The the standard even provides that any ‘streampos‘ can be converted to a ‘streamoff‘. (The reverse is obviously not possible all the time.)

So what are these ’relative position indicators’? Well, there are three of them: ‘ios_base::beg‘, ‘ios_base::cur‘, and ‘ios_base::end‘. The first is the beginning of the file and is therefore just like the one-argument ‘seekg‘. The second is the current position — aka from where ‘tellg‘ would report. Positions relative to ‘cur‘ can be either positive, zero, or negative.

But before you go off trying to back up by the number of characters you just read from the stream — a ‘char‘ being typically one byte — consider this next bit. There is one ‘char‘ that is of variable size depending on its originating OS: the newline. This wee beasty is one byte on Unix-like systems (including MacOS and ChromeOS) but two bytes on Windows and derivatives!⁵²

This can obviously cause trouble when trying to move around in text files and you can’t just rely on seeking to the number of bytes based on the length of the last ‘string‘ you read. Even if you remembered that ‘getline‘ tosses the newline it processed from the stream, how many bytes was that? If you are developing a dedicated app that’ll never run on any other OS, you can get by with hard-coding the 1 or 2 additional bytes. But if you are trying to be portable, that’s not possible.

That’s where ‘tellg‘ also comes in handy! It tells you where you’ve been so you can back up that way and not have to rely on ‘ios_base::cur‘-relative seeking. I know it feels odd to learn a new tool and then be told not to use it, but it really is more trouble than it is worth — at least in a text file. If you want to use ‘cur‘ seeking to good effect, try binary files like in Appendix [file:bin].

Well, what about that last relative position indicator: ‘ios_base::end‘? That one can come in handy from time to time. The important thing to remember here is that offsets can be only 0 or negative. Zero? Yep. The end-of-file marker is considered 0 relative to the ‘end‘ of the file.

Be Careful!

Just make sure you never go too far relative to a certain position or in the wrong direction from a certain position. There are typically no safeguards in place to keep you from going before the ‘beg‘inning or past the ‘end‘ of a file! If you try to go negative from the ‘beg‘inning or positive from the ‘end‘, it is liable to ’work’. This could corrupt things on your disk in the worst cases, so be very careful!

Full Disclosure

Another way to deal with those Windows line-ending bytes is to convert them to 1-byte variants upon transfer of the file. This is technically allowed for by the ASCII transfer mode in an FTP-style agent, but I haven’t seen it properly implemented in years. Most systems default to binary transfer mode and leave the newlines intact.

You can also convert the line endings after transfer more manually. No, don’t load it into an editor and retype them all! Just run them through a handy conversion tool like which is found on or is available for most Unix-like systems. Such a tool is run from the terminal or command or shell prompt — depending on who you talk to. *smile*

A Full Example

Perhaps an example is in order. The following program calculates the size in bytes of a file and also converts it to a more human-friendly form for good measure.

Here we see that we record the position of the file before seeking the ‘end‘ so that the caller doesn’t have to time their request for the file’s size to a certain point in their code. That is, they don’t have to record the get position and back up themselves after we’ve moved things around. Then we move to the ‘end‘ and ask where we are. This is the position of the end-of-file marker and — since it is 0-based — it is also the number of bytes that preceed the marker! Luckily we don’t even need to add 1 since the eof marker doesn’t count as part of the file’s size by convention. Once we back up to their original position we ‘return‘ the length we recorded before.

The rest of the program just converts the bytes to a more human-readable form with typical metric prefixes. Just note that in file sizes⁵³ a kilo is 1024 rather than 1000 as in science class.

Layout of Data in a File

The first thing to remember is that the program itself determines the layout or order of data in the file. The user must comply and is not in charge here! When you write the program to read the data from the file in a certain way, that is the way the data must be layed out from then on.

So what ways are there to arrange data in a file? Well, there are two basic ways: sequential and block.

In a sequential layout, we put the data one after another with some sort of spacing between until the end of the file. This works for simple files, but not so well for complex ones — or even moderately complex ones!

In a block layout, the data are grouped together in pairs or triples or however many items are needed to fully describe a single entity in the program — a single ‘class‘ object, for instance. This has applications in a wide variety of situations as you might expect.

Sequential Layout

In a purely sequential layout, we might have a set of numbers to do statistics on listed one after another in a file. Or we might have some names for contacts or products on separate lines of a file. Or...

If the data is more complex, a pure sequential file is a right pain. Let’s look, for instance, at a simple list of 2D points. The natural way to list them would be interleaved

x

and

y

coordinates. But that would be in blocks, technically. To make it purely sequential, you’d have to list all the

x

values first and then all the

y

values.

How do you know where one type of value is done and we are starting the second? We could count as we read them all in and split the list in half in a second pass or we could mandate that the user enter the number of points at the top of the file before the

x

coordinates start. Either is messy and to be avoided.

Common Misconception

A common mistake by programmers new to data file management is thinking that the way they have their test file layed out is the one and only right way. This can lead to trouble when a user sends in a file that is having issues and the layout is even slightly different. The programmer can have a knee-jerk reaction to think the slight difference is the cause of the problem when it has nothing whatsoever to do with it in actuality. For instance, let’s say the programmer had a test file with these data in it:

This file is processing just fine but the user sends in this rearrangement of the file and says there is a problem:

The programmer could immediately think the slight rearrangement of the data is the issue, but it probably isn’t. Let’s look at it from a buffer perspective. Here is the first file:

The only difference is that some of the spacing has changed from space to newline and vice-versa. That is: there is no difference here! Something else must be going wrong for the user and not this file. After all, the only thing ‘>>‘ cares about is the separation by whitespace — not what kind it is.

All of these work the same as the first two! (And it wouldn’t matter if there were tabs in them or anything else that was considered spacing...)

Block Layout

Let’s start the block layout exploration with a simple example of why this is even important. Remember the problem with the

x

and

y

coordinates earlier? What if the data was much more complex? What if we had information about students in a little database like so:

Note that the leading 5 tells us how many entities are described in the file. This is followed by the students’ names, their class standings, their GPAs (11-point scale), and their branches. Branch names have spaces in them and so are on separate lines.

This is clearly a mess and not readable by the human user trying to make manual adjustments to the data. By simply blocking the data into groups (sometimes called records and the individual data in them fields), we can make it much nicer:

The user could even keep things more spaced out for further readability enhancement if they so chose:

And, noting that the student’s name will be extracted (‘>>‘) from the file, we could even allow more blank space before it to help separate the blocks from one another:

How would we tackle reading in something like this with the data all mixed together like that? Quite simply with a standard EOF loop:

Here ‘object‘ is an object of the ‘class‘ we’ve written to represent the ’student’ concept from the program. We call this ‘class‘’ ‘read‘ method and pass it the file stream to read from (as an ‘istream&‘, of course). Then we can put the ‘object‘ in a ‘vector‘ or process it immediately or...whatever we need to do with it!

Like always, read into local variables and call the mutators. I like to ‘return‘ a ‘bool‘ to let the caller know it went well or not, even if they don’t care.

Labeling Data

With JSON and XML being such popular ways to transport data these days, it’s easy to forget that labeling data has been around for many, many years. So, let’s start with the basics and do some name-value pairs or label-value pairs for our data files.

Taking the student database above as an example, we might see the following in a data file:

Here each piece of data is stored alongside a labeling string of characters like "name" or "class". This lets the user see immediately what the value is in context of the database’s idea/schema. The labels and values are separated by some conveniently chosen symbol like an equal sign or colon. I’ve chosen to not have the labels need quotes around them like in a JSON file. Nor the values, for that matter. The separator is enough to tell one side of the line from the other and so begins our adventure into parsing (reading) such data.

First Try

Here ‘label‘ and ‘name‘ are C-strings with the implied maximum allocations and ‘sep‘ is a ‘char‘ to hold the equal sign.

But this attempt doesn’t account for several things. Firstly, the user might re-order their information and, as long as it is within the block, this should be fine by us:

Note that the same data is present as before, but the blocks have been internally shuffled. The assumption here is that your program should be able to place the correct data into the correct variables (of the ‘class‘ object) by the context of the labels. That is a major assumption! Are we ready for it? I say let’s GO!

Another Go

Let’s pause after reading the ‘label‘ part and see what it is before we deal with the value part:

Here I’ve made a simple array of C-strings with all the labels we are supposed to know about in this program. Then, after reading the ‘label‘ and separator, we look for the ‘label‘ read amongst the labels known. At the end of the loop, the variable ‘L‘ will either be the index of the matched label or it will be the constant ‘MAX_KNOWN_LABELS‘. (Both of these are ‘size_t‘ type variables, of course.)

After this simple linear search we can use a ‘switch‘ to find out which label was seen and take an appropriate action:

A Third Tack

But, if we are labeling data so that it may be easily read, it might also be easy to edit/change outside our program. Users don’t type as carefully as our program reads, of coruse. The user might end up with something like this:

We can write a case-insensitive C-string compare function to dodge the case issues this person is facing in the labels. This kind of thing was done for the ‘string‘ ‘class‘ in the first book of this series, for instance.

But what about all the spacing issues? Having the separator up against the label like that is disastrous, for instance.

Luckily the labeled data is one item per line of the file. So, we can read the whole line at once and then do some C-string processing — which we are getting pretty good at — to break up the pieces within our program:

Here ‘search‘ is a function to look [linearly] through a C-string for a particular ‘char‘acter. ‘min‘ is found in the standard library ‘algorithm‘ if you don’t want to code it for yourself.

We have to use the pointer to the separator for a couple of tasks here that at first seem unusual. First we overwrite that position with a null ‘char‘acter. This makes a single ‘char‘ array into two C-strings essentially. Since we have the pointer, though, we have the locations of both.

Then we use that address (‘sep_at+1‘) as the starting point to copy the value from and into the ‘value‘ C-string.

But we also now have an issue looking up the label in the array of known labels – skipping that spacing that may/may not have been on either side of the separator. It is now in our ‘label‘ C-string and it won’t prove the same as the labels in the known array.

We could make our case-insensitive C-string comparison function able to skip spaces — that’s a reasonable option to include on such a function, after all. Or we could make a general space-removal function to help in lots of places this might become an issue. I guess I really telegraphed that one, eh?

When making this into a function, make sure you give the caller the option to remove/strip spaces from either the left end, the right end, or both of the C-string. This is easy to do with an ‘enum‘eration. And even cooler if you use some bit manipulation techniques.

(Of course, if you’re willing to use pointers, the shifting loop could be done with ‘strcpy‘ instead.)

With all this in place, the ‘class‘’ input function probably looks something like this:

Wait, what are ‘end_of_block‘ and ‘seen_any‘? Those are there to track two other conditions we didn’t mention yet. First, since the block elements can now be rearranged, it is a little tricky to tell when we’ve read all the elements for a block. So we have to keep track somehow and I’m flagging that all of them have been read with a ‘bool‘ flag for our loop. Second, it is best to handle the situation of a block with missing data. ‘seen_any‘ is a ‘bool‘ to track that any of our elements were read properly and thus we have a partial block of data we can report to the caller.

How do we detect/track these two situations? There are numerous ways, of course. For simplicity we could use a set of ‘bool‘s for a short set of known labels we were tracking. But for extensibility, we’d probably use an array paralleling the known labels array itself. Then we can use a simple loop to accumulate the state of ‘end_of_block‘ and ‘seen_any‘ after each line has been processed.⁵⁴

One other issue for the ‘end_of_block‘ situation ties in with the partial block situation. This is when the partial block isn’t the last block in the file. How do we detect this and what do we do about it?

Well, we can use our ‘bool‘ array to realize we’ve seen a label already to detect that we’ve read too far and that this duplicate label is not ours but the next block’s data. But how to fix that we’ve already read that line and how does the next block get back that data? Hmm... Aha! ‘seekg‘/‘tellg‘! We’ll do a ‘tellg‘ before each line is read and we’ll then ‘seekg‘ back to that location when the duplicate label is detected. We will also mark ‘end_of_block‘ as ‘true‘ forcibly so we don’t forget and try to read even more!

The first we can do with either functions like ‘stod‘, ‘stol‘, etc. from the ‘string‘ library or with their underlying core functions ‘strtod‘, ‘strtol‘, etc. The underlying functions take C-string’s as pointers and allow for more configurability in the translation like from what base the numbers are to be assumed and so on. Or we could use the tools from the next section on ‘stringstream‘s (section [str:strm]).

For the second we can either ignore offending lines without separators or with unknown labels. Or we can print error messages. Or we can ‘throw‘ ‘exception‘s as discussed in chapter [assort:tools].

For the third we can just default initialize the object before the EOF loop begins so that everything will be in a clean state if not overwritten during processing.

What about the main program? They just need to call the ‘read‘ method in their ‘while‘ head in an object-oriented fashion:

When this loop ends, there will be no more blocks in the file and thus no more data to process. If the ‘read‘ method ‘return‘s ‘true‘ it has read at least a partial block and it is safe to process it.

Mixing Layouts

So far our designs have been pretty simplistic. But is that as complex as it gets or can get? Of course not! Even in a pretty basic program we can have things like nested data! In ‘class‘ terms, we could have a ‘vector‘ inside a ‘class‘. In data file terms, we’d have a sequence inside a block.

These nested sequences could be numeric or non-numeric in general terms. Let’s look at each situation in turn.

Numeric Sequences

It is important to note that the numeric sequence is last and that the first thing in the block is a name of some kind. This let’s us detect the end of the numeric sequence by either an EOF or the ‘fail‘ure that will happen when we read for a number and hit the next block’s name. Overall we might code this:

Another way this could go down is that the data could be labeled. This is actually advantageous in that we no longer have to worry about the name coming next in a roundabout read — there will be a label or EOF after the numeric sequence! The big question here is: should the sequence still be on a line beside its label or should it be allowed to wrap to the next line or further?

After all, if we just extract the numbers (‘>>‘), they’ll be able to cross lines with their list. To prevent this is almost excessive — it is their data, after all. If we wanted to, though, we can just ‘peek‘ for a newline ‘char‘acter, of course, so it isn’t really that bad on our end.

One solution is to introduce syntax for a continuation mark. The C & C++ languages do this with a \ character, for instance. This is implemented fairly crazily, though, in that the wrapped line is immediately concatenated with the original line without indention removal.⁵⁵ We could look for such a marker as easily as a newline and know that processing of the sequence should keep going until the next newline.

Non-Numeric Sequences

If you have a sequence of non-numeric data inside your block, more care is warranted. Since numbers read into ‘string‘s just fine, we need to either ‘peek‘ for a digit (‘isdigit‘) before each extraction (‘>>‘). We also need to know in general where the end of this sequence is because it might otherwise subsume the next element of data!

Labeling the data can help here as long as the labels themselves are not allowed as list members. We can also use a special flag value that can’t be a list member to terminate each sequence. Or we could fall back on the old "put the count at the start of the list" method we used back in the pure sequence file that contained both

x

and

y

coordinates.

Comments in Data Files

Often times data files are commented with what are knonw as meta-data. This is typically information that isn’t abotu the file contents so much as it is about those who have processed the file — who edited it, when, why, what did they change, etc. Such information can be important when the department’s sales figures are all wrong — then we’ll know who to blame. *grin*

Your program won’t need to read and understand these comments anymore than the compielr doesn’t understand your comments in your C++ program. However, you will have to process past them to get to the next bit of data — much like when we did nice input notation with our user back in the previous volume.

First we must pick some sort of syntax that will denote a comment. One of the most common symbols is the pound sign (). But others are also used: ‘’*’‘ in FORTRAN, ‘’;’‘ in ini (initialization) files, ‘’!’‘ in Access data bases, etc. And many use not just a single character but a short sequence of them to denote a comment: ‘"rem"` for both ini files and BASIC, ‘"//"` for C++ or Java, etc. We’ll just pick a single character to represent our comment mark for simplicity.⁵⁶

Whole-line comments are pretty much a subset of end-of-line comments that just don’t allow for there to be other information before the comment mark on the line — most likely whitespace, but nothing else.

An end-of-line comment, though, can appear at any point on the file line and extends to the end of that line. This is the most common allowed comment style and isn’t too hard — especially in a labeled format.

Block comments are pretty tricky in that they can span either several lines of the file or just a smaller part of a line that isn’t necessarily to the end of that line. There is also the issue of whether you allow them to nest inside of one another. Although this can be done, not even the C++ language allows block comments to be nested! This might be a goal for later or just some extra credit when you have more time on your hands. *smile*

Lastly, should comments be relegated to just the top of the data file or should they be allowed at any point in the file. This is a big deal and can lead to both trouble and efficiency. The trouble is that user’s will place comments all over the place whether you remind them of the top-only requirement or not.

The efficiency potential is that if you have a bunch of comments at the top of the file and need to reprocess the file later, you can remember where the comments ended with a ‘tellg‘ and seek back to that point later instead of to 0 and rescan the comments all over again!

Overall, I recommend not limiting the user to comments at the top because it is relatively easy — especially in a labeled process — to strip comments from an input before processing a line for data. In fact, if you follow the idea from labeled line processing and combine it with the techniques of the next section, you can strip end-of-line comments from any file with ease and even take care of separating blank lines at the same time!

string Streams

Our last necessary topic for streams is from the ‘sstream‘ library. It is a combination of streams and ‘string‘s to allow you to trun a ‘string‘ into a stream — essentially.

"Why would this be useful?" you may ask. Well, we’ve been focused so far on either console interaction or storing/retrieving data from permanent storage on a drive of some sort. But much of our interfaces with the world are through other means. And those deal muchly with ‘string‘ (or C-string) data.

For instance, GUI⁵⁷ front ends do lots of ‘string‘ work with labels for buttons, windows, and messages for the user. And when the user enters info into an input text field, that comes to your program as a ‘string‘ — even if it has a number in it!

Also, network communication is almost all done with ‘string‘s of information instead of raw numbers and such. Being able to turn our regular data into a ‘string‘ form is essential to such transmission. And, at the other end of it, we’ll need to pull the data back out of the received ‘string‘.

Let’s start with putting the data into a ‘string‘ and then we’ll work on getting it back out again.

Output to strings

To ’output to a ‘string‘’ we have to create an ‘ostringstream‘⁵⁸, output to it, and then ask for the resulting ‘string‘. So if we wanted to put the number 42 into a ‘string‘ we could do this:

Now the ‘string‘ ‘s‘ has the value 42 in it. This will work with variables of all types, of course. But further, the ‘ostringstream‘ ‘class‘ is compatible with ‘ostream‘ just like the console stream ‘cout‘ is. So anything you could do to an output file stream or ‘cout‘ you can do to an ‘ostringstream‘ as well! You can even pass an ‘ostringstream‘ to an ‘ostream&‘ parameter to get it worked on in a custom function and then get back the ‘string‘ from its ‘str‘ method afterwards.

But the main thing about that is that we can take the attributes of ‘cout‘ or some other ‘ostream‘-compatible source and copy them into our ‘ostringstream‘ so it will output to its ‘string‘ just like they would have. Take this function, for instance:

Here we take the ‘flags‘ and ‘precision‘ from the given ‘ostream‘ and set them into the ‘ostringstream‘ so that the given number is formatted like it would have been on the given stream. This is important so that we don’t use default formatting when the caller has set up special treatments.

But if they have an ‘ostream‘, why are they turning this data into a ‘string‘? Well, like we said, the data might be bound for a GUI or network interface instead of ‘cout‘ or a file. But they want formatting to be consistent throughout their program, perhaps. (Not necessarily a thing for a network, but definitely necessary for a GUI!)

Should we take the ‘fill‘ and ‘width‘ as well? That’s debatable. Some situations call for it and others don’t. I think it will be fine here for an individual value to turn it into a ‘string‘ and let the output of that set these formats. But if it were going out to a GUI, we might need to do it ourselves.

Wait, what about that ‘imbue‘ stuff? That is fancy verbiage for, "tell me how you format thousands-group separators and decimal points in the local customs."⁵⁹ ‘getloc‘ will return the current ‘locale‘ information from the ‘ostream‘ we are given. This tells that local customs are a ‘’,’‘ and a ‘’.’‘ or vice versa or whatever they may be. And we then ‘imbue‘ that information into our ‘ostringstream‘ to make it act the same way as local custom dictates. We wouldn’t want to start giving a European customer American looking measurements, now would we?

And, to do other data types is similar but doesn’t need ‘precision‘ set up because only floating-point types need that.

You don’t have to use this as a separate function, of course. But having a set of routines in a library isn’t a bad idea, either.

Input from strings

Similar to output, ’input from a ‘string‘’ is accomplished with an ‘istringstream‘ object. You either construct the ‘istringstream‘ with the ‘string‘ or set it with the ‘str‘ method. (If setting a new ‘string‘ after you’ve already processed another, make sure you ‘clear‘ the ‘istringstream‘ before calling ‘str‘!) In other words:

You can also ‘imbue‘ the ‘locale‘ from an ‘istream‘ like ‘cin‘ to get local customs on decimal point and thousands group separators as with ‘ostringstream‘s. There are also some ‘flags‘ that are used in input, so you might want to set those, too. ‘precision‘ doesn’t really get used during input, though, so you can skip that. *smile*

The main problem we might have here is that if the caller wants to take more than a single value from the same ‘string‘, this function won’t work. See here:

This will print 4.2 twice instead of 4.2 and -8.5. The reason is that we pass the same ‘string‘ to the function twice, but the local ‘istringstream‘ is therefore constructed twice and starts each time from the beginning of the given ‘string‘. If we could remove the first value from the ‘string‘ before the second call, we could make progress. But that would depend on application specifics as to how the data are separated in the ‘string‘.

Perhaps separate functions/calls for parsing a single ‘string‘ aren’t the best idea? But, still, could we make it work? The problem is that the functions are using separate ‘istringstream‘s each time. If we could get the functions to share a single ‘istringstream‘, things would work much more smoothly.

We could put the ‘istringstream‘ in the global area of the program... *eww* That would be horrible! We NEVER use global variables!

We could put them together in a ‘class‘. Hmm...that has potential! Let’s try it:

Now we can use a single ‘Extractor‘ object to get all the data from a single ‘string‘:

This now results in 4.2 and -8.5 being displayed! (The space between the values is eaten by the ‘>>‘ on the ‘istringstream‘ as usual.)

A Practical Example

Here, then is a more practical example using a simple ‘class‘ for rolling a set of dice for gaming:⁶⁰

And let’s focus on the implementations of its ‘input‘ and ‘output‘ methods more carefully. Let’s do output first as that is easier:

Note that this collects the entire ‘DieRoll‘’s data together in a single ‘string‘ before it is output. This will make sure the entire object is in one ‘width‘ setting together instead of just the ‘count‘ or the ‘’d’‘ being in the ‘width‘ alone. This can make or break a formatting setup and is a great side-benefit of the ‘string‘ stream tool!

Here I’ve mixed some ‘istringstream‘ with some ‘istream‘ to make sure we’ve hit all the features properly. Particularly I wanted to make sure we showed how to ‘clear‘ an ‘istringstream‘ before calling ‘str‘ to set its ‘string‘ in the face of possible EOF or ‘fail‘ure.

A more general ‘input‘ method might read from a ‘string‘ instead of a ‘istream‘. This would prove useful in a GUI or network program, for instance. But having ‘input‘ come from a stream is more traditional and avoids a temporary buffer copy (see below).

Caveats and Tips

Other than the obvious uses listed above, we also noted that you can use an ‘ostringstream‘ to collect an entire object’s data into one ‘string‘ so that it fits into a single ‘width‘ specification on an ‘ostream‘.

We also saw that an ‘istringstream‘ could be useful to help an ‘istream‘ with more complex line inputs. (But it isn’t always necessary here if you are careful with your ‘istream‘ you’ll usually find a way to just work with it directly.)

But don’t take this as an ’in’ to read an entire file into a ‘string‘ and then start parsing that! Such a duplication of the entire stream buffer or even a whole disk file into a ‘string‘ is not only RAM and heap intensive, but really, really SLOW, too.

Wrap Up

In this chapter we’ve explored stream input and output more deeply than we had in the first volume. We briefly reviewed the basics of reading and writing files, but quickly dove deeper for a look at several aspects of the topic. We discussed what can go wrong during a file open attempt and how to deal with the most basic problems. We spoke of how to pass files to functions and why it is the way it is. We moved around in the stream and tracked our location therein. Then it was time for a deep look at how to arrange data in a file. And finally we looked at treating ‘string‘s as if they were streams making it easier to connect our C++ code to alternative front-end systems.

That’s it for this chapter, but if you want more on file topics, be sure to check out Appendix [files:extra] as well!

C++ Tools

operator Overloading

Some students ask at this time, "What exactly is ‘operator‘ overloading?" Well, it allows us to make our ‘class‘ types and some other types interact with the standard C++ ‘operator‘s just like the buit-in types do. This has the effect of removing much of the need for dotting method calls and also makes possible some advanced techniques of generic programming (see Chapter [defn:templ]).

An Example

For instance, we can take a ‘class‘ whose objects currently have to input and output like so:

All the operators

Rather than try to maintain my own list of ‘operator‘s for this ever-changing language,⁶¹ I’ve decided to link you to a few lists maintained by experts and enthusiasts alike.

It remains, then, to find out what exactly precedence and associativity are. Precedence is simply the order of operations when they are mixed together — which comes before others. For instance, multiplication has a higher precedence than addition.

Associativity is whether an ‘operator‘ processes its operands from left-to-right or rather from right-to-left. For instance, addition works from left-to-right but assignment works from right-to-left. This rarely affects us in daily programming, but can at times and is historically placed in this chart together with precedence.

Rules of Overloading operators

There are three sets of rules involving the overloading of ‘operator‘s in C++. They tell you things you can never do, things you have to do, and things you should do.⁶²

Thou Shalt Not

Thou Shalt Always

Thou Should Always

Again, you don’t have to follow these last three ’rules’, but they are strongly recommended!

Patterns for Unary and Binary

While the wikipedia article on overload-ability mentioned above does have prototypes of each ‘operator‘ in both member and non-member forms where appropriate, I’d like to talk a little about it here as well.

‘operator‘s come in two basic arities as far as overloading goes: unary and binary. I’ll show the general patterns for overloading in each arity in either member form or non-member form.

Unary operators

First, what ‘operator‘s are unary? Well, there are, of course, logical not (‘!‘) and arithmetic negation/opposite (‘-‘). But there are others that we don’t think about or haven’t yet seen as well: unary plus (‘+‘), ones complement (‘ ‘), dereference (‘*‘), and address of (‘&‘). In addition, although arrow for member access from a pointer (‘->‘) is a binary ‘operator‘, it is overloaded in unary style. I’m also going to leave increment and decrement to their own section ([defn:incr]) as they are a bit more complex.

I’ll show a general example of overloading for the ‘operator‘ for arithmetic negation, but the others would be the same overall. Let’s start with the member function form. Let’s say you were defining the function non-‘inline‘ as well. The declaration in the ‘class‘ — let’s call it ‘OOver‘ — would look like so:

Note that the function name is the keyword ‘operator‘ followed by the ‘operator‘’s symbol. Very unusual and not normal, but still intuitive. There can even be space between the symbol and the keyword if you like that:

Note also that there is no argument since there will be a calling object to negate.

Further, we mark this ‘operator‘ ‘const‘ to avoid changing that calling object. We want to send back a new object that represents the calling object’s negation — not change the calling object itself! This is just the way arithmetic works on numbers and variables in math and built-in type evaluations and we like to keep it that way.⁶⁴

But, in some compilers, you might receive an error message in a different form. I’ve seen compilers show problem reports like this ‘x.operator-()‘ rather than ‘-x‘ when the ‘operator‘ is one that has been overloaded by the programmer. This is odd at first glance and freaks out those new to ‘operator‘ overloading. I just wanted you to be forewarned in case your compiler was of this ilk.⁶⁵

If, on the other hand, you wanted the ‘operator‘ to be a non-member — perhaps even a ‘friend‘, you’d code the declaration like so:

Here we note that the ‘operator‘ function does need an argument because we are defining it as a non-member. Also, this declaration would go outside the ‘class‘ definition if we weren’t making it a ‘friend‘, as usual.

Also, we make the argument a ‘const‘ reference for speed and to protect it from change. This keeps that arithmetic style as we did for the member function version above.

And the call would look exactly the same as above. Unless there were a crazy error report. Then it might look something like this: ‘operator-(x)‘. Note that now ‘x‘ is an argument to the ‘operator‘ function instead of its calling object.

Binary operators

The binary ‘operator‘s are numerous and include every one of them besides the above and ‘?:‘ for deciding between two values. But to be complete, here is a list of the ones that will concern us:

They are in no particular top-to-bottom order or even left-to-right order. I just wanted to group them by general function category. You’ll probably note that the function call ‘operator‘ — ‘()‘ — is missing. It has arbitrary arity as it can have as many or few arguments to the function as needed. I’ll treat it specially in a separate section ([defn:fcall]). Likewise, I’ll treat the left and right shift ‘operator‘s (‘<<‘ and ‘>>‘) separately due to their special treatment in C++ as regards output and input ([defn:inoutops]).

Further, let me just blanketedly say that all the ‘bool‘ operations should not change their single operand and should result in a ‘bool‘ value. Also all the assignments should follow the style of their base ‘operator‘ (‘=‘) as we learned in the section on overloading that ‘operator‘ before ([defn:opasgn]) and ‘return‘ the left-hand operand by pure reference.

And, lastly, the comma ‘operator‘’s normal purpose is to separate two expressions where only one should normally exist. This is sometimes handy in a ‘for‘ loop head to have multiple index variables initialized and updated at once.

Again, let’s take a particular ‘operator‘ as an example. I’ll take addition (‘+‘) as my focus. As a ‘class‘ member, it would be declared like so:

Again, if not a ‘friend‘, you’d make that outside the ‘class‘ definition.

And those strange error messages might look like ‘z.operator(x)‘ for a member version or ‘operator(z,x)‘ for a non-member version. Note how the left-hand operand is either the calling object of the ‘operator‘ function or its first argument. This is always the case as we mentioned when overloading ‘operator=‘ ([defn:opasgn]) back in the dynamic memory section ([def:dynmem]).

Stream operators

Let’s say we had a very basic ‘class‘ for working with rational numbers (fractions — improper or otherwise). It would have two getters and a single setter — since the numerator and denominator might cancel with one another we must mutate them together, after all.

Imagining such a ‘class‘, let’s override ‘operator‘s for its input and output with streams. Their prototypes would look like so:

Here we see that the streams themselves act as the left-hand operand and that, therefore, we have to overload these ‘operator‘s as non-member functions as per the rule mentioned above.

Also note that the ‘Rational‘ argument to the extraction ‘operator‘ (‘>>‘) is by pure reference so we can change it but the one to the insertion ‘operator‘ (‘<<‘) is a ‘const&‘ for speed and safety.

Note how we can chain these together naturally. This is because we have ‘return‘ed the original streams from the functions to act as the next operated on object:

See how the input of ‘r1‘ results in ‘cin‘ which is then used in the subsequent input of ‘r2‘ and so on until the final resultant ‘cin‘ is just laid aside due to the semi-colon ending the statement.

And what do the functions’ definitions look like? We would definitely have ‘operator>>‘ look like so:

Here we call the setter after locally reading in the data format needed for the ‘class‘.⁶⁶ Then we use the success of the stream itself combined with the success of the setter call to decide whether to set the stream to a ‘fail‘ed state or not.

Once again we use the getters to display the parts of the ‘class‘ object onto the stream in proper format/notation. But we also use that last copy of the stream as our ‘return‘ value! Instead of having two separate statements where one sets the stream aside and the next picks it up to ‘return‘ it, why not just ‘return‘ it straight away?

Increment/Decrement operators

As mentioned before, the increment and decrement ‘operator‘s are a bit different. The reason for this difference is that they are actually two ‘operator‘s in one. Remember that you can place the ‘++‘ or ‘–‘ on either the left or right of the operand. In fact, these two positions have different behaviors — it isn’t just for looks!

When you pre-increment, you change the value of the operand to be one more than it was before and the result of the operation in the surrounding context is the new value. Thus:

will result in both ‘x‘ and ‘y‘ being 5. Pre-decrement is similar but you end up with the value being one less than at the start.

However, when you post-increment, you still increase the value of the operand, but the result in the surrounding context is the value from before the increment! So:

would produce ‘x‘ as 5 but ‘y‘ as 4. (And similarly for post-decrement.)

With that knowledge out of the way, let’s look at how to code these for a ‘class‘ of our own. Let’s keep working with the ‘Rational‘ ‘class‘ from before. As member functions, the prototypes of the two increment ‘operator‘s — the decrements are left as an exercise — would look like this:

Wait. What’s that unnamed ‘int‘ there? And why is one ‘return‘ing a reference and the other not? One thing at a time. Hold your horses.

The unnamed ‘int‘ argument is to distinguish the overloads from one another. After all, the names of the two functions are the same and the ‘return‘ type is ignored during overload determination. So the only way to distinguish them from one another was to add an unused argument. Unused? Yep. We not only don’t name it, we don’t even use it! It is a placeholder only — no functionality.

As to the reference ‘return‘ versus not, the post-increment is sending back the original value of the operand and so that can’t be the calling object as its value would have changed. But for the pre-increment, the value ‘return‘ed is the new value and we can go ahead and return the calling object itself. Since this object is guaranteed to still exist in the caller’s context, we can send it back by reference. Also, this is how the built-in types behave and we want to be as much like them as possible.

What that reference means, of course, is that you are allowed to chain the pre-increment if so desired: ‘++(++x)‘ would have the same effect as ‘x += 2‘.

So how would the definitions for these look? We typically do the bulk of the work in the pre-increment and then reuse that work in the post-increment’s definition:

I didn’t call the setter to check for cancellation because this is guaranteed to work based on the rules of basic arithmetic: we can’t end up with a new cancellation due to this addition.

As noted in the comment, we could just say ‘operator++()‘ to call for pre-incrementation in the post-increment definition. I merely used the ‘this‘ pointer since I was using it already and because it was a little shorter to type.

What if you wanted to code these as non-members? Then the prototypes would look like so:

Note that the unnamed ‘int‘ is still there, but moved to the 2

^{\textrm{}nd}

argument position.

As an aside, the fact that pre-increment does the bulk of the work and post-increment reuses that work with one or two additional copy operations leads us to use pre-increment for most of our loop work. The only time most folks these days use post-increment is for special occasions or on built-in types where speed is always guaranteed by hardware. On ‘class‘ types — like ‘Rational‘ here or ‘iterator‘s, we always pre-increment instead.

A Case Study in Compatibility

To talk to the point of "thou should" make your ‘operator‘s compatible with built-in types whenever possible/necessary, I’ll use the ‘Rational‘ ‘class‘ and ‘operator+‘ in binary form. Seeing as the integers are a subset of the rationals in math, we should probably make them inter-compatible in the programming world, too.

We’ll look at this in several passes to see several nuances of how these operations work and how constructors get involved as well. In all cases, the constructor and setter will call this function to make sure the ‘Rational‘ object is in a standard form:

Here I’ve called for a greatest common divisor of the numerator and denominator. This can be your own ‘gcd‘ function or the one from the ‘numeric‘ standard library. Then I fix up some other issues that might make the object less presentable — like negative denominators.

(This technically has nothing to do with the compatibility issue or the ‘operator‘ overloading, but I will show calls to it so I wanted it here for completeness. I won’t show the setter or getters, but they are pretty standard.)

First Pass

This first pass will involve a pretty plain constructor and 3 overloads of ‘operator+‘. One will handle the adding of two ‘Rational‘ objects. Another will handle adding a ‘Rational‘ to a ‘long‘. The third will handle adding a ‘long‘ to a ‘Rational‘. Wait. Isn’t that the same thing? Well, in mathematical terms, yes. But in the compiler’s eyes, no. The compiler doesn’t understand commutativity, you see. It’s been taught that idea for the built-in types, but not for any types you create. So if a type is commutative, we have to explicitly show that in our code.

I’ll make these member functions whenever possible. Here are the prototypes and the definition of the constructor:

The only thing with the constructor is that it is kinda cool. It has, after all, three call signatures! It can be called with two, one, or even no arguments. Just keep this in mind as it comes into play in later passes...

Notice that the first two use RVO to optimize away a local temporary and the third just rotates its arguments to call the earlier version. If we had ‘inline‘d all of these, they would be uber efficient!

Also note that the middle one could be as coded or could invoke the constructor in either of these forms: ‘operator+(Rationaln)‘ or ‘*this+Rationaln‘. These would make a ‘Rational‘ anonymous object from the integer argument and that would force a call to the first overload using our calling object as its calling object.

Since they ‘return‘ new objects arithmetically, we can even chain them together like so:

Of course, if you are going to do a lot of adding the same ‘Rational‘ number to itself, you should probably invest in an ‘operator*‘ or two. *smile*

A Second Take

This second version of overloading ‘operator+‘ for intercompatibility between ‘Rational‘ and integers looks a lot like the previous one. In fact, it uses all the same code except for one thing: the middle overload for ‘Rational‘ plus integer is missing!

All works fine — no compile messages or anything! What gives? Did we not need that other overload after all? Well, it would seem we didn’t here. So why is that? What is happening to make up for it?

Remembering what we said in that ‘operator+‘ about being able to create an anonymous ‘Rational‘ out of the integer parameter, we can start to see what happened here as well. The compiler likes the code we write to work out. And when it sees a type incompatibility, it looks really hard for a way around that. This automatic conversion between types is called coercion. It uses roughly the same means as type casting would, but is done on our behalf by the compiler instead of explicitly by us.

One of the standard paths used in this coercion process is to call an acceptable constructor to make one object from another type of data. And that’s exactly what happens here. Since our constructor can be called with a single integer argument, the compiler takes it upon itself to do so for us and — voila — a ‘Rational‘ is born to be used in a call to the first overload!

The ‘operator=‘ used is the compiler-provided one, of course, since we didn’t write our own this time.⁶⁷

One might ask, can’t we take out the non-member overload, too, and let the compiler coerce the left integer into a ‘Rational‘? Unfortunately there is a rule in coercion that a calling object cannot be created. So for that left-hand value, we can’t rely on the coercion of a new object. Or can we..?

Yet Another Version

This pass will make two furthers changes to the last one. I’m going to take out the last member ‘operator+‘ and alter the parameters for the non-member overload:

The rationale here is that, since a calling object can’t be coerced into being but an argument can, we’ll make both of the objects arguments and none as a calling object! It’s brilliant!

Wow! Who’d have thought that with a little help from the compiler and its coercion via construction we could make our ‘class‘ interoperable with integers in one function?

What, Again?

But the above won’t always work. There are ‘class‘es where such a one-argument constructor would be undesirable or even harmful! What, for instance, would happen if the ‘string‘ ‘class‘ had a constructor that could take a single integer and turn it into some sort of horrible space-wasting run of null characters of length equal to the integer plus one?! This could have happened if the standards committee had allowed the ‘char‘ to repeat on this constructor:

Now, I’m not saying that we shouldn’t be able to construct ‘Rational‘ objects from integers — that would be silly! But what if there were such a situation for some other ‘class‘ we were designing: the caller can construct an object from a single value when they really want to but it should never happen on accident.

The standards committee has us covered! They have a keyword for that, in fact: ‘explicit‘. If we place this mark before the head of a constructor, it tells the compiler that this constructor should never be used in a coercive way. It can only be used when a programmer has ‘explicit‘ly asked for such a construction, you see.

This keyword only has useful meaning on a single-parameter constructor, but I have seen compilers accept it on any constructor. I’d skip that sort of thing, though, if I were you.

So what would this look like on the ‘Rational‘ ‘class‘? And what impact would it have on our ‘operator+‘ experiment? Let’s take a look:

Note that we are back up to three overloads of ‘operator+‘ to make this all work. This is because no ‘Rational‘ objects can just spring forth from integers mixed into the ‘+‘ operations anymore!

Summing Up

In summation,⁶⁹ we’ve seen that a normal, one-parameter constructor — or at least a constructor that can be called with only a single parameter — can be used along with as few as one ‘operator‘ overload to make your ‘class‘ intercompatible with a built-in type. But if your constructor needs to be ‘explicit‘, you’ll need up to two more ‘operator‘ functions to make that intercompatibility happen.

This count is per built-in type, btw. If making your own ‘class‘ to handle string-like operations, you’d want many of them intercompatible with both C-string and ‘char‘ if not also the real ‘string‘ ‘class‘. That would need at least four extra or up to six extra overloads for each of those ‘operator‘s!

+= and Its Kind

Just as commutativity is only understood for the built-in types, short-hand ‘operator‘s like ‘+=‘ are understood only for the built-in types as well. So, just coding a ‘+‘ and an ‘=‘ won’t automatically garner you the ability to ‘+=‘ at will!

How do we code for these situations? Let’s take a look at a mythical ‘Complex‘ ‘class‘ for an example. One way might be like this:

I went ahead and coded ‘operator=‘ even though no dynamic members were involved, just for completeness of the example code. The only thing mine does that the compiler-provided one wouldn’t is to check the address of the calling object against that of the provided argument reference. The compiler’s version would just blithely overwrite the member data without a care!

Anyway, so what’s going on here is that both ‘operator=‘ and ‘operator+‘ are doing their own work and ‘operator+=‘ is relying on that work to be done well in its implementation. I also decided to avoid using ‘this‘ frivolously and just called the other ‘operator‘s manually.

Some people don’t like this version, however, because of the simple fact that there is a hidden temporary being created during the process. The result of ‘operator+‘, after all, is an anonymous object and has no permanent residence. But we pass it to ‘operator=‘ all the same. It has to be held onto, therefore, for the length of time that ‘operator=‘ needs it until it can be thrown away at the end.

Here we’ve switched the roles of ‘operator‘s ‘+‘ and ‘+=‘ in terms of who relies on whom. This time ‘operator+=‘ is doing all the actual work and ‘operator+‘ is reliant on the other.

Does this actually do better vis-á-vis temporaries? Yes and no. The temporary from before has been made more explicit here — no relation to the keyword — by making a temporary variable. But that just seems to me to hide the other temporary even better! The other one is still the result of ‘operator+‘ itself. If used in other situations, it will still hold dearly to life instead of just being used and tossed aside as was intended. So there is still only one hidden temporary, but there is also an explicit temporary as well.

Our only saving grace might be if the compiler decides to use the facility of named-RVO. This optimization is much like RVO except here the compiler must realize that the local named variable is just being acted on and then used as the ‘return‘ value and thus can be conveniently created in the ‘return‘ area for efficiency. While RVO is a mandate by the C++ standard (C++17), named-RVO is still just a strong suggestion by the committee.

Subscript operator

There are still a few special ‘operator‘s to talk about, though. First up is ‘operator[]‘ — the subscript or indexing ‘operator‘. This can be applied to an object to access its members. And those members don’t have to be encapsulated arrays, ‘vector‘s, or ‘string‘s, either! They can just be normal fields/members of the ‘class‘.

How? Well, the subscript parameter doesn’t have to be a ‘size_t‘, you see. You can designate that it be a ‘char‘ or ‘string‘, if you like. Then the programmer using your ‘class‘ can request a member by name or initial. Let’s examine this on our ‘Rational‘ ‘class‘, for example:

Here we’ve let the caller indicate they choice of member by a ‘char‘ parameter instead of a ‘size_t‘. This makes more sense because designating 0 or 1 or whatever for numerator versus denominator is odd to say the least.

Also remember that this ‘operator‘ must be a member function by standard mandate!

I’ve lowercased the initial to make sure we don’t worry about capitalization preferences of other programmers. Then I check if it is an ‘’n’‘ or a ‘’d’‘ — for numerator or denominator respectively, of course. The -42 is just an error flag in case they mistype and ask for member ‘’z’‘ or ‘’?’‘ or something other than ‘’n’‘ or ‘’d’‘.

A Second Form

The really careful observer will notice that this ‘operator[]‘ is ‘const‘ and so won’t change the calling object. Well, we also know that such a mark is considered when determining whether two member functions are overloaded or not, right? So, it is possible to overload ‘operator[]‘ a second time without the ‘const‘ mark.

This is often done to ‘return‘ members by reference for mutation. After all, the above form was kind of like an accessor (getter) an so it might be nice to have a mutator (setter) form as well.

This is done, for instance in the ‘string‘ and ‘vector‘ ‘class‘es so that programmers can change the elements in those containers if they are not ‘const‘ant containers.

Since this form will need to ‘return‘ a reference to whichever member, all those members need to be the exact same type. We’ll also need some way to ‘return‘ a reference when they don’t send a proper parameter. This bit is tricky, but we’ll find a new tool invaluable here!

Our first instinct is probably a local variable in the function. But this won’t work because that variable’s memory will cease to exist when the function ‘return‘s and we will no longer be able to refer to it!

The second typical thought is a member variable. This would work, but it would make there be an extra member for every object created of the ‘class‘ type. That seems excessive.

Okay, then a ‘static‘ member variable? This might work, but would still be ‘class‘-wide and generally accessible unless we made it ‘private‘. Hmm...what to do?

Well, it turns out that member variables and ‘const‘ants aren’t the only things that can be made ‘static‘. We can also make functions’ local variables ‘static‘. This has the effect of making them stay around even between function calls. They are generally only accessible within the function.⁷⁰ But we can ‘return‘ a reference to them since they do stick around between calls!

I could initialize it to, say, -42, but the user is unlikely to notice since they are using this for mutation purposes. And since a ‘static‘ local variable is only initialized once on the first function call, it wouldn’t stick, either. So I’m just leaving it as garbage bits. It’s the thought that counts, right? *grin*

Actually, I over-stated something just now. I said the caller was using this function for mutation. But that might not be the case. It turns out that the non-‘const‘ version of a method is used for any non-‘const‘ object calling the overloaded method name. It doesn’t have to be the ‘const‘ overload for access, after all!

But Haven’t We..?

Haven’t we broken encapsulation now? Why yes, yes we have. Note that the programmer using the ‘Rational‘ ‘class‘ can not only put new values in their object with this non-‘const‘ overload, but they are bypassing our old normalization code that made the rational be in lowest terms and look nice in a display. Se here, for instance:

Further, the programmer could put any old crazy thing in our members. As long as it looks like a ‘long‘ integer or some subset type, it’ll go right in there:

Now the ‘Rational‘ object has 53 as its numerator and some memory address in ‘long‘ form as its denominator! *eek*

Is this a good idea? Of course not! Then why allow it? Well, if we did have a member ‘vector‘ or the like and we were passing through our ‘operator[]‘ to it, we could do so by reference unless we were also protecting the content of the container as well as the container itself. What? That is, thinking as if you were actually coding the ‘vector‘ ‘class‘ itself, you sometimes want to protect the overall container but don’t care about the content inside so much. A ‘vector‘, after all, doesn’t care what is inside other than the type of it. What the ‘class‘ itself protects is the underlying dynamic array that holds the data — beginning and ending pointers, perhaps. The content themselves are irrelevant to the ‘vector‘.

A Further Twist

Furthermore, there are also members known as ‘mutable‘. These members are part of the ‘class‘ but don’t need to be treated as ‘const‘ant even when the rest of the data in the ‘class‘ is being so treated. What? Sorry, I’m just confusing you for fun now, aren’t I? Well...

Let’s take a new version of a ‘string‘ ‘class‘ that allows the programmer to choose whether comparisons are to be case sensitive or not at run time. This would be a great feature and we’d love to have it, right? But, in designing it, we think about ‘const‘ant objects. Should they be locked in to being whatever default case was set up or should they be allowed to change their case sensitivity on the fly even while maintaining the ‘const‘ancy of their content?

So, we have a ‘const‘ant ‘string‘ with, say, ‘"Jason"` in it. And we are comparing that to another value, say, ‘"jason"`. If the original object were set in case sensitive mode, this comparison would fail to see their equality. The human using the program would become very irate and we’d lose their business! But if we allow the object to change its sensitivity without changing the contents, this becomes a breeze. Just change the sensitivity and compare away! The contents would not be affected by this shift, but the comparison result is all better now.

Another example is that of a ‘class‘ representing a set of values. When we go to print these values, we might want to use standard notation like curly braces around the elements and commas between them. But in another application, we might want to change that up to be spaces between and angles around the list. Hmm... These new ‘string‘ members we’ll add to configure these things will possibly even change from place to place in the same application. Thus, we’ll want them to have mutators and not just constructor parameters.

But should they be able to change when the object is ‘const‘ant? Should the object’s display be able to change when the set content is locked into certain values? I can’t see a reason to do so, so we can set those members to be ‘mutable‘.

This new keyword is merely placed before the member’s type and let’s the compiler know that — even in a ‘const‘-marked function — this member can be changed. So we might have:

This tool is often used for any kind of maintenance data. Data that is about how we use the real data of the ‘class‘ but not that data itself.

Some of you are wondering why this is in the section on ‘operator[]‘. Well, it was the most opportune place to put it. It does, after all, fall into the discussion of encapsulation and member protection here. Also, you could use a ‘const‘ mutator version of ‘operator[]‘ to allow these things to change with some sort of name/initial to do so. At least if they are all the same type like the ‘string‘s for the set example.

What About any?

Some students say to me "I found this type on cppreference.com the other day called ‘any‘. Couldn’t we use that to allow for different types to be changed from the same ‘operator[]‘ with a reference ‘return‘?" Wow! You do come up with stuff when you go a-searching!

‘any‘ is a type found in the ‘any‘ library. It allows you to store literally any type of information in a single variable albeit still just one value at a time. To retrieve the information, you must use an ‘any_cast‘ which will either give you the value back or ‘throw‘ the ‘exception‘ ‘bad_any_cast‘.

This cannot be used here because an ‘any‘ contains a copy of its initializer rather than a reference to it. If you make a change to the ‘any‘, it won’t affect the original. So you can’t make changes to the original member variables — ‘mutable‘ or not — via the ‘return‘ed ‘any‘ object.⁷¹

Typecast operators

All this talk about encapsulation violations with the reference ‘return‘ing or non-‘const‘ version of ‘operator[]‘ gets me thinking about another possible issue we’ve faced in C++ over the years. And it also happens to be an ‘operator‘ issue!

This time it is a whole classification of ‘operator‘s known as the typecast ‘operator‘s. These ‘operator‘s have a distinct look and feel all their own and are used when you, for instance, do a ‘static_cast‘ of your ‘class‘ object into another data type.⁷²

For instance, we could overload a typecast ‘operator‘ to approximate our ‘Rational‘ ‘class‘ objects as ‘double‘ values. This would look like this:

as a member function. As a non-member function, it would need a single ‘Rational‘ argument — preferably passed by ‘const&‘.

Here we see that there is no ‘return‘ type listed as it is inferable from the name of the ‘operator‘ function. This, in turn, is the usual keyword ‘operator‘ followed by the desired type to turn the object into. The definition should be fairly self-explanatory by now.

(I assume we’ve overloaded insertion (‘<<‘) for ‘Rational‘s as above as well.)

A Side Issue

There is a continuing debate about ‘operator ‘ and a ‘Rational‘ ‘class‘. Should it be reciprocal — flipping the numerator and denominator like it flips the bits in a built-in integer — or should it be approximate — like the tilde over an

\approx

symbol?

I’ll leave that for you and your colleagues to decide, but just wanted to point it out and this seemed like a decent time to do it. (It also speaks to the "make your ’clever’ overloads ... as clear and obvious as possible" rule. Your idea of clear and obvious might not be the same as that of other programmers on the team.)

So What Was That About Encapsulation?

Some might think this would be a good way to get out a C-string representation from a ‘string‘ object and wonder why we use the ‘c_str‘ method instead. Well, let’s say ‘string‘ had this ‘operator‘:

If this mythical method existed, a particularly lazy or vengeful programmer might do something like this:

Here they’ve gotten out a C-string version and cast that pointer to a writable variation with C-style casting! Truly evil! But further, they pass that writable version to ‘strcpy‘ as the destination and store there something that the current ‘string‘ object doesn’t have room for. Diabolical!

What’s to stop them from doing the same thing with ‘c_str‘? Well, there are two provisions in the ‘c_str‘ description within the standard that rule this out. First, it says that the pointer will be invalidated (as with ‘iterator‘s) by passing a non-‘const‘ version of the pointer to any standard library function. Second, it says that writing to the pointed to array is undefined behavior. This is a biggie and implies horrible things for the perpetrator.

Function Call operator

First of all, ‘operator()‘ is the function call ‘operator‘. It is not the parentheses used to change order of operations or to group operations. It is not the parentheses used to prototype or define a function and delimit its arguments list. It is the parentheses used after a function name to pass actual arguments to the formal arguments and evaluate/run that function’s code.

By overloading this ‘operator‘, we make our ‘class‘ objects look and act like a function at times. After all, we’d place the ‘operator()‘ right against our object’s name just like we would to call a function by name:

(I didn’t put a semi-colon on that because I’m not limiting this function to a ‘void‘ ‘return‘. We could further use its ‘return‘ value in a surrounding expression, after all.)

Since our objects now look and act like functions, many will call them function objects and our ‘class‘ a function object ‘class‘.⁷³

But you may ask, "Why would we do such a thing? Why make a ‘class‘ object look and act like a simple function?" Well, the answer is three-fold:⁷⁴

So what is ’state’? That’s the set of variables that tell where a certain process was last at. If you got to a certain point in a process, you’ll want to remember where you were so that when you come back after lunch or break or whatever, you can pick up where you left off. The note you leave for yourself — physical or mental — tells the state of the system in which the process works. Variables do the same thing in a computer program.

In the situations we are looking at here, a single function call represents but a step in the process — not the whole thing. The most common occurrence is that we are working on a list of information from either a file, a ‘vector‘, or some other data source like a sensor hooked to the computer with a USB cord or Bluetooth. A call to the function will process a single item from this source of information and the state will keep track of how far that item got us in the overall process.

If we were adding or averaging the values, we would keep track of the total and count of values we’d calculated so far between calls, for instance. When the entire list has been seen and sent through the function, we’ll look at the state to tell the results.

We’ll treat the two approaches — functions and function objects — separately and see how each tackles these issues.

Function Objects

As a ‘class‘ object, we can solve these issues easily. Let’s tackle each in turn. But first let’s take a high-level view of our plan:

A Function Object class

Let’s start with a function object ‘class‘ to keep track of the largest value seen in a list being processed. Remember that our ‘operator()‘ function will see only one value at a time from this list and some other part of the program will be calling on us to update the state with each new value brought into the program.

Note that the member variables ‘max_so_far‘ and ‘count‘ record the current state of the maximum/largest finding process as each ‘next‘ value comes into the ‘operator()‘. But don’t be confused! The ‘class‘ has two ‘operator()‘ functions!

The second one with no arguments is for retrieving the most recently updated value of the ‘max_so_far‘ member variable. ‘count‘ is mostly for show and really just helps us at the first element of the list to make the right decision. After that it is not really useful/used. We relegate retrieving that value to a plain getter.

We also have two ‘reset‘ functions to parallel the two constructors. These take care of resetting to an empty or nothing-yet-processed list situation and resetting to a we’ve-seen-the-first-item-in-the-list situation, respectively. Both ‘return‘ the previous ‘max_so_far‘ to mimic the way the stream formatting functions would ‘return‘ the prior formatting settings before making the update to your requested new formatting setting. I always thought that was a cool technique and thought I’d use it here.

To determine a maximum for a set of values, let’s say they are in a ‘vector‘, we need a loop and a call to a function object of this ‘class‘:

If we wanted to find another maximum for another ‘vector‘, we don’t even need a second object. As long as they aren’t simultaneously coming in, we can just ‘reset‘ this object (‘max‘) and use it again:

Typical Usage

There are actually two typical usages of function objects when processing lists. One is as above as a countr or skimmer or just processor of the list’s data. It skims off information about the data as it rolls by.

Another usage is as a producer, generator, or originator of list data. A producer takes data from a source and relays it one piece at a time to the rest of the program. This source could be a file, a ‘vector‘, or some other source like an Internet connection to a server or user’s browser somewhere.

As we’ve already explored the ‘Largest‘ countr, let’s look at a typical producer. This one’s source is a random sequence!

Producers

I’ve decided to make the members ‘const‘ants to ensure that once a set of dice is created it won’t get changed arbitrarily. (Wouldn’t do to have that fancy magic axe suddenly doing less damage than it has all along, now would it?) Because of them being ‘const‘, these members must be set in the member initialization list. Not everything can be done with the in-‘class‘ initialization syntax! (Also remember that every object created can fill in these members with different values. The ‘const‘ just says that, once filled in via the constructor, the values can’t change for the rest of the object’s life.)

Although it might seem that the ‘const‘ on the ‘operator()‘ function itself relates to the above decision, it really is more a separate design decision to make it so that any object — ‘const‘ or not — can be used to roll the dice.

To produce a new value from the random sequence that is rolling a set of dice, just call ‘operator()‘ on the object:

Here I’ve set up a standard die roll of one 6-sided die by default and another object representing, perhaps, a battle axe wielded by someone who is a little too small to handle it. Then I set up the random number generator from ‘cstdlib‘ used in the ‘class‘ for simplicity.

Rolling these dice is just a matter of calling ‘operator()‘ as you can see. I even rolled a new die created on the spot which has 17 sides and no adjustment and is rolled alone.

An Example: Preserved State and Multiple Access Paths

If we combine these with the ‘Largest‘ object from before, we can find the largest value rolled over several experiments (as the statisticians would say):

Here the ‘max‘ object tracks the state between calls in the ‘for‘ loop and then reports with a different ‘operator()‘ call afterwards.

Then it is ‘reset‘ for another run with a different source/producer. Multiple access paths are also shown here with the multiple uses of the ‘reset‘ function and its overloads.

Further, we see preserved state in the producers themselves since the parameters for the production of new values is stored inside the objects and doesn’t need to be passed again to the function each time a new value is desired.

An Example: Multiple Concurrent States

Let’s add two more ‘Largest‘ finding objects — one each for the two producer objects. I’ll name them cleverly so we can easily keep track of which is for which.

Once again we roll the various dice of the producers 5 more times and find the largest rolls during this time. But this time the producers are run side-by-side in lock-step as it were. Their results aren’t separated in time but interleaved. We’ve tracked their sequences simultaneously instead of in tandem.

But this is hardly the way sources of information would arrive from, say, Internet connections or external data sensors. Those would be more sporadic and less regimented. How can we check for that? Well, I’ve now added a third producer called ‘flip‘ representing a coin rather than a die. It will tell us which source has come in this time over the course of the experiment:

As we can see, we have multiple access paths to preserved state in two sets of state simultaneously. A rousing success! (The two other sets of state show preservation but not really anything else at this point in the code.)

But Can’t a Plain Function?

Let’s try to do the largest-finding scraper we’d done first with the function object approach. That’ll be a minor challenge compared to a producer like dice rolling.

Round One

Remember that a ‘static‘ local will only be initialized on the first call to that function, so that we will store ‘next‘ directly only that first time. Thereafter, the ‘?:‘ will naturally test things before updating.

An application could now call ‘largest()‘ to retrieve the largest value seen thus-far — assuming that 0 was less than the current largest state, of course. Or they could call ‘largest(new_value)‘ to test and set a new value into the largest state.

But there is currently no way to reset the largest state for a new sequence of values.

Allowing Reset

Since the state within this algorithm continues to grow bounded only by the data type itself, we need an altered approach. Let’s try it this way:

But our guess of 0 for retrieval is not general enough, as was hinted at before. If they may have sent a sequence of negative values and then calling ‘largest()‘ would actually disrupt their accumulation of the largest value by arbitrarily setting it to 0. We’ve gotta try again...

Fixing the All Negative Case

Let’s beef up that ‘bool‘ parameter again with an ‘enum‘eration:⁷⁵

*whew* Now they can call ‘largest(new_value, Reset)‘ to reset the accumulation sequence or ‘largest(???, Retrieve)‘ to merely look at the current state — the ‘???‘ indicating that the value passed is arbitrary and will be ignored. Calling ‘largest(new_value)‘ will as usual evaluate the ‘new_value‘ into the state — i.e. test and update if necessary.

But having to pass a first parameter for ‘Retrieve‘ seems clunky at the least. Not cool...*sigh*

Less Clunky But...

Let’s give this another shot with a major overhaul. I didn’t want to, but I’ve had to bring out the big guns:

With a global variable marked ‘static‘, only the file in which this code resides can use it. Thus, this would go in an implementation file and the functions’ prototypes would go into an interface file.⁷⁶ We can’t even ‘inline‘ these functions since they need to access this global variable which has to be in a separate compilation unit to hide it from other programmers and their nefarious purposes!

This nearly covers all the features of the ‘Largest‘ function object ‘class‘ we originally defined. But there is still one hurdle to overcome, and it’s a doozy!

Multiple, Simultaneous Sequences of Data?

We still suffer the problem of only being able to handle a single sequence’s state at once. The function object ‘class‘ can be instantiated multiple times — concurrently — so that our program has multiple largest calculations happening simultaneously.

To do something like this with the function [set], we’d have to have much more book-keeping such as a ‘vector‘ of maximums. Further, we’d have to relate to the caller an index position to tell each of us which maximum was being looked at, reset, or adjusted on this call. This would not only be tricky for us, but also for the caller as they’d have to track the indices for each of their sequences.

This tracking could also be foisted off on the caller, of course. At first this option seems in poor taste, but it actually leads to cleaner design. They know the purpose of their concurrent streams and can, for instance, make an ‘enum‘eration or even use a container with both ‘string‘ names and maximums in each position so that the maximums have a nicely associated name rather than have us assign them arbitrary numbers. It will make our caller’s code bulkier, but more readable at the same time!

Either way, they’ll have to set a maximum state into the function with ‘largest_reset‘, update with any number of ‘largest(new_value)‘ calls available at this time, then retrieve and put into their container of maximums this state with ‘largest()‘. And they’ll have to do that for each maximum they want — that is for each sequence they need to track a maximum on — every time throughout the program!

In Summation

I think the winner is clear: the function object approach. It easily handles not only the state preservation issue for processing a list or sequence and allows for multiple access paths to that state, but it can be super easily made to handle multiple, simultaneous sequences in a single program without lots of bookkeeping mumbo jumbo.

And, not to foreshadow anything, but when we get to ‘template‘s later (chapter [defn:templ]), we’ll start to see the true power of this technique!

operators for Types Other Than classes

There are three other ways to form new data types in C++ that can be used in ‘operator‘ overloading: ‘struct‘s, ‘union‘s, and ‘enum‘erations. It does not, sadly, work on a simple ‘typedef‘ or ‘using‘ alias.

structs and unions

It is possible to overload ‘operator‘s for these kinds of types. But I say so with a couple of warnings...

As mentioned in the previous volume, due to their by default ‘public‘ nature, ‘struct‘s are only typically used when we have complete control over the use of those ‘public‘ members. We used it there, for instance, to house elements for ‘private‘ ‘vector‘ members so that we could remove a parallel ‘vector‘ situation. We kept the ‘struct‘ usage internal to a ‘class‘ and never used it to interface with the other programmers in the program.

A ‘union‘, if you’ve never heard of it, is like a ‘struct‘ where all the listed members overlap one another in a single memory space big enough for the largest member. So only one value is stored in the space at a time and we have to take care which one is valid for any particular instance of the ‘union‘. This kind of thing is used sometimes in language translation.

‘union‘ is even less used in modern designs than ‘struct‘s. Instead we focus on the use of the ‘variant‘ type introduced to the standard library in C++17. This is a type-safe way to store one of a set of possible types in a single object.

enumerations

It is also possible to overload ‘operator‘s for ‘enum‘erations. Perhaps a slight discussion of this type creation mechanism is in order before we talk to overloads for it. I’ll stick to basic ones here, but there are now other variations on this theme from recent standards releases.

Enumer-what?

An ‘enum‘eration is a way to create a set of ‘const‘ants under a single type name. They are a subset of integers and we rarely care what their values are — although we can control them if we wish.

For instance, if we wanted to create a set of ‘const‘ants to represent the days of the week, we might do so like this:

Here ‘THURSDAY‘ would take on the value 0 by default and each ‘const‘ant after that would increment by 1 so that ‘WEDNESDAY‘ would have the value 6.⁷⁷ All of these ‘const‘ants exist under the auspices of the type ‘WeekDays‘ and so could be stored in a variable of this type or passed to a function of this type and so on.

An advantage to this method of ‘const‘ant creation over just typing them up yourself is that they work nicely with ‘switch‘es. If you use a ‘WeekDays‘-typed expression in the head of a ‘switch‘, the compiler will check the ‘case‘s and make sure that you didn’t miss any of the possible values for this type.

If, on the other hand, you wanted sequential values starting somewhere else, like the months of the year, you could do something like this:

Here ‘January‘ is given the value of 1 explicitly and each ‘const‘ant thereafter is incremented by one more than the one before it such that ‘December‘ has the value of 12.

In an extreme case, you can give separate values to each of the ‘const‘ants like this:

You can also use specially valued ‘enum‘erations for bit values. We’ll see this in the appendix on bit manipulations ([defn:bitmanip]).

Overloading operators for Them

Let’s take as a quick example the case of simple traffic lights. These take on the possible colors green, yellow, and red. We could define this idea as an ‘enum‘eration like so:

Now that we have these, we might want to display them for the user. But if we just display ‘Red‘ on ‘cout‘, for instance, we’ll see 0 instead of the word Red or red. So we should overload an ‘operator<<‘ for this type like so:

Here we see an array of ‘string‘ objects paralleling the ‘enum‘eration values and being subscripted by the ‘TrafficLight‘ argument itself. This automatic conversion to ‘size_t‘ is simple coercion and comes naturally to the compiler — no typecast ‘operator‘ required.

Another situation might arise as well. What if we are moving from one state of the traffic light to another and want to do this conveniently with ‘++‘? This would be troublesome if we let the compiler work it out as ‘Red‘ would be followed by ‘Yellow‘ instead of ‘Green‘. Worse, ‘Green‘ would be followed by 3 instead of any true ‘TrafficLight‘ color!

Wrap Up

In this chapter we’ve learned about the idea of overloading ‘operator‘s for a programmer-defined type. Such types include not only ‘class‘es but ‘struct‘ures, ‘union‘s, and even ‘enum‘erations.

We learned both general and special patterns for overloading various ‘operator‘s and even what some new-to-us ‘operator‘s do in C++.

Further, we’ve explored the fascinating topic of function objects. It might not seem as glorious now as it will in chapter [defn:templ] on ‘template‘s, but we’re getting there!

Don’t forget to check out more on ‘operator‘s in the appendices [defn:bitmanip] and [other:opers] on bit manipulation and more advanced ‘operator‘s, respectively.

Other Tools

This chapter deals with some tools that will help you with design, debugging, and finding elegant solutions to programming conundrums. Let’s start with debugging, though, and work from there!

Assertions

Assertions come in two approaches from the two languages in our purview: C and C++. C used assertions at run-time to find problems as they were occurring. C++ decided to add the ability to use assertions to find problems before they occurred — during compile-time!

C: At Run-Time

Note that we already spoke in depth to the use of the ‘cassert‘ library’s ‘assert‘ technique for debugging help at run-time in the previous volume. Please see there for more on this technique.

C++: At Compile-Time

C++11 added the compile-time assert capability called ‘static_assert‘. This tool complements the C-style ‘assert‘ macro by allowing programmers to test things that are known at compile-time early in the system development rather than at run-time when it might fall on the user to see the problem.

It has two forms, either just ‘static_assert(bool_expr)‘ or ‘static_assert(bool_expr, "string")‘. If using the string variant, the string becomes the error message when the assertion fails.

Scope of a static_assert

Unlike the C-style ‘assert‘, which goes inside functions, the C++ ‘static_assert‘ can go even at global scope, ‘namespace‘ scope, or even ‘class‘ scope, as well. This can be useful to test things like the number of bits in a data type to make sure the plans you have for the system are acceptable or if they need to be changed:

(Of course, if you want a 32-bit integer, why not just go with ‘int32_t‘? This and other similar types can be found in the ‘cstdint‘ library header.)

This, of course, depends on the library supplying a nice ‘constexpr‘ value called ‘Version‘ in their ‘namespace‘. For more on ‘namespace‘s, see below.

Beyond Standard Tests

The idea of type traits is pervasive in ‘template‘ metaprogramming, but we won’t talk to that until much later (the end of chapter [defn:templ]). But we can start small with the ‘numeric_limits‘ Boolean property ‘is_integer‘. We can use this to test if a type that’s been defined by a ‘template‘ ‘typename‘ or by ‘auto‘ is of integral type or not:

Here I’ve also used the new C++11 tool ‘decltype‘ that tells us what type a program entity or expression has without compiling it or executing it. This type is usable in declaring further variables, constants, argument, or ‘return‘ types.

exceptions

An ‘exception‘ is there to handle the exceptional case. That is, the case that should never happen but we coded to look out for it anyway. The ‘throw‘ing of ‘exception‘s is more of an art form still than a science. There are lots of blogs and FAQs around the web that will tell you one thing or another. I’ll try to distill the best of this advice here for you.

When to throw

You ‘throw‘ an ‘exception‘ when something bad happened that the local function can’t really do much more with. This could be that you received an argument that can’t be processed with (‘invalid_argument‘, ‘domain_error‘, or even ‘out_of_range‘). It could be that something happened when you were processing that can’t be handled (‘range_error‘ or ‘overflow_error‘).

Whatever it is, you simply place the ‘exception‘ ‘class‘ you’ve selected in the ‘throw‘ statement inside your ‘if‘ that detected the problem. Make sure to give a descriptive string in the argument to the constructor:

And don’t forget that there are consequences to ‘throw‘ing an ‘exception‘ such as stack unraveling…

Stack Unraveling

When an ‘exception‘ is ‘throw‘n, the function call stack will unravel to where an appropriate ‘catch‘ clause can be found. That is, function after function will be tossed off the stack and all their progress laid to waste until a function where a ‘try‘/‘catch‘ group with a ‘catch‘ that matches the ‘throw‘n ‘exception‘ is found.

If this sounds expensive, it is. It is not something to be done lightly. Don’t just ‘throw‘ an ‘exception‘ because you think it’s cool to do so. Make sure you really can’t handle the problem locally or in some less intrusive way first.

trying to catch

We’ve talked about this previously (in the earlier volume). But if you run code that might be ‘throw‘ing an ‘exception‘ at you, it is important that you surround that code with a ‘try‘ block. This sets you up so you can actually ‘catch‘ the ‘throw‘n ‘exception‘ if one comes.

The ‘catch‘ block comes after the ‘try‘ block and lists at least the type of the ‘exception‘ that you expected to be ‘throw‘n at you. This should be listed in the documentation for the function. See, for instance, the listings for functions like string::at at a site like cppreference.com. It has a clearly marked section called "Exceptions" that tells which ‘exception‘ type is ‘throw‘n and under what conditions it might occur.

catching by Name

If you want to ‘catch‘ the ‘exception‘ with a name, you can do so by either value or reference — even ‘const&‘. The value is frowned upon as it will call the copy constructor and that might further ‘throw‘ if it is a memory situation that is the problem. But, once you give the caught ‘exception‘ a name, you can use the ‘what‘ method on it to recover that string that the ‘exception‘ was constructed with back at the ‘throw‘ site:

But that isn’t much better than letting the program crash from not ‘catch‘ing the ‘exception‘ in the first place. At least your program isn’t dead. But the user is no more enlightened, so…

Multiple catch Blocks

If you have several possible ‘exception‘s that might be ‘throw‘n in a ‘try‘ block, you can ‘catch‘ them one at a time with sequenced ‘catch‘ blocks all following the initial ‘try‘ block.

There is even a catch-all syntax! If you put ‘...‘ inside the ‘catch‘ head, it will ‘catch‘ any ‘exception‘ regardless of type. This is generally a bad idea, so avoid it as it could do you more harm than good!

It is also important to order these ‘catch‘ blocks correctly so that they don’t keep another related ‘exception‘ that you’ve coded for from being caught. The reason is that they follow the rules of inheritance (talked about later in chapter [inh:defn]).

Although we haven’t talked to this specifically yet, inheritance is a way for us to tell the compiler that two ‘class‘es are related to one another.⁷⁸ And then, if we ‘catch‘ the primary of those ‘class‘es before we ‘catch‘ the secondary, the secondary ‘class‘ will never be caught. All instances of the secondary ‘exception‘ ‘class‘ will be caught by the primary ‘catch‘ block. Be wary of this!

Re-throwing

If you don’t want to completely handle an ‘exception‘, but do want to add your two cents to the mix, you are more than welcome to ‘catch‘ the ‘exception‘ and then re-‘throw‘ it. To do so, just ‘catch‘ like normal and then end your ‘catch‘ block with the single statement:

This will re-‘throw‘ the same exact ‘exception‘ you caught for someone upstream in the function call flow to ‘catch‘ again.

This can be useful if you need to clean up something that you’d been doing before things get too out of hand — like closing a file to free up its file handle to the OS or deleting some dynamic memory to avoid a memory leak. That sort of thing. This might, in fact, be the only place to use the ‘...‘ ‘catch‘ syntax.

namespace Management

We’ve been using ‘namespace‘s since the first day of C++ — literally! Remember these?

But what is a ‘namespace‘? Could we use one for our own code somehow? When would that be a good idea?

What’s in a namespace?

As we learned so long ago, a ‘namespace‘ collects together many identifiers that belong together so that they don’t clash with other names than may look identical but refer to something totally different.

For instance, what if you were programming a game and there was a function to activate a player’s power called ‘pow‘. If the ‘cmath‘ library weren’t surrounded in the ‘namespace‘ ‘std‘ we could have trouble!

What kind of names can go in a ‘namespace‘ you might ask? Any kind! ‘const‘ants, functions, ‘class‘es and other data types, even variables if you really needed to – but that’s borderline tacky, so let’s just not. Heck, you can even put other ‘namespace‘s in there!

std versus Global

First, let’s talk about the two ‘namespace‘s that are given to us: ‘std‘ for the standard libraries and the global ‘namespace‘ to hold main and all our regular work. The committee works hard to make sure there are no name conflicts within and between these spaces — at least when they come straight out of the box. Since everything we write goes into the global ‘namespace‘, they can’t guarantee everything.

But, if we cause a conflict, we can use scope resolution to get around it. That is, we can use ‘std::‘ on a library symbol to force the compiler to use that name if we make one that looks just like it for some odd reason:

But this kind of conflict was caused because we let the ‘std‘ ‘namespace‘ out of the bag in the first place! It was that ‘using‘ directive’s fault!

using Directives

Let’s try to understand the ‘using‘ directive better — at least such as it pertains to ‘namespace‘s. First, its purpose in the above form is to tell the compiler that we will be using lots of names from a particular ‘namespace‘ and so we don’t want to have to scope them all. This brings all those names from the specified ‘namespace‘ out and mixes them with the global ones for faster and easier lookup.

If there were just one or two names from the other ‘namespace‘ we wanted to use, we could instead use this form of a ‘using‘ directive:

This brings just the specified name into the global ‘namespace‘ mix instead of all the names from the other space. This can be really handy when only a few names need to be used instead of five or more.⁷⁹

So far, though, I’ve shown the ‘using‘ directive placed at global scope — outside the main function or any function. We’ve also seen this directive placed inside a function when using ‘inline‘ functions in a library interface file. This doesn’t mix the brought in names with the global ‘namespace‘ but rather with the local scope names.

In fact, you can use this directive in any scope you like except ‘class‘ scope — or the like.⁸⁰ It will only mix at the current scope level and then disappear from view when that scope ends.

Writing Your Own

Writing your own ‘namespace‘s is pretty simple. You just have to remember the basic tenet from designing libraries: keep it all to a single theme! Don’t cluster a lot of stuff into a single ‘namespace‘ just because it’s convenient or all by the same coder or whatever.

To make a ‘namespace‘, just place the name of the space after the keyword ‘namespace‘ and enclose everything to be inside it in a pair of curly braces:

Note that unlike a ‘class‘ definition, a ‘namespace‘ doesn’t end in a semi-colon.⁸¹

Starting and Stopping

A ‘namespace‘ doesn’t have to be unique to a single file or place in a file, either. You can place some things inside a ‘namespace‘ at line 20 in a certain file and then close that off at line 30. Then, later on at line 70 you can reopen the exact same ‘namespace‘ and add more items to it. This isn’t necessarily the way we always do it, but see below for more on designing with ‘namespace‘s.

Actually designing with ‘namespace‘s can be a hurdle initially. What is typically done is to enclose the elements given in a library’s header ( or ) in the ‘namespace‘ and then enclose any non-‘inline‘ function definitions in the same ‘namespace‘ within the library’s implementation ().

Anonymity

Some ‘namespace‘s are left without a name! This is called making an anonymous ‘namespace‘. What could such a beast be used for?!

It makes the symbols global but only within the current file. This can be used instead of a ‘static‘ global declaration, for instance.

The main difference here is that only variables, ‘const‘ants, and functions can be marked ‘static‘ whereas a ‘namespace‘ can also include type definitions.

Nesting

When I said you can put anything in a ‘namespace‘, I wasn’t kidding! You can even nest one ‘namespace‘ inside another:

This can be done for organizational purposes like when a technical report feature is added before ratification. In many library implementations we had ‘std::tr1‘ well before technical report 1 was finalized and brought into the ‘std‘ ‘namespace‘ outright.

If you want a group of symbols to be collected together for organizational purposes but don’t want to force anyone to have to place ‘using‘ declarations all about to use them, you can ‘inline‘ your ‘namespace‘. This will mix the ‘namespace‘ content with that of the surrounding space for lookup purposes.

One nice place to use this is for library versioning. You can place the entire library in one ‘namespace‘ for general logistics and then place the current version of the library in an ‘inline‘, nested ‘namespace‘ within that.

Aliasing

If the name of a ‘namespace‘ gets too hairy to type — either by initial design or by nesting, you can use a ‘namespace‘ alias to give it a shorter name. The syntax is like a ‘using‘ alias but with the ‘namespace‘ keyword instead:

Here the ‘original_namespace_name‘ can include any scope qualifiers necessary to refer to a global or nested ‘namespace‘ as well.

string_views

We have two ways to represent strings of information in our programs now: the ‘string‘ ‘class‘ and the C-style string stored in a C-style array with a ’\0’ at the end. While the ‘string‘ ‘class‘ can take in a C-string, this copying is expensive in both time and space. So sometimes we might be wasting time writing overloads of a function to handle strings of information in two different forms. To simplify this and get rid of the excessive overloading, the C++ committee accepted the ‘string_view‘s proposal into C++17.

Compatibility

The ‘string_view‘ ‘class‘ provides a way to view a part of a string of information from afar, as it were. Often used on function parameters when a function needs to look at/examine but not change a portion of a string, it can represent a substring within either a ‘string‘ ‘class‘ object or within a C-string as necessary.

It does this by holding a pointer to the beginning of the view and a length/count of how many characters are in the view. This is sufficient and significantly cheaper to copy to a function than a whole ‘string‘ ‘class‘ object.

Utility

The ‘string_view‘ ‘class‘ provides much of the ‘const‘ utility of the ‘string‘ ‘class‘ such as iteration, element access, ‘length‘ determination, ‘substr‘ access, comparison, and ‘find‘ing functions. This means that a ‘string‘ ‘class‘ object passed to a ‘string_view‘ parameter won’t suffer loss of functionality within the function just as if it were passed as ‘const&‘.

In addition, when passing a C-string to a ‘string_view‘ parameter, the C-string data will suddenly gain these capabilities of the ‘string‘ ‘class‘. While they are available in the form of ‘cstring‘ library functions, that requires the duplication of overloading. Having just the one function taking a ‘string_view‘ instead of one each for ‘string‘ ‘class‘ and C-string is far preferred.

In Action

In each case the ‘string_view‘ parameter is initialized with an appropriate pointer and count for the initial string of characters. It’s on par for the ‘string‘ but a step up for the C-string which could not normally use a range-based ‘for‘ loop!

And Also

And there are a couple of extra functions available in the ‘string_view‘ ‘class‘ as well: ‘remove_prefix‘ and ‘remove_suffix‘. These advance the pointer or decrement the count respectively and so are quite efficient at their jobs. And they don’t disturb the underlying data and so are still part of a constant view into the other string source.

Lambda Expressions

Lambda expressions are like tiny functions. We generally use them to pass as arguments to functions like ‘find_if‘ or ‘for_each‘ from the ‘algorithm‘ library. Such functions take a function and use it on each element in a container to either process those elements or identify properties of those elements. We’ll talk about how to write such functions in chapter [defn:templ] on ‘template‘s.

But, back to lambdas, let’s look at various features of these tiny functions. And even how they can be as powerful as the function objects we discussed during ‘operator‘ overloading in chapter [defn:oper-ovrld].

Basics

The basic syntax of a lambda is a pair of square brackets followed by a pair of parentheses where you list any parameters the lambda takes followed by a pair of curly braces where statements go to do the lambda’s job. We’ll start with the arguments and work our way around.

Arguments and returns

The arguments to a lambda look like normal function arguments. They can be by value or by reference or even constant reference as necessary/desired. For instance, we could have the following situation:

This would take each ‘string‘ from the ‘vector‘ and display it on a line by itself. The ‘string‘ comes into the lambda by ‘const&‘ to protect it from change and avoid copying as usual. Notice that ‘for_each‘ takes an ‘iterator‘ range instead of the ‘vector‘ straight up. This makes it more general and allows us to pass pointers as well as ‘iterator‘s thanks to the ‘template‘ mechanism.

Similarly, ‘find_if‘ can be used to identify elements of a container that meet certain criteria:

Notice that the ‘return‘ type of the lambda is deduced automatically by the compiler from the ‘return‘ statement if any. If you want/need to, though, you can explicitly state it by using this syntax:

This use of the arrow is odd and so is its placement, but you’ve gotta follow the rules, right?

Captures

What if, though, the year wasn’t set by the application but entered by the user? How do we get our information into the lambda along with its parameter? We have to capture it! This is where the square brackets come in. Inside them you can list any local variables you’d like to use within the lambda comma separated and they will be available to the lambda throughout its lifetime. For instance, we could do this:

Okay, that was a silly use — we could have asked ‘words‘ for its ‘size‘. But it proves the concept, so…

You can also capture all local variables by value with ‘[=]‘ or all of them by reference with ‘[&]‘. This isn’t usually necessary and we prefer to list specific variables and types of capture.

If you’d like, you can call the captured variable by a different name within the lambda with a fairly simple syntax as well:

We can also capture by constant reference since C++14 with the syntax: ‘[&x=as_const(x)]‘. Or with renaming: ‘[&y=as_const(x)]‘.

Throw-Away versus Multi-Use

If you feel the need to use a lambda more than once, you can store it in a variable. The key to this working easiest is to use ‘auto‘ for the type:

Note the extra semi-colon after the lambda body to end the variable declaration and initialization. Now we can use predicate to pass to, say, ‘find_if‘ but not just for a single container, but over and over.

Is this better than writing an ‘inline‘ function? If you aren’t capturing anything, it is probably a draw. But if you are capturing something, it is a definite improvement. More on this below…

auto Parameters

For some extra power, you can even mark the type of parameters as ‘auto‘ and the compiler will figure out what they are as the lambda is called each time. So, for instance, you could do the following:

Here the ‘totaler‘ lambda is used first to add many ‘double‘ values and then to add some ‘long‘ values.

Mini Function Objects

The examples above with captures bring to mind the power of function objects to remember state and update it along the way. This is, indeed, true. So you might be able to get away with a lambda where a function object would have otherwise been the best deal. But note that the lambda mechanism cannot do multiple, concurrent states. Only the function object tool achieves this goal!

Wrap Up

In this chapter, we worked with ‘assert‘ions to debug program issues more quickly. Then we switched gears and used ‘exception‘s to report errors at run-time effectively.

Next up was managing code with the idea of ‘namespace‘s. And finally we learned about the quick tool lambdas to make function and even function object-like entities for those many throw-away situations where crafting a nice function or function object ‘class‘ would be too time-consuming.

Polymorphism

Run-Time Polymorphism

We’ve used polymorphism in the form of overloading and defaulting arguments for some time now. But run-time polymorphism is different in that it waits to take effect until the program is actually run by the user. The other two were handled by the compiler when the program was initially being put into binary form.

Run-time polymorphism requires the use of a tool called inheritance — so we’ll explore the basics of this first. After that we’ll look at run-time polymorphism itself. Then we’ll wrap up the chapter with a look at more advanced features of inheritance.

Basics of Inheritance

Inheritance is a means to reuse code from old ‘class‘es in new ‘class‘es.

Well, that’s a little shallow of a view, I suppose. But it is a useful view. And it is a place to start!

Concepts

When one ‘class‘ chooses to inherit from another for code reuse, it is known as the child ‘class‘ and the old one is the parent ‘class‘. We use these biological terms because the idea comes from biological inheritance, of course. When you were born, after all, you were a bit your mom and a bit your dad. Now that you’ve grown, though, you’ve become more than that — and you’ve improved on the base you were given, of course. *smile*

Although programming inheritance does allow for multiple parent ‘class‘es for a child ‘class‘, we’ll start with single parent examples. The child ‘class‘ chooses a ‘class‘ to be its parent because it it wishes to build upon or mould that ‘class‘’ behaviors and/or attributes. (Behaviors are methods (functions) and attributes are data members (member variables), of course. More on the building and moulding later.) The parent doesn’t even know there is a child ‘class‘. But the child has seen its parent in action and knows that those are acts it can use as a basis for its own. (Okay, the designer of the child ‘class‘ saw the parent’s code or usage and made the decision. Six of one, half-a-dozen of another…)

For instance, a color printer driver would probably start from a standard printer driver and then add the capabilities dealing with color. Most of the standard printer driver’s behaviors of sending information down the wire and such don’t need to be changed. Just enhanced with color information where appropriate/necessary.

Vocabulary

In terms of vocabulary, we will sometimes call the parent ‘class‘ the base, ancestor, or super ‘class‘. Likewise we’ll call the child ‘class‘ the derived, descendant, or sub ‘class‘. The last of each list of names comes from set theory. The first of each trio is from the computer programming usage of an existing code base and deriving new code from it. The middle terms are just a formalizing of parent and child.

Reasons

As the child specializes the parent’s provided code to its needs, it describes a finer subset of the objects its parent describes. Adding detail leads to a smaller set of possible objects, but this specificity was what the child was after. (We don’t write color printer drivers for use on black-and-white laser printers, now do we?)

This follows the model of a typical classification system like from biology or geology or such. Think of the overall group animals — quite large but very vaguely described. At each level of the classification system — Animalia, Mammalia, ..., Canis, lupis — we add descriptive requirements that narrow the focus of each group. This makes each group smaller than the previous groups — a subset of it. But the number of descriptors has increased. So the more accurate/detailed your description, the fewer things in the resulting group.

Design Perspectives

From a design perspective, the main question facing us is composition or inheritance? That is, should you design as a ‘class‘ which contains its data or which derives from its data?

To help you decide, let’s look at a classic example of the problem: point, circle, and cylinder.

Is a circle an object that has a point at its center or is it a point that has been extended out thru a radius?

Is a cylinder an object that has a pair of similar circles at each end or is it a circle that has been extended out thru a height?

This classic conundrum is one that has plagued programmers since the mid-late 60s. It isn’t an easy argument and no clear winner has yet been found. The debate rages on.⁸²

But how does it relate to composition and inheritance? Composition is a design by ownership: one ‘class‘ is composed of or owns an instance of another ‘class‘. Inheritance is a design by specialization: one ‘class‘ is a more specific kind of thing than a previous ‘class‘.

Therefore viewing a circle as an object that "has a" point at its center or a cylinder as an object that "has a" circle at each end is using a composition approach. Versus saying that a circle "is a" point extended thru a radius or a cylinder "is a" circle extended thru a height which is clearly inheritance.

In fact, software engineering uses these two catch phrases — "has a" and "is a" — to describe this predicament of composition vs. inheritance.

Syntax

Theory is all well and good — and it’s good for you to know — don’t get me wrong! But I know you all want to get to the syntax: how do we do this in code? So here it is:

We start with a parent ‘class‘ named cleverly ‘base‘. This has a member ‘a‘ of type ‘double‘, a constructor, and two printing methods. Then we have the child ‘class‘ named ‘derived‘. This ‘class‘ adds to the parent’s goodness a new member variable named ‘a‘ of type ‘long‘, its own constructor, and a printing method that looks suspiciously like one of its parent’s methods.

Note the syntax with which the derived ‘class‘ makes its inheritance known to the compiler. We use a colon followed by the keyword ‘public‘ and the name of the parent ‘class‘. This all follows the name of the child atop the ‘class‘ definition. There are several modes of inheritance, but we’ve chosen ‘public‘ here. This is not the default mode, but it is the most common. We’ll see the other modes and their effects a little later (Section [inh:modes]).

What Doesn’t Come Along

Note also that, although we said in the introduction to this chapter that the child inherits everything from the parent, this is theoretically and is not exactly the case in C++. In C++, inheritance doesn’t bring with it any constructor-like methods. This includes constructors and destructors. These have an intimate relationship with their ‘class‘ in assigning and cleaning up its members and won’t go along for the ride.

Some people will tell you that the parent’s ‘private‘ members are not inherited by the child ‘class‘. This is not exactly true. The child ‘class‘ gets memory copies of data members from the parent that were ‘private‘,⁸³ but cannot access any of the parent’s ‘private‘ members directly. This makes accessors and mutators for parent members particularly important! However, since you cannot use the parent’s ‘private‘ methods, you might as well not have inherited them, I suppose. *smile*

To make sure the parent’s data members are properly initialized, we must call the parent’s constructor or mutators to do so. If we take no action, the compiler will call the parent’s default constructor on our behalf as a child object is constructed. But, if we want to construct our parent data members differently, we need to call a different parent constructor. It would be awkward and inefficient, after all, to let the compiler default construct the parent members and then turn around and mutate them in our constructor body!

To call a different parent constructor, then, we place the call in the member initialization list of the child constructor. An example of this is given above for ‘class‘ ‘derived‘ calling its parent’s constructor with a non-default parameter.

What’s It Look Like?

There are two ways to look at inheritance. One is via a memory diagram. This diagram shows what data members are inherited and where they reside in terms of which ‘class‘ brought them to the party. The other is an inheritance diagram which shows the organization of the ‘class‘es involved from a base down to any derived ‘class‘es.

Memory Diagram

The memory diagram to the side represents an object of the ‘class‘ type ‘derived‘ as coded above. It consists of two member rectangles and two containing boxes around them. The member rectangles are labeled with the name of the member and its type in parentheses. The containing boxes are simply labeled with the ‘class‘ type of that container in parentheses.

The ‘base‘ box is smaller as it contains just one member from that ‘class‘. The box for ‘derived‘ is much larger as it contains not only its member variable rectangle but also the ‘base‘ box to represent its inherited memory.

The two containing boxes share an upper-left corner because they coincide in memory at the same address. This is, of course, because they are part of the same object. We’ll see this more explicitly by running some of the upcoming code examples by printing the ‘this‘ pointer for our objects.

Inheritance Diagram

At right here you see another diagrammatic view of inheritance. This one is generically called an inheritance diagram or tree. So called because we modeled it after a family tree used to visualize human inheritance. The parent is placed at the top of the diagram — ‘base‘ here — and child ‘class‘es are linked below this by lines sometimes called edges. These would spread out wider if there were more children immediately below the parent class. It would extend deeper if there were children of the top parent’s children. We’ll see examples of this shortly (section [inh:hier]).

Optionally we mark off to the side the members of each class. Here we list the data members on top of a bracketed area and methods below a dotted line. This can get quite bulky so we don’t always do it. But it is sometimes illustrative as it helps us visualize what members are inherited from above.

Some people like to put arrows on the edges connecting ‘class‘es. I’ve not done that here because there is a bit of a tussle as to which end the arrowheads should be at. The most popular at this time is the UML-style diagrams which place the arrowheads at the parent showing that the child’s methods and such are inherited from the parent. I’ll leave it to you to decide which way the arrows should point in your diagrams.

Inheritance Hierarchies

Although the above is considered a minimal inheritance hierarchy, generally we talk about such a thing when there are 3 or more ‘class‘es involved. For instance, we could give the ‘class‘ ‘base‘ above a second child like so:

I’ve eschewed the sidebars on ‘class‘ members to keep it clean and readable here. Again, arrows are preferred, but I’m leaving the direction of them up to you. It could depend on the project leader for your team or on corporate mandates or who knows!

Another way to have a decent hierarchy would be to have the third ‘class‘ be a child of the first child like so:

These two varieties can, of course, be combined to have almost any size and shape of hierarchy you can imagine. But, the larger the hierarchy, the harder it becomes to visualize for the human programmer. It becomes difficult to remember what capabilities you’ve received from parents and grandparents and so on, for instance.

It is also important to remember that a child chooses a parent in this system! This means the parents’ don’t know anything about their children or those ‘class‘’ members. And each child knows nothing about any siblings they might have, either!

Overriding and Hiding

When a child makes a method with the same name as a parent’s method, there are two possibilities and two vocabulary terms we use: hiding and overriding.

When the signatures of the methods differ, we say that the child method hides the parent method. This makes it so that when a child tries to call the parent’s version of the method, the compiler gives an error and says there isn’t such a signature for that method name. Basically, from the compiler’s perspective, it looks at the child class from the bottom of the hierarchy. With all functions having the same name aligned atop one another, the compiler can’t see the parent’s version through the child’s.

We can get around this issue fairly easily, but the syntax is ugly. Let’s say that our ‘derived‘ ‘class‘ from above had another ‘print‘ method with this signature: ‘void print(void) const‘. This method would hide the parent’s ‘print‘ method. We could still call the parent’s method, however, but we have to add scope resolution to the method name in the call like so:

Here we can call the ‘base‘ ‘print‘ method directly off a ‘derived‘ object. Without the scope resolution on the method name, the ‘derived‘ version of ‘print‘ would be called.

Implementing Overriding Functions

When you implement an overriding function that needs to call to its parent’s version, you can’t just call that function within your own. That would look like a call to yourself and this kind of unintentional recursion would be BAD! Instead, use the scope resolution notation to call your parent’s method:

You can’t use the parent’s ‘a‘ directly, either, — even with a scope resolution like ‘base::a‘ — because that variable is ‘private‘ to you as well!

Another Way to [Un]Hide

Another way for the ‘derived_too‘ ‘class‘ itself to unhide an ancestral method is to have a ‘using‘ declaration like so:

This instructs the compiler that the parent’s ‘print‘ method is to be visible to users of the current ‘class‘ as well. This makes it so that the caller doesn’t have to do a scope resolution at all:

The Full Story on Overriding

What about the other ‘print‘ method that ‘derived‘ already has? Well, some would say that, since the signature is identical to that of the parent’s method, the child method overrides the parent method. But others — particularly those in C++ — still use the hides terminology. I believe the reason for the schism is that most object-oriented programming languages are automatically polymorphic but C++ is not. C++ allows you to choose polymorphism or plain inheritance.⁸⁴ We’ll see the effects of this in just a minute, in fact (section [inh:weird]). C++, you see, reserves the override terminology for polymorphic functions only. But as the bulk of the OO community assumes polymorphic functions, they don’t distinguish this possibility.

Weirdness

If you code up the functions for the above hierarchy, you can see the effects of the scope resolution on the ‘print‘ call on object ‘od‘ of ‘derived‘ type. Let’s say that the ‘base‘ method was coded as such:

So what happened to the rest of the data? Where is the ‘long‘ member in this reporting? Hold on! One question at a time…

Normally, when ‘od‘ calls its own methods, its ‘this‘ pointer is a ‘derived*‘ typed pointer. But when we did the scope operation on the ‘print‘ method to call specifically the ‘base‘ version, this turned ‘od‘’s ‘this‘ into a ‘base*‘ version of itself. The compiler had to do this because the ‘base‘ scoped function expects a calling object that is a ‘base‘, after all. How does this work?

Note how the ‘base*‘ pointer points to the address of the ‘derived‘ object in memory. But it doesn’t see the whole object! It only sees the upper-left corner where the data inherited from ‘base‘ is located. This restricted view allows any child ‘class‘ to be seen as a parent object.

The opposite is, however, not true and is disallowed by the compiler. That is, you cannot look at a ‘base‘ object with a ‘derived*‘ pointer. This would try to view more memory space than was actually allocated/initialized at that address. This would be disastrous and so even the compiler says "No!" with a solid error.

In fact, this issue won’t just creep up when the compiler has to change the pointer type of ‘this‘ for a scope-resolved call. It also happens at three other times. See the below functions, for instance:

Then the ‘base‘ version of ‘print‘ will be called in all cases. This is because the arguments to all three functions believe themselves to be ‘base‘ objects or references/pointers to such. The only one that is correct is ‘app_print_two‘, of course. It implicitly copies the ‘base‘ portion of the ‘od‘ object to a new ‘base‘ memory location. But with just inheritance, C++ can’t tell. How can we fix this issue?!

Polymorphism

This section could equally well have been called Overriding II — Overcoming the Weirdness. But polymorphism is the more proper terminology. What is polymorphism? Well, as we learned in the first book in the series, polymorphism means literally "many forms". By this we mean that a function name can behave in different ways depending on the types/arguments involved with it. And we’ve seen examples of polymorphism in other aspects of C++ already: function and ‘operator‘ overloading, default arguments, and even basic ‘template‘s from the earlier volume. But those are all handled at compile-time. This type of polymorphism that allows a function to have multiple forms based on an explicit inheritance relationship is also known as run-time polymorphism. This is because it doesn’t take place during compilation like the others but while the program is being run by a user.

[Run-Time] Polymorphism in C++

We also mentioned above that many object-oriented languages are automatically polymorphic and C++ allows the programmer to choose such behavior. This is done by placing the keyword ‘virtual‘ on the prototype or ‘inline‘ definition of a function we would like to be polymorphic. For instance, let’s say we had a ‘class‘ ‘A‘ that had two methods — ‘f‘ and ‘g‘. Let’s let ‘f‘ be a regular function and let’s make ‘g‘ polymorphic:

Nothing changes with respect to ‘A‘ ‘class‘ objects in the program at all. We call ‘f‘ and ‘g‘ normally and they act in no way differently here.

So what’s the point? It’s from the inheritance-based polymorphism. Let’s take ‘class‘ ‘B‘, for instance, that inherits from ‘A‘:

Note that ‘B‘ also has ‘f‘ and ‘g‘ with the same call signatures as in the parent ‘class‘ (‘A‘). Now when we look at some calls to ‘B‘’s functions, we start to see the difference:

Oh, no, those are normally like that. I meant when we call ‘B‘’s functions with respect to a parent pointer or reference:

Wait, that still didn’t do anything new! That’s the same weirdness as before. Well, of course — that was the non-‘virtual‘ function. Let’s call ‘g‘:

Now that’s a horse of a different color! Suddenly, with that ‘virtual‘ keyword on it, calls to ‘g‘ remember whether the calling object — even when pointed or referred to — is of the parent or child type and calls the proper overridden method! This takes a lot of effort on the compiler’s part, so it isn’t necessarily something we do with all functions, but it is nice when we need it to work this way.

Again, this all works only via a parent pointer or reference and only when the keyword ‘virtual‘ is placed on the function.

What’s in a Keyword

One other thing, once a function [signature] is made ‘virtual‘, all children’s overrides of this function are automatically ‘virtual‘. The child ‘class‘ doesn’t have to mark the function ‘virtual‘ again. The compiler (nor other programmers) won’t be upset, however, if you continue to mark ‘virtual‘ further on in the hierarchy.

Also note that the ‘virtual‘ keyword doesn’t have to appear on a non-‘inline‘ definition of the function. The compiler remembers and hooks things up correctly anyway.

Another way to go is to use the keyword ‘override‘. This is marked at the end of the function head instead of the beginning, however. This keyword also need not be repeated on non-‘inline‘ definitions. (And it may be combined with the ‘virtual‘ mark if that rocks your boat!)

The Destructor

And, BTW, of all the functions you could ever make ‘virtual‘, the destructor is the most important one! If the destructor is not ‘virtual‘, after all, the pointer we ‘delete‘ will just call the base ‘class‘ destructor which wouldn’t clean up all the object’s data were the pointer pointing to a descendant object rather than an actual base ‘class‘ object. With the destructor made ‘virtual‘, however, the correct destructor is called and a lovely chain of destructor calls is automatically started which will follow our ancestry back up to the uber-parent. (This is akin to the chain of default constructor calls that would have happened had we not called appropriate constructors from our parentage in our member initialization lists for ourselves.)

What if you don’t have any dynamic members? Who cares! Make an empty ‘virtual‘ destructor. If you care anything about your children or their children, you’ll go to this minimal extra effort!

Okay, if you control the entire design of this hierarchy and you know that a certain ‘class‘ is terminal — at the bottom of the hierarchy — then you don’t have to worry with it. But if you are just making a ‘class‘ in a library that someone using said library might derive from, please be considerate and make a ‘virtual‘ destructor.

The VMT

The way this all works is a ‘virtual‘ method table (VMT). The basic idea is that there are entries in a 2D table where each row’s first part indicates a type from the inheritance hierarchy and the last part indicates the address of a method in memory that is associated with that type. Such a table is only made with respect to ‘virtual‘ functions, of course, as it is a bit more expensive to have the program sift through this table for the right row to find the function address than to just fit the function address right into the code when it is called.

A full understanding of how this works is best left until you’ve taken assembly language and done a proper dispatch table.⁸⁵

A Full Example

On the companion website, you can find this example program which not only demonstrates the above points with runnable code, but further points out how the chain of constructors and destructors works as mentioned above. This is done by printing the ‘this‘ pointers of the objects in constructors and destructors. As noted in the comments, you can place this code side-by-side with your terminal and note which hexadecimal addresses correspond with which objects and trace what is happening there.

Polymorphic Dispatch

An interesting side-effect of ‘virtual‘ function processing is that a polymorphic function called from a non-polymorphic function still acts polymorphically! This is known as polymorphic dispatch for odd historical reasons.⁸⁶ I prepared an example, but it is a little long so I’ve placed it on the website here.

As you watch it run, note that the calls to ‘base_only‘ for the derived object ‘doo‘ are still polymorphic. There are other good notes in the comments as well. Be sure to read through those and maybe take some of the experimentation suggestions!

Abstract classes

When we inherit functions from a parent ‘class‘, we effectively inherit a little language. It is a language that people who see a parent pointer or reference know will be spoken by the object at the end of that pointer. And if the functions are ‘virtual‘, the objects will even be able to use the language appropriately to their current circumstances — rather than just mimicking what their parents always said.

Some ‘class‘es even serve only a purpose of specifying the language all their children should speak. And if such a ‘class‘ has nothing specific to say about/in this language, it shouldn’t introduce erroneous verbiage. By this I mean that we shouldn’t put in a body to the function that does something incorrect or silly. Instead, it should make such ‘virtual‘ functions "pure". This way the language will be specified, but uncorrupted.

A pure ‘virtual‘ function is a ‘virtual‘ function with no definition. Not just a function prototyped to be defined later. Not just a function never called that we forgot to define. But a function which is planned to never be defined.

Take, for example the concept of a random distribution. This phrase brings to mind a set of values with corresponding probabilities. The primary functionality (spoken language) of such sets is generating (picking) a variate (value) from the set probabilistically. But, just knowing this doesn’t give us a reasonable way to do this activity in a general way. We have to know more about the particular random distribution in order to generate a value from it. Is it uniform? Is it Poisson? Is it Weibull? More information will be necessary.

Similarly, all shapes have a concept of area: area under a curve, area of a 2D figure, surface area of a 3D figure, etc. But just knowing that something is a shape doesn’t give us enough information to calculate its ’area’. I need to know what kind of shape it is in order to calculate the proper area.

A ‘class‘ having a pure ‘virtual‘ function, BTW, is known as an abstract ‘class‘. A ‘class‘ which is abstract cannot be instantiated. That is, you cannot declare an object of an abstract ‘class‘ type. After all, if you did, what would the compiler do if you were to try to call that pure ‘virtual‘ function — the one without a definition! That would be all kinds of bad, right? So that isn’t allowed — ever.

So an abstract ‘class‘ is just there to guarantee a language for all of its descendants as we mentioned before. How do you know they speak this language? Well, we can still point to or refer to them via the abstract ‘class‘ type. That is guaranteed by inheritance itself.

Oh, and any ‘class‘ that isn’t abstract is sometimes also called concrete. If it is derived from an abstract ‘class‘, it must have defined the pure ‘virtual‘ function in order to have become concrete.

I’ve prepared another long example. It is based on the random distribution idea above. As you look it over, you’ll see that the ‘RandDist‘ ‘class‘ is abstract having the pure ‘virtual‘ function ‘generate‘. In fact, that’s almost all it has other than the ‘virtual‘ destructor — not even any member variables! That’s because it doesn’t know enough yet to need any data — it is just the concept of a random distribution. It just needs what it can describe to the compiler and that is that generating a random variate creates a ‘double‘ from nothing at all without harming the object doing the generating.

Note especially how the pure ‘virtual‘ function is designated: at the end of the function head an equal sign and a zero are appended. This ‘=0‘ syntax is the official C++ way to designate a function that will never be defined.

When you run it, you’ll see that values are randomly generated in the proper way for the given [concrete] distribution type provided. Note also that the pointer used to ‘generate‘ variates doesn’t have to be pointing to a dynamically allocated object. This is shown with the final two examples in main.

Polymorphic Containers

Let’s build something from our last example that’s a little bit more powerful. As you recall, we’d added the concept of an abstract base ‘class‘ to our repertoire allowing our entire hierarchy to speak a common language based on the [pure] ‘virtual‘ functions provided by the [abstract] base ‘class‘.

But before, we just created and used individual objects of our children’s types. Now let’s go for something more ambitious.

to create a polymorphic container of various ‘RandDist‘-derived objects. This one is a C-style array for simplicity, but it could very well be an ‘array‘ ‘class‘ object or a ‘vector‘ as well.

This array can contain pointers to any object derived from ‘RandDist‘. Its elements can’t point to ‘RandDist‘ objects, of course, since that ‘class‘ is abstract. But any concrete descendants can be pointed to since the base type pointer can point to any derived type. This makes us able to ’contain’ (indirectly) multiple [related] types of data!

How do we fill in this container? We would normally have the user choose which distribution they wanted to add from a menu, but let’s keep it simple and just prove the concept. We’ll place ‘new‘ ‘Uniform‘ objects in all even slots and ‘new‘ ‘DieRoll‘ objects in all odd slots like so:

Then, when we go to display the results, we’ll describe the variates ‘generate‘d with a simple ‘?:‘ operation:

(Recall the ‘make_em‘ function needs a description ‘string‘ to display above the ‘generate‘d variates.)

Why didn’t I use a ‘?:‘ to fill the array in the first place? Well, that would have required some typecasting and we don’t quite have the proper tool for that. We’ll get to it in the next section, though, so stay tuned!

(Recall that the ‘done_with_it‘ function will ‘delete‘ each pointer and set it to ‘nullptr‘ for us as well.)

To save you having to code up these tweaks to the earlier program, I’ve posted it to the website here.

Enter dynamic_cast

The ‘dynamic_cast‘ ‘operator‘ will check if an actual object pointed to is of the type we are testing for. This looks something like so:

If the object pointed to is of the desired type, the address will be returned in the guise of a properly typed pointer. We can store this and use it to call functions more specific to the desired type.

In fact, ’is’ here is quite general. To the compiler, a pointer is of the correct type if the true type is any descendant of the requested type. So when we store the new pointer, we can narrow our knowledge further and use a more limited language shared in a sub-tree of our hierarchy!

What if the actual pointer points to a different type? ‘nullptr‘ is returned instead of the original address.

How can we use this? Well, we can use it as a traditional typecast for dynamic objects like so:

This greatly shortens the allocation of our objects from before. Also, technically only one of the pointers needs to be cast to ‘RandDist‘ here. Once that is done, the compiler realizes that the other half of the ‘?:‘ can be that type, too. But without the cast at all, the compiler is at a loss that the two seemingly unrelated types of ‘Uniform‘ and ‘DieRoll‘ can be made synonymous. I just put in both casts to make it look nicer.

But we don’t have to keep it all regimented to even/odd positions anymore! With the testing facility of ‘dynamic_cast‘, we can place the objects’ pointers more randomly. Let’s use a coin-flip to do it like so:

Here we have to cast the ‘generate‘ result to ‘short‘ to account for the fact that all ‘generate‘d values are ‘double‘ but that type doesn’t like to be exactly ‘==‘ anything. When the ‘flip‘ comes up a ‘1‘, we’ll store a ‘Uniform‘ pointer. And when it’s a ‘2‘, we’ll store a ‘DieRoll‘ pointer.

But now, since we don’t know where the various children objects are at in the array, we’ll have to test for them with ‘dynamic_cast‘ when deciding the descriptions:

Here we test for a ‘Uniform‘ object at the end of the ‘dists[d]‘ pointer and if it really was of that type, we describe with ‘"uniform"`. Otherwise we use ‘"dice"`.

This is still not the full power of ‘dynamic_cast‘ being used, but it is a good start. We’ll see more on this in a couple of sections, too.

Oh, and here’s the full example for you so you don’t have to tediously make all those changes to the previous example.

But What About typeid?

The ‘typeid‘ ‘operator‘ can be used to access run-time type information.⁸⁷ But this is available only if you the ‘typeinfo‘ library. This ‘operator‘ can identify any type in the entire program with a ‘type_info‘ ‘struct‘ure object. In particular, it is typically used with its ‘==‘ ‘operator‘ to tell if the object at hand is of a particular type like so:

This seems at first a little weird as wouldn’t we already know something was a ‘double‘ or not? Yes, most of the time. But in a ‘template‘ (which you saw in the previous volume) or in a dynamically typed polymorphism situation (like we’ve been dealing with), it can make sense.

So, for instance, we could check if our ‘dists[d]‘ pointer from before actually pointed to a ‘Uniform‘ object with a bit of code like so:

Some people are fond of using the ‘type_info‘’s ‘name‘ method which returns a ‘string‘ representing the data type. But it is not just the name as you’ve typed it into your program. It has undergone a process called name mangling. This adds other information to the ‘string‘ that the compiler felt vital to know. For instance, on my compiler, the mangled ‘name‘ of the ‘DieRoll‘ ‘class‘ is ‘"7DieRoll"` and that of ‘Uniform‘ is ‘"7Uniform"`. But when I change the name of the ‘DieRoll‘ ‘class‘ to ‘Dice‘, the ‘name‘ comes out as ‘"4Dice"`. It would seem my compiler wants to know how many characters are in the name of the ‘class‘ without having to count them..? Weird…

Anyway, here is an example on the website with the ‘typeid‘ implemented in the ‘for‘ loop instead of ‘dynamic_cast‘. Notice that ‘dynamic_cast‘ is still used to initialize the polymorphic array elements in the first ‘for‘ loop. Be sure to play around with the currently commented out ‘.name()‘ variation to see what your compiler does with name mangling!

Although the ‘typeid‘ code is shorter than the ‘dynamic_cast‘ approach, it does require more heavy lifting behind the scenes. Because by having to the ‘typeinfo‘ library we’ve incurred a huge expense to gather and have available ‘type_info‘ data for all the data types used in your entire program — builtin types and all.

The ‘dynamic_cast‘ mechanism, on the other hand, is managed via the VMT mechanism that the compiler is already using to manage your polymorphic function calls. This is no more overhead and can lead to faster runs and smaller binary sizes. Also, as mentioned before, ‘dynamic_cast‘ has another property that ‘typeid‘ just can’t replicate. More on that next…

When Not to be Polymorphic

But it occurs to us that the whole ‘dynamic_cast‘ versus ‘typeid‘ thing to determine the description could be avoided by just implementing a ‘string‘ member in the base ‘class‘:

Now the programmer using any descendant of ‘RandDist‘ will be able to just ask it to ‘describe‘ itself. And it doesn’t even need to be polymorphic! We just make sure the children pass their desired description ‘string‘ to their parent’s constructor call like so:

Here we’ve even let the programmer making the object name it differently if they so choose.

Now it doesn’t need a ‘string‘ parameter and just calls the ‘describe‘ method to get the description for the values being generated. And, calling it is far simpler:

But why do we have all this polymorphism still hanging around? Well, we do still need the polymorphic behavior for ‘generate‘, after all. What I’m saying with this change is that you need not rely on polymorphism for everything in your hierarchy. Sometimes simple solutions are best.

However, I also promised to finally show you that other use of ‘dynamic_cast‘, so here that is. I added another function to the ‘DieRoll‘ ‘class‘ to retrieve the number of dice in the roll. Then I changed creating the child objects a little to change up how many dice were in each roll:

Now the ‘DieRoll‘ children will all have different numbers of dice in them — anywhere from 1 to 10.

Here ‘count_sum‘ and ‘dice_count‘ are earlier initialized to zeros and are responsible for tallying the total dice in all rolls and the total number of ‘DieRoll‘ versus ‘Uniform‘ objects. Note that I can only call the ‘get_count‘ accessor via a ‘DieRoll‘ pointer. I could not have directly called ‘dists[d]->get_count()‘ because ‘dists[d]‘ is a ‘RandDist*‘ instead. The ‘dynamic_cast‘ not only vets the pointer as pointing to a particular descendant type, but also allows you to call type-specific member functions from that result.

I’ve placed the full example with a few extras and notes on the website for your enjoyment and further study.

Advanced Inheritance

We’ve talked at length about basic inheritance and how polymorphism helps to round out those sharp corners. But there are also more advanced topics in inheritance that might rear their ugly heads to your design process. In this section we’ll look at those including inheriting from multiple parents, a new member access classification, non-‘public‘ inheritance modes, and how ‘static‘ ‘class‘ members interact with inheritance.

Multiple Inheritance

Sometimes it becomes useful or some would even say necessary to inherit from multiple existing ‘class‘es. For instance, a hovercraft might inherit from both land and air vehicles. Or a seaplane might inherit from both aircraft and boats.

Let’s look at a basic example of this where a ‘derived‘ ‘class‘ inherits from two base ‘class‘es at once:

Here we see that to inherit from multiple parent ‘class‘es, we just comma-separate them after the inheritance colon. Also, we call both of their constructors in our member initialization list. Just watch that you call those constructors in the same order you inherited from those ‘class‘es or you might suffer a warning about having to rearrange them!

Overcoming Name Clashes

So, how can a ‘derived‘ ‘class‘ object call the two inherited ‘print‘ functions? They have the same signatures! It’s as simple as using a scope-resolution override like we’ve done before:

Here you’ll get that the ‘derived‘ object contains a ‘base‘ value of ‘-6‘ and a ‘base_too‘ value of ‘18‘.

The Diamond Pattern

The only unfortunate thing about multiple inheritance is that sometimes you end up with a diamond pattern in your hierarchy. Often called the diamond of doom, it happens when two or more of a ‘class‘’ parents come from a common ancestor themselves. The simplest of these situations is modeled here:

At first glance, it seems ridiculous and unnecessarily complex. But look, for instance at the input/output streams hierarchy for C++. It has a diamond pattern in it! Turns out that ‘istream‘ and ‘ostream‘ both derive from the common ancestor ‘ios‘. Then, ‘iostream‘ derives from both ‘istream‘ and ‘ostream‘! This can be seen in any number of diagrams on the web like this one from a stackOverflow discussion.

Running this fragment we see that the first display has both ‘base‘’s ‘grandparent‘ value as ‘12.43‘ and ‘base_too‘’s ‘grandparent‘ value as ‘12.43‘. But in the second display, the ‘base‘ ‘grandparent‘ value has changed to ‘2.43‘ while that from ‘base_too‘ is still ‘12.43‘. This shows that, without some care, the two inheritance paths’ common ancestor data can become out of sync.

We could, of course, handle this manually by always making the same change to both paths back-to-back. But this becomes tedious and is prone to error. A better way must be found!

To solve this problem, the C++ committee decided to reuse the ‘virtual‘ keyword in a unique way. And the solution comes in a unique place, as well, some feel. It is hardly a change at all for the ‘derived‘ ‘class‘ at the bottom of the diamond, but the middle-layer parents need to make a change to their inheritance modes from the common ancestor:

Here we’ve added the ‘virtual‘ keyword to the inheritance mode ‘public‘. Note that it can go on either side of the ‘public‘ and the compiler doesn’t care. And this is all that is absolutely required to fix the problem. If we tweak this in our fragment of above, we’ll find that both ‘base‘ and ‘base_too‘ have ‘grandparent‘ values of ‘2.43‘ now.

The compiler does this by forcing the two middle-layer parent’s share their common ancestor when they are both inherited from at once. That’s quite the effort for our benefit, so kudos to the compiler team!

But that relies on the shared ‘grandparent‘ portion being initialized twice: once by the ‘base‘ and then again by the ‘base_too‘ ctors. To alleviate this double effort and to clarify what the grandchild (‘derived‘) wants to have in its now solitary ‘grandparent‘ memory, we can also change the member initialization list of its constructor to call to the ‘grandparent‘ constructor. This is different than before because the compiler never let us call to a constructor two levels back before — only one level! But in the case of a ‘virtual‘ized diamond pattern, the compiler allows it. It looks quite simply like this:

Note that the ‘grandparent‘ call must precede those of the parents or warnings may come forth!

You can find this full example on the website for further study, of course.⁸⁸

How Not to Use Multiple Inheritance

Another way to avoid the diamond of doom is to just not use multiple inheritance at all. Instead, choose one parent and make yourself a data member of the other proposed parent’s type. The problem with this solution is that you must take responsibility upon yourself to keep the common grandparent data synchronized. As mentioned above, this will be tedious and error-prone and is not the best choice.

protected Membership

It turns out that a ‘class‘’ members don’t have to be just ‘public‘ or ‘private‘. There is a third access classification: ‘protected‘. Members of a ‘class‘ whose access is marked ‘protected‘ are ‘private‘ from outside the inheritance hierarchy but ‘public‘ from inside the hierarchy. That is, they share some attributes with each of the other specifiers.

I wish I had a diagram to show how this worked, but the best one I’ve found is now lost in the web.⁸⁹ It had two neighboring houses with a single fence around the two yards. The houses themselves represented the ‘private‘ areas of two ‘class‘es. The shared yard represented some ‘protected‘ members that the two ‘class‘es had in common. And the fence kept the outside of the hierarchy from getting to the ‘protected‘ area. It was pretty slick. Hopefully my poor description can be enough for now. Maybe someday I’ll learn to draw.

A stern warning: Do NOT make data members ‘protected‘! Leave them ‘private‘. You can have ‘protected‘ functions that provide easier or special access to your children, but not to those outside the family. But ‘protected‘ data members caused the Java designers to design a whole new GUI hierarchy because their first one could be corrupted and made ridiculous and/or ugly by misuse of ‘protected‘ data members.

Non-public Inheritance

But it is also possible to use other inheritance modes besides ‘public‘. The two others are — perhaps not surprisingly — ‘private‘ and ‘protected‘.

A ‘private‘ mode of inheritance makes all ‘public‘ or ‘protected‘ members of your parent ‘private‘ when viewed from your objects.

A ‘protected‘ mode of inheritance makes all ‘public‘ members of your parent ‘protected‘ when viewed from your objects.

In other words, the access to your parent’s members is made more restrictive by the other two inheritance modes — when viewed from objects of your ‘class‘.

Here the row labels represent the inheritance mode used and the column labels represent the access mode of parent members. The boxes inside the table, then, represent the access mode that appears to those outside the child ‘class‘ with respect to those members of the parent. The ‘private‘ access in the last column is starred because, of course, not even you can reach your parent’s ‘private‘ members without using accessors or mutators — sort of uber-‘private‘.

This has limited applicability, of course. It is used fairly rarely. But we will see a good occasion to use it when discussing stacks and queues later on (chapter [def:stack-queue]).

static Members and Inheritance

One last nagging question involves ‘static‘ members of a ‘class‘ involved in inheritance. Clearly a child ‘class‘’ ‘static‘ members wouldn’t affect the parent since the parent doesn’t even know the child exists. But the other way around …how does the child ‘class‘ handle those members during inheritance?

One popular theory is that the child ‘class‘ will get copies of all the parent’s ‘static‘ members and all child objects will share those copies separately from the parent ‘class‘’ copies. Others believe that all objects of both ‘class‘es share one copy of the ‘static‘ members regardless of parent or child. Which is right?

We could try to use one of our conceptual models to ferret out the answer to this conundrum, but a fairly simple set of code should help us see what’s going on as well. Just download those files and load them into your favorite environment to follow along below. Be sure to pay attention to the text comments and feel free to test it by changing the code comments in the two files of the library.

Let’s look at the file first. The parent ‘class‘ creates three ‘static‘ members of increasing complexity to help with this example and to refresh our memories on handling ‘static‘ members of a ‘class‘.⁹⁰ The ‘MAX‘ constant is initialized right away as it is simple enough to do so and constant. The other two are not so lucky.

The ‘grades‘ array is considered too complex to initialize within the context of the ‘class‘ itself. It must therefore be initialized outside. Similarly the ‘name‘ member variable cannot be initialized inside the ‘class‘. This time, however, it is because it isn’t constant — one-dimensional arrays are simple enough normally.⁹¹

Since both of these need to be initialized outside the ‘class‘ definition, you might try to do that right below in the file. But this leads to multiple definition errors from the compiler when things are being put back together after the separate compilation phase of things.

Thus, they must be placed in a separate implementation file of their own to keep them unique — not d multiple times. This is done in the file, of course.

The next file is which simply defines a derived ‘class‘ with its own ‘static‘ constant member for good measure.

Both of these ‘class‘es have ‘output‘ methods to display their components. The ‘A‘ parent ‘class‘ prints a random grade for the ’student’ each time — just for fun.

The final file — — calls on both of these libraries to test how the ‘static‘ members behave. First we make a parent object and test it. We ‘output‘ it, change its ‘static‘ member variable contents, and ‘output‘ it again. This seems to go off without a hitch. (Remember that the grade printed is random!)

Next we test the derived ‘class‘ and how it interacts with the parent’s ‘static‘ member variable. (We don’t test the member constants because we wouldn’t be able to affect them anyway — even if they were copies in our local ‘class‘ space instead of being shared with the parent ‘class‘.) If the child ‘class‘ has a separate copy of the ‘static‘ members, then the initial name should still be ‘"Bill"` as that member was originally initialized in the file. If not, it should be ‘"Alma"` as changed by the parent ‘class‘ object above. After changing the ‘name‘ itself, the child object re-‘output‘s itself to see if the change took.

Then, just to be extra certain we know what’s going on, we ‘output‘ the parent object one more time to see if the child’s change of ‘name‘ affected it as well.

As you can see when you run the program, the output might look something like this:

You see that all changes were effective and so the parent and child ‘class‘es both share a common memory space for their ‘static‘ members.

This could have also been ferreted out fairly easily with the set theory model of inheritance. As a subset of the original ‘class‘’ objects, the child ‘class‘ objects would clearly share the same ‘static‘ members as all their surrounding parent ‘class‘ objects.

Wrap Up

In this chapter we’ve learned about the basic concept and syntax of inheritance between ‘class‘es. We’ve studied basic designs using inheritance and even making small hierarchies of related ‘class‘es. But there was something off-putting when we looked at a derived object with a base pointer or reference and called methods the child had reworked — the parent’s version was always called!

Thus we developed polymorphism to solve this issue. And with that power came further ideas like abstract ‘class‘es and polymorphic containers to solve even more interesting problems than basic code reusability like inheritance solved.

Afterwards, we went back to inheritance and studied its deeper and more esoteric corners. Here we learned how to inherit from multiple parent ‘class‘es at once and how to avoid/fix the diamond pattern that might creep up. We also learned the ‘protected‘ keyword and how it affected member access. And we reused this keyword and ‘private‘ to change up how members were inherited from a parent via alternative inheritance modes. Finally we looked at how a ‘static‘ member in a parent ‘class‘ is treated with respect to inheritance.

Compile-Time Polymorphism

Compile-time polymorphism is achieved through the use of ‘template‘s. So you should first review what we said of ‘template‘s in the first volume. You could also review what we said of them later in the ‘vector‘ chapter (both this section and this one).

As you’ll recall, ‘template‘s can make quick work of implementing an algorithm in a generic way that applies to most any base type. Then it is all on the compiler to make the various overloads that lead to the polymorphic behavior of that algorithm within our program. That last bit is called instantiation of the ‘template‘ and creates binaries for each base type chosen.

Now that we’ve motivated and refreshed, let’s deepen our appreciation of what this tool can do for us!

template Function Design

In learning to design well with ‘template‘s, we must dig deeper into the requirements list hinted at in the first volume when we looked at the ‘swap‘ ‘template‘ there. In fact, let’s start with the ‘swap‘ ‘template‘ and make an improvement.

It’s requirements list is commented off to the side, even. And this ‘template‘ will work for many data types but not all. Let’s take a fairly simple type like this one:

This is a perfectly lovely ‘class‘ with a lot of potential. But it can’t be ‘swap‘ped with our ‘template‘! Which of the requirements is missing? All ‘class‘es get an ‘operator=‘ we’ve learned and that only needs changing if we dabble in dynamic memory which isn’t happening here. When calling ‘swap‘, both arguments would be of the same type, so that’s not the problem. It must be the constructor thing. Why can’t this ‘class‘ default construct?

Remember the detailed rules about constructors: the copy constructor is always supplied and the default constructor is supplied unless you write one of your own. We’ve broken the "unless" clause here! We supplied our own constructor to divvy the arguments into the members properly. Now we can’t default construct. And what sense would that make for this ‘class‘, anyway, right?

Can we fix this issue? Sure we can! Looking back to the first volume’s own example ‘swap‘ overloads, we see that they weren’t written in quite this way, either. They were all copy constructing their temporary helper variable! Let’s do that:

Now we require the ‘template‘ type to be copy constructible and that will either always be supplied by the compiler or it will be supplied by the programmer with dynamic members in their ‘class‘! In fact, we now see that calling ‘swap‘ with two ‘Str_Pair‘ objects works just fine!

A Common Language

Like [run-time] polymorphism, ‘template‘s can basically enforce a common language amongst the different types which can be used to instantiate a ‘template‘. Polymorphism specifies the language all of a ‘class‘’ descendants will share via ‘virtual‘ functions. A ‘template‘ specifies a language via its ‘template‘ type requirements and then data types attempting to instantiate are merely tested to see that they indeed know the proper language before they are allowed a binary form. (The ‘template‘ mechanism is often referred to as compile-time polymorphism. That’s why some people don’t just call polymorphism that but rather run-time polymorphism.)

So the requirements list is the common language that all participants in the ‘template‘ must speak. We can use this much more general approach⁹² to allow a piece of code to work not just with types that are related via inheritance, but which all provide the same functionality. Like, if all the types had a ‘print‘ member that took an ‘ostream&‘ and didn’t change their calling object, it wouldn’t matter if they were part of the same hierarchy. We could provide something like this for them with ease:

Now all of them can be inserted into a standard stream like ‘cout‘! No need to retrofit all of them individually if our design were consistent in the first place. *smile*

Containers of a Feather

Let’s talk about a simple linear search algorithm and how we can make it work with all kinds of containers. We’ve got the original design built with C-style arrays and it looks like this:

This version of the linear search ‘template‘ function takes a built-in type array of elements and the element to find. The base type of the array must, of course, match that of the element.

But, not all containers of elements are built-in type arrays. ‘vector‘s and ‘string‘s are ‘class‘ types which overload ‘operator[]‘ to behave like built-in type arrays.

To alleviate this problem, we can try to add a new ‘typename‘ slot to our ‘template‘:

Yes, it is possible to have multiple ‘typename‘s on a single ‘template‘. I just opened up whole new worlds for you, didn’t I? *smile*

This now works with ‘class‘-type containers and yet doesn’t break our ability to work with built-in type arrays. "But wait," you say, "you cannot refer to an array!" Well, no, but I can refer to a pointer to the beginning of the array. And all arrays degrade to a pointer when passed to a function, so…grin*

But those worried about the compatibility of ‘string::size_type‘ and ‘size_t‘ (or its compatibility with various ‘vector‘ ‘size_type‘s) will want more. They’ll want a third ‘template‘ ‘typename‘ to represent the position/size type for the container:

The Empty Curly Syntax

But what’s the +PosT+? Oh, that default constructs an anonymous ‘PosT‘ object with which we then copy construct ‘p‘. Why not just ‘PosT p;‘? Because built-in types don’t default construct unless special circumstances — like us calling their default constructor explicitly — occur, remember?

That gets rid of it, but now they have to specify where to begin searching. If they really wanted to start at the beginning by default, we’d have to supply a defaulted argument like this:

And the const& on Arguments?

But why have I put all these ‘const&‘ on the arguments? Well, since they are going to be filled in by the compiler at the callers request, we don’t know how simple or complex of a type they may be. They may be extremely costly to copy. Therefore, I avoid the copy construction by passing them by ‘const‘ant reference. (‘const‘ because I don’t need to modify any of these arguments, of course.)

Whither Requirements

Before we move on, what exactly is required by the types in this ‘template‘? Its requirements list looks like this:

Although this seems at first a rather long requirements list, this is actually rather shallow as requirements list go. Even with its automatically deduced ‘typename‘ and all the hand-wavy ‘bool‘-like business. It could get much worse! What if ‘T‘ doesn’t actually have the ‘operator!=‘ but rather ‘ElemT‘ overloaded it as a non-member? That should work, too. What if either ‘operator!=‘ or ‘operator<‘ actually returns a new deduced type ‘U‘ that then overloads ‘operator&&‘ to combine itself with the other ‘operator‘’s result?! The compiler can handle all of this — and more! We may not even be able to conceive of the oddness that can result from this simple linear search construct. Luckily the compiler is infinitely patient and tedious if not clever.

Another Container Style

Another thing we can do is to allow for pointer/‘iterator‘ -styled access to a container for searches. This requires a separate ‘template‘, of course, with quite different requirements:

The ‘PosT‘ should be instantiable on either pointers or ‘iterator‘s. Pretty nice, eh?

Overloading templates

These two ‘template‘s can overload one another because of SFINAE — "substitution failure is not an error". Basically, the compiler tries both ‘template‘s one at a time and the first to work is the victor. If neither work, then we have a mismatch to a call that reports as ambiguity.

Functions as Arguments

But there is one more thing we can do to help this ‘template‘ function out. Right now it is dependent on ‘operator!=‘ for locating the object within the collection. If the caller wants to search on more complicated criteria than simple [in]equality or doesn’t have an ‘operator!=‘ available, they might want to pass a function returning a ‘bool‘ to tell us what to consider [un]equal.

Wait! Did you just say, "pass a function?!" Like, pass one function as an argument to another function?! What are you, nuts? That can’t be possible! Au contraire, mon frère! It is…

So what does ‘CompF‘ look like before it gets here? It could be a regular function. The caller would just use its name in the call without applying parentheses to call it there:

But as long as it meets the specifications above of taking a ‘T‘ and an ‘ElemT‘ — or types convertible from these — and returns a ‘bool‘-compatible value, we’ll be happy to use it in our linear search ‘template‘ function!

Function Objects

Thus it could be a function object which would, of course, be just a variable and not need parentheses. (It could even be a lambda as we talked about in section [defn:lambda]!)

So letting a ‘template‘ parameter be another function is amazing power. We can put off some of the decisions in our algorithm design until later and just take a function to handle that when it gets finalized.

Its purpose is to walk through a container and apply some function to each element. This can be used to work with the values individually or to even modify them since we took the container by pure reference instead of our usual ‘const&‘.

As an example, we could use this to display a list of ‘double‘ values in a special format like so:

For more on the use of single ‘|‘ to set up multiple format flags simultaneously, see chapter [defn:bitmanip].

Nice and neat! We could also use it to total the elements in a container like so:⁹³

Hey! What happened? Well, the ‘Total‘ object ‘sum‘ was passed to the ‘foreach‘ function by value instead of reference. This caused the total to update locally inside the function and then be thrown away as it returned! To fix this, we just need to adjust the ‘foreach‘ ‘call_me‘ parameter to be by reference:

There is a possible problem now, though, with our earlier ‘print_nice‘ example. Some older compilers won’t like passing a plain function to a reference parameter. And sometimes your company might be stuck on an older compiler due to pricing issues. To fix this is a bit of a kludge. We would have to take the ‘call_me‘ parameter by ‘const&‘ and then make the ‘Total‘ ‘class‘’ internal ‘s‘ member ‘mutable‘.⁹⁴ Then mark the ‘operator()‘ that updates ‘s‘ as ‘const‘ and you’ve got it all working again!

But using ‘mutable‘ to mark core data is not a good design. It’s just not the done thing! Best thing to do: negotiate a compiler update with IT. *smile*

Requirements Filling

So, just out of curiosity, what do the requirements look like as they are filled in? Well, it’s more than curiosity, it can also come in handy when reading error messages from compilers having trouble fulfilling requirements from a call. So let’s explore that.

We saw in our first volume that the ‘swap‘ ‘template‘ was filled in during instantiation with types that matched the requirements. So if I called ‘swap‘ with two ‘short‘ integers, I’d get ‘swap<short>‘ as a binary version of the function.

But how does this extend to multiple ‘typename‘ ‘template‘s like we’ve been doing? Just like you might expect, the instantiation just has multiple actual types filled in and comma-separated. So if we look at a call like this:

We’ll find that the instantiation is ‘linsearch<short,short*,size_t>‘. Notice that they follow the order specified in the ‘template‘ head and not their use in the function head.

But when it comes to a call with a function parameter, it can be more tricky. For instance, what is the type of a regular function? They have them, but we’ve never looked at them. I suppose it’s about time. First, a word of warning: a function resolves to a pointer type. After all, a function’s code must reside somewhere in memory. And so their data types are pointer types. But they look really weird at first glance and will require some explanation. But I think we should just throw you into the deep end and then teach you to swim if that doesn’t suffice. *smile*

Here we see that the function name is basically replaced with a parenthesized pointer. Once you get used to it, it isn’t so bad.

That call would be instantiated as ‘linsearch<double,double*,bool(*)(double,double),size_t>‘.

And what about the pointer/‘iterator‘ version of ‘linsearch‘? How does that one work? Let’s check it out:

The instantiation of this can come out as either ‘linsearch<char[5],vector<string>::iterator>‘ or sometimes as ‘linsearch<string,vector<string>::iterator>‘ depending on the compiler’s current whim. *smile*

This gives us a failure to compile! Clearly ‘s‘ is an array and should degrade to a pointer for the function call, but at least on my two compilers, it checks into the ‘template‘ as ‘char[10]‘ instead. There is a conflict, therefore, between this parameter and the ‘s+10‘ which is definitely a ‘char*‘. To fix this, we can either do explicit instantiation:

Full Disclosure

The C++ committee did update the function pointer syntax in C++11 to make it somewhat less daunting. For instance, now we could do the data type of ‘rand‘ as simply ‘int(void)‘ without having to deal with the parenthesized star syntax at all.

But note that if you want to store such a function address, you need to add the star back on:

Function Prototype	Function Pointer Type
‘char * strcpy(char * dest, const char * src);‘	‘char * ()(char ,const char *)‘
‘int toupper(int ch);‘	‘int (*)(int)‘
‘time_t time(time_t * result_now);‘	‘time_t ()(time_t )‘
‘void srand(unsigned int seed);‘	‘void (*)(unsigned int)‘
‘int rand(void);‘	‘int (*)(void)‘

And the compiler will implicitly convert the ‘rand‘ function to a pointer without need of the address operation.⁹⁵

Another Approach

Above we let the caller use our ‘linsearch‘ ‘template‘ that used ‘!=‘ directly for most types and just call the overload with a function argument when they needed to compare things like ‘double‘ that don’t like ‘!=‘. This approach is valid, but there is another.

The second approach would use a ‘template‘d helper function and only provide the equal-testing function parameter version of ‘linsearch‘. Such a helper could look like this:

But this would just give warnings for ‘double‘ and the like or crash if the type had no ‘==‘ operation. So what do we do with those types? We recall from the ‘vector‘ revisit of ‘template‘s in the first volume about specializing a ‘template‘ and make such for our ‘Equal‘ ‘template‘:

Note: if your compiler complains that ‘double‘ is converting to ‘int‘, make sure you ‘cmath‘ and think about adding ‘std::‘ on the call to ‘abs‘.

And it crashed at compile time! It said, basically, that it couldn’t use ‘Equal<unknown type>‘ in the call. That’s because we didn’t tell it what type to use in the ‘Equal‘ ‘template‘! We’ve got to explicitly instantiate it like so:

But That’s Too Accurate

Those who tried the above code already have noticed that the last search finds the 4.2 at position 3 and not the 4.199999 at position 2. This might seem odd at first, but at

10^{-6}

, there aren’t enough 9s to round us up. We’d need to cut the precision of the match to make it round. Looking over our available tool-set, we find that this function solution is inadequate to the job and so a function object approach might be the better tack.

The first two still find the value at position 3 but the last one is so slack that it rounds up and find the value at position 2.

But That’s Tedious

If, again, this ‘linsearch‘ is now our only ‘linsearch‘, then we could also change the head of the ‘template‘ to by default use ‘Equal‘ when a comparison function wasn’t specified:

Note that we’ve still used explicit instantiation, we just don’t know yet to what type! But there is one thing missing. We have to tell it that the ‘CompF‘ can default as well:

There, now that will compile and run. But the syntax of the ‘CompF‘ default is a bit much. We can make it a bit better with the ‘decltype‘ tool. Basically you give ‘decltype‘ an expression and it tells you the type you would use to declare that result. It looks much cleaner, as you can see:

To prove it all works, here is a short code sample. It even works with the specialization!

A Variation

This talk of defaulting ‘template‘ ‘typename‘s has me thinking that being ‘Equal‘ doesn’t mean you have the same data type all the time. For instance, ‘5 == 5L‘ is ‘true‘ even though the left value is ‘int‘ and the right value is ‘long‘. We can make a simple tweak to our ‘template‘ to make this possible:

Failure versus Success

So we’ve seen many successful ‘template‘ instantiations so far. And we saw one failure with the pointer/‘iterator‘ version of ‘linsearch‘. But what actually happens when an instantiation fails? The messages change from one compiler to another, of course, but hopefully they are readable and you can interpret them to head toward a solution.

What’s more concerning is what happens when an instantiation should have failed but didn’t? What do I mean? Let’s look at this ‘linsearch‘ call:

This function is clearly not comparing anything and doesn’t even return a ‘bool‘. But this call will instantiate and returns 4 as the answer. Position 4 is clearly not ‘"stuff"` but ‘"laughing"`. So what’s the deal?

We’ll have to analyze this a little more deeply. When ‘linsearch‘ reaches a position in the container, it passes that value and the searched for value to the function argument in that order. So we are generally looking at a call to ‘um_len("apple","stuff")‘ and so on — advancing to the next slot to change ‘"apple"` to the next and next and so forth.

Let’s look at this first call. First, the lengths of each ‘string‘ are subtracted and the absolute value of this difference is taken. For ‘"apple"` and ‘"stuff"`, the difference is 0 which is its own absolute value. This is then cast to a ‘string::size_type‘ for the ‘return‘.

When that call gets back to ‘linsearch‘, this result is used in a ‘!‘ operation. ‘!‘ expects a ‘bool‘, of course, and receives a ‘string::size_type‘ — an ‘unsigned‘ integer type! Instead of being ruffled, it just coerces the value into ‘bool‘ form. The 0 it received has no 1 bits and so it turns that into a ‘false‘. Then, applying the ‘!‘ to that, it gets ‘true‘, which pushes the loop around one more time.

This process continues with ‘"neato"` and ‘"stuff"` — both of which are the same ‘length‘ as the ‘"stuff"` to be found and so they just push the loop around again. Finally, ‘"laughing"` is reached which is longer than ‘"stuff"` by 3 and so we get a value in the ‘!‘ that has 1 bits.⁹⁶ This turns into a ‘true‘ which the ‘!‘ negates to ‘false‘ and this stops the ‘&&‘ which stops the loop!

Thus, we discover that the use of ‘um_len‘ in the ‘linsearch‘ function is not only a possibility, but it finds the first entry not of the same ‘length‘ as the searched for value in a container full of ‘string‘s.

Can we protect from this nonsense? Not easily. This might — MIGHT — be possible with the new C++ ‘concepts‘ mechanism in C++23. But I haven’t got a compiler to test it with. For now, we have to take it in stride as an accident waiting to happen or the most clever usage pattern ever invented!

Making a Whole class a template

‘template‘’ing a whole ‘class‘ comes in two basic variants: all ‘class‘ functions are ‘inline‘ and some ‘class‘ functions must be non-‘inline‘.

All inline Functions

If a ‘class‘’ functions are all ‘inline‘, making a ‘template‘ out of it is relatively easy. Note the following function object ‘class‘, for instance:

We just add the ‘template‘ head on top and use the new ‘typename‘ throughout where it might need to be. But one thing has changed for this type — its name. We don’t see it in this all-‘inline‘ definition, but when we instantiate a function object from it, we see:

Here we note that the ‘typename‘ must be filled in for instantiation to take place. The compiler has no context to deduce the type from otherwise. Even if we used an initializer on the construction like so:

The compiler will fail to deduce the ‘typename‘ for the ‘template‘ unless we are using at least C++17. Note also that the initializer for the ‘s_tot‘ object must be a ‘string‘ ‘class‘ object and not a literal C-string!

Examining the ‘template‘ ‘class‘ in more detail shows us that we’ve nested one ‘template‘ inside another here. The ‘operator()‘ that takes an argument takes one of a different type than the ‘SumT‘ for the ‘class‘ itself. The reason being that you can add say, ‘short‘ values to a ‘long‘ total or ‘char‘ or C-string values to a ‘string‘ total. The individual values don’t have to be the same as the result type. This nesting of ‘template‘s is also known as the ‘template‘ method pattern and is pretty useful in many places.

Some non-inline Functions

Not having an example handy with large enough functions to keep them not ‘inline‘, we’ll just assume that we want a couple of the above functions to not be ‘inline‘. Let’s start with the constructor. It raises two issues. One with the name of the ‘class‘’ scope and contrasting this with the name of special functions like constructors and destructors. The scope turns out to be the type of the ‘class‘ and the special functions’ name is just that of the ‘class‘ itself.

We’ve always just assumed these were synonymous, but we were mislead by the simplicity of the non-‘template‘ ‘class‘es we were working with. The scope being actually the type of the ‘class‘ means it needs the instantiation name used instead of just the ‘class‘ name. Since we won’t know a specific instantiation until the programmer using the ‘template‘ fills one in, when we write the code we use the ‘typename‘ from the ‘template‘ head like so:

Note, also, that the ‘template‘ head has to reappear on top of all non-‘inline‘ function definitions.

Another key example to look at is our nested ‘template‘ function for adding a new value to the ‘sum‘:

Here we see that the two ‘template‘ heads are stacked or at least space separated. The scope/type of the ‘Total_c‘ ‘class‘ hasn’t changed, though, with the new head added — it is still just ‘Total_c<SumT>‘. The second ‘template‘ head applies only to this function and so it is the function’s type that has changed to include that ‘typename‘. Luckily that won’t affect us since it is deducible from the argument.

Keep the type name rule in mind, though, when non-‘inline‘ function takes an argument of the ‘class‘’ type or returns a ‘class‘ type value/reference as well. For instance, if we had an ‘operator=‘ for the ‘Total_c‘ ‘class‘ for some reason, it would look something like this:

Separate Compilation

In the case where all the ‘class‘’ function are ‘inline‘, we have no problems putting it into a library. Just put the whole ‘class‘ definition in the interface/header file and away you go.

But when the ‘class‘ has non-‘inline‘ methods, we have an issue. Tradition holds that such methods need to be defined in a separate source file (the implementation file) to maintain encapsulation and to keep our implementation secrets safe from the competition. If we were to eschew these concerns, we could define the non-‘inline‘ functions outside the ‘class‘ definition — below it, in fact — within the header file and we’d be done.

But we don’t want to eschew tradition so easily! We want it to separate! So, we went to a lot of research as a community and found that we couldn’t get the compiler to recognize separated code as ‘template‘ code and still be able to instantiate the ‘template‘ later. Thus we did what any self-respecting programmer would do: we tricked the compiler.

We invented a new file extension/type and a new kind of inclusion guard. Unfortunately these aren’t standard throughout the community, but they are fairly easy to spot and understand once you know what to look for.

Let’s start by talking about the new extension. This needs to indicate that it contains ‘template‘ function definitions that is to be d into another file and so many go with extensions like , , or . We avoid extensions like or because those are considered temporary files on many systems and might be deleted by file cleaners!

We take all the non-‘inline‘ method definitions and place them alone into the file. They can any other libraries they need, but they cannot employ a ‘using‘ directive as they will end up d themselves!

Once we have picked an extension, we need a new inclusion guard symbol. This one is to tell the compiler that it cannot separate ‘template‘s into multiple files.⁹⁷ This can be wordy or terse, as long as it is readable. I often go with something like ‘TEMPL_CANT_SEP‘ — possibly with full words on the first and last.

Then we put the following inclusion-guarded code into the bottom of the interface file:

Now to compile the whole application, we have two possible approaches. The most portable is to simply the include guard symbol before each of the library containing the ‘template‘ ‘class‘’ definition. This would look something like so:

Or, instead of ’ing this all over the place, we can define it once for the whole compile process. The most common way to do that is with the ‘-D‘ command-line parameter which most compilers seem to support. This command-line parameter to the compiler will define a symbol that follows it — typically with no spaces separating it — in all separately compiled source files this build. So it would look like ‘-DTEMPL_CANT_SEP‘ on your command-line. If compiling in a GUI IDE, however, finding where to put such a global define or command-line parameter is difficult sometimes. Hopefully your instructor will be able to help with this.

Note that a separate file is not required for this library as it would end up empty anyway.

Since this process is a bit complex, I’ve placed a complete example on the website.

Making friends

If you feel the need for your ‘class‘ to make ‘friend‘s for efficiency at the cost of data security, you can still do that with a ‘template‘d ‘class‘. The process is a little tedious to implement but guaranteed to work in any compiler as it has been part of the standard since C++98. There are three steps:

Let’s look at another ‘class‘ that’s been turned into a ‘template‘: a ‘Set‘ ‘class‘. This ‘class‘ has also overloaded ‘operator<<‘ for printing to an ‘ostream‘-derived output stream. Let’s make this a ‘friend‘ as an example. The steps look something like this in the interface file:

Note that the ‘friend‘ship declaration doesn’t have the function’s ‘template‘ head on it since it had the same one as the ‘class‘ it has been made ‘friend‘s with. Since the function takes a ‘class‘ argument, it would pretty much have to have the ‘class‘’ ‘template‘ ‘typename‘(s) in tact and so be itself a ‘template‘.⁹⁸

Note that the ‘Set‘ argument has the full type name here as it did during declaration of this function. It did not have the full type name during the declaration of ‘friend‘ship…

An Alternative Approach

In newer compilers, there is a shorter syntax that allows you to combine the not only the declaration of the ‘friend‘ function with the declaration of ‘friend‘ship, but even define the ‘friend‘ function right there ‘inline‘, too! This would look like so:

Here is an example program for you to check out since it got a little long to publish here.

Overloading vs. Specializing

When using a library with ‘template‘’ing going on and you aren’t getting the results you expect, how can you best resolve it? There are a couple of approaches, let’s explore which is best. I’ve worked up two sample programs on the website for you to download and play with. They can be found here and here.

In the first one we see a ‘template‘ for finding the ‘min‘imum value of two arguments followed by a similar function. The similar function might look at first like it is overloading the ‘template‘ as we did above. But it doesn’t really. Remember that the plain function here is just called ‘min‘ whereas the ‘template‘ is really called ‘min<Data>‘ — very different names. But we see this kind of thing a lot and sometimes just call it overloading because we are human and we are a little sloppy. Just be aware that you might need to defend yourself to others on such grounds.

Here we see three calls to a ‘min‘ function of some sort. One is ’plain’ where we give just the argument names and let the compiler figure everything out. The second is an explicit instantiation of the ‘template‘ version. And the third uses typecasting to tell the compiler what we think the arguments should look like for the function.

On this first program that utilizes ’overloading’, we see the following results:

Here the right answer (the capitalized value) is given by both the plain call and the typecasted call. The ‘template‘ version failed because it just compared the addresses of the variables instead of their contents.⁹⁹

In the second code, we see two ‘min‘imum functions and again one is a ‘template‘. The other this time is a ‘template‘ specialization for ‘const char *‘. We run these again through the main above and now get this result:

Again, the one that called the ‘template‘ gets the wrong answer, but this time it is for different calls! This time the plain call does the ‘template‘ whereas last time it was only the explicit instantiation that did it! In both programs, though, the typecasting worked beautifully to get us to the function that would work.

Lesson learned: use typecasting to help guide the compiler to the right function for the job.

Non-Type template Parameters

The things that the compiler deduces to match a ‘template‘ don’t have to be just types. There are also non-type ‘template‘ parameters. These do have to be 1D discrete types like for a ‘switch‘ head, but they can still be useful all the same — just like a ‘switch‘!

In the provided example, you can see two functions to ‘swap‘ the contents of two C-style strings. The non-‘template‘ uses an arbitrarily large constant to size the temporary helper C-string. This is, of course, a poor design as it takes up way too much space most of the time and wouldn’t work on some cases, still. We also fear that the calls to ‘strcpy‘ will overrun at some point and that’s never good!

This version is at first commented out as it provides a more exact match to the compiler’s taste than does the ‘template‘ version despite the ‘template‘ being listed first. If you uncomment it and run the program, you’ll see that when it prints the sizes of the parameters with the ‘sizeof‘ ‘operator‘, you get something smaller than the arrays were declared to be. This isn’t because the contents is shorter as it doesn’t vary from value to value. It is actually the size of a pointer on your system. Remember that arrays degrade to pointers when passed to functions.

Commenting that function back out, we recompile to use the ‘template‘ with the non-type parameter. This looks like so:

Here a ‘size_t‘ is deduced and used in the ‘template‘ instantiation. In this case, the deduced value is the declared size of the C-string parameter. The compiler can do this because the array was declared in the same scope as the call — main. And we also took special care with the formal parameters themselves and made them refer to the original arrays. This reference mark had to be parenthesized because it would otherwise have been assumed part of the array’s base type and arrays of references are illegal.

Once deduced — and note we are requiring that both parameters have the exact same size! — we can use this constant known at compile time to size the temporary helper array inside the ‘swap‘ function itself! Now it won’t waste memory or fall short. It will always be the exact right size! And we can use ‘strcpy‘ without fear of overrun, too!

The careful observer will note that just putting the references on the array parameters was enough for the compiler to remember the sizes of the declared arrays. But to do so we must fill in the ‘[]‘ on those parameters with some literal or constant number — making ‘sizeof‘ redundant. Having the ‘template‘ deduce the array size makes the function work with a variety of arrays again.

Metaprogramming Basics

‘template‘ metaprogramming is the use of ‘template‘s to run code at compile time. Think about it. That’s just insane, right?!

But it’s true. When deducing ‘template‘ information — especially non-type ‘template‘ information — we can make use of it to write bits of code that will run during compilation. I’ve got two examples of this that seem tractable at this level. For future exploration, I’d recommend something like the boost libraries site. They have tons of examples and do amazing work that sometimes finds its way into the standard libraries!

Improving the swap Function

Let’s start where we left off above. There was a test in the main for the ‘swap‘ example that couldn’t run because it tried to ‘swap‘ two C-style strings that were of different physical lengths. Let’s make that sort of thing work!

We’d need two non-type ‘template‘ parameters to deduce the separate sizes of the C-string arrays coming in like so:

And then we’ll need to decide which of these is larger to size the temporary array with. This can be done with a simple ‘?:‘ test, of course, like so:

Since both ‘N‘ and ‘M‘ are compile-time constants, either is able to be used to size the temporary array.

This kind of decision being run by the compiler at compile-time is just amazing and allows us to do all kinds of coding we couldn’t do before. And it isn’t limited to just this. We can encapsulate this decision into a package for easy reuse — even at compile-time:

Here we’ve coded up a ‘template‘d ‘struct‘ with a single ‘enum‘erated constant called ‘value‘. This constant was initialized with our ‘?:‘ decision from before. Now we can use this in the declaration of ‘c‘ from before like so:

Since both ‘N‘ and ‘M‘ are compile-time constants, when they are used by ‘Max_of‘, they can initialize the constant ‘value‘ member. Then this compile-time known constant is used to declare the size of the temporary array. It’s like magic!

Another Approach

Some folks wonder if another approach isn’t better. Some folks would want to just ’overload’ a normal ‘swap‘ ‘template‘ with a ‘char *‘ version that uses ‘strlen‘ and dynamic allocation to do the temporary array sizing like so:

Improving Random long Generation

Another classic way to use ‘template‘ metaprogramming is specialization on the values a deduced type can have. This could be something simple like ‘bool‘ or something more interesting like an ‘enum‘eration. We’ll take it easy and use ‘bool‘. Since ‘bool‘ has only two possible values, we’ll only need the main ‘template‘ and one specialization.

What will we do with it? Let’s endeavor to make random ‘long‘ values more appropriately spread for our system. You see, most systems today use 32-bit ‘rand‘ values but ‘long‘ might be 64-bit. If we try to use the built-in ‘rand‘ to make those kinds of values, it won’t be able to spread out far enough to generate all the desired data.

We’ll start with the code to call our ‘template‘ so the ‘template‘’s code itself will make more sense. A basic ‘rand_range‘ overload would be written like so:

Here I’m testing the ‘RAND_MAX‘ constant from the ‘cstdlib‘ directory against the maximum value of ‘long‘ as defined in ‘climits‘. (I could have used ‘limits‘’ ‘numeric_limits‘ facility, but I chose to go this way since I had to use the old C constant for the random maximum anyway.)

Now we call our ‘template‘ — ‘rand_range_helper‘ explicitly instantiated with the value of this comparison which will be either ‘true‘ or ‘false‘. We just need to put the right code into the main ‘template‘ and the specialization to match the situation we find ourselves in.

In the ‘true‘ case, the random number system has enough range to handle the possible requested bounds. But in the ‘false‘ situation, we’ll have to do something special to make it work. I’m going to make the main ‘template‘ the ‘true‘ version like so:

Here we just generate a random value as we usually would when things are going right. But in the ‘false‘ specialization we’ll do something new:

Notice that to make this a specialization, we must list the special value in the angle brackets after the function name. There is no other context for the compiler to deduce it from.

Here we use bit manipulations as from chapter [defn:bitmanip] to splice together two randomly generated values. (There is a bit of an assumption that the random values are exactly half as wide as the ‘long‘ values on the system. This isn’t without merit, but might bite you in the butt on some platforms. Be careful!)

If we download this sample program and compile/run it, we should see proper values generated. If you see ‘true version called‘ on the screen, then you’ve got a large enough random number generator for your ‘long‘s already. If you see ‘false version called‘ many times, you have had to use the bit-manipulation method to meet the spread. Either way it should work. But in the case of the ‘true‘ version being called, you won’t see any values outside the ‘RAND_MAX‘ boundary whereas for the ‘false‘ version you should.

static template Members

As we learned in section [use:static-inh], ‘static‘ members of a ‘class‘ can sometimes behave oddly under new tools like inheritance or ‘template‘s. So let’s explore how they work under ‘template‘s with this simple test ‘class‘:

As we can see, we’ve got two ‘static‘ data members and we’ve initialized them below the ‘class‘. The ‘double‘ member is initialized once and the ‘template‘d member is specialized for both ‘short‘ and ‘string‘. Now we just need a main to test it all with:

And running the test program, we see that all of the instantiations’ ‘double‘ members were initialized to ‘0.42‘ and that their ‘DataT‘ members were properly initialized and changed. But in the final phase, we see that changing the seemingly shared ‘double‘ member via the ‘string‘ instantiation, only its member changed — the one for the ‘short‘ instantiation remained ‘0.42‘ as before.

Thus we find that each separate instantiation of the ‘class‘ will have its own memory location for each ‘static‘ member — whether that member is of known type or is ‘template‘d itself.

templates and Inheritance

How can ‘template‘s be mixed with inheritance? Just about any way imaginable! Well, at least as many as I could come up with. Let’s start with a basic ‘class‘:

(I’ve placed members inside the ‘class‘ to test instantiation, but it doesn’t affect the results — it works either way. So don’t sweat it. But if you want, you can tweak a full example by just adding a main that declares objects of the various types to these ‘class‘ definitions and adding members to them.)

So you can get a ‘template‘ inheriting from a non-‘template‘. We can also inherit a non-‘template‘ from a ‘template‘:

Here ‘D2‘ is a non-‘template‘ because we’ve explicitly instantiated our ancestor ‘template‘ to ‘D1<long>‘ making it no longer a ‘template‘.

Here the ‘D3‘ ‘class‘ is of the same kind of ‘template‘ as the parent ‘class‘ — both ‘T‘ as specified by the programmer using the ‘template‘. But the child ‘template‘ need not be the same instantiation type as the parent as we see here:

Here ‘D4‘ has a ‘template‘ type but the parent ‘template‘ has been instantiated to ‘D1<double>‘ so that it doesn’t share the same type necessarily.

I’ve also tested this with a main declaring variables of all these types, but that didn’t seem to prove anything beyond just making sure the above compile. Still, that’s a lot of variability in inheritance patterns as you mix in ‘template‘s in different ways!

Wrap Up

Compile-time polymorphism — or ‘template‘s in C++ — is an amazing tool that leads to all sorts of powerful techniques. We learned things from basic function design and using the common language model to making a ‘template‘ out of an entire ‘class‘ their ‘friend‘ships. And we talked about separate compilation of ‘template‘s. We focused some on both overloading and specializing techniques and calls and looked at ‘template‘ parameters that weren’t even data types at all! Then we dabbled in meta-programming techniques and reviewed how the ‘template‘ mechanism interacted with prior things like ‘static‘ members and inheritance.

Data Structures

Algorithm Analysis

Over the years, we’ve developed many algorithms to solve various problems. In fact, we’ve developed many algorithms at times to solve the same problem. This might have happened via independent development or in competition with one another. However it happened, we need a way to compare two algorithms that accomplish the same task to see which is right for us to use in our current application. In steps algorithm analysis.

Just the Basics

There are two main types of analysis: time analysis and space analysis. Time analysis looks at an algorithm with respect to how long it takes them to run on a problem of a certain size. Space analysis looks at an algorithm with respect to how much memory it takes while running on a problem of a certain size — beyond the actual problem data, of course. Space analysis isn’t as hard so we’ll leave that as an exercise and focus on time analysis.

These comparisons are troublesome to perform on hardware since machines run at different speeds and programmers code with more/less efficiency than one another. Not to mention compilers optimize better/worse than one another. Oh, and there may be multiple processes/users taking up CPU time and throwing the timings off. And... Well, let’s just say it would be easier if we could analyze algorithms ’offline’ — mathematically rather than on a particular coded version. Besides, if we analyze them mathematically, it gives us a single analysis to cover any implementation language chosen.

To do a time analysis we count the number of critical operations performed on a problem of a certain ’size’.¹⁰⁰ Exactly what is a critical operation is given to you at this stage and will become increasingly intuitive as your experience grows. This counting gives us a function of problem size that determines how long — in terms of critical operations — a particular algorithm will take to solve the problem.

Best, Worst, Average

Further, when looking at these critical operations, we are interested in the algorithm’s behavior in three different cases: best case, worst case, and average case behaviors. In the best case behavior, everything in the input will be perfect to give the most efficient run of the algorithm. In the worst case scenario, exactly the opposite is true — we’ll look at the least efficient run of the algorithm for the inputs given. And then we consider what will happen in the rest of the input situations — what will happen on average.

To do these analyses, we can break down the different possible inputs and assign them each a probability of occurrence. Then we multiply the critical count for each input by its probability and add them up:

This will give us the proper weighted critical counting function for each of the three cases. What, then, are the probabilities? Well, for best case analysis, we set the probability of the best possible input to 1 (aka 100% likely) and all other inputs’ probabilities to 0 and the sum reflects just the count for the most efficient input. Similarly for a worst case analysis.

For an average case analysis, we typically assume all inputs are equally likely to occur and then we can factor out the

p_i

multiplicand to outside the summation and just add the critical counts and multiply by the common probability once.

BTW, average and worst are the most sought after measurements. Best case situations are deemed rare and inconsequential. Average case happens a lot and worst case can affect your operations deeply and so those are the most impactful to our decisions.

A Quick Example

Let’s take a quick look at linear search, for instance. Typical code for this was found in our discussions from chapter [defn:templ]:

But for analysis purposes, we typically look at a more generic view called pseudocode like this:

Very like the C++ code, but more general and easily implemented in any high-level language. Only the ‘<=‘, subscript, and ‘!=‘ notations from C++ remain and are often substituted with math notations instead. I’ve not done that here for simplicity of typography.

We take as our critical operation here the comparison of the ‘value‘ to the ‘container‘ element. The integer assignments, updates, and comparisons are going to be done at CPU speeds and only comparing the target data might be more expensive and thus affect our timing much more.

A quick thought process will let us know that the best case for this algorithm is when we find the data in the first position and are immediately done. Thus the count for this situation is 2 comparisons and that is our result:

T(length) = 2

in the best case. But this is a little bulky for our math kin so we usually term the ‘length‘ parameter as

n

for the math:

T(n) = 2

. Thus, for the best case, we achieve a constant amount of work no matter the number of data in the ‘container‘.

Similarly, we look at the worst case possibility and find that the most comparisons of target data is done in the case where we either find the target ‘value‘ in the last position or can’t find it in the ‘container‘ at all. Both of these lead to a count of

n+1

comparisons. Setting the probabilities of each to

0.5

(so that they add to 1 as probabilities must), we find that the worst case is:

T(n) = n+1

. So in the worst case, we take a linear amount of work and it will totally depend on the number of data in the ‘container‘.

For the average case, we make each outcome equally likely. But what are the possible outcomes? They are the positions from

0

n

where the last position indicates a not found condition. These take from

1

n+1

comparisons with the

n+1

comparisons being repeated twice. Doing some math, we see that there are

n+1

outcomes and so, if each is equally likely, their probabilities are all

\displaystyle\frac{1}{n+1}

. So our summation looks like this:

You’ll learn lots about how to tackle this kind of thing in discrete mathematics when you get there. Suffice it to say that this boils down to:

\displaystyle T(n) = \frac{n}{2} + 1

. So it takes just over half the list on average to find the data we are looking for. This is still linear, but less so than the worst case due to the slope being shallower.

Sequence versus Nesting

An easy way to estimate the critical count function of an algorithm is to look over its loops. Count out how many times each loop operates. If loops are sequenced — one right after another, add their counts. If loops are nested, multiply their counts. (This is due to the fact that the inner loop will complete all of its iterations for every one of the outer loop’s iterations.) For instance, the following algorithm:

When the interesting bit in the center of the loop(s) changes the loop condition variable(s), things get a little tricky. We’ll talk more about that kind of thing in discrete math.

Oh, Omega, Theta

So now that we have a critical operations counting function, how do we compare it to that of another algorithm? This is done with some mathematical techniques to categorize the algorithms with respect to several standard reference functions. Once we know to which reference function each algorithm compares, we will then know how the two algorithms compare with one another.

The categorization works by placing our algorithm functions into one of three sets:

O

\Omega

, or

\Theta

also known as Big-Oh, Big-Omega, and Big-Theta, respectively. Let’s look at each in turn.

If a function falls into the set Big-Oh with respect to a reference function, then we know that our function’s output grows at worst as rapidly as the reference function. That is, it will grow that rapidly or less rapidly as the size of the problem input grows larger.

If a function falls into the set Big-Omega with respect to a reference function, on the other hand, then we know that our function’s output grows at least as rapidly as the reference function. That is, it will grow that rapidly or even more rapidly as the size of the problem input grows larger.

When a function falls into the set Big-Theta with respect to a reference function, it specifies that the two functions grow at approximately the same rate.

These three sets split the first quadrant of the x-y plane into three areas. We focus on the first quadrant only, of course, because neither time nor problem size can be negative. Here is a diagram of the split:

All of these can be off by a constant multiplier and it is considered fine. Also, we generally don’t see ’lower-order’ terms as significant in the process. Both of these things come about due to limits from calculus so your mileage may vary. What we mean is that

3n^2+4n+8

will still be Big-Theta of

n^2

. In more appropriate notation:

3n^2+4n+8\in\Theta(n^2)

.¹⁰¹

Big-Theta is clearly ideal, but it can be tricky to arrive at such a classification for certain algorithms. Many research hours have been spent making this easier for larger and larger classes of functions, but it doesn’t always apply to your algorithm. When you are in such a situation, you will find that a Big-Oh classification will be much easier.

In fact, this was so for so long, that many of our analyses are in terms of Big-Oh instead of Big-Theta. Many [older] programmers will be more interested in the Big-Oh just because that is what they are used to seeing available. So if you know something is Big-Theta from your current studies, don’t fret that your colleagues keep saying Big-Oh. Just nod and smile and know you are more correct. *smile*¹⁰²

Standard Reference Functions

Function	Notes
$n!$	factorial
$c^n$	exponential
$n^c$	polynomial
$n\log(n)$	$n$ times logarithm of $n$
$n$	linear
$\log(n)$	logarithmic
$c$	constant — often listed as just $1$

The line between exponential and polynomial draws an arbitrary line that delineates the reasonable-speed algorithms — below — and the unacceptable-speed algorithms — above. In fact, this line is often drawn amidst the polynomials between

n^3

and

n^4

. Some would draw it even lower! But it can depend heavily on your hardware and the expected size of problems you are going to encounter.

A Look Back at Linear Search

So where does linear search fall in all of this. Well, from the above, we see that linear search’s average time analysis of

\displaystyle T(n) = \frac{n}{2}+1

falls in

\Theta(n)

as does its worst time analysis of

T(n) = n+1

. At least its best time analysis is

\Theta(1)

Validating an Analysis

Once you have an analysis — and let’s face it, it isn’t very accurate with our estimation techniques — you’ll probably want to verify it. Well, we can take a tip from Calculus (limits and convergence) to do it with implementation timing data.

The idea is to time your algorithm implementation¹⁰³ at several different sizes of input. (Remember to take a median of times at each input size instead of just a single reading!) Then divide each by the analysis function you came up with in your Big-Oh or Big-Theta work evaluated at that input size.

Wrap Up

In this chapter we have discussed algorithm analysis basics. We’ve covered a wide range of terminology related to the task and basic estimation and validation techniques. We looked at classifying our analysis function into categories based on standard reference functions. And we tried it all out on linear search — a classic and yet simple algorithm.

Hopefully it gave you a healthy respect for the purpose of analyzing algorithms and how tricky it can be. But you will learn more in future studies, never fear!

Recursion

Recursion is when a function calls itself — either directly or indirectly! We learned this briefly in the first volume, but only to say don’t do it because it was too tricky. Now we’ll learn how to do it properly.

Why is it so difficult/tricky? Well, think about when do you stop calling yourself? Ah… This is especially true for indirect recursion which often happens accidentally when two functions call one another in a cyclic fashion.

Further, recursion directly correlates to two mathematical techniques: proof by induction and recurrence relations.¹⁰⁴ As in those systems, you will need to identify three things:

Design

This leads us to the question of how to design such an algorithm? It can be viewed as the following generic steps:

The three important things are listed there in the side labels along with a fourth. This fourth is not a classically listed part but is nevertheless a crucial step in the process — putting the recursive call’s answer and the current step’s information together to form the current step’s answer. Without it all else would be for naught!

Factorial

The classic example to start with is calculating a factorial. It looks something like this:

I’ve spaced it out to label all the parts because they are rather tightly woven together in this simple algorithm. The first branch is actually an addition to the general plan — catching issues caused by using a ‘signed‘ type for the argument. Factorials are only defined for non-negative inputs, after all. But the magnitude of the produced factorials grows so quickly that a 32-bit ‘long‘ can only hold up to 12! correctly. So I just went lazily with ‘short‘ for the input type.¹⁰⁵

The next branch is the base case but I enhanced it since the first two factorial values are actually identical. So both 0! and 1! are 1 and why make a separate step for this.

The final branch is the rest of the algorithm. It reduces the size of the problem by subtracting 1 from our input value, passes that off to our recursive call, and then takes that answer and multiplies it by our original input to get our answer.

Visualization

To visualize recursion in action, many people swear by many methods. But the most common seem to be index cards and memory diagrams. I’ve tried to combine those below in a side-by-side fashion with the current index card on the left and the memory diagram of the function call stack on the right. Let’s start with one of the most mundane cases: ‘factorial(1)‘.

You can see that on the left the code has been filled in with 1 everywhere the parameter ‘n‘ would have been. (These ones were colored red to differentiate them from the regular code which also had ones.) We can then trace which branch actually executes and get the return value for the function listed on the right side (alongside the function’s memory area on the call stack).

But there wasn’t any recursion happening with this call, so let’s explore a deeper case: ‘factorial(3)‘.

It at first seems nothing has changed, but note that as we evaluate the branches, we end up in the recursive branch hanging on or waiting for the result of the recursive call. This leads us to call for ‘factorial(2)‘. And now things look more like this:

Now there are two calls on the stack and both have reached their third branches and are waiting on a recursive evaluation! Let’s keep going to the next call — this one to ‘factorial(1)‘. Although it seems like we’ve been here before, this time we are at a call stack depth of three:

Finally, we reach the base case branch and find an answer. We then return the 1 to the previous function and this happens:

Here we receive the 1, multiply it by our waiting 2, and get the return value to be 2 overall. This leads us back to the earlier function call state but with an answer:

Now we multiply our waiting 3 by the returned 2 to get our return value of 6. This goes back to the caller and we are done.

Using index cards, you would also get a feel for the function call stack because you could stack the recursive call cards on top of the previous cards as you went. If you do one deep enough, like ‘factorial(6)‘ or so, you’d really get the feel and vision of the function call stack getting deeper and deeper as you went.

Other Examples

There are many other examples we could do of recursive functions. And we will see more in upcoming chapters. But for now let’s just look at a few more that add something interesting to the picture.

A Mystery Guest

This first function is a bit of a mystery for you. (No reading ahead to find out who done it! *smile*)

What do you think it does? Take a few minutes to puzzle with this for real. Don’t just read on. It’s an important skill to be able to reason out what a piece of poorly documented code does at times.

So, once you’ve got your idea, let’s talk it through. We’ve clearly got a C-string pointer coming in. We know this due to the ‘const char*‘ argument type and the null character (’\0’) test. This value for the first ‘char‘ in the C-string — an empty C-string — leads to a 0 result as a ‘size_t‘. And so does a pointer value of ‘nullptr‘ — a C-string that isn’t just empty but nowhere! Then, for the recursive case, we see that the ‘char‘ that we just saw that wasn’t the ’\0’ is counted in as 1 more than the recursive call.

But that recursive call is weird, right? We pass ‘s+1‘ instead of a subtraction! Thinking about this for a second, though, we realize that adding to a pointer will move it toward the end of the array it points into. So this is advancing the pointer through the C-string toward the eventual null character and that base case!

So, putting it all together, we see that an empty (or non-existent) C-string results in 0 and any non-null ‘char‘ adds 1 to a running total. Sounds like a ‘strlen‘ replacement if there ever were one!

What’s interesting about this example? Well, two things. Firstly, it has an additive reduction step instead of being subtractive or divisive. That’s pretty unusual. Secondly, it gives us a ‘strlen‘ replacement that respects the presence of ‘nullptr‘s in our world. Turns out that the standard library ‘strlen‘ will crash if given a ‘nullptr‘ instead of just reporting 0 like it probably should!

Binary Search

Another classic start is binary search. I thought we’d explore it as a pointer-based version:

So what’s going on here? Why the deep nesting, anyway? It looks like the ‘target‘ not being found and being before the middle are pretty straightforward. But when the ‘target‘ is after the middle, we have an issue.

Returning a ‘size_t‘ offset is hard this time because on the recursive call, the ‘beg‘ parameter would have been mapped from the prior call’s ‘mid+1‘ pointer. Thus, if we just returned the recursive call result directly, it would be offset by over half the container’s length! So, we have to adjust it by adding ‘half_way+1‘ to it. But we can’t just add this because the ‘target‘ might not have been found at all! If we added ‘half_way+1‘ to ‘max_size_t‘, we’d end up returning just ‘half_way‘ due to integer wrap-around! Thus the hideous nesting…

So what have we learned from this example? One thing is that sometimes you can have more than one recursive branch. Like here we have before middle and after middle calls. And another is that you sometimes don’t have additional work to do with the recursive result. Like here we just return the recursive result when the ‘target‘ is potentially before the middle somewhere.

Single versus Multiple

But that last example also brings to mind that sometimes we do have the need to perform multiple recursive calls to get our current answer put together. In binary search, only one of the two calls would execute. But here I’m talking about a situation where we need two or more calls at the same time. A classic example is the Fibonacci numbers sequence. It is defined mathematically as:¹⁰⁶

\begin{array}{lclcl} fib(0) = fib(1) & = & 1\\ fib(n) & = & fib(n-1) + fib(n-2) & , & n > 1\\ \end{array}

As you can see, each number after the second requires two prior numbers to be added together. A visualization might help. Here is a visual depiction for

fib(5)

Depending on the context of the problem, this repeated recursion can be quite expensive in terms of recalculation. This goes against our fourth directive from before: don’t waste previous calculations!!!

Costs

The above discussions lead us to the idea that recursion is sometimes rather costly. So why do we do it? Is it needed? Is it useful?

Turns out recursion is exactly as powerful of a solutions tool as our old friend repetition — aka looping. So we’d need tools to turn recursive definitions and algorithms into looping solutions to remove it. Sometimes this is simple and sometimes not.

In the meantime, there are other tools to remove the redundant calculations involved in some recursions. So we’ll explore those, too.

But first we’ll look at the last big cost in recursion: the limits of the function call stack.

Stack Overflow

As I said, recursion is limited by the size of the function call stack, of course. And there is a limit on this size in all compiled systems and many interpreted systems as well.

To show this, I’ve created a broken recursive function which has no base case to stop it. It will therefore overflow its function call stack after so many calls. To track this, I’ve installed a ‘static‘ ‘size_t‘ variable to count how many times this function has been called. To help cut down the number of ‘calls‘ it takes to experience a stack overflow, there is an ‘array‘ declared on each invocation of the function. The size of this ‘array‘ can be changed easily between runs to make there be 2, 200, or even 20000 ‘double‘s as you desire. You’ll find that the larger the ‘array‘, the less calls it takes to eat up the function call stack.

When compiled on my system, I get a warning about ‘array‘ being unused. That’s by design, so I’m ignoring that for now. When I then run it on my system with an ‘array‘ size of ‘20’000‘, I get 52 runs before the crash (which is called a Segmentation fault on my Linux box). At a size of 200, I get 5185 runs before the crash. So it is approximately proportional. Also note that your mileage may vary as your macOS or ChromeOS or Windows may have a different maximum stack size. The point is that all systems have a maximum and so this is a perpetual issue with recursion and large problem sizes.

We’ll look at tackling this issue in chapter [def:stack-queue] when we talk about the stack data structure.

Tail Recursion

Tail recursion is an optimization performed by many compilers. When they see a recursive function coded in just the right way, they can automatically transform it into a looping form. The basic idea is that the last statement in the function needs to contain the recursive call. Our ‘factorial‘ function from before would be a prime example:

Since the recursive call is in the last statement, many compilers will rewrite this automatically as something akin to:

We won’t study here how this transformation is accomplished, but we’ve now learned to benefit from it.

Full Disclosure

In the interest of completeness, there is a fancier form of tail recursion that many people espouse on the web. It involves transforming the function to have a helper argument which often means having a helper function for the recursion itself like so:

Another approach in C++ would be to supply the ‘ans‘ parameter with a default argument. Then the second function wouldn’t be needed.

This slightly fancier version of tail recursion is another learning curve up, though, so I wouldn’t expect you to master it just yet. *smile*

Memoization

Although it looks like we just misspelled memorization, the title of this section is a real word. It historically comes from a process involving interoffice memos (or memorandums). The use of these memos to remember that something had already been done and therefore didn’t have to be done again — rather we just looked at the answer from the memo that recorded it.

So we memoize results in programming by remembering all of the previous calculations so that they need not be re-computed! In C++ this is easily accomplished with a ‘static‘ local ‘vector‘ or the like:

See how we initialize the ‘vector‘ with our base cases and then in the second branch (the first still being error checking) we look to see if the answer should be inside the ‘vector‘ already. If it is, we just use it right away. If it isn’t, we proceed to the third branch and calculate and ‘insert‘ it.

The ‘insert‘ is worth extra care in reading, though. It puts the new value in front of the ‘end‘ position of the ‘vector‘ — thereby extending the ‘vector‘ by one. Then, ‘insert‘ returns the position just added by ‘iterator‘ which we dereference with the ‘*‘ operation and return.

Another, but slightly trickier way, would be to use ‘push_back‘ followed by subscript:

Note, though, the terrible use of the comma ‘operator‘ here to separate the ‘push_back‘ from the subscript in the last branch. This is terrible code, of course, and really hard to read! We could make it work like so:

But this is a little clunkier and doesn’t use the tools we’ve learned to their utmost.

At any rate, this memoization causes any second call for a value to be more-or-less instantaneous and that makes a huge difference! This is especially true for multiple recursions like the Fibonacci numbers above.

Wrap Up

In this chapter, we learned about the idea of recursion and its components. We learned to design recursive functions from scratch. And we saw recursion in action!

But, we also saw that recursion doesn’t come without costs. Luckily for it, there are many places in upcoming data structure processing where recursion is just so elegant and hard to avoid, we use it regularly. So don’t fret! Your new-found knowledge will not be wasted!

Linked Lists

Why study a new data storage technique? Aren’t arrays working fine for us? Especially the dynamically growing ones stored inside ‘vector‘s and ‘string‘s? Yes and no. There are things arrays do well and things that can be improved — sometimes drastically.

By holding each piece of data separate and linking them together with some sort of link like maybe a pointer, linked lists aim to provide for improved memory management patterns as well as speedier insertion and removal compared to a traditional array. However, they have slowed access patterns due to having to follow pointers from element to element.

But there are other attributes that cause us to think critically about our decision to use a linked list versus an array in a particular application.

Above the double line are the standard things folks talk about with regard to linked lists as compared to arrays. But those below the double line are less well disseminated.

Data Structures Algorithms

Clearly the three vital activities of insertion of new data, removal of old data, and locating data are those above the double line. And as clearly, linked lists win on the first two of these and lag behind on the third. So, just from the standpoint of "what am I doing with the data?" in the application, we have some hint at what we should be choosing.

If we are doing a lot of insertion and/or removal of data, but less search, a linked list is great! But if our goal is to load all the data once and then search it lots and lots, a linked list might not be so ideal.

Memory Issues

In terms of memory usage patterns, the linked list wins by leaving behind all the same sized holes. This will aid in later dynamic lookups for new places to store data. The array, on the other hand, grows in multiples — for amortized growth — and leaves behind holes only useful if shrinking later as well.

But from the point of view of how often do these allocation requests come up, the array seems to win the day. It amortizes the need for allocation down to a constant feel whereas the linked list is clearly linear as every individual piece of data will require a separate allocation.

Finally, in terms of memory waste, we have a cautionary tale. The array is potentially wasting a lot of space depending on how much of the allocated space is utilized to store true data. In the linked list we only allocate space for true data except for some pointers we use to link them together. As long as these pointers are not ridiculously larger than the data themselves, we are wasting relatively little memory, therefore. (Note to self: don’t make linked lists of simple ‘char‘ data! After all, ‘char‘ is only 1 byte in size and pointers are at least ‘4‘. By that token, a list of ‘short‘ is not that good of an idea, either.)

There are two different ways to put this together, of course. We’ll look first at dynamic management of a linked list.

Dynamic

In a dynamic linked list, we’ll be using pointers to link together the data. We call each memory block a node and the two basic fields are the data field and the next pointer that tells us the location of the next piece of data. When we reach the last node, a ‘nullptr‘ is used to signal the situation that there is no next piece of data. Such a design leads naturally to ‘class‘es as below.

In Code

There are actually two approaches to designing a linked list in ‘class‘ form. The classic approach uses two ‘class‘es and another uses just a single ‘class‘. We’ll look at the classic approach now and the other after some experience has brought us to a revelation.

The Classic Approach

The classic approach uses separate ‘class‘es for each of the node and the whole list. The node is quite simple with just two members: the data and the next pointer. The list also holds just two members: a counter for how much data we’ve got in the list — totally optional, but useful and easy to maintain — and a pointer to the first item in the list — the so-called head of the list:

The only real improvement to be made to this design is to nest the node ‘class‘ inside the ‘private‘ area of the list ‘class‘ so that its details are hidden and the application programmer knows that they don’t need to know how the nodes are stored — just that the list holds their data. This is a minor design change, though, so it is left as an exercise. *smile*

In Memory

The diagram to the right shows a linked list of nodes without the initial ‘head‘ pointer. Just imagine a ‘class‘ object off to the left whose ‘head‘ member — with another ‘count‘ member alongside — points to the first node in this list.

But in that diagram, you see nodes with relatively large ‘data‘ paired with a ‘next‘ pointer. And the first of these has its ‘next‘ pointer actually pointing at the second node in the list. The second node’s ‘next‘ pointer is, though, ‘nullptr‘ to show that there is no following item in the list.

Typical Actions

The typical actions mentioned above in the intro were inserting new data, removing old data, and searching for data within the storage structure. Let’s examine each in turn.

Insertion

Insertion into a linked list at a known insertion point is actually quite simple. For instance, let’s say we had the list at right containing 15 in one node and 67 in the next. And let’s say we wanted to insert a 42 between the 15 and the 67. Well, we’d just need to take a few simple steps:

Compare to insertion into an array where all following elements have to be shifted over to make room for the new data and you see the improvement!

This can be coded pretty simply if we have a good constructor for the node ‘class‘ that takes the data and the next pointer (probably defaulted to ‘nullptr‘). It can look like so:

Of course, ‘short‘ is a bit small compared to a pointer on most systems as well. Again, we should really only consider linked lists for larger data items like ‘string‘s and other ‘class‘ objects.

Removal

The process of removing a piece of data is almost as simple. Once you find the data and have a pointer to the node that precedes it, you just follow the next few steps. Let’s say you had the situation from above with 15, 42, and 67 in the list currently and wanted to remove the 42 this time.

Again, compare this to the process for removing data from an array and it is amazingly simple and efficient! Recall that in an array, we’d have to scoot all remaining elements down to overwrite the removed data (and one another).

Searching

The question then becomes, how do you get to the node with 42 or to the position you want to add the 42 in the first place? Let’s tackle these one at a time.

For removal, we need to find the node with the value to remove and the node before that in the list. This requires two pointers and we just need to advance them along until we reach the proper positions. Of course, the pointer for the target node should be initialized to the ‘head‘ of the list so we are looking at nodes actually in the list. But the pointer to the previous node should be initialized to ‘nullptr‘ and should then update as the target pointer does but a step behind:

Just be careful if the ‘target‘ value isn’t found in the list — don’t try to remove it!

Also notice how we advance to each ‘next‘ position with the assignment of the removal pointer to its ‘next‘ pointer. This is the mainstay of linked list processing!

Not too hard, but definitely slow. Of course, if you didn’t know the location of the value in an array, it would take at least a binary search to find it. So we aren’t too far behind. *smile*

But what if the 42 wasn’t in the list yet and we wanted to put it in order. We’d need a loop something like this one:

Here, again, caution should be taken to make sure the insertion will happen inside the list or at its ‘head‘. If we are inserting in front of the ‘head‘, we need to update the ‘head‘, too!

Another thing we can do that is like searching is just visiting every node in the list and doing something — maybe we print the ‘data‘. That is even simpler as we don’t need to track the previous node’s location:

Again, a simple ‘next‘ pointer update to advance in the list from node to node until the ‘nullptr‘ is reached.

Recursion and Linked Lists

In staring at the linked list diagrams from earlier (section [list:mem]), we notice that each node of the linked list has a ‘next‘ pointer that points, effectively, at a slightly shorter linked list of nodes — a sublist of sorts. This lends itself naturally to the thought of recursively processing the linked list. We can even come up with elegant solutions to some problems like the printing problem in the last section.

Printing Forward

To print a linked list from ‘head‘ to ‘nullptr‘, we can utilize a recursive routine like this:

It’s a nice ‘private‘ method for the node ‘class‘, in fact. It would probably have a ‘public‘ interface method associated that had no ‘head‘ argument.

Printing Backward

And what if we wanted the content backwards instead of forwards? Maybe they’ve asked to see our sorted list sorted the other way round? We wouldn’t re-sort it, of course: just walk it backwards! This can be done effectively by just reversing the order of the recursive call and the ‘cout‘:

Now it walks to the end of the list before it starts printing and prints on each return from a recursive call — placing it back at a previous node.

But A Long List...

Yes, if a list is really long, we’ll likely run out of [function call] stack space and overflow. So many want to see how to print the list in reverse without recursion. Turns out we need a little help with this. Help from another data structure known as — of all things — a stack! We’ll come back to this issue in that chapter ([def:stack-queue]).

A Different Approach to Design

The recursive processing above leads to a natural second approach to linked list design. In this approach, we utilize the sublist idea from above. To emphasize sublist relationship, we merge the two ‘class‘es of the above design into a single one. This loses the ‘head‘ pointer and the ‘count‘ of held elements. The ‘next‘ acts as the ‘head‘ of every sublist. And storing the ‘count‘ in each node would be highly wasteful and terribly inefficient to keep maintained.

But it begs the question of who holds the true ‘head‘ of the list?! Turns out it doesn’t matter. Anyone holding a node holds the ‘head‘ of a linked list. If you want the uber-‘head‘, talk to the main function! *smile*

A Clear Design Winner?

On one hand, the classical approach has two ‘class‘es to maintain — even if one is nested inside the other versus the recursive approach which has only the one ‘class‘ to work with. But recursive thinking is a bit of a pain for many programmers and so that approach comes with some mental overhead as well.

Also, the classical approach has a clear delineation of duties as to which ‘class‘ is responsible for what. The recursive approach makes this less clear-cut and, again, can cause some programmers mental anguish.

Overall, I’d say there isn’t a clear winner between these two approaches to linked list ‘class‘ design. But keep both in mind just in case they come in handy someday!

Static

The static in the title of this section is not referring to ‘static‘ ‘class‘ members or ‘static‘ local variables or any of that. It is simply referring to a non-dynamic situation. That is, we are going to make a linked list without pointers. We’ll store it in a fixed-sized array for good measure. That way this approach will allow us some of the improvements of the earlier linked list but in non-dynamic form for when there just isn’t enough memory to do dynamic allocation.¹⁰⁷

How can we get by putting our data in a fixed-sized array and still get the benefits of linking? Won’t we just have to shift everything around like normal? Not like this! We’ll keep some sort of link attached to each piece of data in the nodes. And each array element will be a whole node.¹⁰⁸

Fixing the Pointers

So how do we replace the pointers if our data are stored in an array? Well, with ‘size_t‘ position markers, of course! So each piece of data is paired with a ‘size_t‘ which tells what position holds the next piece of data. In fact, we call this member of the node ‘class‘ ‘next‘ as well!

And what of the ‘nullptr‘ ending? That is taken care of with a special flag value that can’t be a position in the array: the maximum ‘size_t‘ value. Since ‘size_t‘ is ‘unsigned‘, this can be gotten in one of two ways. Old-style programmers would cast a ‘-1‘ to ‘size_t‘ knowing it will wrap around to the largest value. But modern designers will use the ‘numeric_limits‘ ‘template‘ — yep, that’s a ‘template‘, too! — to get the ‘max‘ value.

Let’s say we had the same data as in our earlier linked list: 15 and 67. It would look like this:

And the ‘head‘ member of the list would be 0, of course. (Note that I’m being lazy and coding the end-of-list marker as ‘-1‘ even though it is old-fashioned. This is because the actual value is unknown to us until it is compiled so I’m not sure if it is 32-bit or 64-bit on any given system.)

Inserting New Values

So how would we get the 42 in there as we did before? Well, once we know it should go between 15 and 67, we simply add it to the ‘count‘ position of the array and fix up the ‘next‘ links. We don’t have to insert it between the two current pieces of data because they have links to tell you what order things are really in. So we simply overwrite the ‘0,-1‘ in position 2 of the array with ‘42,1‘ and update 15’s ‘next‘ link to be 2. (Then increment ‘count‘, of course.)

As always, watch for having to update ‘head‘ when the new data should go before our current ‘head‘ (15 here)!

An Aside

BTW, this technique can also be used to organize large data with sorting that doesn’t move the actual data around. We simply sort the links into the right order and leave the data items where they fell originally. Note how the data in our array are not in order, but the linked list of them actually is.

Removing Old Values

To remove an old value — like that 42 there, we just take the ‘next‘ links and fix them and then decrement the ‘count‘. So, once we know that 42 came from 15, we change 15’s ‘next‘ to be that of 42 and then set 42’s ‘next‘ to ‘-1‘ for safety. Decrementing the ‘count‘ completes the process:

We don’t actually have to remove the 42 unless we want. (Just like ‘delete‘’ing that node earlier didn’t change the data in that memory location from the heap…)

Free versus Taken Spots

What if things were more extreme than this? What if the list looked like so before 42 was removed:

Further, ‘count‘ is 4 so we’ll be putting the next insertion into slot 4 in the array. That would overwrite the 9 from the second list position! Not cool!

We need a way to track what positions are actually not in use instead of just placing the new data in the same position as the value of ‘count‘. Thus is born the free list. It is a second linked list interwoven amongst the slots with the data linked list. It has its own head but we call it ‘free‘ or ‘free_head‘ instead to distinguish it from the actual data ‘head‘. In this scenario, we’ll know where the new piece of data can be laid down with ease.

Revisiting Insertion

After removing 42 above, we’d set the ‘free‘ head from ‘-1‘ as it was when the list was just full to 2 where the 42 had resided. Then, when we go to insert the new data wherever it may land in the data list, it will be put into ‘array[free]‘ and ‘free‘ will be updated to whatever that slot’s ‘next‘ used to be. That is, we use the head of the free list to tell us where to put any new piece of data and just move it along. Thus, when we create the empty list to start the above process, it would look like so:

And ‘free‘ would be 0 and ‘head‘ would be ‘-1‘. As 15 arrives, we put it in position 0 since that position is free and change ‘free‘ to be that slot’s ‘next‘ which was 1. Also we change the ‘next‘ there to what ‘head‘ was since this is an insertion at the start of the data list. Then we update ‘head‘ to position 0, as well. Finally, we update ‘count‘, as well:

But now the ‘free‘ list starts at 2 and we know to put the next piece of data that arrives there. Let’s say it was -2. It would look like this after insertion:

And ‘free‘ would be ‘-1‘! (And ‘head‘ would be 2, but we already covered that idea.)

Revisiting Removal

Well, how did 42 get removed in the first place? It would have looked like so beforehand:

And we found that 15’s ‘next‘ links to the 42, so we change its ‘next‘ to be 1 like 42’s is. And then we change 42’s ‘next‘ to what ‘free‘ currently is since it’s going in front of the ‘free‘ list now. Finally, change ‘free‘ to be 2 where 42 is located. (Yes, yes, and decrement ‘count‘.) This leaves us with this array:

	‘top=0‘	‘top=-1‘
‘push‘	store then increment	increment then store
‘pop‘	decrement then copy	copy then decrement
‘get_top‘	‘top-1‘	‘top‘
‘empty‘	‘top==0‘	‘top==-1‘
‘full‘	‘top==size‘	‘top+1==size‘

	‘tail=head=0‘
‘enque‘	store then increment ‘tail‘
‘deque‘	copy then increment ‘head‘
‘peek_front‘	‘head‘
‘empty‘	‘tail==head‘
‘full‘	‘(tail+1)‘%‘size==head‘

User Types	Buffer Contents
Hello	‘’H’‘, ‘’e’‘, ‘’l’‘, ‘’l’‘, ‘’o’‘, ’\n’
4213	‘’4’‘, ‘’2’‘, ‘’1’‘, ‘’3’‘, ’\n’

Next ‘char‘	Translated	‘accumulator‘ Shifted	Accumulation
‘’4’‘	‘4‘	‘0‘	‘4‘
‘’2’‘	‘2‘	‘40‘	‘42‘
‘’1’‘	‘1‘	‘420‘	‘421‘
‘’3’‘	‘3‘	‘4210‘	‘4213‘


Mode
Member
Access	‘public‘	‘protected‘	‘private‘
‘public‘	‘public‘	‘protected‘	‘private‘*
‘protected‘	‘protected‘	‘protected‘	‘private‘*
‘private‘	‘private‘	‘private‘	‘private‘*

Preface

Reader Background

Styles

Typography

Exercises

Code Availability

Self-Study

Viewing

Coming Soon!

Copyright and License

Acknowledgements

Storage

C-Style Arrays and Strings

Basics of Arrays

Versus vectors

Declaration

Initialization

A Brief Example

Lots of Loops

General

If Full

Input Is Special

Passing Arrays to Functions

Note on sizeof

Array Subrange Processing

A Comparative Example

Basics of C-Strings

C-String Initialization

C-String Library Functions

Protecting from Overrun

Case-Insensitive Comparison?

Output of C-Strings

Input of C-Strings

C-String Input with Embedded Spacing

Standard C-String Processing Loop

Arrays as class Members

2D Arrays

Declaration

Initialization

Arrays of C-Strings

Sub-Arrays

Passing to Functions

But Why?

An Example

Arrays Beyond 2D

Wrap Up

Memory Management

Pointers

Versus References

Declaration

But isn’t the Asterisk..?

Terminology

To Point, but to where?

Basic Pointer Operations

Passing to Functions

C-Style Referencing

Pointers to class Objects

Arrays Revisited

Four Types of Constant

Pointer Math

More From C-Strings

Weirdness!

Iterators

Essential Usage

Intermediate Usage

Odds and Ends

Invalidation

For the Adventurous

Dynamic Memory

Allocation and Deallocation

try/catch vs nothrow/nullptr

Use of Dynamic Memory

Cleaning Up with delete

nullptr Assignment After delete

Reallocation of a Dynamic Array

Chunks vs Multipliers

Growth vs Shrinkage

Dynamic class Members

Where the Parts Go

Destructors