TDD and Performance

As a programmer, I’ve been involved in a number of optimization attempts. As a consultant, I’ve worked with several programmers for whom optimization was their primary job (one on a system needing to consistently process 20,000+ transactions per second). In both realms, I’ve experienced and witnessed successes that stemmed from a disciplined approach similar to the previous recommendations. I’ve also seen a spectacular failure as one company hired high-priced consultants to desperately attempt to fix a live, production-scaling challenge by stabbing haphazardly at optimization attempts.

A few key elements appear to provide the best defense against performance challenges.

A solid architecture, where the word architecture means the layout of all those things that will be very difficult to change once the system is in place. Specifically, where are the communication points between components (distributed across clients and servers), and how does the architecture support scaling without requiring code changes (in other words, by beefing up hardware)?
A solid but flexible design with clean code, complete with tests that provide the flexibility to make confident, dramatic changes when needed.
Performance goal tests from day one that specify future scaling expectations. If you expect to deploy your application initially to a dozen users and then ultimately to a hundred, you want to know as soon as possible when new code puts the scaling target at risk.

As far as code-level optimization goes, I have yet to see evidence, or hear it from a performance expert, that refutes the classic advice of getting the design right before attempting optimization and then optimizing only if absolutely necessary.

I’ve witnessed many wrong-headed optimization attempts. In some cases, they were based on misguided or downright false folklore (sometimes even based on another language!). In other cases, the performance recommendations were once true, but later compiler and runtime improvements rendered them obsolete.

Some code-level optimizations do fall in the category of “free." For example, passing by reference in C++ is usually more efficient than passing by value, and it costs nothing in expressiveness. Where such optimizations do not degrade readability or ease of maintenance, go for ’em. Otherwise, save the optimization attempts for later, much later.

c9/24/GeoServerTest.cpp
	TEST(AGeoServer_Performance, LocationOf) {
	const unsigned int lots{50000};
	addUsersAt(lots, Location{aUserLocation.go(TenMeters, West)});

	TestTimer t;
	for (unsigned int i{0}; i < lots; i++)
	server.locationOf(userName(i));
	}

c9/24/GeoServer.cpp
	bool GeoServer::isTracking(const string& user) const {
	return find(user) != locations_.end();
	}

	Location GeoServer::locationOf(const string& user) const {
	if (!isTracking(user)) return Location{}; // TODO performance cost?

	return find(user)->second;
	}

c9/25/GeoServer.cpp
	Location GeoServer::locationOf(const string& user) const {
	// optimized
	auto it = find(user);
	if (it == locations_.end()) return Location{};
	return it->second;
	}

c9/25/TestTimer.h
	#ifndef TestTimer_h
	#define TestTimer_h

	#include <string>
	#include <chrono>

	struct TestTimer {
	TestTimer();
	TestTimer(const std::string& text);
	virtual ~TestTimer();

	std::chrono::time_point<std::chrono::system_clock> Start;
	std::chrono::time_point<std::chrono::system_clock> Stop;
	std::chrono::microseconds Elapsed;
	std::string Text;
	};

	#endif

c9/25/TestTimer.cpp
	#include "TestTimer.h"
	#include "CppUTest/Utest.h"
	#include <iostream>

	using namespace std;

	TestTimer::TestTimer()
	: TestTimer(UtestShell::getCurrent()->getName().asCharString()) {
	}

	TestTimer::TestTimer(const string& text)
	: Start{chrono::system_clock::now()}
	, Text{text} {}
	TestTimer::~TestTimer() {
	Stop = chrono::system_clock::now();
	Elapsed = chrono::duration_cast<chrono::microseconds>(Stop - Start);
	cout << endl <<
	Text << " elapsed time = " << Elapsed.count() * 0.001 << "ms" << endl;
	}

Table of Contents for
TDD and Performance