Chapter 3. Scripting with Ruby

Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

Chapter 3. Scripting with Ruby

This chapter covers

Reading and writing CSV data
Generating daily reports
Producing a comparison report
Generating customized printable reports

One of the greatest assets of Ruby is its ability to scale small. As a software developer, you have learned about the difficulties of scaling large: applications with millions of lines of code, handling terabytes of data and serving billions of hits a day, taking advantage of multicore architectures and server farms. The rest of this book is about working with big services: messaging, web services, databases, and so on. Scaling up and scaling out are tough challenges that excite the imagination, but they’re not all that software can do for you. We thought it would be prudent to take a chapter and show you how Ruby can help you out doing everyday “stuff.”

In our daily lives, there are a lot of small problems waiting to be solved. Some solutions take a minute to develop and may fit in a single line, yet are just as important as much larger, attention-grabbing problems. Consider a commit hook that runs the test suite before allowing a commit, attaches the commit message to an open ticket, and sends a notification to the development team. It’s not rocket science, nor an opportunity for the next billion-dollar company, but it makes life easier. Simple solutions like that automate repetitive tasks, double-check what we’re doing, and smooth our workflow so we can get more done without spending the night at the office.

3.1. Scripting with Ruby

If you talk to developers proficient with scripting languages—command-line languages like bash, or programming languages like Ruby, Perl, and Python—you’ll learn that their environment is full of little scripts. Each script on its own does very little, but in combination they cut down on unnecessary workload and help us concentrate on the truly creative tasks.

In practice, scripts come in all shapes and forms. You can write a trivial Greasemonkey script to change the color scheme of a website or remove intrusive ads and annoying sidebar widgets. A lot of build systems use fairly complex scripts to build complex applications targeting multiple platforms. System administrators rely on an arsenal of scripts that run in the background, on a schedule or from the command line, doing everything from deploying to monitoring to controlling and alerting.

In this chapter, we’ll show you some simple, even throw-away, scripts. Not because we have limited space, but because we think writing such scripts is a good habit to pick up. There’s always room in your life for little scripts that do dumb work so you don’t have to.

To illustrate, we’re going to pick a fun project for our very first solution. We’re going to inflict our recent travel photos on all our friends. Instead of inviting them over to watch a slide show (we know they’d have an “emergency” at home that night), we’ll serve those photos one by one using Twitter.

Problem

You need to take a collection of high-resolution images, and scale them down to show fully in the browser, looking like Polaroid pictures. Next, you’ll make them available on the web for everyone to see (using our Amazon S3 account), and announce each image to the world using your Twitter account.

Solution

We’re going to write two different scripts:

The first will scale down and add a Polaroid effect to our images, upload them to our Amazon S3 account, and write list of URLs (one for each image) so we can check the images before unleashing them on the world.
The second will pick one image and post a link for that image to our Twitter account. We’re going to schedule this script to run once an hour, Twittering a different image each time it runs.

We’re going to need two Ruby gems for that, so let’s start by installing them:

$ gem install aws-s3 $ gem install twitter

We’ll start with the first script, shown in listing 3.1.

Listing 3.1. Turn our photos into smaller, Polaroid-like images, and upload to Amazon S3

First, we need images that are just big enough to show fully in a web browser. Our photos are high resolution, so we need to scale them down to 800 pixels wide. We also want to make them look like Polaroid pictures, with a white border, and tilt them randomly between -10 and 10 degrees. We’re going to use the convert program to do that transformation .

We connect to our Amazon S3 account , and since Amazon S3 stores files in buckets, we start by making sure we have a bucket ready . Next, we upload these images one by one (for simplicity, the originals are JPEG and the Polaroids are PNG), and tell Amazon to make them publicly readable . We get the URL for each of these images and store them in a file called images .

To use the script, we first need to supply our Amazon Web Services account credentials. We’ll put those in a separate file, so we can reuse them with different scripts, and so we can share this script without sharing our account information. You’ll need a file called .amazonws in your home directory that looks like this:

key_id: <your AWS access key id> key_secret: <your AWS secret access key>

This is the command to run the script and tell it which bucket to use:

$ ruby upload_images.rb ruby_in_practice

Now let’s turn to the second script, shown in listing 3.2.

Listing 3.2. Twitter a link to an image

We start by reading all the URLs from the images file , but we’re only going to Twitter the first image, so we’ll create a message for only one image . Next, we connect to our Twitter account and post that message . Finally, we’ll write the remaining URLs back to the file , so we can process the next URL when the script runs again.

As before, we’re going to use a configuration file that lives in the home directory. The .twitter file looks like this:

email: <email> password: <password>

We’re not going to run this script from the command line. Instead, we’ll schedule it to run once an hour using cron:

$ echo "0 * * * * $(pwd)/twitter_image.rb "Check out this photo "" > jobs $ crontab jobs

Discussion

There are several libraries we could use to resize images and apply interesting transformations. The first that comes to mind is RMagick. RMagick is a Ruby wrapper around the excellent ImageMagick processing library. It has everything we need to create fun images for our project, yet we decided not to use it. The trick to scaling down is always looking for the simplest, shortest solution to the problem.

A true script ninja will get the job done with the minimum amount of effort. Not that RMagick is all that complicated, but we decided to do something else instead. We typed convert --help on the command line, scanned the help page for the three settings we wanted to use, tested them (again, from the command line) against a couple of images, and pasted the command into our Ruby script. The convert program is a command-line interface to ImageMagick, and for our case it was simpler and faster to use than opening up the RMagick API documentation.

We didn’t build any sophisticated error handling into either script. The first one we’re going to run from the command line, and if it fails, we’ll just run it again. The second script runs as a background job, and because we don’t want to miss posting any image, we made sure it only discards an image’s URL after posting a message about that image. If it fails, it will pick up where it left off the next time it runs.

Eventually, it will run out of images to post and start failing. We’re going to notice that no new images appear on our Twitter stream, check that the images file is empty, and remove the cron job. If we wanted to be smarter, we could also build an auto-remove feature into the script itself.

You probably noticed that we took other shortcuts as well. We placed the original photos, the PNG Polaroids, the list of URLs, and the two scripts all in the same directory. We mixed code with data, original content with temporary files. That’s not modular or organized as you’d expect a large-scale application to be. Then again, this is not a large-scale application that will keep on running into the next millennium. It’s the simplest script we could write in the least amount of time; we’re going to use it once and discard it.

If this is your first foray into scripting, we do hope you’ll think this example is inelegant, maybe downright ugly. Like we said, scripts come in all shapes and forms. Some scripts manage critical systems and others are integral parts of larger applications—you’ll want to use your best development skills to write those. There’s a place for writing scripts that are well thought out, easy to maintain, thoroughly documented, tested, and tested some more.

But we also wanted to introduce you to a different world of scripting. One that’s more focused on getting mundane tasks out of the way by automating them. There are times when investing less is the best course of action, and simple is the best way to start.

We also showed you how easy it is to glue things together with Ruby. We kept things simple by using an image-transformation program instead of diving headfirst into an API, and by keeping our authentication credentials in a text file. Instead of using a database server, we stored state in a text file, and we used cron to schedule our tasks instead of using a scheduling component. Of course, we benefited from the simplicity of AWS-S3 and the Twitter gem.

In the next section, we’re going to take another look at scripting, this time scripting Windows applications using OLE Automation and Mac OS X applications using OSA.

3.2. Automating with OLE and OSA

Scripting languages are easier to program with than low-level programming languages. If you’ve ever written an Excel macro, created a workflow using Word and Outlook, or programmed with AppleScript, you know what we’re talking about. Application scripting—desktop and server applications alike—is done through APIs designed specifically for the task at hand, and is simplified to work from any programming language. In this section, we’re going to take a look at automating applications on the Windows platform using OLE Automation, and Mac OS X applications using Open Scripting Architecture (OSA).

OLE Automation provides scripting support for Windows applications. Microsoft Office is a set of desktop applications that use OLE Automation, typically in combination with the Visual Basic for Applications (VBA) scripting language. OLE is just as easy to use from Ruby programs running on the Windows platform, as we’ll demonstrate by using Ruby and Microsoft Outlook.

OSA provides scripting support for applications running on Mac OS X. It was designed specifically for AppleScript, so some experience with AppleScript is helpful, but it is easy enough to use from the more powerful Ruby language. We’ll use the same example to automate Apple’s iCal calendar application.

Let’s start with OLE Automation and Microsoft Outlook.

3.2.1. Automating Outlook with Ruby

The example we’ll use is a fairly trivial one, but one that would be easier to develop in Ruby than either VBA or AppleScript.

During development we often have to take shortcuts to get something done. Maybe we’re trying to get a demo running, or giving other developers a piece of code so they can start working again. We mark those things we haven’t finished with comments that say TODO or FIXME. And as much as we hate to admit it, we don’t always go back and fix that code—mostly we forget about it. So we’re going to create a simple script that will read these comments from the source code and adds tasks in Outlook or To Do items in iCal to remind us about them.