The data on first vs. second serve win frequency cannot be taken at face value because of selection problems that bias against second serves. The general idea is that first serves always happen but second serves happen only when first serves miss. The fact that the first serve missed is information that at this moment serving is harder than usual. In practice this can be true for a number of reasons: windy conditions, it is late in the match, or the server is just having a bad streak. In light of this, we can’t conclude from the raw data that professional tennis players are using sub-optimal strategy on second serves.
To get a better comparison we need an identification strategy: some random condition that determines whether the next serve will be a first or second serve. We would restrict our data set to those random selections. Sounds hopeless?
When a first serve hits the net and goes in it is a “let” and the next serve is again a first serve. But if it goes out then it is a fault. The impact with the net introduces the desired randomness, especially when the ball hits the tape and bounces up. Conditional on hitting the tape, whether it lands in or out can be considered statistically independent of the server’s current mental state, the wind conditions, and the stage of the game. These are the ingredients for a “natural experiment.”

3 comments
Comments feed for this article
September 3, 2010 at 11:49 am
Jonathan Weinstein
Very good point. The netcord instrument is extremely clever, though it does cut the size of the data set hugely. I guess with a full year of data on a player you might actually get enough though.
September 4, 2010 at 1:32 pm
jeff
yes it limits the data a lot. this is the theorists’ version of empirical research. show the existence of a test and leave it at that 🙂
September 5, 2010 at 2:57 pm
MikeY
great idea!
that being said, you’re assuming the information the server takes from a let ball is equal in both cases. i would expect a strong outcome bias, so that servers after a let in will have more confidence than after a let out.
in both cases the server has learned exactly where to hit it (same as the last time, but up an inch) but psychologically, it’s easier to make that adjustment when you just got one in. i think you’re more likely to “start from scratch” after a miss.
In fact, I would expect better serves on a first serve after a let ball in, than after a typical first serve. (kind of like giving them a practice shot).