Archive for Research

Lesser CLIFFORD Candidates

When I originally published my findings around CLIFFORD — my metric for predicting players that are at a higher risk of experiencing a collapse in their wOBA (defined as a drop of at least .30 points of wOBA) — I presented a limited number of players for 2013. The list only included six players that qualified under the criteria. As a reminder, players that experienced a significant decline in three out of four metrics (Z-Contact%, FA%, UBR, Spd) were tagged as CLIFFORD candidates. These players had 3.4 times the odds of collapse (53% versus 25% for non-CLIFFORD players).

The single largest driver of collapse was change in Z-Contact% — the percent of pitches in the strike zone that a batter swings and makes contact with. Hitters who saw their Z-Contact% decline by at least 1.4% had 1.68 times the odds of collapsing than those that did not experience such a decline. Since there were far more players that qualified with their Z-Contact% than the full CLIFFORD criteria I thought it would be helpful to share that data with everyone.

Behold!

Read the rest of this entry »


Team-Specific Hitter Values by Markov

In my first article, I wrote about the limitations of the linear weights system that wOBA is based on when it comes to the context of unusual team offenses. In my second, I explained how Tom Tango, wOBA’s creator, also came up with a way of addressing some of these limitations by deriving a new set of linear weights for different run environments, thanks to BaseRuns. Today, I will tell you about the next step in the evolution of run estimators — the Markov model. Tom Tango created such a model that can be accessed through his website, and I’ve turned that model into a spreadsheet that I’ll share with you here.

I’ve told you that the problem with the standard run estimator formulas is that they make assumptions about what a hit is going to be worth, run-wise, based on what it was worth to an average team. That means it’s not going to apply very well to an unusual team. What’s so great about the Markov is that it makes no such assumptions — it figures all of that out itself, specific to each team. And when I say it figures it out, I mean it basically calculates out a typical game for that team, given the proportion of singles, walks, home runs, etc. the team gets in its plate appearances. It therefore estimates the run-scoring of typical teams better than just about anything, but it also theoretically should apply much, much better to very unusual or even made-up teams.
Read the rest of this entry »


Felix Hernandez’s Velocity

Last week, the Seattle Mariners inked their ace, Felix Hernandez, to a $175 million extension for the next seven years. The dominating righty will be entering his age-27 season this year, meaning the contract will through his age-33 season. That is, unless, he injures his right elbow.

Embedded within Hernandez’s contract is a clause that gives the Mariners a club option for an eighth season — at a paltry $1 million — should Hernandez miss at least 130 consecutive days due to any kind of procedure to his right elbow. The Mariners negotiated this clause after some concern over what their doctors saw in the pitcher’s MRI.

Apparently, the club was reassured enough by their medical staff to sign the mammoth deal, even though the track record for long-term pitcher extensions isn’t the greatest. But how confident should the team be?
Read the rest of this entry »


Linear Weights + BaseRuns = Good

In my last article, I explained how wOBA’s current implementation changes the value of walks, singles, home runs, etc., annually due to changing league characteristics.  Does this mean that the value of an event is the same for every team in the league each season?  Or in every park in the league?  No way.  If you’re talking about a weak offense in a high-offense era, then the overall constants for a weak offensive era are probably more applicable to that team.  However, it’s not really the point of standard wOBA to guess the run-producing contribution of a particular player to a particular team; I think it’s probably more accurate to say it’s about his probable productiveness in a typical team (although park effects aren’t taken into account, so not exactly… that would be more true of wRC+).

Anyway, Tom Tango realized this limitation, and produced a table that shows how the values change depending on a team’s runs scored.  He accomplished this system of “Custom Linear Weights” (“a necessary offshoot” of linear weights, he says) by making use of David Smyth’s BaseRuns formula, which is, in simplest terms, Runs Scored = base runners * (% of base runners that score) + home runs.  Home run hitters are not considered base runners, in this equation, by the way.  Makes perfect sense, right?

Tango realized that BaseRuns had a better handle on the team run-scoring process than his basic linear weights system (and all the other run estimators), so he translated the results of BaseRuns in various run environments into linear weights.  Specifically, the BaseRuns formula told him how many runs the team should score, and the linear weight value of each hit came from how many additional runs BaseRuns expected the team score if it had one more of that type of hit (the marginal value of each hit type).  Here are just the basics of his results, in graphical form:

Read the rest of this entry »


Adjusting Linear Weights for Extreme Environments

Well, it’s my first assignment as a real writer, having been promoted for my Community Research articles on pitcher BABIPs and ERA estimators, and I’ve been thrown into the deep end of the pool: linear weights.  It’s a tricky subject, but I’ll try to walk you through both the problems with linear weights and how they can be overcome.  This article series mainly draws from various works of Tom “Tango,” a.k.a. “tangotiger,” the creator of wOBA and FIP, as well as from David Smyth’s BaseRuns.  I’ll go deeper and deeper down the rabbit hole of stat geekishness as the series goes on, eventually emerging with a spreadsheet version of Tango’s Markov run modeler that I made for you all to play with.  Where the Markov mainly shines over wOBA is when it comes to extreme run environments, such as unusual offenses or extreme ball parks.

Who cares about extreme run environments?

Nerds like me, I guess?  Tom Tango cared enough to come up with ways to address the shortcomings his original wOBA formulation.  If you’ve ever wondered how valuable a certain player is to your favorite team, maybe you should care too; that low-OBP slugger might be more valuable than wOBA might suggest to your low-OBP team.  On the other end, a typical walk last year was worth considerably more to the high-OBP Cardinals than it was to the low-OBP Mariners (around 0.04-0.065 more runs each… which adds up over a season).

Read the rest of this entry »


Is Big Game’s Game Breaking Down?

James Shields was traded this off season from the Rays to the Royals. He has been known for his durability over the years. Spanning the last two seasons, he is first in complete games with 14. Also, he is second to Justin Verlander in innings thrown. The durability and consistency he is known for may be coming to an end. At the end of the last season, he showed signs of breaking down because he was not able to throw strikes and wasn’t able to maintain a consistent release point.

Read the rest of this entry »


How Can We Predict Stolen Base Talent?

Predicting the ability to steal bases is not something you think you need to do. You did not say to yourself over breakfast, “I wonder if Michael Bourn can steal bases?” You already knew he could. And maybe that’s what made breakfast so delicious.

But if we want to push the frontier of base running, if we want to see the end of the home run era become the beginning of the efficient base running era, we have to do this thing we thought we did not need to do. We have to be able to predict stolen bases.
Read the rest of this entry »


Cano, Granderson, and Other CLIFFORD Candidates for 2013

I recently wrote about my attempt to design an indicator that would predict when players were at a higher risk for having a collapse-type year. I named the metric CLIFFORD, referring to the fact that players identified by it were at risk of falling off a cliff offensively. My inspiration was Adam Dunn and his disastrous 2011, in which his wOBA declined by .113.

My initial research showed that 58% of collapse candidates identified by Marcel actually experience a wOBA decline of at least .03 (or 30 points)–2.43 times the likelihood of non-collapse candidates. Collapse candidates identified by CLIFFORD actually decreased by at least 30 points of wOBA 53% of the time–2.14 times the likelihood of non-collapse candidates.

Marcel initially appeared to do a better job identifying these candidates. If we knew nothing else outside of just the Marcel projection, our chances were better at identifying collapse candidates than if we used CLIFFORD (and, yes, the difference between the relative risk for both measures is statistically significant).

However, and here’s the bright spot, there was not much overlap between the two metrics.

Read the rest of this entry »


I Fought Marcel And Marcel Won

Like my work last year around pitcher aging and velocity decline, I am always looking for reliable indicators or signals of change in players. One thing I’ve been interested in trying to better understand are changes in performance that might signal or herald a large droop-off in performance in the following year.

Projection systems do a very good job of predicting performance, but my thought was there must be some way to better predict the 2011 Adam Dunns of the world.

So, one Saturday morning I decided to do some statistical fishing.

Read the rest of this entry »


1960 Salina Blue Jays: The Year Satchel Paige Came to Town

A small a bigger story sometimes hides behind a bit of information. That bit came in this line I read a few years ago in Larry Tye’s book, Satchel:

In 1960 he [Satchel Paige] threw for the Salina [Kan.] Blue Jays ….

I had no idea. Leroy Robert “Satchel” Paige was arguable one of the best 10-or-so pitchers who played baseball. He was a Hall of Famer on the field, but he was an even better showman. What was one of the greatest players doing playing on a team in Kansas?

I’m a Kansas native. Throughout my life, I’ve had a deep connection with Salina. I lived less than an hour away from the city when I was growing up. Some of my family members still live there. Heck, I was even married there. Because of that, I needed to know what brought Paige to the middle of nowhere to play baseball one summer so long ago.

Read the rest of this entry »