Archive for Research

The Superlative Kyle Hendricks

You know it’s almost time for baseball season when all of the major projection systems forecast Kyle Hendricks‘ ERA one run per nine innings too high.

As much as this sounds like a knock on those who develop projections, it’s not. What Jared Cross (Steamer), Dan Szymborski (ZiPS), Derek Carty (THE BAT), and the folks at Baseball Prospectus (PECOTA) do is no small feat. If I weren’t too cowardly to even try to create my own projection system, I would be too stupid to design one that is half as effective as theirs. Glass houses and all that.

That said, I am just smart enough to know that projected ERAs ranging from 3.84 to 4.42 for Hendricks, who boasts a career ERA of 3.12 and has never finished a season with an ERA above 3.46 (except that dastardly 3.95 ERA in 2015), are too high. It’s easy to poke holes in the obvious outliers, but projections succeed by describing and then predicting the talents of most pitchers, not the ones whose talents deviate dramatically from expectation. Hendricks is every projection system’s known blind spot.

It’s not just projections that struggle with Hendricks, either. We, the sabermetric community, frequently use ERA estimators as shorthand to characterize a pitcher’s talent level. If you frequent FanGraphs, you’re familiar with Fielding Independent Pitching (FIP), expected FIP (xFIP), and Skill-Interactive ERA (SIERA). By virtue of how they’re constructed, each metric makes assumptions about the skills a pitcher theoretically “owns”:

  • FIP: strikeouts, walks, and home runs allowed
  • xFIP: strikeouts, walks, and fly balls induced
  • SIERA: a complicated combination of strikeouts, walks, net groundballs (groundballs minus fly balls), and their squared terms and interactions with one another

While each estimator features a batted ball component, they focus on trajectory (launch angle), not on authority (exit velocity). This is a fair assumption, frankly. I have illustrated how a pitcher can influence hitter launch angle, operating under the assumption they bear little to no influence over hitter exit velocity. It’s not quite that bleak; certified baseball genius Rob Arthur found that the average pitcher’s effect on a baseball’s exit velocity: roughly five parts hitter, one part pitcher. Read the rest of this entry »


Musings at the Intersection of Launch Angle Consistency and Hard-Hit Rate

If you follow the work of Alex Chamberlain at all, you’ve heard of the value of launch angle consistency. I’m not going to recapitulate his body of work on the subject, but briefly: hitters with tighter launch angle distributions routinely run higher BABIPs, and you can think of launch angle consistency as roughly a proxy for “hit tool.”

Most of this comes down to avoiding terrible batted ball outcomes. The two worst things you can do when you put the ball in play are to hit it straight down or straight up. Given that balls are, on average, hit mostly forward and with a tiny bit of loft — breaking news, I know — launch angle consistency is a great proxy for how often you avoid those, because the more -80s and +80s you put in your sample of mostly 10s and 20s, the higher the standard deviation gets.

One thing I’ve often wondered is whether this idea of consistency holds up for subsets of batted balls. Intuitively, it seems like it might. Take hard-hit balls, for example. If you’re hitting the ball 95 mph or harder, you really don’t want to squander it by hitting the ball on the ground or straight into the air. The distribution peaks at 30 degrees, but anything between 10 and 35 is a solid outcome.

With this in mind, I decided to look for batters who grouped their hard-hit balls most tightly. Having a narrow distribution seems like a great way to maximize good outcomes. Which player, you ask, has the tightest launch angle consistency (I’m just using standard deviation here) on hard-hit balls? I’m glad you asked — it’s Dee Strange-Gordon. Read the rest of this entry »


The Seam-Shifted Revolution Is Headed for the Mainstream

Hey there! I want to give you a heads up about this article, because it doesn’t fit into a normal genre I write. Today, I won’t be telling you some new insight about a player you like, or creating some new nonsense statistic that tries to pull meaning from noise. This is a story about how baseball analysis is changing right before our eyes. A group of scientists and baseball thinkers are redefining the way we think about pitch movement, and I think it’s worth highlighting even if I don’t have anything to add to the conversation yet, because this new avenue of research is going to be front and center in Statcast-based analysis over the next few years.

“Seam-shifted wake,” as Andrew Smith, a student of Dr. Barton Smith (no relation) coined it, is a source of pitch movement that the first attempts at understanding the physics of a pitched baseball overlooked. It has already changed the way that coaches and pitchers approach pitch design, and due to recent data advances, it’s about to be everywhere. So let’s go over how we got here, to this newly observable way that pitchers deceive hitters, by starting at the beginning and working forward.

At its core, baseball is a game about one person trying to throw a ball past another person. There are other trappings — bases and baserunners, umpires, a strike zone, the mythology of Babe Ruth, and a million other sundry things. At the end of the day, though, everything starts with the pitcher trying to throw a ball past the batter.

Accordingly, baseball analysis over the years has focused on describing the flight of that ball. For a time, that simply meant describing the shape of pitches — they don’t call them curveballs for nothing. The next step was velocity — radar guns let us appreciate fastballs numerically rather than merely aesthetically.

In the past 15 years, the amount and scope of pitch-level analytical data has exploded. First, PITCHf/x quantified pitch location and movement. When we report a pitcher’s chase rate or how often a batter swings at pitches in the strike zone, it’s because the location where each pitch crosses the plate is recorded and logged. When we say a pitcher has eight inches of horizontal break on their slider, it’s because new technology allows us to measure it.

When Statcast debuted in 2015, it added another wrinkle: radar tracked the spin rate of each pitch in flight, putting a numerical value on something that had previously been only qualitative; a pitcher’s ability to generate movement through spin. Doctor Alan Nathan has written several authoritative studies discussing the value of this spin data. Read the rest of this entry »


The Costs and Benefits of Six-Man Rotations

Planning a starting rotation for 2021 carries innumerable pitfalls. Nearly every pitcher in the league saw a reduced workload last year, and they did it in strange circumstances to boot. It’s not merely that the short season set everyone’s innings back — though that’s a huge component. A large number of cancelations and postponements also meant more doubleheaders and more cobbled-together games, another way to throw pitchers off their rhythm.

Put it all together, and protecting arms sounds like an appealing plan for 2021. The Mariners announced that they’ll use a six-man rotation next year, a continuation of the plan they leaned on for all of 2020. The Red Sox are talking workload management. Since initially publishing this piece, Jeff Zimmerman pointed out that the Tigers will use a six-man rotation as well. Is an embiggened rotation the solution to this universal problem? Let’s do the math.

It depends, first of all, on what you give up. The innings tradeoff of a six-man rotation is straightforward. Giving your pitchers an extra day off limits their workloads, naturally enough. The math on that is straightforward if you assume it doesn’t affect their in-game workload. Take a pitcher who averages six innings per start. In a five-man rotation, that’s 192 innings of work. Adding a sixth pitcher to the rotation cuts that down to 162 innings.

How much do those 30 innings of work matter when it comes to health? I’ll level with you — I’m not sure. We simply don’t have the data to say with any amount of certainty, because the number of comparable situations is so small. Pitchers have light workloads all the time, but in most cases it’s due to age or injury. Looking at what a 21-year-old pitcher did in first building up stamina probably can’t tell us much about how many innings Jake Odorizzi, to pick a random example, should throw in 2021. Likewise, a pitcher’s workload in his first year back from Tommy John surgery can’t tell us how many innings Trevor Bauer can be effective for. Read the rest of this entry »


Do Successful Steals Apply Measurable Pressure?

Consider the plight of the base stealer. In the 1980s, their role was sacrosanct. Get on base first, then cause havoc. For fans of speed and baserunning, it was a veritable golden age. Rickey Henderson and Vince Coleman each stole 100 bases in three separate seasons. Since 1980, the 13 top seasons in terms of stolen bases per plate appearance were 1980 through 1992.

Alas, the scurrilous forces of math and efficiency conspired to dethrone the stolen base. As it turns out, advancing one base is less good than creating an out is bad. It’s bad enough, in fact, that you need about three successful stolen bases to make up for the downside of getting caught once. The very best thieves managed that level of efficiency, but in aggregate, the league only crested a 70% success rate once from 1980 to 1992. Steals simply weren’t advancing teams’ goal of scoring as many runs as possible.

For a time, there was a reasonable counter-argument: what if attempting a stolen base has positive value that isn’t solely contained in reaching second base? Perhaps the pitcher has steals on the brain, or the defense loses its cohesion while attempting to cover the base for a throw. It doesn’t need to add much edge to make the math add up.

In 2007, the authors of The Book took up this question. They found a large advantage to batters when a runner was on first — exactly what proponents of steals suggested. There was a big problem, however. That advantage was for all runners on first base. The faster the runner, the smaller the advantage. In addition, actually attempting a steal carried a huge hit to the batter, more than enough to offset the advantage of having a runner on base. Read the rest of this entry »


Where Vertical Approach Angle Seems to Matter Most

A couple of weeks ago, I was chatting with PitcherList’s Alex Fast about four-seam fastballs swinging strike rates (SwStr%) and their relationship to pitch height — or, perhaps more specifically, their lack of relationship. At the pitcher-season level (e.g., “2020 Clayton Kershaw“), the correlation between SwStr% and pitch height appeared weak at best. When you consider that no fastball is created equal and then introduce small-sample variance to the equation, the relationship could, understandably, become blurred at the pitcher level.

As a retort, I sent him the following graph, which shows SwStr% by pitch height for the three broad pitch classes as defined by Statcast, the source of the data. For reference, I’ve added black lines to indicate the average bottom, heart, and top of the strike zone:

If we zoom out and consider the question at the macro level, independent of context (what’s the average swinging strike rate for all fastballs by pitch height?), we can see that fastballs generate more swinging strikes up in the zone, a phenomenon our own Jeff Zimmerman touched upon here. This finding is mildly interesting in and of itself. But as I considered the matter further, the importance of swing frequency (Swing%) to SwStr% became clear (both use all pitches as a denominator). Regardless of efficacy, more swings will afford more chances for swinging strikes. As such, I anticipated that fastballs probably induce more swinging strikes up high than down low simply because hitters swing more frequently at high fastballs. Similarly (but inversely), non-fastballs would generate more swinging strikes down low instead of up high. The next graph all but affirmed my intuition:

Although the peaks of the bell curves cluster near the heart of the zone, we can see distinct differences in swing rate by pitch class at the thresholds of the strike zone. At its bottom edge, hitters are half as likely to swing at fastballs as they are at non-fastballs; at its top edge, twice as likely. Read the rest of this entry »


What Happens the Year After a Velocity Spike?

I didn’t want to write this article. One of my favorite things to do, back when I was a full-time Cardinals fan and part-time writer, was wait for the first few weeks of the season and then start ogling velocity changes. There’s almost nothing that made me feel so unabashedly happy as seeing an extra tick or two out of some arm I’d written off the previous year. Why spoil that magic by looking into whether it actually matters?

Nothing fun can come of using data to look at incuriously held beliefs, but that’s never stopped me before, so I decided to examine pitchers who experienced velocity gains from one year to the next. Do their fastballs grade out better? Do they strike out more batters? Walk more? Do they hold the gains from one year to the next? I had no clue, but I decided to find out.

First things first: 2020 goes right out the window. The season started in late July, and no one had anything approaching their normal offseason routine. Temperatures were weird, workloads were changed on the fly, and some teams were affected by COVID-related cancelations; trying to tease something out from that noise is pointless and unnecessary. I’ll just use 2015 through 2019 instead.

Why 2015? That’s when Statcast first arrived, and with it a new tracking system. I could, I suppose, use data since 2008, but I wanted to minimize the chances of false readings stemming from the change in systems. 2020 also featured a change — to camera-based readings instead of radar — but we’re already throwing it out anyway, so no big deal there.

In each year, I looked at the population of starters who threw at least 500 four-seam fastballs. I then found the year-to-year changes for each pitcher-season combination. Justin Verlander, as an example, averaged 93.4 mph in 2015, 94.1 mph in 2016, 95.3 mph in 2017, 95 mph in 2018, and 94.6 mph in 2019. That means his ‘15-’16 change was 0.7 mph, his ‘16-’17 change was 1.2 mph, his ‘17-’18 change was -0.3 mph, and his ‘18-’19 change was -0.4 mph. This gave us a database of 236 pitcher-seasons from 2016 to 2018 — I’m leaving out changes between 2018 and 2019 because I want to know what happens the year after a pitcher gains velocity. Read the rest of this entry »


The Diminishing (but Positive) Returns of Tanking

Before we get going, a heads up: this article is about a mathematical model that explains tanking. I’ll give you a preview. The way baseball is currently set up, it’s no surprise teams tank. If you want to fix that, the game’s competitive structure must change. I suggest a few ways to accomplish that change at the end of the article, but be warned: the bulk of this is a no-nonsense dissection of why teams keep tanking even as the returns go down. Personally, I think the game should make the changes I suggest, because the boom and bust cycle of team contention makes for a fraught fan experience. The way the game is set up now, however, it’s no surprise that teams do it.

The logic behind tanking is straightforward and solid. Being in the middle is the worst; there are no prizes for winning 85 games, flags fly forever, and so on and so forth. You’ve surely heard it enough times that you don’t need a repeat, but just for completeness’s sake, we’ll do it one more time.

By trading present concerns for future value, you make your team better in the future. As a byproduct of trading present concerns away, your major league roster gets bad — bad enough, hopefully, that you’ll move to the top of the draft. Additionally, with no pesky need to be competitive, you can use your major league roster to give borderline players extended tryouts. Hit on a few of them, and that’s even more tailwind for the future.

In essence, tanking is making a bet that taking a step back now will let you take two steps forward sometime in the future. Even if that isn’t the case, being quite bad for a while and then quite good for a while sure sounds better than being mediocre the whole time. Tanking works on both axes, which explains its continued appeal. Do you think your team will win something like 77 wins? Blow it up! 75 wins? Blow it up! 70 wins? You guessed it. Read the rest of this entry »


Two Easy Ways To Make Baseball a Better Game

Baseball is great, but it can be better. While earlier versions of this piece had an overwrought and overly long intro on the delicate balance between the intimacy of the pitcher-batter matchup and the frenzied multi-actor action across beautiful acres of wondrous expanse resulting from a ball put in play, let’s just get to my suggestions to improve the play on the field.

Shrink the Strike Zone

One of the unfortunate side effects of the balls-in-play discussion is that strikeouts and walks tend to get lumped together. In reality, walks are a pretty static feature through baseball history, while strikeouts have fluctuated. Here’s a graph showing walk and strikeout rates over the last 50 years.

Over the last 50 years, the average walk rate has been 8.6%, which is the same as it has been the last five years, and over the last 10 seasons, it is 8.2. Whatever hitters and pitchers are doing in recent history, it hasn’t caused more walks. Strikeouts, though, have soared, and on average, there are about 26 fewer free passes per year over the last decade as opposed to the previous 40 years, compared to an additional 352 whiffs per team per year over the same time frame. While you can argue that the increase is due to changes in hitting philosophy, the average fastball has gone from 89 mph in 2002 to 93 mph last season, while pitchers throw more and more offspeed pitches and fewer pitches in the strike zone. It’s not batter philosophy causing the rise in strikeouts; it’s the pitching just getting better and also being better aided by an increase in the size of the strike zone of about 10%. Read the rest of this entry »


Redrawing the MiLB Map: Visualizing the 2021 Landscape

Last year, as part of the negotiations over a new Professional Baseball Agreement (PBA) with Minor League Baseball, Major League Baseball introduced a proposal that would dramatically reimagine the minor leagues. The proposal included plans to shift the timing of the amateur draft and realign some parent-club affiliations, league geographies, and club levels. Most importantly, it proposed stripping more than 40 clubs of their affiliated status, though it also suggested that some of the newly unaffiliated teams would assume other formats, either as so-called professional partner leagues, or as amateur summer wood bat leagues. The plan got us thinking about how access to in-person baseball across the United States would change. We were interested in how many people would lose their ability to watch affiliated baseball in person, or would see that access shift from the relatively affordable confines of the minor leagues to more expensive major league parks.

Those studies relied on a New York Times list of teams reportedly slated for contraction, as well as Baseball America’s excellent reporting. Thirteen months, a pandemic, and one extremely contentious negotiation later, MLB has informed minor league teams of their proposed fates, with 120 franchises “invited” to be part of the new, MLB-developed minor league system. Many are still reviewing the terms of their “invitations”; several find themselves occupying a new rung on the minor league ladder, or with a different parent club than before.

Meanwhile, 25 clubs find themselves ticketed either for summer wood bat leagues, including the newly formed MLB Draft League, or for pro partner leagues for undrafted players and released minor leaguers. Eighteen teams face futures that are, as of this writing, uncertain, though as Baseball America’s JJ Cooper notes, “Major League Baseball has indicated that it will pay entry fees for these teams that were left out of affiliated baseball to join new leagues. MLB will pay their way in, but as a condition those teams are expected to waive a right to sue.” The complete list of the 43 franchises slated to lose their affiliated status can be found below. Of the 43, 11 are full-season clubs:

MiLB Teams Losing Affiliated Status
Team Previous League New League Format
Auburn New York-Penn TBD TBD
Batavia New York-Penn TBD TBD
Billings Pioneer Pioneer Pro partner league
Bluefield Appalachian Appalachian Summer wood bat
Boise Northwest Pioneer Pro partner league
Bristol Appalachian Appalachian Summer wood bat
Burlington Appalachian Appalachian Summer wood bat
Burlington Midwest TBD TBD
Charlotte Florida State TBD TBD
Clinton Midwest TBD TBD
Danville Appalachian Appalachian Summer wood bat
Elizabethton Appalachian Appalachian Summer wood bat
Florida Florida State TBD TBD
Frederick Carolina League MLB Draft Summer wood bat
Grand Junction Pioneer Pioneer Pro partner league
Great Falls Pioneer Pioneer Pro partner league
Greeneville Appalachian Appalachian Summer wood bat
Hagerstown South Atlantic TBD TBD
Idaho Falls Pioneer Pioneer Pro partner league
Jackson Southern TBD TBD
Johnson City Appalachian Appalachian Summer wood bat
Kane County Midwest TBD TBD
Kingsport Appalachian Appalachian Summer wood bat
Lancaster California League TBD TBD
Lexington South Atlantic TBD TBD
Lowell New York-Penn TBD TBD
Mahoning Valley New York-Penn MLB Draft Summer wood bat
Missoula Pioneer Pioneer Pro partner league
Northern Colorado Pioneer Pioneer Pro partner league
Norwich New York-Penn TBD TBD
Ogden Pioneer Pioneer Pro partner league
Princeton Appalachian Appalachian Summer wood bat
Pulaski Appalachian Appalachian Summer wood bat
Rocky Mountain Pioneer Pioneer Pro partner league
Salem-Keizer Northwest TBD TBD
State College New York-Penn MLB Draft Summer wood bat
Staten Island New York-Penn TBD TBD
Trenton Eastern MLB Draft Summer wood bat
Tri-City New York-Penn TBD TBD
Vermont New York-Penn TBD TBD
West Virginia South Atlantic TBD TBD
West Virginia New York-Penn MLB Draft Summer wood bat
Williamsport New York-Penn MLB Draft Summer wood bat

Read the rest of this entry »