Author Archive

Fly Balls and Groundball Pitchers

In today’s Hardball Times, Matthew Carruth did an analysis on extreme groundball pitchers and how they do not really give up more home runs-per-fly ball (HR/FB) than your typical pitcher. There has been some thought that extreme groundball pitchers do tend to give up more HR/FB because they’re only allowing fly balls when they throw a bad pitch, thus making it easier to hit the ball out of the park. Carruth’s analysis even suggests that the opposite might be true, though the correlation was quite weak.

I decided to run a similar analysis using data from 2002 to the present. The average HR/FB rate during that same period is 10.7%. If we look at the 2002-present totals of all pitchers with a groundball percentage (GB%) greater than 55% and more than 100 innings pitched, they have an average HR/FB of 12.2%. That 12.2% is not a weighted average, it’s just a simple average of each qualified pitcher’s HR/FB.

Using the same method, if you look at pitchers with a GB% less than 35%, they have an average HR/FB of 9.9%.

Now I’ll admit this is a much simpler approach than the route Carruth took, but the results seem to be considerably different and I wondered why this would be the case.

First off, if you use my approach with the 20 pitchers Carruth selected in each group, you come to the same conclusions as Carruth did. This leads me to believe the batted ball data from Retrosheet (which Carruth used) and the batted ball data from Baseball Info Solutions don’t quite match up.

Just taking a quick look at the top 10 players, their GB% don’t match. For exampe, Retrosheet has Brandon Webb with a GB% of over 70% and BIS has him at 65%. That’s the first difference.

The second difference is that the time period he used was between 1988 and 2006 where the HR/FB according to Retrosheet was 13.57%. This is considerably different than the HR/FB of 10.7% that Baseball Info Solutions reports between the 2002-present time period. Using the 13.57% for all pitchers over a 18 year period where there’s been some considerable influx in home run totals is probably going to cause some issues as well.

To me it seems there is at least some evidence that extreme groundball pitchers as a group do give up more HR/FB than your typical pitcher. The two most extreme groundball pitchers in the past 5 years have an average HR/FB of 15.5% (Brandon Webb) and 13.1% (Derek Lowe).

The other option is it really has nothing to do with GB% at all (sampling size issue maybe?) and it’s just that some extreme groundball pitchers tend to have higher HR/FB. In their case instead of regressing to the league average, you’d just regress to the player’s actual average; treating it more like you’d treat a batter’s batting average on balls in play (BABIP) than you would a pitcher’s BABIP.


Odds of Catching a Foul Ball

The other day I attended an Orioles game and just like every other baseball game I’ve been to in my entire life, I didn’t catch a foul ball. I didn’t catch a home run ball either, but that would have been impossible where I was sitting.

While I was watching a few lucky fans snag souvenirs, I wondered what my chances of catching a foul ball actually were. Doing some quick back of the envelope calculations, I figured there were maybe 30 balls a game hit into the stands, and maybe around 30,000 fans at each game. If that were the case, about 1 in every 1000 fans would walk away with a foul ball/home run.

In reality, there were 120,946 foul balls and home runs during the 2005 season and 74,915,268 fans in attendance that same year, which ends up being about 1 in every 619 fans end up with a ball. This is probably better than the actual odds since not all foul balls are hit into the stands.

If we were to say half of all foul balls were hit onto the field and the other half hit into the stands then the odds jump to 1 in 1189.

Finally, there are some places in a stadium where you couldn’t possibly catch a foul ball, and other places where you’d likely increase your chances, making that 1 in 1189 is hardly 100% accurate, but good enough for a rough estimate.

Odds are, the only way I’ll be getting a baseball at a game, is at the souvenir stand.


The Top 10 – Week of 4/30/2007

I haven’t been blogging much in the past couple weeks, but I’m hoping to turn over a new leaf with this past week’s FanGraphs top 10. There are more than a few surprises:

1. John Maine – The momentary Major League ERA leader (1.35) is also this week’s #1 player on FanGraphs. He’s been getting a lot of hype and some skepticism lately. I think everyone can agree he’s not as good as his 1.35 ERA, but I think there’s still a lot to like.

2. Barry Bonds – He hit 2 home runs this week and is now just 12 shy of breaking Hank Aaron’s record. He’s playing like the past 2 years didn’t happen.

3. Alex Rodriguez – Since hitting 2 home runs on April 23rd, A-Rod has now gone 41 at-bats without a single home run. I wonder if for every record breaking streak, there’s an equal (and opposite) record breaking slump?

4. David Wright – The other New York third-baseman has been in a season long slump. He’s both walking and striking out more, while his Isolated Power (ISO) has been cut in half. What’s the statistical culprit? A failure to get the ball off the ground.

5. Derek Jeter – The other and still “true” Yankee always seems to make the top 10. He’s having a pretty nice season which isn’t all that different from his MVP seas… oh wait… nevermind.

6. B.J. Upton – The rookie is having a phenomenal year and apparently everyone wants to see what all the hoopla’s about. Upon further inspection, he’s striking out only 38% of the time which puts him right there in Ryan Howard and Adam Dunn territory (sans the high walk rate). My magic 8 ball tells me this about his hot start: “Outlook Not So Good”. That’s how I do all my analysis, by the way.

7. David Bush – I’m a big David Bush fan and it could just be me visiting his page ever so frequently that pushed him up to #7 of the list. Despite his ERA over 6, he has the 3rd highest K/BB in baseball. Good things should happen soon.

8. Manny Ramirez – As soon as A-Rod started slumping, Manny started to pick things up. Since A-Rod hit his last home runs, Manny has batted just over .300 with 7 walks and 3 home runs, including a 2 home run game on May 3rd.

9. James Shields – This guy is looking really really good and should be on everyone’s radar. Few pitchers in the history of baseball have struck out over a batter an inning while walking under 2 batters per 9 innings. In fact, there have been just 11 starting pitchers to do it in a single season. Bruce Sutter, Tom Seaver, Sandy Koufax, Jason Schmidt, Johan Santana, Ben Sheets, Roy Oswalt, Pedro Martinez, John Smoltz, Curt Schilling and Randy Johnson make up the 11.

10. Ichiro Suzuki – He’s off to a slow start, which apparently is concern enough for him to make his first top 10 list. He’s still getting infield hits but his BABIP is considerably below his norm. Fortunately for him, there’s still some 5 months of baseball left to right the ship.


Team Win Probability Blogs

If you haven’t noticed, there are a few new team blogs currently on FanGraphs. Garth Sears (Royals), Erik Manning (Cardinals) and our latest addition Tito Crafts (Red Sox) have been doing a fantastic job looking at games from a Win Probability angle, among other things.

If you have an idea for a baseball related blog, whether it be team specific or something entirely different, and you’re looking for a place to get started, feel free to send me a note. From there we can talk in more detail. You can either fill out a contact form, or e-mail me at david@fangraphs.com.


Minor League Stats

The 2007 Minor League stats are up and available for viewing in both the stats pages and the Leardboards. They will be updated every morning no later than noon Eastern.

Also, I’ve taken down the projections and spring training stats by default in the stats pages. You can still view them by clicking on “Show Projections” or “Show Spring Training”. I figured now that there was almost a full month of the season underway, the projections weren’t being used as much and they were causing some unneeded clutter. I’m open to putting them back up by default, but as long as I don’t hear complaints, they’ll stay hidden by default.


WPA in Kansas City Star

Bradford Doolittle wrote about WPA today in the Kansas City star and gives FanGraphs’ new Live Win Probability quite the favorable review. You can check that out here: These stats add something to the game

There’s also a Q&A session with me on his blog for those interested in some of the behind the scenes happenings at FanGraphs: Q&A With David Appelman


Clutch Changes

Last night we rolled in some changes to the way Clutch is calculated. For the most part, the results remained the same, but there are a few key differences which you should be aware of.

Clutch is no longer calculated using OPS Wins. Instead it’s calculated using WPA/LI, which is WPA divided by LI on a play-by-play basis. Take Alex Rodriguez for example who has a WPA of 1.03 and a pLI of 1.51. His WPA divided by pLI on a full season basis would be .68. Unfortunately, that is weighting each play incorrectly and does not correctly neutralize WPA. When you correctly weight each plate appearance by LI, his WPA/LI is 1.16. To get the new Clutch score, you merely do: WPA – WPA/LI.

Tangotiger goes into the details of what’s going on here in comment #5: What Exactly Clutch Measures.


Is WPA Predictive for Batters?

One of the biggest complaints I see about WPA is that it’s not predictive. The mere mention of it’s non-predictability seems to be enough for many to write it off as a mere toy used by some of stats community.

So let’s see how it actually correlates from year to year compared to the stats we all know, like AVG, OBP, SLG, and OPS. I’ll throw in Batting Runs Above Average for fun too.

Looking at the r-squared from 2005 to 2006 for batters with over 300 plate appearances, here’s how WPA stacks up against the regulars:

AVG: .12
WPA: .27
BRAA: .35
OBP: .36
OPS: .36
SLG: .38

Here’s the same deal, 2004 to 2005.

AVG: .14
WPA: .24
OBP: .27
OPS: .30
BRAA: .31
SLG: .33

It’s true, WPA doesn’t correlate as well from year to year as OBP, SLG, or OPS, but it does have some correlation from year to year. In 2004, a players OBP was almost indicative of his 2005 OBP as his 2004 WPA was of his 2005 WPA. Yet, that wasn’t quite the case in 2005 to 2006. BRAA which is calculated by using Run Expectancy on a play-by-play basis (much like WPA uses Win Expectancy), holds its own against the regulars.

Anyway, the point is, let’s stop using the argument that WPA isn’t predictive as a crutch, because it does actually show some correlation from year to year.


The Top 10 – Week of 4/2/2007

This week’s top 10 most popular players on FanGraphs is devoid of Corey Hart and because of that it makes a lot more sense. Let’s get right to it:

1. Alex Rodriguez – You’ve not only managed to become number one in Win Probability Added (and nearly every other stat in the first week), you’re also the most popular.

2. Albert Pujols – Slipping from the top spot lats week, Pujols got off to a rough start by going 1-13 to start the season. But yesterday, he went 2-5 with one home run, signaling the end of the bizarro first week of baseball.

3. Derek Jeter – What has Jeter done this season to deserve such a prominent place on the FanGraphs top 10? Besides being Jeter, WPA says he’s the second most valuable Yankee thus far this season. Fitting he should be the second most popular Yankee.

4. Barry Bonds – His one home run this week puts him just 20 home runs shy of tying Hank Aaron’s all time best 755 home runs. At this rate he’ll have broken the record by September 3rd. Anyone know the vegas odds on that?

5. Felix Hernandez – His 8 inning — 3 hit — 12 strikeout performance gave him a Game Score of 87, which just happens to be the highest Game Score this season. Hopefully the weather will allow him make a second start sometime this season.

6. Todd Helton – He’s off to a decent start, but I’m not sure why he’s in the top 10. It could have something to do with the nosedive his Isolated Power has taken since the 2004 season.

7. Johan Santana – Why wouldn’t the best pitcher in baseball make the list?

8. Adam Dunn – The always intriguing Dunn is off to a hot start after having a scorching spring training. He’s batting an uncharacteristic .381 with 3 home runs so far this season.

9. Brad Lidge – Speaking of spring training, Lidge has been just as awful the first week of the season as he was during spring training. He’s pitched 1.2 innings, giving up 6 runs (3 of them earned) for an ERA just over 16. If he can’t get things together soon, he’ll definitely be out of the closers job, and possibly a job altogether.

10. Ryan Howard – Even though he’s batting just .217 with no home runs, he’s already drawn 7 walks, which ties him for the most in the National League with fellow teammate Jimmy Rollins. He only hit 1 home run in his first 11 games last year, so not much to worry about yet.

11-20: David Ortiz, Cole Hamels, John Maine, B.J. Upton, Jose Reyes, Ian Snell, Grady Sizemore, Roy Oswalt, Erik Bedard, Elijah Dukes.


Unlike 2006: A-Rod Wins the Game!

In yesterday’s 10-7 victory win against the Orioles, Alex Rodriguez’s game winning grand-slam was the second biggest hit he’s had in the pats 6 years according to Win Probability Added (WPA). It brought his team from a mere 28.8% chance of winning to a complete victory.

20070407_orioles_yankees_0_blog.png

In 2006 however, Rodriguez was about as far from being a clutch hitter as you could possibly get. But before we delve into why, let’s get familiar with two stats: REW and OPS Wins. REW is calculated much like WPA, except it uses Run Expectancy (as opposed to Win Expectancy), which doesn’t take the score or inning into account. It does however account for how well a batter does with runners on base. OPS Wins on the other hand is how a player would do in a completely context neutral environment.

Looking at 2006, Rodriguez’s 3.18 OPS Wins and his REW of 3.34 wins are fairly close, but in general he did a little bit better than expected with runners on base. When you take into account the inning and the score (or late and close situations), he accumulated just 1.18 wins. Basically he performed much worse than he should have in high leverage or “clutch” situations. This is measured by a stat called “Clutch” which is the difference between WPA and OPS Wins once leverage adjusted. Rodriguez’s Clutch was -2.16 wins; the third worst among qualified players in 2006.

Last season was the worst season he’s had in the past 5 years in terms of clutch hitting and probably his worst season ever. Yet in his previous two seasons with the Yankees he was actually a clutch hitter with a Clutch of .76 wins in 2004 and .41 wins in 2005.

Since joining the Yankees, he’s still the 9th most valuable player in baseball according to WPA. If we look at just the Yankees batters since 2004, he ranks first in terms of WPA.

Batter                WPA
Alex Rodriguez      11.27
Derek Jeter         10.54
Gary Sheffield       8.91
Jason Giambi         7.61
Hideki Matsui        6.46
Jorge Posada         2.92
Bobby Abreu          1.96
Johnny Damon         1.76
Tony Clark           1.05
Tino Martinez         .77

Whether you like him or not, he has been the most valuable Yankees batter according to WPA the past 3 seasons including the few games played this season. Of course, Mariano Rivera bests him by half-a-win with a WPA of 11.73.