Archive for March, 2007

Play-by-Play

We’ve just added play-by-play data in the Play Log section for each game. At this time, play-by-play data includes the Leverage Index, Run Expectancy, Home Team Win Expectancy, and the Batter’s Win Probability Added (WPA) for each and every play of the 2002-2006 season.

Win Expectancy is calculated as the result of the play, while Leverage Index and Run Expectancy are calculated before the play happened.

Everything is calculated before the play happened. We’ve also added BRAA which is the difference between Run Expectancy at the start of the play and the end of the play.

If you click on the play, you’ll get the pitch sequence for each play in a little pop-up box. The playoff games pitch sequence is a little screwed up right now. It “snakes” around, so for the first line it will be “pitch1, pitch2, pitch3, result” and then for the next line “result, pitch3, pitch2, pitch1”. We’ll try to get this cleared up soon.

We’ve also moved all the Win Probability graphs out of the Team section and into the Scoreboard section. We’re just trying to make things a little more organized and it will allow us to eventually vastly enhance our team stats.

If you have any problems or suggestions on how to improve the new scoreboard or play-by-play sections, don’t hesitate to let us know!


Preview: Scoreboard

Thought I’d show a quick preview of our new scoreboard. This way you’ll be able to quickly see all the day’s graphs in one convenient place.

You’ll also notice that the box score and play log links aren’t quite working yet. I’m hoping they will be sometime next week.


THT Projections: A (Quick) Closer Look

Earlier this week the much anticipated Hardball Times 2007 Season Preview was released, and with it a brand new projection system. I recently took a look at Bill James, CHONE, ZiPS, and the Marcel projection systems to see how they differed. Let’s throw THT into the mix and see where it has its major differences.

First off, let’s see how THT fares against the other projection systems in OPS and ERA as a whole when compared to the Marcel projection system (the simplest of the five).

System        ERA-R^2    OPS-R^2
ZiPS             .725       .908
Bill James       .714       .875
CHONE            .699       .865
THT              .681       .837

And in English, when comparing the other projection systems to the Marcel projection system, THT’s system is the least similar. (When look at batters with 300+ at-bats and pitchers with 100+ innings.)

So which batters does THT disagree on the most in terms of OPS?

Name            Bill James    CHONE   Marcel     THT    ZiPS
Frank Thomas          .939     .853     .874    .982    .892
Hanley Ramirez        .801     .791     .843    .714    .777
Robinson Cano         .860     .842     .852    .766    .836
Chris Duncan          .862     .776     .891    .753    .803
Melky Cabrera         .766     .796     .787    .715    .800

Except for Frank Thomas, who THT projects is going to have a phenomenal season, they’re the low point for the other four players. It’s interesting to note that those four are also first or second year major league players. There’s generally a lot of disagreement about Chris Duncan and Hanley Ramirez, but the THT projections for Robinson Cano and Melky Cabrera appear to be the sole point of difference. Let’s look at the pitchers:

Name            Bill James    CHONE   Marcel     THT    ZiPS
Tony Armas Jr.        4.85     4.64     4.96    5.81    4.88
Carlos Zambrano       3.40     3.47     3.48    2.77    3.46
Cliff Lee             4.43     4.20     4.48    5.04    4.55
James Shields         4.03     4.29     4.72    5.03    4.70
Brandon Webb          3.53     3.60     3.65    3.07    3.85
Randy Johnson         4.31     3.77     4.33    3.43    3.63

THT clearly hates Tony Armas Jr. (more) with his ERA about a point higher than the others, while they love Carlos Zambrano who they have at about a .75 lower ERA than the other systems. I threw in Randy Johnson since he was next on the list. It looks like the projections are pretty well divided for him between the 4.30-ish ERA, and the 3.50-ish ERA.

Anyway, the THT projections are certainly similar to the others, but there are clearly a number of key differences which are definitely worth a look. There’s also a lot more to projections than ERA and OPS, so I’m sure you’ll find many other unique aspects to THT’s projection system. Like with any projection system, we’ll have to wait and see which one happens to be the most accurate for 2007.


Bill James Projections – Updated

I thought I’d mention that the Bill James Handbook projections on FanGraphs have been updated with the latest and greatest.

“… many things happen during the offseason that change playing time for the coming season. That’s why we produce The Bill James Handbook: Projections Update with cutting-edge projections reflecting changes through the last couple days of February.

We adjust projections for many reasons, including:

-Playing time adjustments
-Free agent signings (including four Japanese rookies)
-Trades
-Injuries
-Ballpark changes”

I think FanGraphs has three of the four rookies in the database now with the exception of Daisuke Matsuzaka. He’ll show up next time he makes a spring training start. For those who can’t wait, the Bill James Handbook has him at 19-2 with a 3.13 ERA in 190 innings.

As always, if you’d like to dice, slice, and sort the Bill James Handbook projections to your heart’s content, you’ll have to purchase them here.

While we’re on the topic of projections, I’d like to give a quick shout-out to the new Hardball Times 2007 Season Preview. Besides the great commentary on teams from many of your favorite bloggers, it has player projections through 2009. I’m still digging in, but it’s full of fun and useful stuff.


Win Probability Changes

You may have noticed the Win Probability numbers have changed slightly. Don’t panic! There have been a few changes, for the better.

First off, we’re now using Tangotiger’s updated win expectancy tables which are no longer a flat 5.0 Runs per Game environment. Instead, we’re using the home team’s league, average run environment. This now puts batters and pitchers on “equal footing” and you should now be able to accurately compare batters and pitchers using WPA.

Second of all, we’re also using Tangotiger’s run expectancy tables to calculate Batting Runs Above Average (BRAA) for both batters and pitchers. Once again the run environment is set at the home team’s league, average run environment.

Next to BRAA there is a column titled “REW”, which stands for Run Expectancy Wins. This is a replacement for OPS Wins because we no longer need to estimate wins in a context neutral environment since we’re now using run expectancy.

Finally, Clutchiness has been shortened to Clutch (Clutchiness was excessively long) and is calculated as WPA/LI – REW.

Update (3/4/2007): Clutch has been switched back to being calculated with OPS Wins. More on this later.

Typically players remain in the same order, but their values have changed slightly. Batters should be slightly more valuable and pitchers slightly less valuable based on WPA scores.


Heath Bell – Maybe This Year

I’ll admit, I’m a Heath Bell protagonist. Last year I expected big things from the 28 year old reliever who ended up posting a 5.11 ERA in just 22 relief appearances. He didn’t quite live up to my lofty expectations:

“Don’t be surprised if he becomes an important piece of the Mets bullpen next season.”

Well Heath, it’s a new year and you have a brand new team (Padres) with new fans to impress. Let’s see where things went wrong last year and if they’re going to happen again this year.

He has pretty much everything you’re looking for in a relief pitcher: high strikeout rate, low walk rate, and he’s even a ground ball pitcher. He’s clearly mastered Triple-A where in 2006 his K/9 was over 14! Not to mention he posted an ERA of 1.29 in 35 innings.

2080_p_season_full_1_20061001.png

Yet what plagues him in the majors has been his extraordinarily high batting average on balls in play (BABIP). The past two years his BABIP has been .374 in 2005 and an insanely high .394 in 2006, which just happened to be the highest in the majors for pitchers with over 30 innings pitched. This same problem plagued him last year in AAA too, where he had a .378 BABIP, the 11th highest at the AAA level.

2080_p_season_full_7_20061001.png

Typically with BABIP this high, you’d think he was just getting unlucky, but it’s hard to ignore the past two years worth of data, so despite his incredible peripherals, maybe this is just who he is?

It’s clear the Mets, at least at the major league level, never had a whole lot of confidence in him. Of the regular relievers he was used in the least important situations possible. His average Leverage Index (LI) was a measly 0.35, with a Leverage Index of 1 being an average situation (the higher the leverage, the more important the situation). The previous year was not much different where his LI was 0.65, the third lowest on the team.

His 2006 ERA of 5.11 is mainly the result of 3 games which were completely out of hand before he even entered the game.

– On 9/26 he entered the game with the Mets trailing by 6 runs and gave up another 6 runs.

– On 9/11 he entered the game with the Mets trailing by 6 runs and gave up 5 additional runs.

– On 7/2 he entered the game with the Mets trailing by 3 runs and gave up 4 runs and an additional 4 unearned runs.

So, if we take away these three horrible (meaningless) outings, his ERA ends up being 1.76. Maybe you have questions whether or not the game on 7/2 was completely meaningless. If we leave that one in, his ERA is still a pretty nice 2.72.

Heath Bell is getting a fresh start this year and despite his historically awful BABIP, his strikeout and walk rates are just too good to ignore. I’ll stick with my same prediction as last year: I’d be surprised if Heath Bell didn’t become an important fixture in the Padres bullpen.


Spring Training

Don’t worry Cactus and Grapefruit Leagues; you have not been forgotten. 2007 Spring Training stats are now included in the FanGraphs player stats pages. Unfortunately, the stats are very basic. But, at least you can get a feel for how your favorite players are doing.

These will be updated nightly and Spring Training leaderboards should be up sometime tomorrow.