Author: Dan Szymborski | Page 85

Projection Hindsight Is 20/20 and It’s Totally Awesome

March 23, 2020

One of the things you have to get used to when you work with projections is being wrong. Like, All. Of. The. Time. While I’d like to believe that the projections are accurate and it’s just real life that mucked things up, that isn’t quite how they work. There are always events you didn’t see coming, assumptions you made erroneously, and just plain old irreducible error, all of which are going to thwart you.

On a basic level, you’re supposed to be wrong. Imagine a world in which you knew, for an exact fact, that every team was a coin flip to win every game. With this perfect knowledge, you’d still expect nearly a quarter of the league to win either 73 games or fewer, or 89 games or more, through nothing but luck. For the math-inclined, this is a hypergeometric distribution, not a binomial one; the coin flips are not independent because the win totals will still add up to 2,430 and one team’s win invariably is another team’s loss. Here’s a quick table for some of the win totals, showing the probability of a team winning exactly X games and how many of the teams you’d expect to have won up to X games:

Win Probabilities, Major League Coin-Flipping

Wins	Probability	1-in-X Chance of Occurring	Cumulative
70	1.4%	73	5%
71	1.8%	56	6%
72	2.3%	44	9%
73	2.8%	35	12%
74	3.4%	29	15%
75	4.0%	25	19%
76	4.6%	22	24%
77	5.2%	19	29%
78	5.7%	18	35%
79	6.1%	17	41%
80	6.3%	16	47%
81	6.4%	16	53%
82	6.3%	16	60%
83	6.1%	17	66%
84	5.7%	18	71%
85	5.2%	19	76%
86	4.6%	22	81%
87	4.0%	25	85%
88	3.4%	29	89%
89	2.8%	35	91%
90	2.3%	44	94%
91	1.8%	56	95%

As an example, you’d expect 3.4% of those coin flip teams to win exactly 74 games, with 15% of all teams winning up to 74 games.

But we don’t have anywhere near perfect knowledge about how good a team will be. We’re not even in the same zip code as “near perfect”; we just hope to be on the right continent. As a result, our error bars are going to be significantly larger than even the rather erroneous results you still get with omniscient projections. Read the rest of this entry »

Dan Szymborski FanGraphs Chat – 3/19/2020

by Dan Szymborski

March 19, 2020

1:03	Mac: Does ZiPS come with standard deviations? If so who has the largest assuming you’re able to leave playing time out of equation (Otherwise it would just be the large difference between healthy Trout and out for the season Trout)?

1:03	Dan Szymborski: Happy Thursday!

1:03	Dan Szymborski: I don’t specifically spit out a standard deviation, but I do it from the other side: specific events and the probability of those (like a .300 BA, 40 HR, etc)

1:04	Dan Szymborski: And I have projectile percentages in beta right now as I work out the kinks, mainly due to defensive volatility and projections

1:04	LAXTONTO: Can you do my online recordings for my grad students for me?

1:04	Aceman: Dynasty

Read the rest of this entry »

COVID-19 Roundup: Penny Stipends but No Dollar Answers

by Dan Szymborski

March 18, 2020

This is the latest installment of a daily series in which the FanGraphs staff rounds up the latest developments regarding the COVID-19 virus’ effect on baseball.

While there’s no big MLB update on any start to the 2020 season — nor will there likely be for awhile — the hunkering down of baseball teams, along with the rest of the country, continues. MLB announcing there wouldn’t be any games for at least a couple months has moved the focus, as it ought to be, towards the mitigation of the current situation rather than practical questions about how many games will be played, where, or when.

MLB Clubs Establish a Fund for Ballpark Employees

MLB clubs have committed $30M — $1M apiece — to assist the ballpark employees affected by the delayed start to our season. pic.twitter.com/ZzJOkxGt2e

— MLB (@MLB) March 17, 2020

Ballpark employees are some of the people most affected by the suspension of the 2020 season. There’s no telecommuting or even a skeleton crew still working as you see in many customer-facing businesses, so these employees are suffering de facto layoffs, even if hopefully temporary. With the hospitality industry one of the sectors suffering the quickest in this environment, simply finding another job isn’t an option for many of these workers. These employees tend to make up a very small percentage of a team’s costs, and keeping the team’s trained workforce around is at a minimum an exercise in enlightened self-interest. Read the rest of this entry »

How Much Do the Playoff Odds Change in a Shorter Season?

by Dan Szymborski

March 17, 2020

Will there be a 2020 baseball season? How many games will teams play? What will that mean for the 2020 baseball season? Normally, these would be extremely upsetting questions to contemplate; in the world in which we’re currently living, they’re somewhere around the 75,000th most important quandaries facing us. But as someone qualified to serve as a baseball writer rather than an epidemiologist, they’re also the kinds of questions I can actually seek to answer, and the differences between how baseball will eventually look versus what we’re used to are bigger than you might think. Assuming we have a season, that is; if no games are played, the projections will be 100% accurate.

So how much do the playoff races change in a shorter season? To answer this, I spent the weekend reconfiguring ZiPS so that it wouldn’t assume a 162-game season — an eventuality I had hoped not to have to deal with unless or until there was a strike — allowing me to run playoff probabilities for seasons of any length. Let’s start with the baseline projections, how ZiPS saw the races before the world turned upside down:

ZiPS Projections Pre-COVID-19 Delay

Team	W	L	GB	PCT	Div%	WC%	Playoff%	WS Win%
New York Yankees	96	66	—	.593	61.3%	29.2%	90.5%	12.7%
Tampa Bay Rays	92	70	4	.568	32.6%	44.6%	77.2%	7.8%
Boston Red Sox	85	77	11	.525	6.0%	25.9%	31.9%	2.0%
Toronto Blue Jays	73	89	23	.451	0.0%	0.8%	0.9%	0.0%
Baltimore Orioles	57	105	39	.352	0.0%	0.0%	0.0%	0.0%
Team	W	L	GB	PCT	Div%	WC%	Playoff%	WS Win%
Minnesota Twins	91	71	—	.562	60.9%	14.5%	75.4%	8.5%
Cleveland Indians	88	74	3	.543	30.3%	20.9%	51.2%	4.4%
Chicago White Sox	82	80	9	.506	8.7%	10.0%	18.7%	1.3%
Kansas City Royals	71	91	20	.438	0.2%	0.2%	0.3%	0.0%
Detroit Tigers	63	99	28	.389	0.0%	0.0%	0.0%	0.0%
Team	W	L	GB	PCT	Div%	WC%	Playoff%	WS Win%
Houston Astros	93	69	—	.574	69.2%	15.0%	84.1%	10.7%
Oakland A’s	88	74	5	.543	25.2%	27.3%	52.5%	4.4%
Los Angeles Angels	82	80	11	.506	5.3%	10.3%	15.6%	1.0%
Texas Rangers	74	88	19	.457	0.4%	1.2%	1.6%	0.1%
Seattle Mariners	62	100	31	.383	0.0%	0.0%	0.0%	0.0%
Team	W	L	GB	PCT	Div%	WC%	Playoff%	WS Win%
Washington Nationals	91	71	—	.562	42.1%	29.5%	71.7%	6.5%
Atlanta Braves	90	72	1	.556	34.8%	31.5%	66.3%	5.5%
New York Mets	87	75	4	.537	18.2%	28.1%	46.3%	3.2%
Philadelphia Phillies	82	80	9	.506	4.8%	13.4%	18.2%	1.0%
Miami Marlins	69	93	22	.426	0.0%	0.1%	0.1%	0.0%
Team	W	L	GB	PCT	Div%	WC%	Playoff%	WS Win%
Chicago Cubs	85	77	—	.525	38.1%	8.5%	46.6%	3.4%
Milwaukee Brewers	83	79	2	.512	23.5%	7.5%	31.0%	2.1%
St. Louis Cardinals	82	80	3	.506	20.9%	7.2%	28.1%	1.8%
Cincinnati Reds	82	80	3	.506	16.9%	6.2%	23.1%	1.5%
Pittsburgh Pirates	71	91	14	.438	0.6%	0.2%	0.8%	0.0%
Team	W	L	GB	PCT	Div%	WC%	Playoff%	WS Win%
Los Angeles Dodgers	101	61	—	.623	92.7%	5.9%	98.7%	18.5%
San Diego Padres	87	75	14	.537	6.0%	43.4%	49.4%	2.7%
Arizona Diamondbacks	82	80	19	.506	1.3%	17.7%	18.9%	0.8%
Colorado Rockies	72	90	29	.444	0.0%	0.7%	0.7%	0.0%
San Francisco Giants	69	93	32	.426	0.0%	0.1%	0.1%	0.0%

Read the rest of this entry »

Dan Szymborski FanGraphs Chat – 3/12/2020

by Dan Szymborski

March 12, 2020