Blog Archives

MLB over-unders: Can anyone beat Las Vegas?

Back in March, several dozen websites, written by either professionals, bloggers, or, in some cases, professional bloggers, came out with predicted MLB win totals.

A predicted win total represents the number of wins this website or individual predicted for each major league team. These numbers can be easily compared to the Las Vegas line for each team (I used the one set by the Hilton) to determine if these predictions are worth our time, and, in some cases, our money.

Here are the sites I used:

O/U: The Hilton’s over/under for each team

BP: Baseball prospectus

TR: Team Rankings (caveat on the linked page: the site stresses their MLB predictions are a work in progress)

DP: Davenport

Zips: ZIPS projection system (espn.com)

PM: Prediction Machine

TB: Trading Bases, an avid blogger and book-writer

Here are my metrics

MSE: Averaged squared error between the prediction and the win totals*

MAE: Averaged absolute error between the prediction and the win totals*

Corr: Correlation between the predicted and the win totals*

*For win totals, I’m use each team’s estimated win totals from here (I’m too excited to wait until the end of the season!)

Results

	O/U	BP	TR	DP	Zips	PM	TB
MSE	68.59	62.50	84.56	70.47	75.37	79.76	61.04
MAE	6.65	6.75	7.73	6.75	7.01	7.22	6.53
Corr	0.68	0.71	0.59	0.67	0.66	0.61	0.72

Baseball prospectus appears to offer the only clear advantage over the Las Vegas line, at least among these predictions, as judged by a higher correlation and a lower MSE between observed and predicted values. As for team rankings & prediction machine, their results were both disappointingly bad. (Note: Trading Bases came into the picture after the initial post, and also appears to be a clear winner).

TeamRankings does offer this disclaimer about their projections:

A word of caution — while our preseason projections for other sports have proven to be useful indicators of where values may lie among the various full season futures bets, we’re not nearly as confident in our MLB preseason ratings. We’re publishing these in the interest of full disclosure, so that you know what the initial rating in our projection system was for each team. We’re most definitely not recommending that you use these ratings and forecasts to go place preseason bets.

Here’s the table of predicted wins for each site.

Team	O/U	BP	TR	DP	Zips	PM	TB	Simulated Wins

Diamondbacks	82.5	85	83	81	85	76.8	80	82.5
Braves	86.5	83	85	85	91	86.6	82	95.8
Orioles	78.5	75	81	75	82	79.2	76	86.2
Red Sox	82.5	85	79	85	84	80.5	83	97.2
Cubs	72.5	77	73	76	74	75.8	69	67.5
White Sox	80.5	76	83	76	80	85	78	64.2
Reds	90.5	92	84	86	90	91.1	84	92
Indians	78.5	80	74	79	80	76.8	85	87.9
Rockies	71.5	71	75	74	70	77.5	70	72.9
Tigers	92.5	91	86	95	91	89.7	95	94.5
Marlins	63.5	67	75	65	65	65.3	64	60.1
Astros	58.5	63	67	72	57	62.5	66	54.9
Royals	78.5	76	78	80	79	75	77	85.1
Angels	91.5	91	86	91	93	93.3	88	79
Dodgers	91.5	91	83	88	90	90.6	91	92.5
Brewers	81.5	78	83	78	81	77.6	78	73.3
Twins	68.5	65	74	69	66	70.9	66	69.6
Mets	75.5	80	78	76	66	76.8	74	73
Yankees	86.5	91	90	86	83	84.7	87	84.9
Athletics	84.5	83	86	84	78	85.3	85	94.6
Phillies	85.5	81	84	81	82	81	86	75.7
Pirates	77.5	80	77	81	77	74.8	79	92.1
Padres	73.5	76	78	76	73	72.7	81	76.1
Giants	87.5	85	85	92	87	85.1	88	75.2
Cardinals	82.5	85	86	83	85	85.1	90	94.6
Rays	86.5	87	88	86	88	89.5	93	89.2
Rangers	86.5	89	88	85	91	86.8	85	88.1
Blue Jays	88.5	84	78	86	94	87.5	82	73.8
Nationals	91.5	87	86	85	94	92.5	90	86.3
Mariners	77.5	78	79	73	74	74	78	71.4

Posted in Uncategorized

3 Comments

Tags: Baseball, MLB, over under, win projections

What does MVP mean anyway?

Nov 2

Posted by statsinthewild

By James O’Connor

What does it mean to be the Most Valuable Player in Major League Baseball? Is it the player who added the most wins to his team? The player who added the most wins to his team, so long as they made the playoffs? Can it ever be a pitcher? Is it the player who contributed the most to his team down the stretch?

Admittedly, MVP is always going to be a subjective award, based on the perception of the members of Baseball Writers Association of America selected, in any given year, to vote for the award. But modern statistics give us some insight into how those writers make their decisions—even if it is not entirely clear to them when they are voting.

First, let’s look at a few possible methods of selecting the MVP. Wins Against Replacement or WAR is a fairly recent statistical calculation that measures (words). It quite literally measures what player had the most valuable impact on his team in a given year. Yet, only five times in the past 25 years has the American League leader in WAR been awarded the MVP.

How about awarding the MVP to the highest WAR player on a playoff team? This may help. 21 of the previous 25 AL MVP’s were on teams that went to the playoffs (including for the purposes of this article, Frank Thomas, whose Chicago White Sox were in first place in the AL West in 1994 before the player’s strike ended the season). But even this doesn’t completely answer our question. Many of the players who won the award were not even the top WAR players on teams that made the post season. In 2006, Justin Mourneau won the MVP with a WAR of 4, while his teammate, Johan Santana led the league in WAR with 7.3 wins against replacement.

Some may argue that a pitcher should not be eligible for the award, anyway. This is where things get particularly tricky. One issue is that the voters change on a yearly basis. If a voter doesn’t believe a pitcher should not win the award, he can simply leave the pitcher off the ballot and doom his chances. See Martinez, Pedro, 1999. But the reality is, pitchers, do win the award on occasion. It has been posited that this honor is saved for pitchers only when they have had a truly transcendent and historic year. This theory is simply not backed up by the numbers.

In 2011, Justin Verlander was the AL MVP with a WAR of 8.2, good for second in the league. It was a truly awesome season by any measure. The league leader in WAR? You guessed it: Ben Zobrist. Zobrist carried an 8.5 WAR and his Rays made the playoffs. He finished 16^th in MVP voting. You can’t entirely blame the writers for going with Verlander though. He was a monster.

Going back for a moment though, how good was Verlander’s year historically? Obviously, WAR is not set up to measure 2011 Verlander to 1999 Pedro Martinez, but the statistic does measure them against their competition for the award. Over the last 25 years, the following pitchers led the league in WAR, but did not win the MVP: Roger Clemens (1987, 1992, 1997), Brett Saberhagen (1989), Kevin Appier (1993), Randy Johnson (1995), Pedro Martinez (1999, 2000), Santana (2006), Zack Greinke (2009). In fact, Clemens’ 1997 campaign (11.8) and Martinez’s 1999 season (11.4), are the two best seasons as measured by WAR in the last 25 years. So, was Verlander’s 2011 season more deserving of the MVP? Probably not.

It’s not as if the MVP voters are picking completely randomly, though. In fact, over the last ten years, seven AL MVPs were in the top three in WAR. Of course, the other three (Mourneau, 2006; Vladimir Guerrero, 2004; and Miguel Tejada, 2002), were 10^th or worse in the league in WAR. What happened and how did they win the MVP award?

A couple things to look at here. First, all three of those teams made the playoffs. That helps. As I mentioned earlier, despite the fact that Mourneau was not even first on his team in WAR, he was second to pitcher Santana, who undoubtedly lost votes based on the perception that pitchers should not win the award (however inconsistent this theme is, it’s impossible to ignore that it often does affect voting). Same story for Tejada in 2002, who was behind teammate Barry Zito (6.4) in WAR.

In 2004, the league leader in WAR was Ichiro Suzuki (7.5), whose team won 63 games. You can understand why the voters might not associate “valuable” with a team that lost 99 games. Guerrero took home this prize that year despite a WAR of 5.2, which was good for only 10^th in the league. Likewise, in 2002, Tejada and his 5.3 WAR won the award over Alex Rodriguez and his 8.6 WAR. Of course, Alex’s Rangers won 72 games.

So, the argument against the WAR winners from those seasons makes (some) sense. Two played for dreadful teams, one suffered from anti-pitcher bias. But what about the rest of the players ahead of Mourneau, Guerrero and Tejada? This is where intangibles step in. The question in voters minds: Did you have a MOMENT?

Let’s look, for example, at Guerrero’s 2004 season. His OPS for the final month of the season was 161 points higher than his full season OPS. While his Anaheim Angels only went 17-14 over that stretch, they finished on a tear going 7-2 and slipping into the playoffs. While his full season WAR was not at the top of the league, it looked (probably accurately) like he single handedly dragged them into the playoffs in the end. He had a MOMENT. It was fresh in the voters’ minds and he got the MVP.

So is it a great September performance that puts a player on the top of the MVP head? Not necessarily. In 2006, Mourneau put up a beefy .926 OPS, but in the final month of the season his OPS was only .884. What gives? Mourneau’s moment was not the end of the season, but rather the eight week period in June and July when the world was really introduced to the Twins first baseman. In June and July, Mourneau put up 1.137 and 1.130 respectively. Meanwhile his team went 37-15, a win/loss percentage of .711. For the season, the Twins won at a .596 clip. Mourneau had eight weeks where he played his best ball while his team tore through the league. He had a MOMENT.

Ok, so what about Tejada? He must have had one stretch where he absolutely crushed it, right? Not really. Tejada had a relatively so start in 2002, but for the most part he was consistent throughout the season. His monthly OPS was as follows: .813, .807, .867, .910, .879, .888. Definitely stronger in the second half, but not the major swings of Guerrero and Mourneau. So what was his moment? You’ve read or seen Moneyball, right? Yea, the most memorable thing to happen in 2002 (probably even more so than who won the World Series), was the Oakland A’s 20 game winning streak from August 13 to September 4. In a season in which the A’s lost Jason Giambi (who, interestingly led the league in WAR the previous year, but did not win the MVP) and Johnny Damon, Tejada was widely considered the best position player on the most memorable team of the regular season. In other words, that winning streak? It was a MOMENT.

A statistics purist would, of course point out that if these players were more consistent like, say all of the people ahead of them in WAR, then they would not have needed these MOMENTS. Undoubtedly true, but also sort of missing the point. Baseball is a game of memories. Intangibles. It’s about that Oakland fan who will never forget for the rest of his life the run the A’s made in 2002 or the heroics of Vlad Guerrero seemingly dragging his team into the playoffs in 2004. Yes, Rodriguez and Ichiro had great years in 2002 and 2004, but chances are most people (their agents aside), would just as soon forget those seasons ever happened. They were the most valuable players for their teams, but they were not the Most Valuable Players for baseball.

MVP will always be a contentious issue. Lots of times it won’t make sense. I mean, who had more of a MOMENT then Pedro Martinez in 1999 or Roger Clemens’ “twilight year” of 1997? But perhaps, through all the fog, there is some semblance of a rationale for how the writers vote for this thing—whether they know it or not.

Posted in Uncategorized

MLB Playoff Probabilities – 9/25/2012

Sep 26

Posted by statsinthewild

The big move from last week is Detroit who jumps from a 23.3% to a 55.3% chance to make the playoffs. Moving in the opposite direction you have the White Sox who lost five game in a row at the end of last week including being swept by the Angels. This have dropped them from 88.2%, where they were on September 18, to 45.5% today. They also fell 4 spots in the rankings to number 11.

Philadelphia, after climbing to 6.1% last week, appears to be done with their hot streak as they drop back to 1.7% and are all but done.

Baltimore has more or less locked up a spot in the playoffs getting to 99.6%, which is made even more impressive since their run differential is currently -7 (Update: -11). How can you not root for these guys?

StatsInTheWild MLB rankings as of September 25, 2012 at 8:15am. SOS=strength of schedule

Team	Rank	Change	Record	Projected Record	Prob make playoffs	SOS	Run Diff
NYY	1	↑1	89-64	94-68	100%	5	+107
Texas	2	↓1	91-62	95-67	100%	11	+119
Tampa Bay	3	↑3	83-70	87-75	11.8%	7	+102
Washington	4	↓1	93-60	98-64	100%	24	+142
LA Angels	5	–	84-69	87-75	5.7%	6	+81
Oakland	6	↓2	86-67	89-73	73.2%	8	+72
Baltimore	7	↑1	88-66	92-70	99.6%	4	-7
Atlanta	8	↑2	88-65	92-70	100%	19	+89
Detroit	9	–	81-72	86-76	55.3%	13	+49
Cincinnati	10	↑1	92-61	96-66	100%	30	+85
Chi WSox	11	↓4	82-72	86-76	45.5%	14	+65
SF	12	–	89-64	93-69	100%	26	+68
St. Louis	13	–	83-71	85-77	81.9%	29	+100
Seattle	14	↑2	72-81	75-87	0%	2	-36
Arizona	15	↑3	77-76	80-82	0.1%	25	+46
Boston	16	↓1	69-85	72-90	0%	3	-34
Toronto	17	↓3	67-86	71-91	0%	1	-62
LA Dodgers	18	↓1	79-74	83-79	5.7%	23	+10
Milwaukee	19	–	79-74	82-80	10.6%	28	+41
Philadelphia	20	–	77-76	81-81	1.7%	21	+7
Kansas City	21	–	70-83	74-88	0%	12	-50
NY Mets	22	↑2	70-83	74-88	0%	15	-58
Pittsburgh	23	↓1	75-78	78-84	0%	27	-20
San Diego	24	↓1	73-80	76-86	0%	22	-48
Minnesota	25	↑1	64-90	68-94	0%	10	-122
Miami	26	↓1	66-87	69-93	0%	16	-101
Cleveland	27	–	64-91	66-96	0%	9	-176
Colorado	28	–	59-94	63-99	0%	17	-129
Chi Cubs	29	–	59-94	62-100	0%	20	-122
Houston	30	–	50-104	52-110	0%	18	-218

Past Rankings:

Cheers.

Posted in Baseball, Sports

MLB Playoff Probabilities – 9/25/2012

Sep 26

Posted by statsinthewild

Philadelphia, after climbing to 6.1% last week, appears to be done with their hot streak as they drop back to 1.7% and are all but done.

StatsInTheWild MLB rankings as of September 25, 2012 at 8:15am. SOS=strength of schedule

Team	Rank	Change	Record	Projected Record	Prob make playoffs	SOS	Run Diff
NYY	1	↑1	89-64	94-68	100%	5	+107
Texas	2	↓1	91-62	95-67	100%	11	+119
Tampa Bay	3	↑3	83-70	87-75	11.8%	7	+102
Washington	4	↓1	93-60	98-64	100%	24	+142
LA Angels	5	–	84-69	87-75	5.7%	6	+81
Oakland	6	↓2	86-67	89-73	73.2%	8	+72
Baltimore	7	↑1	88-66	92-70	99.6%	4	-7
Atlanta	8	↑2	88-65	92-70	100%	19	+89
Detroit	9	–	81-72	86-76	55.3%	13	+49
Cincinnati	10	↑1	92-61	96-66	100%	30	+85
Chi WSox	11	↓4	82-72	86-76	45.5%	14	+65
SF	12	–	89-64	93-69	100%	26	+68
St. Louis	13	–	83-71	85-77	81.9%	29	+100
Seattle	14	↑2	72-81	75-87	0%	2	-36
Arizona	15	↑3	77-76	80-82	0.1%	25	+46
Boston	16	↓1	69-85	72-90	0%	3	-34
Toronto	17	↓3	67-86	71-91	0%	1	-62
LA Dodgers	18	↓1	79-74	83-79	5.7%	23	+10
Milwaukee	19	–	79-74	82-80	10.6%	28	+41
Philadelphia	20	–	77-76	81-81	1.7%	21	+7
Kansas City	21	–	70-83	74-88	0%	12	-50
NY Mets	22	↑2	70-83	74-88	0%	15	-58
Pittsburgh	23	↓1	75-78	78-84	0%	27	-20
San Diego	24	↓1	73-80	76-86	0%	22	-48
Minnesota	25	↑1	64-90	68-94	0%	10	-122
Miami	26	↓1	66-87	69-93	0%	16	-101
Cleveland	27	–	64-91	66-96	0%	9	-176
Colorado	28	–	59-94	63-99	0%	17	-129
Chi Cubs	29	–	59-94	62-100	0%	20	-122
Houston	30	–	50-104	52-110	0%	18	-218