August Offense
Since I haven’t finished the second post on projecting players yet, but don’t want to have another day with no substantive posts, here’s a stub.
M’s offense, by month:
April: .266/.310/.413, 736 PA, .147 ISO, 5.3% BB, 14.5% K, .283 BABIP, 97 sOPS+
May: .291/.345/.433, 1162 PA, .142 ISO, 6.9% BB, 12.4% K, .309 BABIP, 108 sOPS+
June: .290/.348/.416, 1053 PA, .126 ISO, 7.1% BB, 16.2% K, .323 BABIP, 102 sOPS+
July: .262/.320/.374, 1045 PA, .112 ISO, 7.4% BB, 16.2% K, .294 BABIP, 84 sOPS+
August: .333/.374/.527, 748 PA, .194 ISO, 5.2% BB, 14.6% K, .354 BABIP, 131 sOPS+
For those not familiar with the terms, ISO is Isolated Slugging, or SLG-BA, which basically tells you the power that a player (or team, in this case) is hitting for. For instance, looking at SLG, you’d think the team hit for more power in May than in April, but that’s actually not true – the SLG was just inflated by an increase in singles. BB% and K% should be self explanatory, and help show just how aggressive this team is. BABIP is batting average on balls in play, essentially removing home runs and strikeouts from the equation, and show how often a ball that stays in the field turns into a hit. And sOPS+ is the team’s park adjusted OPS compared to the league for each month. So, in April, the team hit 3% worse than average, then improved to 8% above average in May, 2% above average in June, fell apart to the tune of 16% below average in July, but has owned August, hitting 31% better than the rest of the league.
As you can see, the increase in offense has been driven by two forces – an increase in power (I’m looking at you, Raul) and a huge increase in BABIP (Yep, that’s Vidro). With those two reaching unprecedented levels of production and the rest of the offense not taking a nosedive, the M’s have turned into a veritable run scoring machine.
We’ve talked about the sustainability factor of those performances, but here’s the thing – this hyper aggressive approach comes with significant variance, but that variance comes with a built in upside – over any period of weeks, the M’s can absolutely blister the ball. This is basically the blueprint the White Sox used to score their way to a World Series title – swing the bat a lot, get a lot of hits, have parade.
If the M’s make it to October, their offense is good enough to win the whole thing. On Groz’s show a few months ago, I commented that the big chore would be for this team to just make the playoffs, because if they got in, they’re built very well for October play. And I think that still holds very true.
Game 125, Mariners at Rangers
Weaver vs Loe, 5:35 pm.
Thanks to a brutal travel schedule that has the M’s playing 44 games in 45 days, which began last weekend with the White Sox series and has already included three games on the notoriously tough Metrodome turf and a couple of flights as the M’s head into the sweltering heat of Texas in August, John McLaren mentioned that this road trip would be a good time to use his bench to keep everyone fresh. Which explains tonight’s line-up. Wait, that’s the standard line-up? Oh.
Moving on to a topic that doesn’t make me want to run a cheese grinder across my forehead, the Mariners face Kameron Loe tonight. For my money, Loe is one of the more underrated pitchers in baseball. Over the last three years, he’s established himself as a heavy groundball guy who throws enough strikes and misses enough bats to be effective. However, he’s played in front of some lousy defenses in a park that isn’t exactly friendly to contact pitchers, so people haven’t caught on yet, but he’s got a similar skillset to Jake Westbrook, and I wouldn’t be surprised if things started clicking for him sooner rather than later.
2007 Fans Scouting Report
Tom Tango is back for another go around for his fan’s scouting report. Rather than just re-write the explanation, I’ll just quote last year’s post on the subject.
Friend of USSM and extremely smart guy Tom Tango is putting on his annual Fan Scouting Report. With all the work done to try to quantify defensive analysis, Tango is taking the Wisdom of Crowds approach. None of us are defensive experts, but if you’ve read the wisdom of crowds thesis, you realize that we all have our own small insights to add, and when you get a large representative sample, you can end up with some very good information.
By bringing together a lot of people who watch a lot of baseball, he’s compiled ratings based on the popular consensus of how good defensively players are. The data certainly isn’t the be-all, end-all of defensive analysis, but it’s very good information to have.
So, if you’d like to participate, go fill out the survey. However, please read the instructions. Tom takes a unique approach to evaluating defensive skills, and one that I whole-heartedly agree with; he wants to know what kind of skills a player has, and not how those skills compare to the relative merits of the other players who man that position. Or, as he states it:
Try to judge “average†not as an average player at that position, but an average player at any position. If you think that Chone Figgins has an average arm, then mark him as average, regardless if you’ve seen him play CF, 1B, 2B, or 3B.
DO NOT CONSIDER THE POSITION THE PLAYER PLAYS!
DO NOT CONSIDER THE POSITION THE PLAYER PLAYS!
DO NOT CONSIDER THE POSITION THE PLAYER PLAYS!
Position-independant defensive analysis is superior, in my opinion. Do we want to slag Derek Jeter’s defense at shortstop because his range isn’t good, relatively, to other shortstops, while giving Troy Glaus a pass because his defense at third isn’t as bad compared to other 3B? No – Jeter’s a significantly better defensive player than Glaus, but that gets lost in the shuffle of defensive position rankings.
So, let me encourage you guys to go fill out the survey. Tom’s been extremely helpful to us, and this is a great chance for us to return the favor.
Basically, what he’s asking you to do is rate the players you’ve seen play more than 10+ games this season in terms of different skills. Since we’ve all watched a lot of Mariner games, that’s where our opinion will be most useful. But if you live in another city and catch a lot of games from other teams, too, feel free to fill in the data for non-Mariner players.
This isn’t the be-all, end-all of defensive analysis, but it’s another little piece of evidence we can look at when trying to put the puzzle together. So, check it out.
Out with the old
In with the older. John Parrish will be DFA’d today, making room for 38-year-old Rick White on the roster.
Yep. Rick White. The M’s are hellbent on having someone with experience in their bullpen, and it apparently does not matter if they can get major league hitters out or not.
Game 124, Mariners at Twins
Batista vs Silva, 10:10 am.
As promised, a couple of changes in the line-up today. Bloomquist at second in favor of Lopez and Broussard DH’s for Vidro, so Beltre moves up to the #2 hole. The good news – Vidro and Lopez’s simultaneous days off means that we’re probably done with the Turbo as an infielder experiment. Thank God.
Game 123, Mariners at Twins
Washburn vs Baker, 5:10 pm.
I’m throwing the game thread up before the line-up gets announced so that I don’t have any chance to react on the off chance that Kenji’s playing center field and hitting leadoff today or something. Because, for this thread, I want to put down my analyst hat for a second.
The Mariners are hosting something called Lolla-Blue-Za on Monday – the M’s are attempting to leverage the first playoff run we’ve seen in years into some city wide fan excitement, and are asking everyone to wear blue to to the Mariners-Angels series next Monday-Wednesday.
The name is ridiculously corny, but that doesn’t matter – I’m on board. The M’s are winning, there’s a pennant race in Seattle, and some of the best memories of my life were made at the Kingdome during the summer months of 1995. There’s something legitimately exciting about being at the ballpark in a game that means something, cheering on every two strike pitch, and demanding curtain calls from the players who hit monumental home runs.
So, if you’re going to the M’s-Angels series next week, wear blue and yell a lot. It’s been a long time since games at Safeco mattered as much as those three games will. We can talk about sustainability of performances all day long, but in the end, we all just want the team to win. So go nuts next week rooting for a sweep.
Game 122, Mariners at Twins
Ramirez vs Garza, 5:05 pm.
HoRam on the road. Hilarity ensues.
Projecting Future Performance
Last week, Geoff Baker wrote a series of blog posts that dealt with the issue that has been dominating the blogosphere conversation for most of the past three months – the playing time of Adam Jones, Raul Ibanez, and Jose Vidro, and how it should be distributed. Don’t worry – this post is not about that topic. At least, not explicitly. This post is about a commonly accepted principle that was laid out very well by Baker in that trio of entries. The idea is summed up in this statement:
It’s going to be hard to keep Raul Ibanez out of the lineup now that he’s hit six home runs in nine games. Equally tough to sideline Jose Vidro now that he’s back to being a hits machine. I was all for playing Adam Jones every day when those other guys were struggling back in July. But things have changed. The veterans have stepped up and earned their playing time of late.
In July, Geoff was on board with the belief that Adam Jones would be able to help the Mariners as an everyday player, and the struggling veterans should be ceding playing time to the more talented youngster. He felt the struggles of guys like Vidro and Ibanez warrented a change, and Jones provided a superior option. He doesn’t feel that way anymore. Why? Because Raul Ibanez and Jose Vidro are hitting well recently, and Baker believes in the predictive power of the hot hand.
This isn’t a unique position. Almost everyone believes in the predictive power of the hot hand. The overwhelming majority of people in America base their future expectations – not just in sports, but in life – on their most recent experience. In sports, this is even more prevalent, as we’ve all witnessed players perform at a level far beyond what we expected them to do. Joe Dimaggio’s 56 game hit streak may be one of the most celebrated records in sports. Seattle saw Ken Griffey Jr hit home runs in eight consecutive games. Or, to bring it back to the current reason for this discussion, Raul Ibanez has seven home runs in his last 48 at-bats after hitting six bombs in his first 372 at-bats. He’s on fire. He’s swinging the bat well. Each pitch looks like a beachball. Pick your cliche`.
We all know a hot streak when we see one, even if we don’t know why they occur. There’s a debate about whether hot streaks are random fluctuation of events or an actual change in skills for a temporary period of time. I don’t even begin to know the answer to that question, and I can see the validity of both arguments. But that’s not what this post is about.
No, this post is about the predictive power of the hot streak and how that should affect our expecations. As Geoff laid out in the three linked blog entries above, the common wisdom is recent success should be a huge factor in determining playing time. Raul Ibanez is on fire (over 48 at-bats) and Adam Jones hasn’t earned his playing time (over 23 at-bats), and those performances were enough to change Geoff’s mind about who should be taking the field for the rest of the year. Getting away from that specific discussion, the issue I want to address is how much credence we should give recent performance in developing our expectations for how a player should perform going forward, even in the very near future.
And, you know me, I’m not a big fan of developing opinions on anecdotal evidence. I know there are random examples that we can cite to support any cause we want, but I don’t particularly care about that kind of analysis. I want to know what a large swath of history tells us about the predicitve power of recent performance. Do hot hitters actually perform better, even for short periods of time, once we’ve identified that they’re hot hitters?
Keep in mind – this is a statistical argument. This isn’t one of these cases where all the people who think I’m an idiot who needs to care less about the numbers can tell me to get my head out of a spreadsheet and go watch a game, because the hot streak supporters are making an argument based on numbers. All I’m doing is testing the hypothesis of whether the numbers they’re choosing to believe in actually have any meaning.
Okay, so now that the overly long introduction is out of the way, let’s look at the evidence. The best research done on this issue that I’ve ever read comes from The Book: Playing the Percentages in Baseball, written by Tom Tango, Mitchel Lichtman, and Andy Dolphin. For people who care at all about baseball statistics, The Book is a must read. These guys are among the very best researchers on baseball issues alive, and The Book is a comprehensive review of almost any question relating to statistics you’d want to see asked. While it’s not the easiest reading you’ll ever have, it still comes highly recommended.
In the second chapter of The Book, the guys tackled the very question this post deals with – do hot streaks present any kind of real information that is useful in understanding how a hitter is likely to do going forward? To test this, they pulled in every play from the 2000 to 2003 seasons and identified hot and cold streaks as the upper and lower 5% of all performances over any five game sample that included at least 20 plate appearances. The best 5% of performances went into a hot bucket and the worst 5% went into a cold bucket. That gave them 543 unique players creating a total of 6,408 “hot streaks”, and 633 players creating a total of 6,489 cold streaks. With nearly 13,000 streaks in the sample, they eliminated nearly any bias complaint you could happen to have with the study, and created a sample large enough to give us a conclusive answer – do the players who have been identified as “hot hitters” perform better than expected based on their historical averages, and vice versa, do the slumping hitters perform worse than expected in their next few games?
Without getting too deep into the statistical minutae (for that, you should buy The Book), here are the numbers (from page 56, for those of you who already own it) – for offensive performance, they use a metric called Weighted On Base Average, of wOBA for short, which essentially sums up total offensive performance and scales it to look like on base percentage. Think of it like OPS, just better, and on a different scale. .340 is average, .400 is great, .300 is bad. Just like OBP – but as a total sum of offensive production.
Average wOBA of hot hitters during streak: .587
Expected wOBA of hot hitters in 1 game after the streak: .365
Average wOBA of hot hitters in 1 game after the streak: .369
Expected wOBA of hit hitters in 5 games after the streak: .365
Average wOBA of hot hitters in 5 games after the streak: .369
As you can see, the production of the hitters in their sixth game after being identified as being hot (and hot doesn’t even begin to describe a .587 wOBA – that’s scorching), the players performed .004 better than expected if we had just used a three year average of their past performance and had no knowledge of what they’d done in their previous five games. Statistically significant? Yes, but by the thinnest of margins.
Since I’m wary of overstepping fair use and giving away too much copyrighted material, rather than spelling out the actual numbers of the cold hitters, I’ll tell you that the result in basically the same on the opposite end – the players performed worse than expected by an ever so tiny margin immediately after a five game super slump. They also re-ran the data over a seven game sample and looked at the performance in the following three games after being identified as hot or cold and found the numbers consistent with the five game samples.
But, I know, there will be some protests about how not all hot streaks are the same, and averaging 543 players together will be unfair to those who were really, truly hot. Thankfully, the guys included a list of the 10 hottest hitters over a seven game stretch. Marcus Giles had the most success run, going 18 for 25 with 7 extra base hits from July 25th through July 29th of 2003, good for a .720/.731/1.160 line. 18 for 25! His next 5 games? 0 for 4, 2 for 4, 0 for 4, 2 for 3, and 0 for 4, a grand total of 4 for 19 and a .211/.348/.368 line.
Giles was not alone. Of the ten hottest hitters from 2003, nine of them then proceeded to hit worse than expected (again, based on historical averages and ignoring the recent hot streak) in their next three games, with only Magglio Ordonez bucking the trend and continuing to hit well. From July 20th through July 24th, Ordonez went 13 for 19 with seven extra base hits, then went 12 for 20 with five more extra base hits in his next five games. That gave him a 25 for 39 stretch where he ran an 1.850 OPS over 46 plate appearances and is one of the best runs in recent baseball history. From July 31st through August 3rd, Ordonez followed this 10 game hot streak with an 0 for 14 series of hitless games, and in the 47 plate appearances (spanning 11 games) after we could identify him as one of the hottest hitters in recent memory, Ordonez hit .244/.340/.366.
The first sentence of the conclusion of the chapter, quoted from The Book:
Knowing that a hitter has been in or is in the midst of a hot or cold streak has little predictive value.
Historical evidence suggests that knowing that a player is on fire should do essentially nothing for our expecations of what he’ll do going forward, even in the very near future. In fact, given the choice of being totally ignorant of recent performance or knowing exactly how each player performed in a small sample, you would, in almost every case, be better off being totally ignorant. The natural tendancy to overstate the value of the predictive power of the hot streak (or cold streak) outweighs the sliver of actual useful information that is included in hot streak analysis. Because of our own biases, we’d make more correct decisions if we had less data.
Of course, the ideal isn’t to have less data, but to understand our biases and compensate accordingly, allowing us to live in a data-filled world and still make optimal decisions as often as possible. That’s part of what we’re trying to do here, and what statistical analysis does a good job of explaining – identify where human error leads us to drawing conclusions that are unsupported by the realities of life.
Going back to the Mariner-centric discussion that started this all, we have the Raul Ibanez/Adam Jones situation. If you, like Geoff Baker did, believed at the end of July that Adam Jones was a better player than Raul Ibanez and should be taking the field everyday, then nothing that has happened on the field since then should change your opinion. Raul Ibanez isn’t any more likely to hit well tonight than he was three weeks ago. His expected performance should be, for all intents and purposes, exactly the same. Whatever you thought of him on July 31st, you should also think of him now.
History paints a clear picture. Again, quoting from The Book (page 45):
One of the running themes of this book is that, very frequently, fans and analysts make too much from too little.
This is an important bias to keep in mind when performing any kind of analytical exercise. Our natural emotional reactions lead us to overvalue what has happened recently, and too often, we draw incorrect conclusions about what is going to happen based on things that have little or no real predictive value.
I actually have a lot more to write on the subject of correct player evaluations and projections (including talking about longer hot streaks, such as Jose Vidro’s, and how to evaluate a real change in performance), but for time and space reasons, I’m going to have to make that a post for another day.
Before I go, I’m going to make a request – please don’t turn the comments into another chance to rehash the same old argument we’ve been having for the last three months in the comment threads. If you feel that Ibanez should be starting due to clubhouse chemistry, veteran experience, or if you never felt that Jones was better than Ibanez, that’s fine – that’s also not what this post is about. The topic is about the predictive power of hot and cold streaks. I’ll be a much happier author if that’s what we talk about in the comments.
Game 121, White Sox at Mariners
Felix Day! It’s Felix Day!
Standard lineup except with Burke catching.
Game 120, White Sox at Mariners
BATTLE OF TEH OCCASIONALLY EFFECTIVE STARTERS!111ONEONEONE!!
Danks v Weaver!
What kind of a lineup would you run out against Danks? No no no no. No!
No.
CF-L Ichiro
DH-B Vidro
RF-R Guillen
LF-L Ibanez
3B-R Beltre
1B-R Sexson
C-R Johjima
2B-R Lopez
SS-R Betancourt
Yup! That’s right! You like that bench, Jones, you more-talented option that could improve the defense hugely with a fly-ball pitcher on the mound? You better, because that’s where you get to sit. Yeah!
