HomeSportsBaseballzStats for Hitters, June Update

zStats for Hitters, June Update

Daniel Kucin Jr.-USA TODAY Sports

Among the panoply of stats created by Statcast and similar tracking tools in recent years are a whole class of stats sometimes called the “expected stats.” These types of numbers elicit decidedly mixed feelings among fans – especially when they suggest their favorite team’s best player is overachieving – but they serve an important purpose of linking between Statcast data and the events that happen on the field. Events in baseball, whether a single or a homer or strikeout or whatever, happen for reasons, and this type of data allows us to peer a little better into baseball on an elemental level.

While a lucky home run or a seeing-eye single still count on the scoreboard and in the box score, the expected stats assist us in projecting what comes next. Naturally, as the developer of the ZiPS projection tool for the last 20 (!) years, I have a great deal of interest in improving these prognostications. Statcast has its own methodology for estimating expected stats, which you’ll see all over the place with a little x preceding the stats (xBA, xSLG, xwOBA, etc). While these data don’t have the status of magic, they do help us predict the future slightly less inaccurately, even if they weren’t explicitly designed to optimize predictive value. What ZiPS uses is designed to be as predictive as I can make it. I’ve talked a lot about this for both hitters and for pitchers. The expected stats that ZiPS uses are called zStats; I’ll let you guess what the “z” stands for!

It’s important to remember that these aren’t predictions in themselves. ZiPS certainly doesn’t just look at a hitter’s zBABIP from the last year and go, “Hey, sounds good, that’s the projection.” But the data contextualize how events come to pass, and are more stable for individual players than the actual stats. That allows the model to shade the projections in one direction or the other. And sometimes it’s extremely important, such as in the case of homers allowed for pitchers. Of the fielding-neutral stats, homers are easily the most volatile, and home run estimators for pitchers are much more predictive of future homers than actual homers allowed are. Also, the longer a hitter “underachieves” or “overachieves” in a specific stat, the more ZiPS believes the actual performance rather than the expected one.

A good example of this last point is Isaac Paredes. There was a real disconnect between his expected and actual performances in 2023 and that’s continued into 2024. But despite some really confounding Statcast data, ZiPS now projects Parades to be a considerably more productive hitter moving forward than it did back in March. Expected stats give us additional information; they don’t give us readings from the Oracle at Delphi.

One thing to note is that bat speed is not part of the model. The data availability is just too recent to gauge how including it would improve the predictive value of these numbers. It’s also likely that even without the explicit bat speed data, the model is already indirectly capturing a lot of the information bat speed data provides.

What’s also interesting to me is that zHR is quite surprised by this year’s decline in homers. There have been 2,076 home runs hit in 2024 as I type this, yet before making the league-wide adjustment for environment, zHR thinks there “should have been” 2,375 home runs hit, a difference of 299. That’s a massive divergence; zHR has never been off by more than 150 home runs league-wide across a whole season, and it is aware that these home runs were mostly hit in April/May and the summer has yet to come. That does make me wonder about the sudden drop in offense this year. It’s not a methodology change either, as I re-ran 2023 with the current model (with any training data from 2023 removed) and there were 5,822 zHR last year compared to the actual total of 5,868 homers.

Let’s start with the over/underachievers for OPS. This is OPS calculated knowing only a player’s zBABIP, zHR, zSO, and zBB.

zOPS Overachievers (6/12, min. 150 PA)

zOPS Underachievers (6/12, min. 150 PA)

He didn’t make the top 20, but one of the most depressing things about Spencer Torkelson’s struggles this year is that zStats think he should have actually hit worse than his already anemic triple-slash line (.201/.266/.330). Matt Olson was basically lapping the underachievers field in mid-May — when he was underperforming by nearly .300 (!) points — but he’s been hitting more like he’s expected to over the last month or so. His seasonal 2024 line is still well below what it was last year, but it’s a huge improvement over where it was about a month ago.

One of the most interesting hitters by zStats is one who didn’t make either chart, Aaron Judge. He’s been on an absolute tear, and for the season his zStats are in the neighborhood of his actual numbers. Shohei Ohtani led the league in zOPS early, but he’s hit like a mere mortal for the last calendar month, at .248/.321/.446. To better illustrate the ridiculousness of Judge’s performance thus far, let me just throw in another chart, the overall zOPS rankings.

zOPS Leaders (6/12, min. 150 PA)

In zOPS, there’s as large a gap between Judge and Ohtani, as there is between Ohtani and Jake Cronenworth in 36th place! It’s also worth noting that while Guardians utilityman David Fry is one of the larger overachievers, ZiPS still thinks he’s been a very solid hitter this year.

zBABIP Overachievers (6/12, min. 150 PA)

Name BABIP zBABIP zBABIP Diff
LaMonte Wade Jr. .436 .328 .108
Connor Wong .389 .291 .099
Isaac Paredes .315 .236 .079
Kerry Carpenter .327 .249 .078
Ezequiel Tovar .380 .303 .077
Tyler O’Neill .329 .254 .076
Elias Díaz .350 .275 .076
Wilyer Abreu .348 .277 .071
Spencer Torkelson .253 .183 .070
Christian Yelich .382 .314 .068
Jose Miranda .289 .224 .065
J.D. Martinez .358 .296 .062
Steven Kwan .400 .338 .062
Brenton Doyle .353 .301 .052
Ozzie Albies .297 .245 .051
Daulton Varsho .254 .205 .049
Christian Walker .299 .252 .047
Jurickson Profar .356 .309 .047
Salvador Perez .333 .287 .047
David Fry .362 .316 .046

zBABIP Underachievers (6/12, min. 150 PA)

zBABIP includes information such as sprint speed in order to get a better idea of whose stats are out of whack with the inputs. CJ Abrams makes an appearance in the underachievers, suggesting that his weak May is probably more of an outlier than his blazing hot April. If Reddit conversations are a representative cross-section of fans, I expect no Mets fans will trust zStats as a result of its thinking Francisco Lindor ought to have a much higher average than his actual mark of .231.

zHR Overachievers (6/12)

Name HR zHR zHR Diff
Gunnar Henderson 21 12.8 8.2
Josh Naylor 17 10.9 6.1
Kyle Tucker 19 12.9 6.1
Teoscar Hernández 17 11.3 5.7
José Ramírez 18 12.9 5.1
Alec Burleson 9 4.2 4.8
Marcell Ozuna 18 13.3 4.7
Max Muncy 9 4.5 4.5
Nolan Schanuel 7 2.6 4.4
Paul DeJong 13 8.6 4.4
Adley Rutschman 13 8.8 4.2
Isaac Paredes 10 5.8 4.2
Daulton Varsho 10 6.1 3.9
Mookie Betts 10 6.2 3.8
Nolan Gorman 15 11.3 3.7
Ezequiel Tovar 11 7.5 3.5
Anthony Santander 14 10.5 3.5
Bryce Harper 15 11.8 3.2
Mark Canha 6 3.0 3.0
Jackson Chourio 7 4.0 3.0

zHR Underachievers (6/12)

Name HR zHR zHR Diff
Austin Riley 3 8.4 -5.4
Salvador Perez 10 15.1 -5.1
Bobby Witt Jr. 11 16.0 -5.0
Vinnie Pasquantino 7 11.7 -4.7
Ian Happ 6 10.5 -4.5
Charlie Blackmon 2 5.9 -3.9
Alec Bohm 6 9.9 -3.9
Javier Báez 1 4.8 -3.8
Mike Tauchman 5 8.7 -3.7
Jackson Merrill 3 6.7 -3.7
Matt Olson 9 12.6 -3.6
Elehuris Montero 3 6.6 -3.6
Vladimir Guerrero Jr. 7 10.5 -3.5
Colt Keith 2 5.4 -3.4
Wyatt Langford 1 4.3 -3.3
Corbin Carroll 2 5.2 -3.2
Davis Schneider 7 10.2 -3.2
Ronald Acuña Jr. 4 7.0 -3.0
Matt Chapman 8 11.0 -3.0
LaMonte Wade Jr. 2 5.0 -3.0

I’m from Baltimore, so as much as it pains me to note that Gunnar Henderson probably isn’t actually a 50-homer hitter, which is his current pace, I certainly wouldn’t mind if, like the O’s as a team last year, he defied what my projections say! Even if you wring some homers out of Gunnar’s line, he’s still an ultra-elite shortstop and dare I say, the team’s biggest star, rather than Adley Rutschman.

Three of the top four underachievers in home runs here are Royals, including Salvador Perez, who is already having a dynamite season without adding any more blasts to the butcher’s bill. Vinnie Pasquantino is one of the larger underachievers overall, which ought to be a boon to Kansas City as the season goes on given that the team’s offensive depth is rather unimpressive. zStats do include their own park factors, so Kauffman Stadium isn’t putting its figurative thumb on the scale.

zBB Underachievers (6/12)

zSO Underachievers (6/12)

These stats aren’t as important as their counterparts for pitchers, but they do provide additional value in predicting the future over the raw strikeout and walk totals. Strikeout and walks stabilize very quickly for hitters, but components of zSO and zBB stabilize even more quickly. It’s interesting that for both stats there’s a lot of non-overlapping explanatory variables. Contact information is really important for strikeout rate whereas swing-decision information isn’t, nor is called-strike percentage. But swing-decision data are far more important for modeling walk rate than is contact information. The r^2 for zBB% vs. BB% is just under 0.7, and for zSO% vs. SO%, a hair under 0.9. I’m most interested to see how bat speed data will interact with these numbers, but alas, that may be an article for 49-year-old Dan to write, not the current one.

I’ll run down the zStats one more time this season, in late August, and we’ll evaluate again how zStats performed vs. the actual numbers with two more months of data.

Content Source: blogs.fangraphs.com

Related News

Latest News