Why advanced metrics are important: A guide to following the statistics we use on this blog

Let me start off by saying that I am no statistical guru but I think it’s important to understand the value of some of the breakthroughs statisticians have made when it comes to evaluating the game.

This isn’t going to be a detailed study and there are some of you out there that know a lot more than I do, but my aim here is to try give a basic primer on some of the stats we use and why they’re important.  I’ve done this before but the blog has grown exponentially since then and I thought it was a good time to revisit the topic.  For some, this is new and for others it’s old hat, but I hope it makes for good discussion either way.

We will use Fangraphs rather than Baseball Reference for the purposes of this article because I feel they do better at isolating individual production from luck.

On Repeatability, Predictability, Luck, and Myth

Few things are more important in evaluation than separating what is within a player’s control and what is due to luck and environmental factors.  The reason this is important is because you want to be able to find out what is repeatable and thus what can be used to predict future success.  In many ways, it’s what we talk about often when we talk about rebuilding in that we are stressing things that tell us about process rather than results.  Process is repeatable, results can vary based on luck and environment.


When it comes to hitters we can look at plate discipline factors, contact rates, and the quality of contact.   These numbers tend to stay somewhat constant, though they can also improve or regress over time — and they often tell us a lot more about  where a player is headed than statistics that more directly incorporate outside factors, such as RBI and even batting average.

We can take a look at Cubs first baseman Anthony Rizzo to illustrate.  Where Rizzo has shown impressive improvement is with his plate discipline.  If we look at his O-Swing% (percentage of swings on pitches outside the strike zone), we can see that it has dropped to an outstanding 23.1% — this is a remarkable improvement over his 2012 rate of 38.5%.  Not coincidentally, his walk rate has increased to 16.5% over his 7.3% mark in 2012.

But that plate discipline goes beyond chasing bad pitches, Rizzo has become more disciplined within the strike zone (Z-Swing%)– and that has manifested itself in a different way — a higher contact rate and more quality contact.  We see a big jump here over last season.  His K rate has gone down to 14.8% from 18.4% last year.  Rizzo’s line drive rate (LD%) has gone up to 23.4% from 19.6% last season, while his pop-ups (IFFB%) have dropped from 9.9% to 3.6%.  We can take this to mean that Rizzo is laying off pitcher’s pitches he can’t hit hard and waiting for a specific pitch he can drive — and so we’ve seen his HR rate per fly ball increase,  We all know how good Rizzo has been and we can directly trace it back to the improvement in his plate discipline, contact rate, and quality of contact, all of which are intertwined.

We often use BABIP which stands for batting average on balls in play.  We use it because it filters out luck (both good and bad) such as bloop hits or line drives hit right at someone.  The league average BABIP tends to hover around .300, but we have to be careful not to take that number too literally.  BABIP can vary from player to player because of things like speed (more infield hits) and quality of contact (i.e., more line drives, less pop-ups) .  Good hitters tend to have higher BABIPs and we see that when we look at the average BABIP of the top 10 NL hitters last year, which was .362.  Some of that is luck, some of it is better consistent contact and/or speed.  It can often tell us how well a hitter is doing more than batting average.  An unusually high BABIP for that player often means we can expect his average to go down, while an unusually low one often means we can expect the hitters average to improve over a larger sample size.

We also like to talk about  P/PA, which is pitches per plate appearances.  Players like Anthony Rizzo (4.26) and Luis Valbuena (4.48) see a ton of pitches, so it’s not surprising that both players take a lot of walks,  Also not surprisingly, Starlin Castro sees almost one less pitch on average per AB (3.51), a significant difference for this particular statistic.

This explains (in part) why we talk about hitters having good ABs vs. bad ones — though again, nothing is set in stone.  If you know a pitcher likes to get ahead with his fastball and you jump on it and deposit onto Waveland Ave., then I think we can agree that’s a pretty good AB.  But when we talk about stats, we speak in terms of generalities and trends.  There will always be shades of gray.  The important thing to take is that a good approach is repeatable even if the results are subject to outside factors, though good process should yield good results much more consistently than bad process.


With pitchers it used to be common to focus on wins and ERA, but those two statistics depend at least in some part to the quality of the offense and defense behind them.  With wins, run support is an obvious factor.  You can’t win many games if you’re offense isn’t giving you runs.  The idea that pitcher pitch to the score is a myth (see Jack Morris).  It’s less intuitive with ERA, but if your defenders get to less batted balls (resulting in a high BABIP for that pitcher), then the pitcher is going to give up more hits, and in general, if he gives up more hits, he’s going to give up more runs.  That is because a pitchers LOB% (left on base percentage, also known as strand rate) tends to remain pretty constant (around 70-75%).  A pitcher can be unusually lucky or unlucky with men on base and that can affect his ERA, so both BABIP and LOB% have some basis in luck and that extends to the ERA statistic.  Obviously the size of the ballpark can affect a pitchers ERA as well.  For example, Andrew Cashner’s ERA in spacious Petco is 1.77, while it is a more mundane 4.16 on the road.  Cashner is still a good pitcher overall, but if we use xFIP, we can see it is higher than his ERA, in part for this reason (3.39 FIP vs. 2.72 ERA).

So we focus on what pitchers can control by isolating what he does individually from what is affected from his offense, defense, and ballpark.  This is why we use FIP (Fielder Independent Pitching) and xFIP.  FIP assumes pitchers can control walks, strikeouts, and the number of HRs he gives up while xFIP is the same statistic but feels some luck/environment is involved with HRs, so it normalizes the pitchers rate to that of the league average.

This is why we always talk about pitchers K rates and BB rates (and thus FIP and xFIP) more than the amount of hits they give up.  Those rates tend to be better predictors of future success than hits allowed, wins, or ERA because they tend to stay relatively constant throughout a player’s career.

Another note: It doesn’t necessarily account for the quality of contact a pitcher gives up (i.e. a pitcher could be having command issues within the strike zone and thus throwing a lot of hittable pitches), so again…shades of gray — and it’s a reason why we must always look at multiple factors (i.e, a pitcher’s line drive rate LD%) and also combine statistics with scouting.

Out Avoidance 

This is a subject of much debate because some are fans of what is commonly known as “small ball” — and that means things like bunting, stealing bases, and so-called productive outs.

The problem there is that all of those situations create outs when, in general, you only have 27 throughout the game.  Outs are a limited commodity and giving them away has been shown to result in fewer runs scored.  I should note that stolen bases do not cost runs, of course, but getting caught stealing does.  Estimates put a successful percentage of stealing bases at anywhere between 75-80%.  Anything lower has been shown to prevent runs.

Take a look at the chart below…


We can see from above that a team is more likely to score with a man on first and nobody out, then they are with a man on 2nd and two outs.  Knowing this, why would anyone sacrifice bunt except for certain situations (i.e the pitcher at bat late in a close game)?  The extra base tends to be less important than the out squandered.

Of course, the chart also exemplifies how important it is to get men on base in addition to avoiding outs — and this is why we so often talk about OBP (on-base percentage).  Obviously, every time a player gets on base, he has avoided making an out. More baserunners and less outs leads to more runs.  You can read about that correlation here.

I sometimes get criticized for my support of Luis Valbuena’s presence in the lineup but his .356 OBP — which is above league average, helps the Cubs score runs despite his .211 batting average.

Run Prevention

Valbuena doesn’t just help score runs, he helps prevent them with excellent defense.  While errors give an immediate reaction  and fielding percentage was once the standard measuring stick of a good defender, both are far too limited to give you an accurate gauge of a defender’s overall ability.

On an intuitive level, this makes sense.  All things being equal, would you rather have a guy who fields 100 ground balls and makes 3 errors or a guy who fields 90 and makes one error?  The first player has a 97% fielding percentage while the second player is at just under 99% — but yet the first fielder has made plays on 97 ground balls while the second fielder has made 89.  That’s 8 potential hits saved — and save enough hits and that eventually translates to runs, save enough runs and that eventually translates to more wins.

The defensive statistic often used at this blog is UZR/150, which breaks the field up into zones and tracks the number of plays made both in and outside that zone (and naturally incorporates errors into the equation).  That number is translated into a rating, which in turn is translated to runs saved. Using Valbuena as an example again, we can get a feel for just how much value he adds on defense.   Last season, Valbuena was a slightly below average offensive player overall — yet was considered a fringe average starter overall in part because of his defense.  An average defender will have a 0.0 rating.  Valbuena’s UZR/150 last year was an outstanding 18.6.  For frame of reference, NL 3B Gold Glove winner Nolan Arenado’s UZR/150 was 22.4.    2012 winner Chase Headley’s UZR/150 was 8.2 while Valbuena checked in at a much better 27.6 that season.  It’s not a stretch to say that Valbuena ranks among the best defensive 3Bs in baseball.

A more clear cut example is Darwin Barney in 2012 and for him we’ll focus on Runs Saved.  The Fielding Bible calculated that Darwin Barney saved 28 runs — 10 runs saved translates to a win, making Barney’s defense alone responsible for nearly 3 wins over the course of that season.  We won’t get into his offense for the purpose of this discussion, which is on run prevention.  It’s well documented that Barney doesn’t create enough runs on offense.  But we’ll get to that in the next section.

The bottom line, though, is that run prevention is the mirror to stats like OBP, which help create runs.  If getting on base leads to more runs on offense, then it logically follows that any time your defender can prevent opposing runners from getting on base, then you will in turn limit their ability to score runs.

Finding unity in offensive statistics

As mentioned, statistics like RBI tend to give a false gauge of individual offensive value.  It is, in many respects, a team statistic because it can be affected by teammates performances and even where an individual bats in the lineup.  There is no such thing as a clutch hitter, though intuitively this does makes sense to most people — but at that high level of play, you don’t see as much variance in terms of a player’s ability to stay focused and calm as you might at lower levels of baseball.  These guys have climbed to the top of their profession for a reason.  Take a large enough sample size and you will consistently find that a major league player’s average with RISP is right in line with his overall average.

You have seen us use two statistics to measure the all-around offensive ability of a player.  The first is wOBA, which stands for Weighted On Base Average.  What that means is that it is a derivative of OBP but puts greater weight on extra base hits.  It’s more accurate than OPS because OPS factors batting average in twice (it is a central component of both OBP and slugging percentage).

The second stat we like to use is RC+, which stands for Runs Created. It is similar but preferable for some because it uses 100 as the MLB average — so anyone over 100 is above average is over 100 and anyone under is below average.  So far this year, Valbuena has been roughly average on offense (RC+ of 99) while Anthony Rizzo is well above average at 150.  Darwin Barney prevents runs, but he also creates far less than the average player (51 last season and a tragically low 5 so far this year).

We will often use both so that the reader can choose the one he prefers.


 You’ve seen us use WAR, which means Wins Above Replacement.  What WAR does is take all parts of a player’s game: hitting, running, defense — and assigns it a value in terms of wins.

The base value is 0 wins and that is the value of what is called a “replacement player”.  A replacement player is loosely defined as an average AAA player or the sort of player you can readily find on waivers.  Because anything can happen in any particular game, a team of replacement level players would not be expected to win 0 games.  They would be expected to win about 47-48 games in a 162 game season.  As an illustration, Darwin Barney has been around a replacement level player the past 2 seasons.

So Wins Above Replacement measures how many additional wins any given player would have over the Darwin Barneys of the world, though keep in mind that WAR is position specific.  The offensive threshold for a 1B to be a replacement level player is much higher than it is at 2B or catcher.

To give you an idea, a fringe average starter is a 2 WAR player.  Luis Valbuena’s 2013 season is an example.  A good starter is at around 3, Starlin Castro’s 2010-2012 seasons are an example of that, while Jeff Samardzija has been at that level the past two seasons.  I’d say 4 WAR is an all-star level player.  No current Cubs are good examples (Anthony Rizzo projects close to this level this season) but Aramis Ramirez was at this level during his prime Cub years.  A superstar player is in the 6-7 range — Derrek Lee’s 2004 season fits in that category and 10 WAR is Mike Trout level.

So I hope this is helpful for many of our newcomers when it comes to getting a sense of how we evaluate players here at Cubs Den from a statistical standpoint.  We obviously put a great weight on scouting too, but that is an article for a different day.

Filed under: Uncategorized


Leave a comment
  • Just had to say,.... have always loved that particular Gary Larson cartoon.

    And the rest of the article is useful to my non 'advanced' metric mind as well.

  • In reply to drkazmd65:

    Thanks Kaz.

  • For a little extra credit, Fangraphs predicts the Cubs to post an atrocious 16.1 WAR as a team for the remainder of the season. That would put them in triple digit losses territory.

  • In reply to Eddie:

    Only the Mets and Astros are projected to be worse from now through the end of the season.

  • fb_avatar

    Thank you for the very informative article. I'm mostly familiar with the terminology, but I'm far from an expert. I've also LONG been a proponent of using deeper stats to judge pitchers, going back to the 90s. However, while I think that almost all stats can be useful to judge what hitters have done in the past, I'm not as much a fan of the advanced stats to predict the future. I'm one of those "small ball" advocates, and think that the situation and the quality and type of players that you have has a huge effect on this. You simply cannot apply a league average number to everything and say "this is how it always is and should be". Just my opinion. Again, thank you!

  • In reply to Bender13:

    Thanks. I tried to qualify the small ball stuff and statistics in general as trends. They aren't meant to be absolutes, but more of a guide that should be followed in most situations. But to be clear, you'll be more successful playing the odds than continually going against them. Those special situations should be selected very carefully.

    Statistics like FIP and BABIP predict future seasons, for example, far better than batting avg or ERA.

  • fb_avatar

    Just watched the last inning of the no hitter. Matt Szczur saved it on the last play of the game with a phenomenal sliding catch of a ball that had bloop single written all over it.

  • fb_avatar
    In reply to Mike Moody:

    Here's a video:


  • John - the stat I have the most discussions with people about is WAR. We get what WAR is trying to tell us, but seems very few people (that I talk with at least) understand how WAR is calculated. Its great to say someone is a 3 WAR player, but can you try and explain how the conglomeration of offensive and defensive stats are factored and weighted in developing the actual WAR calculation?

    Or more simply - why should a stats novice trust that 3 WAR really means a player won us 3 more games than a replacement level player? Thank you

  • In reply to Charlieboy:

    I'm on my way out, perhaps a stat person can explain or link an explanation. If not, I'll be back later tonight.

  • In reply to Charlieboy:


  • fb_avatar
    In reply to Charlieboy:

    My main opposition to WAR is how it can say that the same player is more valuable at one position than at another. For example, lets say that Baez comes up next season and plays 3B vs 2B. As I understand it, if he has the exact same stats, his WAR is higher, perhaps much higher, at 2B. I don't really agree that he's automatically more valuable. I understand that the league average is higher at some positions, but production is production.

  • In reply to Bender13:

    he is more valuable compared to other 2b. its a position based stat

  • fb_avatar
    In reply to Bender13:

    Second base and third base actually have the exact same positional adjustment in Fangraphs. Shortstop, however, is higher. That reflects the fact that shortstop is the hardest position to play on the field and most teams are required to take a hit to the offense of the player in order to get passable defensive shortstop on the field. Ergo, if the shortstop hits like a left fielder (i.e., ARod) he provides a lot of value to the team, who can additionally get average left field production out of an average left fielder.

  • In reply to Bender13:

    Having a bigger bat at 2B, where the average player is not as good offensively at 3B, that in turns allows you to put a better bat at 3B.

  • K Bryant with an oppo taco 3 run shot!

  • In reply to CubfanInUT:

    Bruno also on fire with a double. How awesome would it for Bryant, Baez, and Soler all be brought up in Chicago at the same time?

  • In reply to Paulson:

    it be like Xmas.

  • fb_avatar
    In reply to CubfanInUT:

    And a strikeout in his other at bat. As much as I love the power, I do wish he'd get the Ks under control.

  • Great article John- the manner in evaluating baseball talent has obviously changed over the past 10-15 years and you definitely covered plenty of ground in that area with this blog.

    I found the "Sacrificial Silliness" article quite informative and revealing, and just for the heck of it I checked out the following stats on team sacrifices:


    As I expected, RR has been far too eager to bunt so far as the Cubs are ranked 9th with .35 sacrifices per game. Sveum was actually very good in this regard as the Cubs were last in the NL for sacrifices the past two seasons at #17... obviously the AL teams sacrifice much less. Smart teams like the As are perennial leaders in not giving away outs while over-managing dumb-asses like Dusty Baker and Tony LaRussa usually lead the league in sacrifices. But Tony was a genius for hitting his pitcher 8th...

  • In reply to Paulson:

    Thanks Paulson! Agree I'm not too crazy about the bunting this year.

  • Interesting. I'm not sure how much I buy into all this, but it's an interesting slant. Just curious, has anyone done a study to determine how often the statistically superior teams have won playoffs/championships?

    John, your comment about Rizzo led me to seek more info on the team from this link:

    Am I reading this right? Rizzo 1.1 War? Team rating equals something out of AA?!!

  • In reply to xhooper:

    It's not a slant. It's math.

  • fb_avatar
    In reply to John Arguello:

    This was hilarious.

  • In reply to John Arguello:

    Yes you're right. It is math... but one way of assessing player performance.

  • In reply to xhooper:

    And one that uses mathematical probabilities that have been shown to have predictive value and that evaluates players on things that they can control, Old stats like RBI, wins, ERA, etc, do not There is a qualitative difference. They are not equal and no MLB team except for maybe the Phillies clings to outdated statistics.

    It's not a matter of opinion as to which statistics (new metrics or traditional stats) have greater evaluative value. It's very clear and demonstrably so.

  • In reply to John Arguello:

    Any Cubs fan would love to trade places with a Phillies fan.

    World Series title. Multiple playoff appearances and they're even currently in the playoff hunt.

    I could have stopped after WS title and it would have been true.

  • fb_avatar
    In reply to Drizzy:

    I'd certainly take the Phillies last 10 years over the Cubs...but I wouldn't trade futures with them! And, I believe the point was that 29 other teams have accepted advanced metrics as a better method of predicting future performance, including many teams with better pays, presents, and futures than the Phillies.

  • In reply to Matt McNear:

    When did everyone accept it and the Phillies didn't?

    I'm really asking, I'm not trying to be a smart ass. I'm new to all of this and want to soak up as much information as I can.

    Thanks for the response. I bookmarked this page for future use.

  • In reply to Drizzy:

    Wow.... another troll. I now have a partner in the troll box.

  • In reply to Drizzy:

    The Phillies GM, Ruben Amaro, is just very old school. So is Drayton Moore of the Royals and Kevin Towers of the Padres, but Phillies have become the most notorious for clinging to old stats.

  • In reply to Drizzy:

    Many Phillies great seasons pre-date the statistical revolution and the recent ones are ones in which they stumbled into guys who had good advanced statistics -- the Phillies found the formula, accidentally, but didn't have the sense to repeat that process -- in part because they didn't understand how they got there to begin with.

  • In reply to John Arguello:

    The Phillies were competitive for about 10 years. They won their division 5 years straight.. all well into the new age of metrics. Not bad for being clueless and with results, arguably, as effective as that of Boston. If that is the chosen benchmark.
    It's nice to embrace the new metrics but I don't see any evidence that shows a system based on new age statistical analysis has performed any better than others. The important thing in managing anything effectively is to have a plan and stick with it. Stats are one tool to assist in the process. Dismissing the success of teams like the Phils and Cards as luck or lack of understanding fails to give them due credit.

  • fb_avatar
    In reply to xhooper:

    what Team Rating are you talking about?

  • In reply to Giffmo:

    I'm looking at all of the individual WAR ratings. None higher than Rizzo's.

  • fb_avatar
    In reply to xhooper:

    Maybe some of your confusion lies in the fact that WAR is cumulative? Rizzo's WAR is 1.1, or, he has accounted for 1.1 wins so far this season. If he continued at the same pace, he would wins up close to a 5 WAR season.

  • fb_avatar
    In reply to Matt McNear:

    *wind up

  • In reply to Matt McNear:

    Thanks. I think that's it. So if a player.is at 1.1 and has played 1/4 of the season, he would project out to 4.4 for the year?

  • First look at Vizcaino. I like what I see. His curve tonight not killer but effective after batters deal with the fastball.

    A Cubs bullpen with Rivero and Vizcaino in late innings should be on par with just about any contending org out there.

  • it is probably not welcome to talk about the big leage club on these posts, but for those who are anot watching:
    - Abreu has a way better series than Rizzo so far

    and Wood seem to be regressing to norm

    and the highly touted infield defence has feiled at least twice today

    ...just another day in Cub fandom...

  • In reply to Csanad:

    Key word there is "day". Just one game in a long season. What fires me up is what happened in Iowa and TN today... A no- hitter by Rusin and total dominance by the Smokies. Bryant, Bruno, Andreoli, Rhee, Vizcaino looked great and the Smokies score 12 runs without Soler.

  • In reply to Paulson:

    yes, thats nice, but to me it is just a very distant, very uncertain future that cannot outweigh the unbearable mlb product. these guys just suck soo so much, that it is painful

  • fb_avatar

    If Valbuena's offense is geared towards getting on base, it makes sense for him to bat first or second. Lake should be towards the bottom of the order. Does Renteria use advances stats to fill out his lineup?

  • In reply to Denvil Farley:

    The thinking is Lake struggles with the off-speed pitches but does well against fastballs. Batting him in front of Rizzo and Castro puts him in a position to see a lot more fastballs than he would hitting behind them. Renteria's just putting him in a position to succeed.

    He's also had several line-ups where he's stacked the top of the line-up with the best OBP guys.

  • I think you meant to see Derrek Lee's 2005 season, but it was a very good article nonetheless.

    I understand that advanced metrics are for the baseball geeks, but I think that some basics (like WAR) are important for fans to understand. I know that commentators and analysts don't really refer to advanced metrics, but I think they definitely tell a whole different story (i.e. Luis Valbuena).

  • Someone with zero swings total could amass an "O-swing %" 23 points better than Rizzo with only 2 less doubles.
    Good lord! Drive the baseball. Abreu has been exploding on far more baseballs. His "SBU %" (square ball up percentage) is superior.

  • What's a good FIP or xFIP as compared to an ERA?

  • fb_avatar
    In reply to HackWilson09:

    They're scaled to imitate ERA. So a good ERA is also a good FIP.

  • In reply to HackWilson09:

    They are on the same scale...FiP/xFIP should reflect what an ERA should be for the pitcher if he were on a team with an average MLB defense.

  • In reply to John Arguello:

    Thanks for the excellent summary, John.

    It would also be helpful to give some measures of average and range for each of these stats, similar to what you did in discussing WAR.

  • Good article. To pick a nit, the 75-80% threshold for success of stolen bases you referred to for increasing or decreasing expectation of scoring is old info. With the decline of slugging/XBH in the past few years, the current threshold number is closer to 67%, making the SB a much more viable skill again for many players to increase their positive output.

  • Why don't homers count toward babip? If an outfielder can go over the fence and take back a homer, doesn't that mean that area is actually in-play?

  • In reply to SFToby:

    Haha ;)

Leave a comment