SB Nation - Login for mobile commenting

Lookout Landing

Sabermetrics 101: Park Factors

Park effects are one of the easiest concepts to understand but actually deriving them is one of the most difficult processes I can think of

Prerequisites for understanding: Environment, win/run conversion

Prerequisites for derivations: Data

Star-divide

Derivation

It's a little strange what effect on the game the playing field can have. We're all aware of the ballpark's impact on home runs, due to different dimensions, elevations, and temperatures. What's harder to wrap one's head around is a stadium lowering strikeout rates, or raising the run value of line drives. There are mechanisms for explaining some of these things physically (pitches break less in dryer air, for example, lowering the efficacy of curves and sliders), but some effects are probably psychological - pitchers might be more inclined to pitch up in the strike zone in a park with a deep outfield, raising fly ball rate and strikeouts. No matter the cause, the varied park effects are real (or they're doing such a good job of pretending that we should just run with it). This means we have to deal with them, because otherwise they're skewing our measurements.

How do we do remove park effects from our evaluations of players? The first step is to measure them. This is more problematic than one might think, mainly because you have to tease out what's being caused by the park from what's caused by the talents of the home nine, and the latter typically dominates. So we take multi-year samples, do some recursive analysis to strip away some of the bias, and regress heavily, and we're typically left with a blanket park factor which we're fairly confident with. We then apply this equally to all players, which works in one way, but not really so well in another.

Applications

There are two reasons to have good park factors when we're looking at players. The first is to gauge their worth in that specific environment, and the second is to gauge their talent level independent of environment (i.e. their expected worth if they move elsewhere). We can use blanket park values  - specifically the run factor - to determine which run environment to use for our run to win conversion, or we can go the slightly easier route of dividing the run contribution of a player by the run factor of the park to normalise our results. This tells us about a player's value. What it absolutely does not do is tell us about the player's talent.

One of the biggest headaches in baseball analysis is that parks play differently depending on both handedness and hitting or pitching style. We've already discussed blanket factors above, and I hope the reader has a grasp on how difficult they are to compute. Imagine complicating an already very difficult analysis by adding handedness and batter type in order to still not be convinced your numbers are perfect. You've just discovered why this sort of analysis hasn't gained much traction yet! Still, it's pretty clear that this is probably a fruitful area of analysis for projecting how players will do in different environments, something currently we have a qualitative handle on while the numbers lag far behind.

Examples

My favourite park factors were derived by The Hardball Times several years ago. Although they're blanket factors, they go into a level of detail that's fairly useful in adjusting hitting and pitching statistics.

What Follows

Park-adjusted statistics.

0 recs  |  9 comments

Comments

Interesting stuff,

Thanks Graham.

I almost want not to read these and then just read them all when you're done.
Strictly looking at home runs

The hardball times had a nice set of factors dividing each park into five slices, for better application to pull hitters, etc. There’s also a calculator to further adjust for atmospherics, but that’s cutting things awfully fine IMO. Still, when you’re confining yourself to home runs and want to talk about which parks are better for which kinds of hitters (finding a perfect home for Jose Lopez in Houston, say, or how bad Safeco really is to LF), it’s interesting stuff.

Hit F/X:

That’s the motion/camera based tool being instituted this season, correct? Won’t that have massive implications for park factors?

It will probably have massive implications for every way we look at baseball

Although I haven’t heard yet if they will make their data public.

Partly

What they’ve instituted so far is to make use of the same camera system they’re using for Pitch F/X. This gives them angle, inclination, and velocity off the bat, which would be enough to tell us where the ball would go if gravity were the only external force (ie, in a vacuum). Unfortunately, the current camera system is insufficient to give us the spin of the ball as it comes off the bat, which can have a significant effect on the path it takes through the air. Consequently we’re still going to be waiting on “real” Hit F/X (though it’s possible that using the HitTracker data for the final impact of the ball and its hang time, plus the atmospheric data, you could backsolve for it to some extent for HRs at least), and the real thing is coming eventually, just not this year (there are apparently other efforts as well, ie “Trackman”).

There’s quite a good discussion of this over at the Book blog.

Are we ever going to be able to have a system that tells us the spin of the ball?

It seems an impossible task for cameras located 100 or more feet away from a small round object spinning at 1800 revolutions to be able to discern something like that. How exactly would they even attempt that unless they’re using a sophisticated radar system using the Doppler effect or something? I’m skeptical that Hit F/X will be able to give us that kind of specific information.

Spin matters because spin causes deviation from a straight path. If you know the initial launch angle, you can figure out the spin based on the deviation.
One of the limits on our ability to make good park factors is that we are unable to measure player intent

I’ll take an extreme example. Lets say there is a 50 mph wind going out to LF. A hitter notices this wind and figures he can hit a flyball to LF and the wind should take it easily over the fence. The hitter changes his approach completely to hit flyballs to LF. In doing so, he reduces the power he hits the ball to better control the direction as well as increasing the angle to loft it into the wind. He is able to hit 1 HR per 10 AB with home runs going 375 ft. When adjusted for wind speed however we only expect these home runs to go 300 ft which would be short of a HR.

The question is, do we give him credit for those home runs? They were definitely valuable but if he was hitting in a different environment they wouldn’t have become home runs. His raw stats would look amazing but his adjusted stats would show him with 100% flyball to LF producing no home runs. Figuring out how to fairly quantify his talent as a hitter is severely hampered by his altered approach. Granted, this is an extreme example but this is a major worry when coming up with any park factors.

You must Login with your SB Nation account and be a member of Lookout Landing to post a comment.