Monday, July 20, 2009
John Dewan (and research assistant) speak!
Great job over at SOSH for the Q&A. Here are their detailed answers, including the first one on UZR:
Getting back to your question, what’s the difference between my system and UZR? While I don’t know for sure if the current version of UZR is an extension of my original UZR, or if it was independently developed, the bottom line is that they are based on the exact same concept. Both systems break the field into small areas and look at the probabilities of plays being made in those areas. The differences lie in the various adjustments that are made.
My research assistant, Ben Jedlovec, prepared the following:
Based on my understanding of both systems,
Similarities
* Both use BIS Data. UZR started with STATS data, but the most commonly referenced version uses BIS data.
* Both have the same idea- break down balls in play by type, location, velocity.
* Both are measured on an above/below average scale.
* Both have runs saved systems with components for GDP, OF Arms, Range.
* We use similar run value multipliers at each position.
* Both are available online (Fangraphs or Bill James Online).
Technical Differences
* UZR uses multi-year samples, while Plus/Minus adjusts for year-to-year league changes. As teams are increasingly recognizing the importance of a strong defense, the league as a whole will be stronger defensively. It is important to handle this trend appropriately.
* Plus/Minus uses smaller, more precise zones, or “buckets� of plays.
* UZR has several minute adjustments, such as batter hand, pitcher hand, base/out state, and pitcher groundball/flyball tendencies. We remain focused on the value contributed to the team in the player’s specific context.
* Park adjustments are handled differently- I believe UZR applies blanket adjustment across all buckets, while Plus/Minus has park factors in form of more precise buckets. A ball hit 395 feet to Vector 190 that stays in the park is only compared to all other balls hit 395 feet to Vector 190 that stay in the park. If it leaves the park, it neither helps nor hurts the fielder. Also, we added the “Manny Adjustment�, which removes fly balls hit unreachably high off a wall. We named the system after the Green Monster’s most notable victim, who went from being by far the worst left fielder in baseball before the adjustment to being only arguably the worst left fielder after the adjustment.
* Plus/Minus accommodates plays where the first baseman holds the runner and middle infielders are covering second on hit-and-run plays. UZR adjusts for all base/out states.
* The two systems apply the run values at different stages in the calculations. UZR applies runs right away, while we convert to Enhanced PM then apply the Run Factors.
* Plus/Minus is a little more aggressive in awarding credit/penalty. An example: 100 balls in a ‘bucket’ (specified type, velocity, location), 30 fielded by the 2B, 20 by the 1B, 50 go through for singles. On a groundout to the second baseman, we give +50/(50+30) = 5/8 = +.625. UZR gives +50/100 = +.50. On a single through both fielders, Plus/Minus gives -30/80 = -.375 to the 2B, and -20/70 = -.29 to the 1B. UZR gives -30/100 = -.3 to the 2B, and -20/100 = -.2 to the 1B. You could make an argument for either method of accounting, but neither one is better than the other. The differences are the greatest at the middle infield positions, where overlap between fielders is the highest.
Fundamental Differences
* Runs Saved includes Bunt Runs Saved for corner infielders, pitcher fielding (Plus/Minus and holding runners), and catcher fielding (handling the pitching staff and the running game).
* Runs Saved measures the extra impact of HR Saving Catches. Runs Saved will add other Defensive Misplay/Good Fielding Play runs in the future.
A fairly descriptive and fair review, considering the kind of competing nature of the two systems. Good job to Ben.
When it comes down to it, I give our overall plus/minus numbers similar credibility as other baseball numbers, like batting average or on-base percentage. In my new book, The Fielding Bible—Volume II, we developed Runs Saved. I think of Runs Saved as the Runs Created of defense in that it encompasses a wide variety of methods. I give Runs Saved similar credibility to Runs Created.
Batting Average is a fair comparison for reliability. OBP is not. I prefer my answer: OBP tells you as much after 200 PA (50 games) as UZR tells you after 400 BIP (100 games). Both of which tell you half the story at those points in time.
On offense I believe we’re measuring 80-90 percent of the true ability of players. On defense, I believe we’re at about the 60 percent level. But we’re still at the tip of the iceberg in terms of precision and a ton more can be done, especially defensively. As new forms of data become available, we’ll be able to enhance our defensive systems. One example: BIS has now developed a batted ball timer, which we believe will greatly improve the accuracy of our system.
Hang time! Finally. I wonder if that’s in the feed that Fangraphs or HardballTimes gets.
And then.. the chat ended. I don’t know what happened. Too bad, because there was alot of good questions out there.
John’s traveling and having trouble finding time to polish off the rest of his answers and post them. There are plenty of excellent (and tough!) questions in the queue, so check back at SOSH in the coming days…