Friday, November 14, 2025

RBIs, Opportunities and Power Hitting

Opportunities significantly affect a hitter’s RBI totals 

 

This appeared in the 2002 Baseball Research Journal (Volume 31), published by SABR, the Society for American Baseball Research.

 

Thanks to Ted Lukacs and Jerry Wachs for their comments an early draft of the paper 

 
Part 1

RBIs have long been one of the staples of measuring a hitter’s contribution to his team’s success. Sometimes a player is said to be “a good RBI guy.” Newspapers and record books list the annual RBI leaders, scoreboards and broadcasters tell us how many RBIs a hitter has, almost as if getting them is a special skill, separate from power hitting or hitting for average. But RBIs are also often criticized as being misleading since all hitters don’t get the same number of opportunities to drive in runs. One hitter might get more RBIs than another because he had more opportunities and not because he is somehow better at driving in runs. So the important question is: “Exactly how much difference do RBI opportunities make?”

They make a big difference and exactly how big can be learned from statistical analysis. The following equation, derived using the linear regression technique, explains a hitter’s RBIs per at bat and the value of opportunities:

(1) RBI/AB = .187*OPP + .196*AVG + .468*ISO - .303

where

OPP = a hitter’s number of RBI opportunities per at bat, with each man on base being an opportunity as well as the batter

AVG = a hitter’s batting average

ISO = a hitter’s isolated power which is slugging percentage minus batting average

How does this work? Equation (1) predicts that Juan Gonzalez would get .206 RBIs per at bat because:

.187*(1.72) + .196*(.297) + .468*(.271) - .303 = .204

Gonzalez actually had .205 RBIs per at bat. The equation is also generally very accurate (I explain the statistical results and the data below).

But first, what does the equation mean from a baseball perspective? With the coefficient on OPP being .187, two players who differ by, say, .082 OPP (49 RBI opportunities for a 600 at bat season), will end up with an 9.17 difference in RBI’s over a 600 at bat season (9.17 = .082*.187*600). This is significant in baseball terms as well as statistically. Why look at a .082 difference in OPP? This study includes all players (61) who had 6000 or more plate appearances during the 1987-2001 seasons and whose situational statistics were listed on the CNN/SI website.1 Juan Gonzalez had the highest OPP/AB at 1.724 for his career. More than half of the other players were at least .082 less than this, including other power hitters like Rafael Palmeiro, Gary Sheffield, and Barry Bonds. Bonds was even lower at 1.587. Gonzalez would get 15.338 (or .137*.187*600) more RBIs than Bonds solely as a result of having more opportunities.

For a single season, the differences in OPP can be even greater. In 1995, for example, Paul O’Neill was the leader at 1.85 while Barry Bonds had 1.61. Everything else being equal, O’Neill would get about 26.93 more RBI’s over a 600 at bat season. So opportunities play a big role in RBI totals.

Hitters vary quite a bit in RBI opportunities. For example, the two lowest in OPP/AB were Rickey Henderson and Kenny Lofton, at 1.46 and 1.47, respectively. The two highest were Juan Gonzalez and Ruben Sierra at 1.72 and 1.71, respectively. Of course, Henderson and Biggio are both primarily leadoff men while Gonzalez (usually fourth) and Sierra (usually third through sixth) have been largely used in the middle of the lineup. But the difference between Henderson and Gonzalez (.26 OPP) would be 156 more RBI opportunities over the course of a 600 at bat season. Just about half of that, say .13, would mean about 78 more.

An actual example supports the importance of opportunities. Juan Gonzalez has a career AVG of .297 and an ISO of .271. Ken Griffey Jr. had .296 and .270, almost identical numbers. Yet Gonzalez had .205 RBI/AB or 123 RBIs over a 600 at bat season. Griffey had .187 RBI/AB or 112 over a 600 at bat season. The difference results largely from Gonzalez having 1.72 opportunities per at bat while Griffey had 1.64.

As for the data, opportunities include one for every time at bat and one for each runner that was on base during an at bat. This means that OPP does not include opportunities from plate appearances when the batter walked. (A regression was run that included these opportunities and the results were similar).2 Isolated power is a hitter’s slugging percentage minus his batting average and is a better measure of power hitting since it only includes bases on hits beyond singles.

As for the statistical results, the r-squared is .974, which means that 97.4% of the difference in RBIs per at bat across players is explained by equation (1). The standard error, which measures dispersion in the equation’s predicted RBI/AB for each player, is .005651 or just 3.39 RBIs for a 600 at bat season (600*.005651 = 3.39). The numbers in front of the variable abbreviations are referred to as coefficient estimates. So, for example, a .010 increase in batting average means a .00196 increase in RBI/AB (.196*.010 = .00196). That is 1.18 RBIs for a 600 at bat season. A .010 increase in ISO would add 2.81 RBIs for a 600 at bat season.

The T-values, which indicate statistical significance, are:

OPP = 13.58
AVG = 5.02
ISO = 28.25

This says that the three variables are all significant at the 1% level (or lower), meaning that there is less than a 1 in 100 chance of getting the coefficient estimates in equation (1) if their true value were zero.

Equation (1) also shows the bigger role played by power hitting in driving in runs. Consider Players A and B who have the following statistics:

Player

AB

HITS

2B

3B

HR

AVG

SLG

ISO

A

600

192

40

8

16

0.320

0.493

0.173

B

600

162

20

4

32

0.270

0.477

0.207

Who will drive in more runs? Using equation (1) and assuming they each get 1.6 OPP, Player A will drive in 83.93 runs while Player B will drive in 87.6 runs. Player B’s edge in home run power gives him the edge in RBI’s despite a much lower batting average and a deficit in doubles and triples. For Player A to get up to 87.6 RBIs, his average would have to jump to .351 (assuming all additional hits are singles). If Player A had just 20 doubles and 4 triples, along with a .320 AVG, he would drive in just 76.55 runs. To get up to 87.17 RBIs, he would then have to raise his AVG to .461!! (Again, assuming all additional hits are singles)

But, are all RBI opportunities of the same quality? No, a runner on third is better than a runner on first. So a runner on third counted as a four-point opportunity, a runner on second as a three-point opportunity, a runner on first a two-point opportunity and the batter as a one-point opportunity. So I ran another linear regression with points per at bat replacing opportunities per at bat.

 

The following equation shows the results:

 

(2) RBI/AB = .069*POINTS + .206*AVG + .475*ISO – .189

 

The r-squared is .973. The standard error is .005771 or just 3.46 RBIs for a 600 at bat season. This result is about as good as the one summarized in equation (1). Notice that the value of ISO is still much greater than the value of AVG, so power hitting is still the dominant force. The three variables were all statistically significant at the 1% level. Also, a regression was run that included opportunities from walks, as converted into points, with similar results. 3

 

A hitter’s RBIs are determined by his ability to hit for average, hit for power and the quality and quantity of his opportunities. There probably is no special “RBI ability.”  The vast majority of hitters will get about the number of RBIs predicted by their general hitting ability and opportunities.  Any deviations are probably just random chance. That would be consistent with the well-known research on clutch hitting. 

  

Table 1: Predicted RBIs vs. Actual RBIs
 

Player

AVG

ISO

RBI Opportunities per AB

RBI per 600 AB*

Predicted

Difference

Harold Baines

0.291

0.172

1.67

94.28

87.35

6.93

Wally Joyner

0.289

0.149

1.64

84.48

77.85

6.63

Delino DeShields

0.270

0.190

1.49

52.73

47.52

5.20

Tino Martinez

0.274

0.270

1.70

103.93

99.13

4.80

Frank Thomas

0.319

0.258

1.66

117.90

113.67

4.23

Jeff Bagwell

0.330

0.251

1.65

113.06

108.93

4.13

Andres Galarraga

0.291

0.219

1.65

102.89

99.01

3.88

Tim Raines

0.288

0.134

1.54

65.42

61.61

3.82

David Justice

0.280

0.227

1.65

103.19

99.40

3.80

B.J. Surhoff

0.281

0.135

1.64

75.98

72.23

3.75

Robin Ventura

0.271

0.176

1.67

9.57

86.85

3.72

Dante Bichette

0.299

0.200

1.67

99.95

96.46

3.50

Jose Canseco

0.268

0.252

1.68

111.68

108.35

3.33

Mark McLemore

0.260

0.800

1.58

51.36

48.68

2.68

Mark Grace

0.370

0.140

1.61

76.18

73.86

2.31

Rickey Henderson

0.274

0.140

1.46

55.40

53.57

1.83

Kenny Lofton

0.320

0.123

1.47

55.16

53.33

1.83

Mark McGwire

0.263

0.327

1.65

127.26

125.99

1.27

Gary Sheffield

0.295

0.226

1.61

98.15

97.07

1.08

Juan Gonzalez

0.297

0.271

1.72

123.21

122.18

1.03

Greg Vaughn

0.245

0.232

1.67

10.19

99.20

0.99

Todd Zeile

0.267

0.162

1.65

8.47

79.73

0.74

Marquis Grissom

0.270

0.134

1.54

6.58

59.92

0.66

Tony Gwynn

0.342

0.127

1.58

71.58

7.93

0.65

Will Clark

0.340

0.196

1.65

93.75

93.23

0.52

Cal Ripken

0.271

0.163

1.64

79.41

78.98

0.43

Tony Fernandez

0.286

0.112

1.58

6.07

59.86

0.20

Paul O'Neill

0.288

0.182

1.71

94.80

94.60

0.20

Gregg Jefferies

0.289

0.132

1.58

66.30

66.63

-0.32

Eric Karros

0.268

0.194

1.66

89.70

9.06

-0.36

Craig Biggio

0.291

0.145

1.49

59.81

6.21

-0.40

Ken Griffey Jr.

0.296

0.270

1.64

112.30

112.79

-0.49

John Olerud

0.300

0.176

1.66

87.83

88.35

-0.51

Jay Bell

0.267

0.153

1.54

64.50

65.27

-0.77

Sammy Sosa

0.277

0.265

1.64

108.32

109.11

-0.79

Jay Buhner

0.254

0.240

1.71

106.52

107.32

-0.80

Fred McGriff

0.287

0.228

1.66

10.76

101.56

-0.80

Chuck Knoblauch

0.293

0.118

1.49

51.53

52.46

-0.93

Matt Williams

0.269

0.222

1.68

99.05

99.98

-0.93

Dave Martinez

0.279

0.114

1.55

55.81

56.83

-1.02

Travis Fryman

0.278

0.171

1.69

86.69

87.88

-1.19

Ron Gant

0.256

0.212

1.61

86.37

87.66

-1.29

Omar Vizquel

0.274

0.770

1.58

47.01

48.42

-1.41

Brady Anderson

0.257

0.170

1.51

63.65

65.15

-1.50

Edgar Martinez

0.319

0.211

1.65

97.80

99.38

-1.58

Ken Caminiti

0.272

0.175

1.68

85.88

87.52

-1.64

Larry Walker

0.315

0.257

1.63

107.72

109.37

-1.65

Roberto Alomar

0.360

0.149

1.58

71.11

72.90

-1.78

Ray Lankford

0.274

0.290

1.60

86.15

87.97

-1.82

Devon White

0.264

0.156

1.55

64.56

66.85

-2.28

Barry Larkin

0.300

0.155

1.55

68.58

7.98

-2.40

Luis Gonzalez

0.286

0.198

1.63

87.50

9.16

-2.66

Barry Bonds

0.295

0.299

1.59

111.72

114.49

-2.78

Rafael Palmeiro

0.295

0.225

1.64

96.95

99.77

-2.82

Bernie Williams

0.350

0.194

1.68

93.49

96.43

-2.94

Bobby Bonilla

0.280

0.200

1.67

9.35

93.97

-3.63

Ruben Sierra

0.270

0.184

1.71

89.29

93.54

-4.26

Benito Santiago

0.260

0.151

1.67

71.44

78.15

-6.71

Steve Finley

0.275

0.164

1.58

66.49

73.81

-7.32

Wade Boggs

0.317

0.117

1.58

56.42

64.67

-8.25

Ellis Burks

0.292

0.220

1.68

93.29

102.72

-9.43

 

Explanatory notes: The * indicates that RBIs from sacrifice flies and bases-loaded walks are not included. The number in the predicted is based on equation (1).

Part 2

I also ran a regression in which all RBIs per plate appearance were a function of the following variables:

H/PA = hits per plate appearance
XB/PA = extra bases per plate appearance
PTS/PA = points per plate appearance
K/PA = strikeouts per plate appearance

RBIs from sacrifice files are included in this recession. Walks are included in plate appearances and Points from walks are included. But excluded are intentional walks and points from intentional walks. I included strikeouts to see if players who stuck out more hurt their RBI totals by perhaps driving in fewer runs with sacrifice flies or groundouts. Of course, many players who strikeout often also hit many of home runs, so that would increase their RBIs.

Here are the results:

RBI/PA = .163*(H/PA) + .492*(XB/PA) + .068*(PTS/PA) – .033*(K/PA) – .165

The r-squared was .968. The standard error was .005681 or 3.98 RBIs per 700 plate appearances.

The t-values were:
H/PA = 3.16
XB/PA = 20.86
PTS/PA = 13.09
K/PA = -1.33

Now, notice that the t-value is not significant for K/PA (it needs to be 1.96 or more in absolute value). Strikeouts do not significantly affect RBIs. In fact, their coefficient value is the smallest, too. The highest K/PA was about .24, for Jay Buhner. The average for this group was about .144. The difference between Buhner and the average hitter would be about 2.24 RBIs over 700 PA.

Part 3

This part shows a regression that uses players (N = 114) who had between 3000 and 6000 plate appearances during the 1987-2001 period. The regression equation is

RBI/AB = .198*OPP + .227*AVG + .445*ISO - .323

The r-squared was .962. So the results are similar to equation (1) in terms of the coefficient estimates:

(1) RBI/AB = .187*OPP + .196*AVG + .468*ISO - .20

I wanted to do this to see if the analysis worked for another group of players. The standard error was higher in this case, .00663, but that is expected since the players have fewer plate appearances. Randomness will play a bigger role. When I applied equation (1) to this group of players, it predicted all of them to with ten RBIs of their actual total for a 600 at bat season. 87 were predicted to within five RBIs.

End Notes

1. Some outstanding hitters of recent times, Manny Ramirez and Mike Piazza, for example, were not in the study since they had not achieved 6000 plate appearances through the 2001 season. Both were high in opportunities per at bat at 1.71 and 1.68, respectively. Ramirez had 4.89 more RBIs per 600 at bats than expected and Piazza had about 5.91 more.

2. RBIs from sacrifice flies are also not included. Neither are opportunities that were available when the player hit an SF. For the average player in this study, sacrifice flies make up less than 1% of his plate appearances and no more than 1.5% for any one player. So excluding SF’s matters very little. RBIs from bases-loaded walks were not included in the equation (1) or equation (2) results. They were included in the unreported regressions that included opportunities from walks. In those regressions, all variables were divided by plate appearances rather than at bats. HBPs were also included in those cases. But again, the results were similar with basically the same meanings as the two regressions reported here.

3. If I used walks, plate appearances and the point system, the regression results show that opportunities alone would give Juan Gonzalez 11.67 more RBIs than Barry Bonds over a 660 plate appearance season. That is less than 15.338, but still very high.

No comments: