Share with your friends


Analytics Magazine

Basketball genomics

November/December 2012

Evaluation of performance: Evolution of the official box score.

The evaluation of performance in basketball analytics parallels the application of methodology utilized in the analytics of genomics.
The evaluation of performance in basketball analytics parallels the application of methodology utilized in the analytics of genomics.

William CadeBy William Cade

Jim Larranaga“When a player looks at the game, they begin with the least important statistic and that’s playing time. … They think that if they play a lot of minutes then everything will work out in their favor … not just our guys, but I think players across the country think it’s just all about playing time. ‘If I play a lot, then I’ll play well …and well, you have to earn that!”
– Jim Larranaga (LEFT), head coach, University of Miami men’s basketball team

What if basketball analytics could formulate an “end-all” value that could justly evaluate team and/or player performance? Regardless of the complexity of its formulation, those immersed in the world of basketball analytics are challenged with this mission: to translate a game of interdependent factors into simple measures of player and team performance. The “Four Factors of Basketball Success,” established by basketball statistician pioneer Dean Oliver, have long played a role in the understanding and the evaluation of team success ( Simply put: If you’re better than your opponent at making field goals, creating turnovers, grabbing rebounds and getting to the foul line, then you’re going to win many more games than you will lose.

Genetics is the study of the variations between humans and how those variations are passed through a family. Described as the “cook book of recipes” that tells our body how to grow and how to develop, DNA is the basis of genetics. In genomics, team research is conducted to investigate the complex instructions between multiple environmental and genetic risk factors.

Interestingly, the same advanced statistical methods implemented to discover and map the genes responsible for disease in families and populations, equivalently can be modified to identify and evaluate the “basketball DNA” genes attributed with success on a basketball team. In principal, the evaluation of performance in basketball analytics parallels the application of methodology utilized in the analytics of genomics. So then, what is your favorite team’s basketball DNA?

“Whereas a coach looks at, playing well … earn your playing time
by your performance on the court … by how well you defend, how
well you rebound, how well you guard your man, how well you run
the floor, how well you make good decisions on offense (make your
shots, make your passes correctly, don’t turn to the ball over) …”

– Jim Larranaga

A fundamental building block for the measure of team performance is time, as function of minutes (“playing time”). Within the analytics of “basketball genomics,” the genetic makeup of playing time is sequenced and coded as a “possession.” How much information observed and collected on a particular possession is fundamental in identifying the basketball DNA associated with/within a team (player and/or lineup).

The Official Box-Score (play-by-play), for example, significantly serves as an invaluable guide in understanding the analysis of the game on a fundamental level. With an unbiased precision, the composition of the Official Box-Score is two-fold. It provides a quarter-by-quarter description of events, along with descriptive measurements, used to inform how well or how poorly a player and/or team have performed. Statistics included in an Official Box-Score are field goals made and field goals attempted (“FGM” and “FGA”), three-point field goals made and three-point field goals attempted (“3PM” and “3PA”), free throws made and free throws attempted (“FTM” and “FTA”), offensive rebounds (“OR” or “OREB”), defensive rebounds (“DR” or “DREB”), total rebounds (“TREB”), assists (“A” or “AST”), steals (“S” or “STL”), blocked shots (“B” or “BS”), personal fouls (“F” or “PF”) , Turnovers (“TOV” or “TO”), minutes (“M” or “MIN”) and points (“P” or “PTS”). Validated by the aforementioned play-by-play component of the Official Box-Score, the methodology for the evaluation of player and/or team lineup performance is displayed in the truncated example of game charting shown in Table 1.

Table 1: Game charting.
Table 1: Game charting.

With the use of advanced statistical software tools (SAS Version 9.3), I have extended the Official Box-Score and established a never-ending framework that can measure the offensive and defensive prowess for a basketball team, by lineup (per game, seasonally, etc.), entitled “Official Box-Score DNA.” The principal areas of extension within Official Box-Score DNA include possession, field goal, rebound and free throw. Derivative and unique (*) statistics provided in Official Box-Score DNA are two-point field goals made and two-point field goals attempted (“2FGM” and “2FGA”), “offensive and defensive possessions (“OP” and “DP”), three-point field goal attempted offensive rebounds (“O3REB” and “O3RB”), three-point field goal attempted defensive rebounds (“D3REB” or “D3RB”), free throw offensive rebound (“FTOREB” or “FTORB”), free throw defensive rebound (“FTDREB” or “FTDRB”) and potential free throws (“PFT”).

Subsequent basketball analytics executed with the utilization of advanced statistical software tools produces measurements of frequency, efficiency and precision relative to team performance. Complimentary to the notable “plus/minus” basketball statistic that looks at the point differential when players are both in and out of the game, the example in Table 2 illustrates team lineup performance for the entirety of a single basketball game.

Table 2: Team lineup performance
Table 2: Team lineup performance.

This impartial approach to quantify team chemistry clearly identifies the qualities shared by players whose play on the court seems simply to flow. It can look at a variety of combinations of players on the court and clearly show which combinations have the biggest effect – best impactful two-player, three-player and even five-player combinations for each game.

For example, suppose your favorite team’s opponent(s) is entering a tournament or playoff game setting, and game preparation involves exact knowledge of successful team lineup defensive performance? Game strategy and decision-making to implement the best team defense would naturally lend itself to some of the following questions: What lineup has played the most defensive possessions together? What lineup defends the two-point field goal attempt the best? What lineup rebounds the three-point field goal attempt the best? What lineup fouls the least? The solution to these defensive questions of interest, respective to game preparation, are illustrated in Table 3.

Table 3: Team defense.
Table 3: Team defense.

In essence, the collection of information provided by Official Box-Score DNA statistics allows for an efficacious way of showing the best-assembled/best combination of players on the court. Though basketball analytics comes with its limitations and imperfections, the pursuit of the advancement of knowledge of the game further incites ongoing analyses and a penchant for better statistics! ?

William Cade (, who holds a master’s degree in public health, is a senior data analyst at the John P. Hussman Institute for Human Genomics, University of Miami Miller School of Medicine, and an institutional staff member of University of Miami men’s basketball team.

business analytics news and articles


Report: One in five cloud-based user accounts may be fake

According to the Q2 2018 DataVisor Fraud Index Report, more than one in five user accounts set up through cloud service providers may be fraudulent. The report, based on information gathered between April and June, analyzes 1.1 billion active user accounts, 1.5 million email domains, 231,000 device types and 562 cloud hosting providers and data centers, among other indicators. Read more →

When managers respond to online critics, more negative reviews ensue

A new study in the INFORMS journal Marketing Science found that when managers respond to online reviews it’s possible that those responses could actually stimulate additional reviewing activity and an increased number of negative reviews. The study, “Channels of Impact: User Reviews When Quality is Dynamic and Managers Respond,” is authored by Judith Chevalier of the Yale School of Management and NBER, Yaniv Dover of the Hebrew University of Jerusalem and Dina Mayzlin of the Marshal School of Business at the University of Southern California. Read more →

IE student designs software to optimize snow removal at Penn State

It is well known among the State College and Penn State communities that it takes a lot for university officials to shut the campus down after a major snowfall. In fact, since 2010, the University Park campus has been shut down just three full days due to snowfall. Much to the chagrin of students – and faculty and staff – the snow day at Penn State may just have become even more elusive, thanks to software developed by recent industrial engineering graduate Achal Goel. Read more →



Winter Simulation Conference
Dec. 9-12, 2018, Gothenburg, Sweden

INFORMS Computing Society Conference
Jan. 6-8, 2019; Knoxville, Tenn.

INFORMS Conference on Business Analytics & Operations Research
April 14-16, 2019; Austin, Texas

INFORMS International Conference
June 9-12, 2019; Cancun, Mexico

INFORMS Marketing Science Conference
June 20-22; Rome, Italy

INFORMS Applied Probability Conference
July 2-4, 2019; Brisbane, Australia

INFORMS Healthcare Conference
July 27-29, 2019; Boston, Mass.

2019 INFORMS Annual Meeting
Oct. 20-23, 2019; Seattle, Wash.

Winter Simulation Conference
Dec. 8-11, 2019: National Harbor, Md.


Applied AI & Machine Learning | Comprehensive
Dec. 3, 2018 (live online)

Advancing the Analytics-Driven Organization
Jan. 28–31, 2019, 1 p.m.– 5 p.m. (live online)


CAP® Exam computer-based testing sites are available in 700 locations worldwide. Take the exam close to home and on your schedule:

For more information, go to