Share with your friends










Submit

Analytics Magazine

Viewpoint: Did Nate Silver beat the tortoise?

July/August 2013

O.R. vs. analytics … and now data science?

Brian KellerBy Brian Keller

In a 2010 survey [1], members of the Institute for Operations Research and the Management Sciences (INFORMS) were asked to compare operations research (O.R.) and analytics. Thirty percent of the respondents stated, “O.R. is a subset of analytics,” 29 percent stated, “analytics is a subset of O.R.,” and 28 percent stated, “advanced analytics is the intersection of O.R. and analytics.” The remaining 13 percent were split between “analytics and O.R. are separate fields” (7 percent) and “analytics is the same as O.R.” (6 percent).

The emergence of data science only adds to the confusion. Is data science just another clever marketing term popularized by the math illuminati?

INFORMS has developed working definitions of both O.R. and analytics through surveys of INFORMS members and Analytics magazine readers. O.R. is the “application of advanced analytical methods to help make better decisions.” Analytics is the “scientific process of transforming data into insight for better decision-making.”

Data Science: An Emerging Field

Data science is an emerging field with no standard definition yet. An early description can be found in “Data Science: An Action Plan for Expanding the Technical Areas of the Field of Statistics” [2]. I think of data science as an interdisciplinary field combining mathematics, statistics and computer science to create products based on data. The delivery of data products is the key idea. More on that later.

Indeed, the definitions for each sound similar. Differences begin to emerge when looking at O.R., analytics and data science in terms of the focus of the discipline and types of techniques applied.

Operations research tends to focus on the solution of a specific problem using a defined set of methods and techniques [3]. Classic examples of O.R. include facility location problems, scheduling and deciding how many lines should be opened at a service center, which are all problem-solution focused. Techniques tend to be model-driven in which analysts select a reasonable model, fit the model parameters to the data and analyze results. Based on survey data in “ASP: The Art and Science of Practice” [3], the top O.R. quantitative skills are optimization, decision analysis and simulation.

Analytics tends to go beyond solving a single problem and focuses on overall business impact [3]. Classic examples of analytics include business intelligence to summarize operations and customer segmentation for improved marketing and sales. The same survey identified the top analytics quantitative skills as statistics, data visualization, data management and data mining.

Data science tends to focus on data as a product. For example, Amazon records your searches, correlates them with other users and offers you suggestions on what you might like to buy. Those suggestions are data products that personalize the world’s biggest market, which drives sales. Google Now presents the results of your search before you even think to search for the information. Google Now is a data product that increases use of Google services, which delivers added revenue to Google.

Amazon product recommendations and Google Now may sound like an analytic, which focuses quantitative effort on a broader business impact. However, the results of data science are not just competitive advantages; results of data science are the products of the company. The data is the product.

Creating data products requires a strong sense of creativity and diverse perspectives of thought. As such, data scientists hail from a variety of academic backgrounds including O.R., statistics, computer science, engineering, biology and physics. The common themes across data scientists are creativity, curiosity to ask bigger questions, skills in data analysis and programming.

Data science often relies on combining multiple types of data together for analysis. Some data may be company proprietary; other data is available from one of the many public data sets available on the Web. These data sets often are too large to analyze using desktop tools, have missing or erroneous data, vary in structure across data sets, and may be lacking structure entirely (e.g., free-form text in maintenance repair logs). The combination of data size and structure adds an additional challenge on top of data analysis – the data itself becomes part of the problem.

Leveraging Diverse Skills

Because so much effort of data science work falls on parsing, cleaning and managing the data, data scientists often must leverage diverse software development skills. One project may use Python for data acquisition and parsing, R for exploratory analysis, Hadoop for data storage and Map Reduce via Java for production analytics, with results delivered through Ruby on Rails. Analytics practitioners share in many of the data management challenges of data scientists, although usually at a smaller scale. In contrast, O.R. applications tend to focus on problem solution, and O.R. analysts usually use fewer tools during a project.

Visualization is key to the success of data science projects since the information must be consumable to users. Who would want to use Google Now if it presented results in a table with p-values? Similarly, analytics practitioners value data visualization, whereas visualization is much less important to O.R. practitioners [3].

Analysis techniques may also differ with the large amounts of data collected. O.R. and analytics approaches generally assume a model and then fit the model to the data. The large amounts of data collected in many data science projects enable an alternative, model-free, data-driven approach. For example, automated language translation algorithms were predominantly manual, rule-driven approaches until an increase in storage and compute power enabled storage and processing of large amount of bilingual text corpora from which statistical models could infer the translation rules from the data.

DuoLingo [4], a free language learning website, has created a data product based on a data-driven approach. As users progress through lessons, they help translate websites and documents. In other lessons, users vote on correctness of translations. Statistical models based on user skill choose the best translations of documents, which others have submitted to be translated for a fee.

O.R., analytics and data science are closely related – all apply math to gain insights – and the fuzzy descriptions of the three disciplines above have boundaries as porous as the borders of countries in the European Union. However, just as a person in Germany is most likely a German (although he or she could be French or Italian), an O.R./analytics/data science practitioner will most likely fit the description outlined in this article.

Brian Keller (keller_brian@bah.com), Ph.D., is a data science practitioner and lead associate at Booz Allen Hamilton. He is a member of INFORMS.

References

 

  1. Matthew Liberatore, Wenhong Luo, “INFORMS and the Analytics Movement: The View of the Membership,” Interfaces, Vol. 41, No. 6, November-December 2011, pp. 578–589.
  2. W. S. Cleveland, “Data Science: An Action Plan for Expanding the Technical Areas of the Field of Statistics,” ISI Review, Vol. 69, p 21-26, 2001.
  3. Matthew Liberatore, Wenhong Luo, “ASP: The Art and Science of Practice,” Interfaces, Vol. 43, No. 2, p 194-197, March/April 2013.
  4. www.duolingo.com

business analytics news and articles

Related Posts

  • 86
    INFORMS member Brenda L. Dietrich, an IBM Fellow, vice president and leader of IBM’s data science group, was recently profiled by Forbes in an article headlined, “Meet 9 Women Leading The Pack In Data Analytics.” Dietrich is also an INFORMS Fellow and a member of the National Academy of Engineering.…
    Tags: informs, data, analytics, science, operations
  • 75
    Considering attending the 2018 INFORMS Conference on Analytics & Operations Research? Nowhere else in the world will you find the analytics and operations research expertise and experience across so many industries coming together for a three-day event. On April 15-17, a thousand professionals from more than 400 companies and essentially…
    Tags: analytics, data, operations, informs
  • 75
    Considering attending the 2018 INFORMS Conference on Business Analytics & Operations Research? Nowhere else in the world will you find the analytics and operations research expertise and experience across so many industries coming together for a three-day event. On April 15-17, a thousand professionals from more than 400 companies and…
    Tags: analytics, operations, data, informs
  • 75
    The 2017 INFORMS Annual Meeting will be held in Houston, Texas, on Oct. 22-25. The meeting will take place at the George R. Brown Convention Center and the Hilton Americas, with all technical sessions taking place at the Convention Center.
    Tags: informs, data, science, operations, analytics
  • 73
    Leaders in industry and academia from around the world are among those who will be joining the INFORMS Board of Directors in 2018. INFORMS is the leading international association for operations research and analytics professionals (and the publisher of Analytics magazine).
    Tags: informs, analytics, operations, o.r, science


Headlines

Challenges facing supply chain execs: leadership, labor, legacy technology

While most companies recognize the value of a digitally enabled supply chain – empowered by new technologies like artificial intelligence, blockchain, big data and analytics – many chief supply chain officers (CSCOs) are not leveraging their C-suite counterparts to help reinvent the supply chain function and transform it into an engine of new growth models and customer experiences, according to new research from Accenture. Read more →

Data Science Bowl: Using AI to accelerate life-saving medical research

Imagine unleashing the power of artificial intelligence to automate a critical component of biomedical research, expediting life-saving research in the treatment of almost every disease from rare disorders to the common cold. This could soon be a reality, thanks to the fourth Data Science Bowl, a 90-day competition in which, for the first time, participants trained deep learning models to examine images of cells and identify nuclei, regardless of the experimental setup – and without human intervention. Read more →

UPCOMING ANALYTICS EVENTS

INFORMS-SPONSORED EVENTS

INFORMS International Conference
June 17-20, 2018, Taipei, Taiwan

INFORMS Annual Meeting
Nov. 4-7, 2018, Phoenix

OTHER EVENTS

Advancing the Analytics-Driven Organization
July 16-19, noon-5 p.m.


Making Data Science Pay
July 30-31, 12:30 p.m.-5 p.m.


Predictive Analytics: Failure to Launch Webinar
Aug. 18, 11 a.m.


Applied AI & Machine Learning | Comprehensive
Sept. 10-13, 17-20 and 24-25

CAP® EXAM SCHEDULE

CAP® Exam computer-based testing sites are available in 700 locations worldwide. Take the exam close to home and on your schedule:


 
For more information, go to 
https://www.certifiedanalytics.org.