Share with your friends










Submit

Analytics Magazine

Executive Edge: Why data science projects fail

Delivering brilliant ideas is the easy part; execution is often where things go awry.

David MamanBy David Maman

Thousands of companies all over the world are competing for a finite number of data scientists, paying them big bucks to join their organizations – and setting them up for failure.

For most organizations, data science is not the be all and end all. It cannot be the answer if company leadership has not formulated the strategy or questions that data science needs to answer or paved the route to production.

Even organizations that have already invested millions in integrating their complete corporate data infrastructure to create the ultimate data repository – data lakes, data pools, common data sources or whatever pleasant definition your organization has chosen – for integration and analysis will not be able to see results at the rate and accuracy they are demanding once the system is in production.

As with the long-term adoption and implementation of any business process or procedure, fully benefiting from data science takes time. You cannot just jump in head first.

At most companies, data science fails for very specific reasons:

1. Undefined business goals/processes. Just because your data science team can map and predict customer satisfaction rates for the next six months doesn’t mean that information is significant. If the results demonstrate broad-scale customer dissatisfaction, simply predicting the results is woefully inadequate without resolving the roots of that dissatisfaction. You can predict an untold number of data points, but without clearly defined goals accompanied with the will to fix your problems, those data points are meaningless. Many organizations keep following the misconception that data scientists (no matter how smart they are) can correctly define the business goals. Most of the time they can’t, because it’s not their job.

2. Inability to build and apply a uniform data set across the organization. While the politics in your organization may have inspired silos and fiefdoms, each department in your organization is ultimately driving toward the same goal – increasing ROI. To maximize the value of data science, you need to create a uniform data set that will deliver actionable results that every part of the organization can implement.

Just because your data science team can map and predict customer satisfaction rates for the next six months doesn’t mean that information is significant. Photo Courtesy of ThinkStock.com

Just because your data science team can map and predict customer satisfaction rates for the next six months doesn’t mean that information is significant. Photo Courtesy of ThinkStock.com

3. Investment in “sexy” algorithms instead of useful ones. You have just hired the data scientist who graduated in the top of her class. Having just joined your company, she’s extremely excited to get started. Because of her in-depth mathematical and academic background, she has some great algorithms that can get some really cool results. However, are those algorithms going to deliver the results that can focus your organization and drive it forward? Just because something is old, tired, tried and true doesn’t mean it won’t work.

Your team needs to examine the full range of algorithms you are using for your modeling and preprocessing with an objective eye. Is the information those algorithms are delivering actually useful? Do they deliver the actionable intelligence that will increase ROI? Often not.

In most cases, the challenge you are facing isn’t new. A simple Google search will lead you to some great research and case studies to see how others tackled the challenge and how they tailored the solution. Be open. Start thinking from a problem perspective and not just from a solution perspective.

4. Inappropriate data or infrastructure. Most organizations are built in silos. Large organizations have both an ERP system and a CRM system and many others, but those systems may not be in sync. They may not even be touching the inbound marketing and sales software. Until everyone in the organization can work with the same data and analyze the same data, siloed data science results aren’t as applicable or effective. Sometimes it takes much longer to determine whether specific data or the volume of data is sufficient for the specific task.

5. Failure to differentiate between academia or research and the real world. Data scientists generally go from the ivory tower directly into the data silos. They generally deliver great results in most properly defined challenges. Remember, though, at the end of the data scientist’s job, the hard work may only be beginning. The engineers need to implement it into production – many times from scratch. Don’t expect your data science team to learn how businesses actually work. It’s up to the engineers and systems analysts to determine your specific system requirements to get their work into production.

The most critical result of any data science activity is relevancy. Every “answer” generated by your data science team needs to answer specific questions derived from the overall corporate strategy. While investment in the “bells and whistles” of AI is fun and exciting, organizations need to make sure that they have a defined strategy, the right idea and a properly mapped roadmap before the first dollar of data science investment is spent.

David Maman is CEO, CTO and co-founder of Binah.ai, whose out-of-the-box data science solutions leverage signal processing combined with machine learning and AI to create models and accelerate delivery of the right answers to critical business questions.

Analytics data science news articles

Related Posts

  • 91
    With the rise of big data – and the processes and tools related to utilizing and managing large data sets – organizations are recognizing the value of data as a critical business asset to identify trends, patterns and preferences to drive improved customer experiences and competitive advantage. The problem is,…
    Tags: data
  • 90
    The Internet of Things (IoT) is considered to be the next revolution that touches every part of our daily life, from restocking ice cream to warning of pollutants. Analytics professionals understand the importance of data, especially in a complicated field such as healthcare. This article offers a framework on integrating…
    Tags: data
  • 87
    Businesses are greatly expanding the autonomous capabilities of their products, services and manufacturing processes to better optimize their reliability and efficiency. The processing of big data is playing an integral role in developing these prescriptive analytics. As a result, data scientists and engineers should pay attention to the following aspects…
    Tags: data
  • 86
    Frontline Systems releases Analytic Solver V2018 for Excel Frontline Systems, developer of the Solver in Microsoft Excel, recently released Analytic Solver V2018, its full product line of predictive and prescriptive analytics tools that work in Microsoft Excel. The new release includes a visual editor for multi-stage “data science workflows” (also…
    Tags: data
  • 85
    Today, we live in a digital society. Our distinct footprints are in every interaction we make. Data generation is a default – be it from enterprise operational systems, logs from web servers, other applications, social interactions and transactions, research initiatives and connected things (Internet of Things). In fact, according to…
    Tags: data


Headlines

Does negative political advertising actually work?

While many potential voters dread campaign season because of pervasive negative political advertising, a new study has found that negative political advertising actually works, but perhaps not in the way that many may assume. The study “A Border Strategy Analysis of Ad Source and Message Tone in Senatorial Campaigns,” which will be published in the June edition of INFORMS’ journal Marketing Science, is co-authored by Yanwen Wang of the University of British Columbia in Vancouver, Michael Lewis of Emory University in Atlanta and David A. Schweidel of Georgetown University in Washington, D.C. Read more →

Meet Summit, world’s most powerful, smartest scientific supercomputer

The U.S. Department of Energy’s Oak Ridge National Laboratory on June 8 unveiled Summit as the world’s most powerful and smartest scientific supercomputer. With a peak performance of 200,000 trillion calculations per second – or 200 petaflops – Summit will be eight times more powerful than ORNL’s previous top-ranked system, Titan. For certain scientific applications, Summit will also be capable of more than three billion billion mixed precision calculations per second, or 3.3 exaops. Read more →

Employee engagement a top concern affecting customer experience

Employee engagement has surfaced as a major concern in delivering improvements in customer experience (CX), with 86 percent of CX executives in a Gartner, Inc. survey ranking it as having an equal or greater impact than other factors such as project management and data skills. “CX is a people issue,” says Olive Huang, research vice president at Gartner. “In some instances, the best technology investments have been derailed by employee factors, such as a lack of training or incentives, low morale or commitment, and poor communication of goals." Read more →

UPCOMING ANALYTICS EVENTS

INFORMS-SPONSORED EVENTS

INFORMS Annual Meeting
Nov. 4-7, 2018, Phoenix

OTHER EVENTS

Making Data Science Pay
July 30-31, 12:30 p.m.-5 p.m.


Predictive Analytics: Failure to Launch Webinar
Aug. 18, 11 a.m.


Applied AI & Machine Learning | Comprehensive
Sept. 10-13, 17-20 and 24-25


Advancing the Analytics-Driven Organization
Sept. 17-20, 12-5 p.m. LIVE Online


The Analytics Clinic: Ensemble Models: Worth the Gains?
Sept. 20, 11 a.m. -12:30 p.m.

CAP® EXAM SCHEDULE

CAP® Exam computer-based testing sites are available in 700 locations worldwide. Take the exam close to home and on your schedule:


 
For more information, go to 
https://www.certifiedanalytics.org.