Share with your friends


Analytics Magazine

Executive Edge: Why data science projects fail

Delivering brilliant ideas is the easy part; execution is often where things go awry.

David MamanBy David Maman

Thousands of companies all over the world are competing for a finite number of data scientists, paying them big bucks to join their organizations – and setting them up for failure.

For most organizations, data science is not the be all and end all. It cannot be the answer if company leadership has not formulated the strategy or questions that data science needs to answer or paved the route to production.

Even organizations that have already invested millions in integrating their complete corporate data infrastructure to create the ultimate data repository – data lakes, data pools, common data sources or whatever pleasant definition your organization has chosen – for integration and analysis will not be able to see results at the rate and accuracy they are demanding once the system is in production.

As with the long-term adoption and implementation of any business process or procedure, fully benefiting from data science takes time. You cannot just jump in head first.

At most companies, data science fails for very specific reasons:

1. Undefined business goals/processes. Just because your data science team can map and predict customer satisfaction rates for the next six months doesn’t mean that information is significant. If the results demonstrate broad-scale customer dissatisfaction, simply predicting the results is woefully inadequate without resolving the roots of that dissatisfaction. You can predict an untold number of data points, but without clearly defined goals accompanied with the will to fix your problems, those data points are meaningless. Many organizations keep following the misconception that data scientists (no matter how smart they are) can correctly define the business goals. Most of the time they can’t, because it’s not their job.

2. Inability to build and apply a uniform data set across the organization. While the politics in your organization may have inspired silos and fiefdoms, each department in your organization is ultimately driving toward the same goal – increasing ROI. To maximize the value of data science, you need to create a uniform data set that will deliver actionable results that every part of the organization can implement.

Just because your data science team can map and predict customer satisfaction rates for the next six months doesn’t mean that information is significant. Photo Courtesy of

Just because your data science team can map and predict customer satisfaction rates for the next six months doesn’t mean that information is significant. Photo Courtesy of

3. Investment in “sexy” algorithms instead of useful ones. You have just hired the data scientist who graduated in the top of her class. Having just joined your company, she’s extremely excited to get started. Because of her in-depth mathematical and academic background, she has some great algorithms that can get some really cool results. However, are those algorithms going to deliver the results that can focus your organization and drive it forward? Just because something is old, tired, tried and true doesn’t mean it won’t work.

Your team needs to examine the full range of algorithms you are using for your modeling and preprocessing with an objective eye. Is the information those algorithms are delivering actually useful? Do they deliver the actionable intelligence that will increase ROI? Often not.

In most cases, the challenge you are facing isn’t new. A simple Google search will lead you to some great research and case studies to see how others tackled the challenge and how they tailored the solution. Be open. Start thinking from a problem perspective and not just from a solution perspective.

4. Inappropriate data or infrastructure. Most organizations are built in silos. Large organizations have both an ERP system and a CRM system and many others, but those systems may not be in sync. They may not even be touching the inbound marketing and sales software. Until everyone in the organization can work with the same data and analyze the same data, siloed data science results aren’t as applicable or effective. Sometimes it takes much longer to determine whether specific data or the volume of data is sufficient for the specific task.

5. Failure to differentiate between academia or research and the real world. Data scientists generally go from the ivory tower directly into the data silos. They generally deliver great results in most properly defined challenges. Remember, though, at the end of the data scientist’s job, the hard work may only be beginning. The engineers need to implement it into production – many times from scratch. Don’t expect your data science team to learn how businesses actually work. It’s up to the engineers and systems analysts to determine your specific system requirements to get their work into production.

The most critical result of any data science activity is relevancy. Every “answer” generated by your data science team needs to answer specific questions derived from the overall corporate strategy. While investment in the “bells and whistles” of AI is fun and exciting, organizations need to make sure that they have a defined strategy, the right idea and a properly mapped roadmap before the first dollar of data science investment is spent.

David Maman is CEO, CTO and co-founder of, whose out-of-the-box data science solutions leverage signal processing combined with machine learning and AI to create models and accelerate delivery of the right answers to critical business questions.

Analytics data science news articles

Related Posts

  • 91
    With the rise of big data – and the processes and tools related to utilizing and managing large data sets – organizations are recognizing the value of data as a critical business asset to identify trends, patterns and preferences to drive improved customer experiences and competitive advantage. The problem is,…
    Tags: data
  • 90
    The Internet of Things (IoT) is considered to be the next revolution that touches every part of our daily life, from restocking ice cream to warning of pollutants. Analytics professionals understand the importance of data, especially in a complicated field such as healthcare. This article offers a framework on integrating…
    Tags: data
  • 87
    Businesses are greatly expanding the autonomous capabilities of their products, services and manufacturing processes to better optimize their reliability and efficiency. The processing of big data is playing an integral role in developing these prescriptive analytics. As a result, data scientists and engineers should pay attention to the following aspects…
    Tags: data
  • 87
    Data science has seen a dramatic rise in the last decade. The LinkedIn 2017 US Emerging Jobs Report revealed the two fastest growing jobs as “machine learning engineer” and “data scientist.” Universities are struggling to keep up with this trend, assembling new programs to address the growing need for data…
    Tags: data, science
  • 86
    Frontline Systems releases Analytic Solver V2018 for Excel Frontline Systems, developer of the Solver in Microsoft Excel, recently released Analytic Solver V2018, its full product line of predictive and prescriptive analytics tools that work in Microsoft Excel. The new release includes a visual editor for multi-stage “data science workflows” (also…
    Tags: data


Fighting terrorists online: Identifying extremists before they post content

New research has found a way to identify extremists, such as those associated with the terrorist group ISIS, by monitoring their social media accounts, and can identify them even before they post threatening content. The research, “Finding Extremists in Online Social Networks,” which was recently published in the INFORMS journal Operations Research, was conducted by Tauhid Zaman of the MIT, Lt. Col. Christopher E. Marks of the U.S. Army and Jytte Klausen of Brandeis University. Read more →

Syrian conflict yields model for attrition dynamics in multilateral war

Based on their study of the Syrian Civil War that’s been raging since 2011, three researchers created a predictive model for multilateral war called the Lanchester multiduel. Unless there is a player so strong it can guarantee a win regardless of what others do, the likely outcome of multilateral war is a gradual stalemate that culminates in the mutual annihilation of all players, according to the model. Read more →

SAS, Samford University team up to generate sports analytics talent

Sports teams try to squeeze out every last bit of talent to gain a competitive advantage on the field. That’s also true in college athletic departments and professional team offices, where entire departments devoted to analyzing data hunt for sports analytics experts that can give them an edge in a game, in the stands and beyond. To create this talent, analytics company SAS will collaborate with the Samford University Center for Sports Analytics to support teaching, learning and research in all areas where analytics affects sports, including fan engagement, sponsorship, player tracking, sports medicine, sports media and operations. Read more →



INFORMS Annual Meeting
Nov. 4-7, 2018, Phoenix

Winter Simulation Conference
Dec. 9-12, 2018, Gothenburg, Sweden


Making Data Science Pay
Oct. 29 -30, 12 p.m.-5 p.m.

Applied AI & Machine Learning | Comprehensive
Starts Oct. 29, 2018 (live online)

The Analytics Clinic
Citizen Data Scientists | Why Not DIY AI?
Nov. 8, 2018, 11 a.m. – 12:30 p.m.

Advancing the Analytics-Driven Organization
Jan. 28–31, 2019, 1 p.m.– 5 p.m. (live online)


CAP® Exam computer-based testing sites are available in 700 locations worldwide. Take the exam close to home and on your schedule:

For more information, go to