Share with your friends


Analytics Magazine

Healthcare Analytics: Data governance and management

The fundamental building blocks for a sustainable data analytics program

Rajib Ghosh healthcare analyticsBy Rajib Ghosh

The healthcare analytics industry is making great strides. As part of my work I talk to many data analytics companies who report they are very busy with implementation projects. One large company told me that its implementation staffs are booked until end of the first quarter of 2017. This is evidence that the demand for healthcare analytics is strong.

I predicted the rise in demand during the latter part of 2016 in my previous articles. Payment reform has started to accelerate, with more states and commercial payers moving their contracts with providers from volume to value. Without a strong data analytics program, it is impossible for many provider organizations to stay viable in this new environment.

Many data analytics companies report their clients are missing the basic building blocks for a successful data analytics program: data governance and data management. Healthcare organizations jumped on the data analytics bandwagon without establishing a process of data governance. In this article I will outline the need for data governance and my experience leading such an initiative within a complex organization.

Data Has a Time Value

Data has a time value, and this is not a secret. Information and insights from data have the best value when the business is attempting to understand why something just happened. The longer the process takes, the less this value becomes. For data analytics to produce the best return on investment, it is important for the business to have the relevant data available at the fingertips of the analyst so that the analysis can be produced quickly to help decision-makers make decisions in a timely fashion. Therefore, the longer it takes to collect all the data and conduct the analysis the less effective the action becomes for the business (see Figure 1).

Figure 1: The longer it takes to collect data and complete analysis, the less effective it becomes in informing decisions. Source: Rajib Ghosh

Figure 1: The longer it takes to collect data and complete analysis, the less effective it becomes in informing decisions. Source: Rajib Ghosh

According to Health Catalyst, a leading U.S. healthcare analytics company, about 80 percent of an analyst’s time is spent in searching for the relevant data. That is a huge waste of expensive resources and valuable time for any organization.

Data Quality: The Other Big Issue

Access to data alone is not enough. Access to good quality data is extremely important to make the work of analytics worthwhile. According to IBM, one in three business leaders today in the United States do not trust their own data. Poor data quality costs the U.S. economy around $3.1 trillion a year. What is the point in conducting sophisticated analysis with expensive data analysts and scientists if that is the case?

In my experience data quality analysis is not always done adequately within healthcare organizations. It is one of the most tedious tasks that seldom bears any glamor. It is, however, a key function of what the data industry calls the role of a data steward. A data steward needs to have the necessary knowledge about the content and the metadata to assess the quality of the data, and then work with business units to correct what is wrong. Sometimes it may lead to fixing issues with the data acquisition process and even standardizing the data vocabulary.

Data Governance: Key Piece of the Puzzle

Timely access and quality bring us to data governance. Organizations need to invest time and resources to build a well-defined data governance process. It begins with identifying key people to participate in an organizational data governance committee. Many times the committee can be appointed by the CEO or the board of directors to ensure that the data in the organization is treated like an asset. In the case of healthcare organizations, members of such a committee may include the chief data officer, chief analytics officer, chief financial officer, chief medical officer, chief information officer, chief operating officer, etc. In other words, fairly senior members of the organization or top business unit leaders.

A data governance committee has to ensure that the organization’s data is governed appropriately, maintenance of metadata definitions and business rules are followed, and appropriate levels of data privacy and security audits are in place. Healthcare organizations benefit when they bring their clinical, operational and financial data together to develop a single definition of truth. To achieve that goal, the data governance committee needs leadership representation from all those functional areas.

Data quality analysis is not always done adequately within healthcare organizations. Photo Courtesy of | scanrail

Data quality analysis is not always done adequately within healthcare organizations.
Photo Courtesy of | scanrail

Data Governance for a Networked Organization

I was asked to chair a data governance committee of a network of several healthcare organizations. The goal was to build data governance policies and procedures for building a centralized data analytics infrastructure. To achieve this goal we established a committee with leadership representation from all the functional areas as stated in the previous section. We embraced a framework that focused on four key areas: governance, stewardship, management and compliance.

We diligently worked on various policies and procedures that not only addressed the needs of the present day but also considered emerging opportunities such as data on social determinants of health, mental health and substance use.

To establish transparency in data flow between networked organizations, we implemented strict data access monitoring and reporting policies and procedures. We also defined oversight of data stewardship as a key role of the committee. Committee members were given the responsibility of developing data for a stewardship program within their own organization as well as within the centralized data organization that assumed the responsibility for the data analytics program. The committee oversaw specifications of data extraction, transformation and load specifications, along with adequate data security and privacy measures. The upfront time spent in crafting the governance process enabled this complex network of organizations to develop a data analytics infrastructure with confidence and transparency.

Data governance is a fundamental building block for successful and sustainable data analytics programs for organizations of any size and complexity. It is not a glamorous job. It does not create press cycles. It is also not very well understood by executives responsible for data infrastructure. However, technology for data analytics has now become a commodity with many vendors striving to earn enterprise business. Once the fundamental building block of data governance is in place, the data team of any organization can feel confident that they have established a sustainable and effective analytics program that will eventually garner kudos in the boardrooms.

Rajib Ghosh ( is an independent consultant and business advisor with 20 years of technology experience in various industry verticals where he had senior-level management roles in software engineering, program management, product management and business and strategy development. Ghosh spent a decade in the U.S. healthcare industry as part of a global ecosystem of medical device manufacturers, medical software companies and telehealth and telemedicine solution providers. He’s held senior positions at Hill-Rom, Solta Medical and Bosch Healthcare. His recent work interest includes public health and the field of IT-enabled sustainable healthcare delivery in the United States as well as emerging nations.

Related Posts

  • 66
    The 2016 election is a watershed moment for the U.S. healthcare industry. Any presidential election and change of guards come with changes in policies. It happened in 2008 when President Obama was sworn into the office. That led to the establishment of the Affordable Care Act (ACA) or Obamacare. To…
    Tags: healthcare, data, analytics, management
  • 65
    January/February Cybersecurity: new threats, new solutions The IOT and related, hidden security risks Can analytics save U.S. healthcare system? March/April Supply chain advances and solutions Software survey: vehicle routing Capitalizing on AI & machine learning May/June Social media, marketing & analytics Real-time customer personalization Next generation revenue management July/August Software…
    Tags: analytics, data, management, healthcare
  • 64
    Features Eyes on the road, not dashboards How automated analytics help detect significant business incidents; each anomaly creates an opportunity to save or earn money. By Patrick Vernon Basic Sales Analysis So much data, so little insight. Twelve ideas for anyone assigned the task of analyzing a firm’s sales data.…
    Tags: data, healthcare, analytics, governance
  • 64
    Basing decisions on reliable, well-understood data has become table stakes in many industries. Furthermore, the advanced use of analytics to derive hidden insights from information has quickly become the new frontier for creating competitive advantage. Data-driven decision-making has completely transformed a variety of industries, beginning several decades ago. Healthcare, on…
    Tags: data, analytics, healthcare, governance
  • 64
    Features Forum: Anxiety over AI Now is the time to address misunderstandings regarding artificial intelligence to alleviate fears, before it’s too late. By Joseph Byrum The rise of self-service analytics As SSA gains momentum, the need for data governance increases in order to drive true business value going forward. By…
    Tags: analytics, data, healthcare, governance


Using machine learning and optimization to improve refugee integration

Andrew C. Trapp, a professor at the Foisie Business School at Worcester Polytechnic Institute (WPI), received a $320,000 National Science Foundation (NSF) grant to develop a computational tool to help humanitarian aid organizations significantly improve refugees’ chances of successfully resettling and integrating into a new country. Built upon ongoing work with an international team of computer scientists and economists, the tool integrates machine learning and optimization algorithms, along with complex computation of data, to match refugees to communities where they will find appropriate resources, including employment opportunities. Read more →

Gartner releases Healthcare Supply Chain Top 25 rankings

Gartner, Inc. has released its 10th annual Healthcare Supply Chain Top 25 ranking. The rankings recognize organizations across the healthcare value chain that demonstrate leadership in improving human life at sustainable costs. “Healthcare supply chains today face a multitude of challenges: increasing cost pressures and patient expectations, as well as the need to keep up with rapid technology advancement, to name just a few,” says Stephen Meyer, senior director at Gartner. Read more →

Meet CIMON, the first AI-powered astronaut assistant

CIMON, the world’s first artificial intelligence-enabled astronaut assistant, made its debut aboard the International Space Station. The ISS’s newest crew member, developed and built in Germany, was called into action on Nov. 15 with the command, “Wake up, CIMON!,” by German ESA astronaut Alexander Gerst, who has been living and working on the ISS since June 8. Read more →



INFORMS Computing Society Conference
Jan. 6-8, 2019; Knoxville, Tenn.

INFORMS Conference on Business Analytics & Operations Research
April 14-16, 2019; Austin, Texas

INFORMS International Conference
June 9-12, 2019; Cancun, Mexico

INFORMS Marketing Science Conference
June 20-22; Rome, Italy

INFORMS Applied Probability Conference
July 2-4, 2019; Brisbane, Australia

INFORMS Healthcare Conference
July 27-29, 2019; Boston, Mass.

2019 INFORMS Annual Meeting
Oct. 20-23, 2019; Seattle, Wash.

Winter Simulation Conference
Dec. 8-11, 2019: National Harbor, Md.


Advancing the Analytics-Driven Organization
Jan. 28–31, 2019, 1 p.m.– 5 p.m. (live online)


CAP® Exam computer-based testing sites are available in 700 locations worldwide. Take the exam close to home and on your schedule:

For more information, go to