Share with your friends


Analytics Magazine

Executive Edge: It pays to modernize your data architecture

September/October 2015

business analytics news and articles

Arvind PurushothamanBy Arvind Purushothaman

In today’s world where data is collected at every interaction, be it over the phone, mobile, PC, sensors, with or without us knowing, it becomes important to have a strategy around data. Traditionally, data has been seen as something to “run the business,” but, in today’s context, it can actually “be the business” if monetized well. An example of Internet of Things (IoT) data in a customer context is the wristband one wears at amusement parks that provides real-time data about customer interaction at all times, and this data can be processed in near real time to push out relevant offers and alerts to enhance the customer experience. The question is: How do organizations prepare themselves to take advantage of data?

The key lies in building a modern data architecture that is open, flexible and scalable, something that can accommodate your existing data assets as well as potential new ones. Before we talk about specific steps to modernize data architecture, let’s look at typical challenges:

  1. Many applications within the organization have been around for 20 or more years. While the usage for some of them is known, it is still not clear who is leveraging the data in each application and for what purpose. How do we find out?
  2. To meet their reporting needs, organizations have built multiple data assets including data warehouses and data marts. Additionally, they have power users collating data from multiple sources and creating reports using MS Excel. Numbers are inconsistent and vary based on who is preparing them and the intended purpose.
  3. Organizations have multiple applications and data assets starting with mainframe-based ones, client-server, Web applications and some newer cloud-based applications, all co-existing. They struggle to find the right people to support the applications, especially the older ones.
  4. Organizations are aware of the new developments in the big data space including NoSQL databases and the Hadoop ecosystem, and have typically embarked on some initiatives to get started on this. The main challenge is around integrating this with the traditional data warehouse technologies.
  5. People, and by extension, their skills, are the biggest assets of any organization. CIOs are concerned about having to find an army of programmers for populating Hadoop-based data repositories. The other big concern is how to leverage existing SQL skills, which people have acquired over the years.

These are valid concerns, and some are more applicable than others based on the context. Nonetheless, given the inevitable need to be able to better monetize data and modernize technology platforms, it is important to have a strategy. I recommend the
following approach:

  1. Data asset inventory: Create a complete list of data assets – legacy, data warehouses, data marts, data islands. Identify the data flows between these assets and the usage patterns. It might be particularly hard for some legacy systems, but this serves as the starting point for any consolidation and modernization.
  2. Data asset rationalization: Based on the list of data assets and the usage, it is important to rationalize them. What this means is to identify if the same data is coming from multiple applications, and if so, which is the authoritative source, and which ones can be retired. This is a very important exercise and can help consolidate the number of data assets to a manageable few. In this context, master data management is critical to ensure you have good quality data.
  3. Data lineage: Undertaking a data lineage exercise to identify data flows – creating detailed documentation especially for the legacy applications – is a must. This greatly reduces the risk of dependency on key personnel and also makes it easier to migrate to a future state architecture.
  4. Data infrastructure: Have a big data and cloud strategy in place to bring in newer technologies in a pilot mode. Start with a non-legacy application to understand the technology, and move applications over in conjunction with data asset rationalization. The “data on cloud” is going to be an important component of modern architecture especially when dealing with IoT data.
  5. Data technology: It pays to understand the different options available in a very crowded and rapidly evolving marketplace, and to select the right technologies that fit into your architecture from a technology standpoint as well as a people standpoint. For example, using a data integration tool with big data connectors will eliminate the need for people who can write MapReduce code.

Creating a holistic data strategy in light of changes in the business, and taking a structured approach, will definitely help lay a solid foundation that will be the basis for monetizing data.

Arvind Purushothaman is the practice head and senior director of Information Management & Analytics at Virtusa’s Chennai ATC of Virtusa, an information technology services provider with global reach. He has 19 years of industry experience, with a focus on planning and executing data management and analytics initiatives. Prior to taking on this role, he was involved in architecting and designing centers of excellence, as well as service-delivery functions focused on information management encompassing traditional data warehousing, master data management and analytical reporting. He holds an MBA from Georgia State University.

Related Posts

  • 63
    Imagine a sensor inside an offshore drilling rig. The sensor checks for damage to a critical valve. To do so, the sensor regulates pressure in the oil well 7,000 feet below the ocean’s surface. This sensor generates data that might have gone unnoticed half a decade ago. Back then, the…
    Tags: data, big, things, internet
  • 56
    In today’s world where data is collected at every interaction, be it over the phone, mobile, PC, sensors, with or without us knowing, it becomes important to have a strategy around data. Traditionally, data has been seen as something to “run the business,” but, in today’s context, it can actually…
    Tags: data, architecture, internet, things
  • 53
    FEATURES Fulfilling the promise of analytics By Chris Mazzei Strategy, leadership and consumption: The keys to getting the most from big data and analytics focus on the human element. How to get the most out of data lakes By Sean Martin A handful of requisite business skills that facilitate self-service…
    Tags: data, big, things, internet
  • 52
    IoT (Internet of Things) devices have become increasingly popular in recent years. They are all around us – from fitness trackers on our wrists to smart thermostats in our homes – and adoption will only continue to grow in the coming years. In fact, Gartner, Inc. reported that 5.5 million…
    Tags: data, things, internet
  • 50
    Through advances such as big data and the Internet of Things (IoT), the field of analytics has been growing by leaps and bounds. However, much of the focus, particularly in the United States, has been in the consumer (i.e., business-to-consumer, or B2C) market. Indeed, much of the innovation coming out…
    Tags: data, things, internet


Using machine learning and optimization to improve refugee integration

Andrew C. Trapp, a professor at the Foisie Business School at Worcester Polytechnic Institute (WPI), received a $320,000 National Science Foundation (NSF) grant to develop a computational tool to help humanitarian aid organizations significantly improve refugees’ chances of successfully resettling and integrating into a new country. Built upon ongoing work with an international team of computer scientists and economists, the tool integrates machine learning and optimization algorithms, along with complex computation of data, to match refugees to communities where they will find appropriate resources, including employment opportunities. Read more →

Gartner releases Healthcare Supply Chain Top 25 rankings

Gartner, Inc. has released its 10th annual Healthcare Supply Chain Top 25 ranking. The rankings recognize organizations across the healthcare value chain that demonstrate leadership in improving human life at sustainable costs. “Healthcare supply chains today face a multitude of challenges: increasing cost pressures and patient expectations, as well as the need to keep up with rapid technology advancement, to name just a few,” says Stephen Meyer, senior director at Gartner. Read more →

Meet CIMON, the first AI-powered astronaut assistant

CIMON, the world’s first artificial intelligence-enabled astronaut assistant, made its debut aboard the International Space Station. The ISS’s newest crew member, developed and built in Germany, was called into action on Nov. 15 with the command, “Wake up, CIMON!,” by German ESA astronaut Alexander Gerst, who has been living and working on the ISS since June 8. Read more →



INFORMS Computing Society Conference
Jan. 6-8, 2019; Knoxville, Tenn.

INFORMS Conference on Business Analytics & Operations Research
April 14-16, 2019; Austin, Texas

INFORMS International Conference
June 9-12, 2019; Cancun, Mexico

INFORMS Marketing Science Conference
June 20-22; Rome, Italy

INFORMS Applied Probability Conference
July 2-4, 2019; Brisbane, Australia

INFORMS Healthcare Conference
July 27-29, 2019; Boston, Mass.

2019 INFORMS Annual Meeting
Oct. 20-23, 2019; Seattle, Wash.

Winter Simulation Conference
Dec. 8-11, 2019: National Harbor, Md.


Advancing the Analytics-Driven Organization
Jan. 28–31, 2019, 1 p.m.– 5 p.m. (live online)


CAP® Exam computer-based testing sites are available in 700 locations worldwide. Take the exam close to home and on your schedule:

For more information, go to