November 24, 2006

Bottoms Up or Top Down? The Eternal Business Intelligence Question

There are two kinds of client prospects that a business intelligence systems consultant is likely to encounter: those who already know something about business intelligence systems and data warehousing, and those who don’t. Of the former, practically all of them have read at least one book by Ralph Kimball, or Bill Inmon, or both, and invariably ask: which methodology do you, the consultant, recommend?

Well, this consultant is likely to say both, or neither, or it depends. Neither approach is perfect for all clients, but both contain elements that are useful in almost any context. The issues have been discussed in many other blogs and forums, and I’m sure I have nothing revolutionary to say about it. Â

But this is my blog, and I want to communicate with my prospective clients, so I’m going to lay out my take on it.

The majority of projects I have been involved in started with a scope limited to a particular business process, and I suspect this is true for the bulk of enterprises tackling their first business intelligence project. They are set up as “pilot” projects or “proofs of concept”, sponsored by an executive or department head with a specific objective in mind. The limitation of scope (and of time and resources) forces the adoption of a focused methodology that will produce a useful result in the shortest possible time. That limits the options to the “bottom up” dimensional modeling approach promoted by Ralph Kimball.

Dimensional modeling is a bottom-up approach to data warehousing because what it produces is not a data warehouse but a data mart. Â When applied to multiple business processes, it results in a collection of data marts that is referred to as the data warehouse

Occasionally an enterprise looks to overhaul the way it does business entirely. It does one of two things: a complete business process re-engineering, or the implementation of an enterprise resource planning system (ERP). These amount to the same thing, but in the latter, the ERP forces change on the business processes according to its own model, while in the former the enterprise remakes itself in response to an examination of its goals, objectives, and the requirements of the market place. The business then drives the selection or development of the information systems, rather than the other way around.

In these cases, the development of a data warehouse can, and should, proceed from a top-down perspective, in parallel with the rest of the process. The resulting data warehouse is a truly integrated operational data store, based on the reengineered business model, from which application data marts can be extracted virtually on demand.

The big drawback to the top-down approach is the protracted lead time from project inception to the first usable result, though subsequent applications follow very quickly. The drawback to the bottom-up approach is that application data marts may not integrate with each other very well.Â

Consider, for example, in a hypothetical apparel business, a data mart designed for advertising versus one designed for inventory management. Inventory management needs to know every unique item at every location at every point in time. That data mart will use a product dimension based on product UPC, which encompasses style, material, color and size. Advertising, on the other hand, really only cares about the style, maybe with pictures in different colors, but not unique identifying codes. And time is not generally a consideration, though locations might be.

If the inventory management data mart had been created first, the more summary data needed by Advertising could be extracted from that. But if the advertising data mart is created first, the inventory management data mart has to be built from scratch because the advertising data is not sufficiently granular. Subsequently, the processes that produce the advertising data mart should be reengineered to work with the inventory management data.

For this reason, even when a first business intelligence project is for a specific department or business process, a top-down analysis should be done at the outset in anticipation of future requirements. The goal of the analysis is to produce what Kimball refers to as the data warehouse bus architecture matrix, in which the business processes of the organization are listed, and related to the dimensions of their information. The granularity of each dimension needs to be carefully considered during this process.

(Educational sidebar: All data is made up of “attributes”. Attributes used for organizing and finding data are “key” attributes, and can be used as “dimensions” of the data. Non-key numerical attributes on which you can operate mathematically to analyze the business are called “facts”.)

The dimensions in the matrix then should be related to primary data sources. Every data mart must get its data from somewhere – the files and/or databases associated with the enterprise’s legacy applications. Some of this data is “primary”, meaning it is original data, such as transaction inputs. Other data is secondary – derived from primary inputs by some other application. The data warehouse should be populated from primary data sources, to guarantee data consistency across all data marts. It should be able to reproduce secondary data, to demonstrate that the rules of derivation have been accounted for.

The architecture of a data warehouse as defined by the top-down method, is highly integrated.  It looks quite different from the architecture of a bottom-up design, which is not. This type of data warehouse is not organized dimensionally, as is a data mart (technically called a “star schema”). Rather, it is rigorously structured over many tables, with a broad selection of data attributes from which to choose the dimensions and facts of a particular application. Such a database is not easily queried by non-expert users, and even available query-and-report tools may not be much help, as they generally work best with a star schema. If the database is large, as it would be for an enterprise data warehouse, the performance (in terms of response time) may be very slow.

Readers familiar with the SAP Business Warehouse product know that this is an exemplary implementation of Inmon’s top-down concepts. It relies heavily on extensive metadata and a dense layer of middleware to make it work. (Not that there’s anything wrong with that!). Thousands of development hours, and millions of investment dollars (or Deutschmarks), made this possible, and if your company buys the SAP Enterprise Resource Planning solution, you get it all for free. Really.Â

But if you haven’t gone that route, a true top-down enterprise data warehouse is probably not a cost-effective solution. Even if you don’t limit the scope of the project to a single business process, you will get results more quickly by following a methodology closer to Kimball’s.

The Worcester Group, Inc. approach is to create the data warehouse bus architecture matrix first, then identify and prioritize data mart projects. We then design a dimensioned operational data store, based on the bus matrix, that will deliver the data to satisfy the highest priority projects. The data marts for individual applications can then easily be extracted from this. An additional benefit is that many projects that were not high priority will be accommodated by this approach, their development time will be significantly shortened, and therefore their return on investment will be improved.

As the business grows, there may be new business processes added to the matrix. Most of these will involve the same dimensions previously encountered. If new dimensions need to be added, this will have no effect on the previously identified processes. The bus matrix thus insures the data warehouse will be scalable and robust. It serves as the cornerstone of the data warehouse.

November 14, 2006

The Price of “Free”

Eric Schmidt, of Google, says my cell phone should be free, its cost subsidized by advertising. Bad idea. Advertising is intrusive enough. We need less, not more. In the age of Business Intelligence, intrusive advertising is no longer intelligent business.

Half of every publication I buy is taken up with advertising. Just trying to find the table of contents is an effort for which I could profitably employ Google. In some magazines, there are twenty pages of advertising right up front, and the table of contents is spread over two half pages, which are shared with (and not easily distinguishable from) still more advertising. Once I have found the table of contents, and know the page number of the cover story that caught my attention in the first place, finding that page number becomes yet another arduous search effort, because at least half the pages are not numbered at all. And if there is a “special advertising supplement”, cleverly disguised as an article, with twenty or so pages inserted between pages 50 (which is numbered) and 51 (which is not), I give up the search.

Twenty minutes out of every hour on TV is taken up with advertising. Unless I have recorded the material (or have a TIVO), these intrusions on my limited, linear time cannot be avoided. I may take advantage of the interruption to go to the bathroom or the kitchen, but they are still an unwelcome attribute of “free” TV. (Which, by the way, is not free in my neck of the woods, because you can’t get anything if you don’t have cable.)

When you buy a magazine or a newspaper, most of what you pay is not for the content. That is supported by advertising. You are paying for distribution costs. Somebody had to lug all that paper from the printer to the distributor to the news stand. So you are really paying for the advertising. What you pay for with basic cable is access to a distribution channel that is largely supported by advertising. Again, you are paying mainly for the advertising.

This advertising model is sooo last century.  Before the Internet, the only way to get the advertising message out to a receptive audience was to bombard everyone in listening range with a blitzkrieg of repetitive visuals, slogans and jingles in the hope that you could get someone’s attention.

Marketers were aware that certain things could create or enhance responsiveness in the marketplace. Price promotions were one potent weapon. Leveraging consumer impulses was another (for example, the placement of gum and candy by the checkout register). Coat-tail effects were also exploited (like placing a coupon for spagetti sauce on a pasta carton).Â

Amazon.com applied these ideas on its website, taking advantage of the Internet’s interactivity to track customer preferences and target promotions to customers who were likely to be receptive, based on what was in their “shopping cart” and on their account history. Now, most commercial sites on the Internet do the same thing.

This is effective, and non-intrusive, advertising. I call it intelligent business, fueled by business intelligence; a marked contrast to the ambient message traffic in which we are constantly immersed that is now just so much black noise. We need less of the latter, rather than more.

My cell phone is a tool. It contains all my contacts, my calendar, a notepad. I give the number out to my customers so they can reach me when they need me. Every few years I pay for a new one that does a better job, or to replace one that is wearing out. I don’t have time to voluntarily listen to or view advertising. I highly resent the time I spend cleaning junk e-mail or voice mail from my in-boxes, and I think anyone but a pre-teen with her first cell phone would, as well. If my cell phone were advertising supported, on the other hand, the advertisers would know that they have to place their ads as obstacles to my use of the phone, or they will never be seen.

Thanks, but no thanks. I’ll pay for my phone to keep it advertising-free.

Welcome to the Worcester Group, Inc. Blog

Welcome to my blog!

My name is Eric Weiss, and I am the founder and principal consultant of The Worcester Group, Inc. In the next few weeks, we will have completed the development of our web site. But in the meantime, I plan to use this blog to introduce myself and my company to the marketplace.

We are Business Intelligence Systems consultants. “Business Intelligence” is information that a business decision maker can use to make choices that will benefit the company’s growth and profitability. It includes internal and external subject matter, collected from the business’s own information systems and from outside sources. At The Worcester Group, Inc., our mission is to assist small and midsized companies to compete effectively by enhancing the quality and accessibility of Business Intelligence.

Among the services we offer are: Â

  • data strategy and consulting
  • data warehouse consulting
  • decision support system consulting
  • sales business intelligence

We provide both technical and non-technical assistance in:

  • identifying our clients’ information needs
  • designing, developing and implementing systems to meet those needs
  • analyzing and integrating data from internal and external sources
  • evaluating and selecting the best software products for the client.

We have worked with companies in a variety of industries, from grocery products to pharmaceuticals to entertainment to luxury fashion, developing systems for sales management, competitive analysis, and logistics management.
In future posts, I plan to talk about these and related subjects, and maybe have some dialog with my readers on topics of interest. Â If you found this page while looking for information on data warehousing and business intelligence, and would like to know more about our services, you can call us at 201-222-0908, or drop us a line at wcstrgrp@aol.com until our new site is up and running.