Few of us would doubt that a data warehouse strategy should include the hardware/software platforms that are part of the architecture. These platforms can include multi-subject data warehouses, subject-specific data marts, master data management repositories, data warehouse appliances, enterprise information integration (EII) federated databases, operational data stores, and direct access to data in operational systems. However, when you're creating your data warehouse strategy, there are many other factors to consider as well.
Cloud Verses On-Premise Hosting
Open Source Verses Proprietary Software
The choice between open source and proprietary commercial software is also not an either/or decision, and most organizations are likely to deploy both. Major decision criteria include functional requirements and vendor (or community) support as well as total cost of ownership.
Data Delivery Vehicles
Although one of the early goals in the design of a data warehouse architecture was to ensure the delivery of the right information to the right people at the right time, it should now be extended to include "regardless of where the recipient happens to be." We live in a mobile society and the ability to run an analysis and access information should not be limited to desktop PCs or laptops (or, for that matter, printed reports!).
Many organizations have standards relating to approved smartphones, tablets, and laptop computers, but most users do not wish to carry two devices -- one for personal use and one for work. Some users will try to connect their own personal devices to their organization's network, and you must educate them on the possible risks. Better yet, accommodate their needs with appropriate solutions.
In general, an organization needs to protect its data (and the privacy of its constituents). Access from mobile devices that can be easily lost or stolen makes that task more difficult. Techniques such as encryption and data wiping (after a specified number of password failures or when a device is reported missing) must be a part of the design, not a feature rushed into place following a major (and frequently costly) data breach. With that in mind, organizations might consider providing -- or subsidizing the cost of -- user-owned mobile devices under the condition that the organization can install appropriate software to remotely disable the device in the event it is lost or stolen.
Data Sources and Data Consistency
One of the major advantages of data warehousing is the ability to extract data from multiple heterogeneous operational systems and third-party data providers and transform it to match organizational data standards including formats, value lists, and units of measure when loaded into the data warehouse. Furthermore, it is desirable to load data into an enterprise data warehouse first and then distribute subsets to other data warehouse platforms. However, complete consistency is often unlikely because the data warehouse environment may include independent data mart platforms and/or allow analyses directly against operational systems whose data elements don't conform to organization standards that were established long after the application was initially deployed.
It is, therefore, extremely important that users understand the limitations of the data on individual data warehouse platforms. At a minimum, use a metadata repository (a master metadata manager?) that describes what data resides where along with any associated limitations. Furthermore, the same queries run against two platforms should produce consistent results, although they might have dramatically different response times. For example, if the data under analysis resides in both an enterprise data warehouse and a data warehouse appliance, the results should be the same (although the appliance might generate them faster).
Unstructured Data
With the growing recognition of the value of data contained in social media sources, an organization's data warehouse strategy should include the ability to accommodate unstructured data.
Data Security
Some data is simply too sensitive or too private to allow unrestricted access to everyone in your organization. For example, someone analyzing salary trends might only be allowed to see yearly average salary values by job code rather than individual salaries. Delivery of the right information to the right people does not imply delivery to everyone. Protective measures should be established to ensure that access is restricted as appropriate.
Although many factors influence the creation of an organization's data warehouse strategy, one of the most important characteristics of a good data warehouse strategy is its ability to respond to the changing needs of the organization. Make sure your data warehousing strategy is flexible enough to accommodate changes.