Gilbane Report logoContent Management Technologies, Trends & Advice

Gilbane San Francisco and Boston banner
Gilbane Reports

The Gilbane Report: Volume 9, Number 4

Open Source Content Management: A Parallel Universe?

May 2001

Download a PDF version of this article

Read the news for this issue.

Untitled Document
Open Source Content Management: A Parallel Universe?

It is typical that "enterprise" software applications originate as highly customized solutions designed and built in-house, often with assistance from integrators or consultants. As word gets out about the benefits similar applications are built, and eventually software vendors build commercial products. As vendor competition increases, more functionality is built into products, prices come down, and some features creep into infrastructure software, further reducing prices and forcing high-end vendors to expand their solutions to maintain business models that require large sales. Enterprise content management will progress through this same cycle. To ensure successful content management strategies it is critical you consider how this cycle will affect your existing and planned implementations.

Open Source software adds another dimension to this complex cycle. Open Source software consists of much more than Linux and Apache. There are all kinds of Open Source software tools available, including Open Source content management systems and databases and application servers that can be used to build content management systems. This month we welcome back Sebastian Holst, who provides advice on how to think about Open Source software in the context of the evolution of enterprise content management software.

Open Source Content Management: A Parallel Universe?

This report will examine the likely impact of Open Source (OS) software on the content management market and introduce some of the early OS-Content Management Systems (OS-CMS).

OS-CMSs are providing viable alternatives to a small but growing number of organizations and their influence is on the rise. Understanding if and when an OS-CMS is appropriate has the potential to save an organization a significant amount of money. Having said that, the rapidly expanding scope of "content management" and the caveats that come with open source software may still temper your enthusiasm.

The open source software approach to development has won legions of supporters, generated millions of lines of code and been the subject of numerous white papers and analyst reports. Content management systems are firmly established as essential building blocks of e-business and other information intensive operations. Yet, the final chapter of these two stories is far from having been written. Interestingly, the interaction between these two phenomena sheds light on the essential characteristics of both.

The Essential Characteristics of Open Source Software

Open Source refers to software distributed under a license that permits free distribution and requires open availability of the source code. Examples of Open Source licenses include: the Gnu Public License (GPL), the BSD license, the MIT X License, the Artistic License, and the IBM Public License. Open Source licenses should meet the Open Source Definition, which is described at http://opensource.org/osd.html. The core requirements include:

Free Redistribution

Good News: This requirement is critical to maintaining a focus on long-term quality by removing the incentive to derive works for immediate sale. But: Developers with high value and proprietary intellectual property are not likely to incorporate their work in this forum.

Source Code License

Good News: Programs cannot easily evolve unless the source code can be enhanced. This ensures that all aspects of an open source product can be enhanced on an as needed basis. But: You have licensed source code.

Derived Works

Good News: You are free to modify, experiment with and enhance your licensed product. But: You might be required to modify, experiment with and enhance your licensed product.

Integrity of The Author's Source Code

Good News: Authors and maintainers can develop solid reputations and set clear expectations as to the scope of their responsibilities. But: Who is qualified to evaluate the quality and reputations of the authors and maintainers?

No Discrimination Against Persons or Groups

Good News: This is intended to be a meritocracy.

No Discrimination Against Fields of Endeavor

Good News: This is designed to ensure that no commercial use can be excluded. The intent is to include, not quarantine commercial developers. There are a number of other clauses that protect against other types of license traps as well.

License Must Not Contaminate Other Software

Good news: Distributors of open-source software have the right to make their own choices about their own software.

Open Source licenses guarantee everyone has the right to read, redistribute, modify, and use the software freely. Under many Open Source licenses modifications of existing software must be distributed under the same license as the original software and all source code to any changes or improvements must be made available to the public as well.

The terms of the Open Source license are crafted to encourage development by loosely affiliated developers rather than a single software vendor. Typically, "project maintainers" who address technical or end-user requirements guide Open Source software development.

Commercial opportunities include the packaging, distribution and support of Open Source software. For example, Linux vendors such as Red Hat, Caldera, SuSE, and others package existing Open Source software and provide a more convenient way for their customers to access and install the software.

Examples of widely used Open Source software

Many of the leading Internet and networking software programs are Open Source software:

  • Linux, the operating system
  • Apache, a Web server
  • Sendmail, a Internet mail server
  • BIND, a Domain Name Server (DNS) platform
  • Samba, a cross-platform networking server
  • Perl, a scripting language
  • Mozilla, the browser from Netscape/AOL
  • MySQL, a database

The important trait that all of these OS programs share is that they are VERY general. There are no wildly successful Open Source molecular modeling software packages or financial derivative trading packages. For the Open Source development method to thrive, large numbers of relatively dedicated developers need to come together over relatively long periods of time. This requirement self selects out specialized or highly complex technologies where the development communities are relatively small. This will prove to be a very important consideration when investigating the growing interest in Open Source content management.

The Essential Characteristics of Content Management

It is important to distinguish between content management systems and content management solutions. A solution combines a system (traditionally this has been synonymous with a content management product) with customizations, the underlying infrastructure and integration with other content-centric systems such as authoring tools, application and syndication servers, and other delivery facilities.

Figure 1 illustrates the relationship between a content management system and a complete content management solution. The software that comprises a content management solution can be placed in one of four categories:

  • The underlying infrastructure is broadly adopted, low cost (on a per user and per transaction basis) and forms an almost invisible platform upon which the rest of the solution stack is based.
  • The content management system: Commercial software that adds significant value to the underlying platform justifying a relatively high cost (on a per user and per transaction basis).
  • Integration services: Custom development, integration and configuration of premium products to bridge the gap between an individual site's requirements and the generic capabilities of the underlying commercial software.
  • Creation and distribution facilities: These are other segments of the content lifecycle that include the creation, distribution and syndication of content outside of the content management solution stack.

Figure 1 also illustrates the natural product feature creep that occurs over time as features migrate from custom extension to premium product and finally to an underlying infrastructure service. For example, the degree to which a DBMS cannibalizes today's CMS functionality and the pace that CMS vendors incorporate the work of professional services into their CMS products can be modeled within this framework.

Product managers inside commercial CMS vendors know that in order to justify a license premium, their offerings must consistently add significant value to the underlying platform (DBMS, application servers etc.). CMS vendors take great care to incorporate the right features at the right pace to assure that significant value-add is preserved even as the underlying infrastructure suppliers cannibalize the features of many of yesterday's CMS offerings.

This is the underlying dynamic that has driven the expansion of the content management systems and it is not unique. Email, Internet chat software, and file management software all debuted as custom software, have been licensed as relatively expensive applications and are now fairly standard components of today's operating systems. The vendors that initially licensed these applications either moved on to other applications or, like the applications themselves, were devoured (typically this can be read as acquired).

As we will see, applications developed under an Open Source Software license live outside of this traditionally closed ecosystem. At the very least, OSS CMSs add a significant new dynamic and are very likely to dramatically impact the behavior of the traditional CMS vendor community.

Feature Creep of Content Management Infrastructures

There are numerous underlying infrastructure components that support a content management system, but the one that most actively cannibalizes the CMS is the DBMS.

Figure 2 charts the evolution of the DBMS as it has expanded its information modeling functionality and its information management functionality. This is a reasonable comparison as the management functions such as security, transaction management, schema evolution, etc. get increasingly difficult as the information model is made more robust. Therefore, DBMS architects often make conscious decisions to simplify one facet of their system in order to extend or optimize the other. It is a useful comparison because it also shows that the market overwhelmingly prefers trusted management functionality over enhanced modeling capabilities. The early relational products were ridiculed by the then dominant network (CODASYL) DBMS vendors (you couldn't even model an organization chart let alone a parts assembly with the relational model!). The buying public decided that elegant modeling could reduce coding efforts and improve design, but management facilities were essential to running a business. Many years later, the early Object-Oriented (OO) DBMS vendors had visions of conquering the then multibillion-dollar DBMS market due their revolutionary information model. Again, the market spoke unambiguously in favor of predictable and scalable platforms.

Of course, consumers don't like to choose, they want it all and the RDBMS vendors have moved to incrementally enhance their modeling capabilities by (you guessed it) cannibalizing many of the most generalizable features of content management. First came enhanced datatypes including data blades and data cartridges from Informix and Oracle respectively. Next came numerous interfaces, filters and data processors that provided persistent storage and various levels of validation of OO-based and XML content. Most recently, the tight bundling of application servers and file system and hierarchical storage management support have been bundled into "the DBMS." These developments have seriously cut into the functionality of the basic CMS system. As such, one can see CMS vendors responding by accelerating enhancements to minimize the importance of these commoditized (low value) capabilities.

Feature Creep of Content Management Systems

Feature creep of premium applications expresses itself as specialization or expansion. Specialization focuses on increasingly complex applications that have traditionally required significant integration services, examples include document workflow, portal management, and customer relationship management. Expansion focuses on moving "up" or "down" the information supply chain incorporating creation, publishing and syndication capabilities as part of a more comprehensive platform.

Figure 3 maps a variety of specialized solutions that are derivative or at least highly dependent upon content management functionality. As leading DBMS systems have incorporated increasing amounts of basic CMS functionality, the premium application vendors have either specialized their content management functionality, e.g. knowledge management or expanded their functionality, e.g. CRM.

The specifics on how these content management variations differ from one another is beyond the scope of this paper. However, what this high level graphic should convey is that content management is a dynamic category where innovations, value propositions and user requirements are evolving in multiple directions simultaneously and at a rapid pace.
In order to succeed, Open Source CMS projects must find a sustainable niche within this very competitive software market segment.

Open Source CMS Projects

Today's Open Source CMS projects are growing in the same "parallel universe" as their other OS project peers. They do not need to buy into the rat race of feverish functional growth to drive market expansion. Rather, they tend to focus on a growing community of users who have relatively stable and well-defined content management requirements. The following are just three good examples of OS-CMSs that are achieving growing acceptance in the marketplace today.

ACS Content Management System

Excerpted from www.arsdigita.com/products/cms
ACS Content Management System (CMS) is a component of the ArsDigita Community System that provides functionality for managing Web site content. ACS CMS allows establishment of clearly defined roles and documented workflow for all forms of web content.

ACS CMS provides tools that facilitate the design, authoring, review, approval, aggregation and deployment of web content, the maintenance, monitoring, revising, and eventual archiving of published content, and reporting and analysis. ACS CMS offers a set of web site content services, including:

  • Type definition
  • Item creation and editing
  • Common storage
  • Associations
  • Versioning
  • Categorization
  • Searching
  • Reporting

The ArsDigita web site lists 60 web sites that have deployed using the open source without paying for support or service and 20 sites that have chosen to pay fees for service and/or support.

Cofax

Excerpted from www.cofax.org
Cofax is a Web-based text and multimedia publication system. It was designed to simplify the presentation of newspapers on the Web and to expedite real-time Web publication. At Knight Ridder, it is serving newspaper content for several of the papers, including the largest papers, the Philadelphia Inquirer and Philadelphia Daily News and smaller papers like the Wilkes-Barre Times Leader.

Cofax CMS is a web based Content Management System used to edit and manage the Online Editions of 16+ Newspapers. Xmultra is a flexible data ingest application that integrates with legacy news publication systems & classifieds systems at the newspapers.

Cofax is a framework of software modules and tools to manage and deliver news content from different sources. It was developed using Java, database, and XML technologies.

The Cofax web site identifies 13 newspaper web sites and 2 other web sites that are powered by Cofax.

Midgard

Excerpted from www.midgard-project.com
Midgard is an Open Source (OS) content management system based on Apache and MySQL. The system uses PHP as its scripting language. While Midgard will always implement an OS development to publishing solution, future releases will include APIs for implementing commercial applications.

Midgard addresses the publishing challenge by providing tools that are based on content organization which creates a content tree that's similar to a Yahoo style directory. Layout is separated from content by a style system that enables HTML developers to focus on site design while writers focus on articles. The SiteGroup (SG) tool separates the single MySQL database into multiple virtual databases and manages access control. When many developers work on one site, each one is assigned membership to a SiteGroup.

Code developers can directly embed Midgard objects; content, users and styles into PHP applications with the Application Programming Interface (API) which makes calls to the Midgard library. Redundant coding is controlled with CodeSnippets which are shareable objects.

The Midgard site identifies 35 sites that are powered by Midgard.

OS-CMS and Content Management Solutions

The previous OS-CMS examples are representative of the kinds of CMS systems that are getting some traction in the marketplace today. Understanding where they fit into the commercial content management landscape is an important step in predicting where they are likely to succeed and the kinds of organizations that might be able to consider an OS-CMS today or in the near term future.

Figure 4 illustrates the impact of the Open Source model on enterprise solution ecosystem. The major changes are:

  • The feature lists highlighted are not as specialized or advanced as those currently being touted by commercial software vendors.
  • The underlying infrastructure is not cannibalizing the CMS layer nor is the CMS layer cannibalizing the integration layer, rather new OS projects build upon - but are distinct from - earlier OS projects. This is a difference that can be directly attributed to the Open Source methodology. The lack of ownership removes the imperative to stretch the market category beyond its natural boundaries.
  • Integration/customization is no longer distinct from the application/system layer - again, a direct result of OS methodology. Customizations and derived works are also open and available to the broader community.
    Implications of the new Open Source "ecosystem"
    The following changes in the nature of content management solutions are likely to flow from the Open Source development approach.
  • The modular components that result from cooperative OS projects as opposed to competitive product development may ultimately result in a solutions architecture that is functionally organized rather than being defined by ownership and licensing terms of particular product suites.
  • Clearer evidence of applicability and fit to specific applications will be generally available due to the distribution of extensions and customizations associated with each deployment. This is contrasted with the divergent terms and conditions of ownership between infrastructure, systems and professional services.
  • While more logically organized and distributed, Open Source CMS systems are likely to lack cutting edge features that may prove highly valuable to many organizations. Whether these are associated with advanced search algorithms, multimedia support, cutting edge encryption, digital rights management or 3D projection onto plasma paper (a future future), organizations that see content management as a competitive advantage will likely require the advanced functionality of the commercially licensed content management system.

Is Your Organization a Good Candidate for an Open Source Solution?

The following list is a minimum set of questions that every organization considering an Open Source CMS should consider.

  • Content management functionality required: this is a fairly straightforward criterion to evaluate. After all, OS solutions are available for inspection and evaluation.
  • Support required: Is there a vendor willing to sell a support contract? Are the terms consistent with the other support contracts your IT organization already has in place? What is their financial viability?
  • Technical skill of in-house staff: Does your organization have individuals in-house or under contract that can evaluate and enhance the areas of the CMS that will require enhancements?
  • Mission criticality: Ask yourself "what if?" What other organizations are "powered by" and do they have the same level of dependence?
  • Anticipated life span of deployment: Most deployments of software solutions do not last more than a couple of years. Is this project an exception? The community that is behind the OS-CMS is not likely to evaporate completely in 12-24 months, but how confident are you that there will be such a community in 60 months?

If you can step through these questions and feel comfortable, then there is certainly no immediate or obvious reason that OS-CMS alternatives should not be considered alongside the traditional CMS vendor list. There is certainly a potential that you may simplify your deployment and save some significant licensing fees.

Open Source Content Management Systems will prove to be a lasting option and a viable one for a small but growing community of users. However, these systems will not prove to be a serious threat to the "old guard". The only vendors that will be threatened are those that fail to acknowledge this viable alternative.


Sebastian Holst
sebastian.holst@artesiatech.com

Subscribe to NewsShark
Content technology industry news without the hype

Email Address:*
First Name:*
Last name*
* = Required Field

RSS/XML Newsfeeds
Industry News
Analyst Blog
Enterprise Search Blog
Content Globalization Blog
XML Technologies & Strategies
Press Releases & Announcements


The Gilbane Report is published by Gilbane Group, Inc. © 1993 - 2005 The Gilbane Report. All Rights Reserved.
Contact | Privacy Polic