Main

March 29, 2008

ePublishing Best Practices

As part of the review I was doing of the eBookWise-1150, I played some with their publishing tools. The device maker, eBook Technologies, Inc. (ETI), has some tools for publishers, and I tried both a batch processing tool and an interactive one. I say "played" with them because I only tried a few things, and there were many features, especially to the interactive tool. The tools looked very solid. I have also played around some with the Kindle Digital Text Platform. I do this to learn the tools, but also to keep myself honest. We advise clients on these devices and also the workflow surrounding eBook creation. Our clients don't expect us to know every bell and whistle, but they do expect us to understand what is possible and not possible.

The more eBooks become attractive options for publishers, the more issues of publishing to multiple formats and platforms become important for publishers. Our experience so far has been that the most typical requirement for publishers is the need to produce eBooks in many different formats and not just one (this despite sensible solutions like IDPF's EPUB format). And they need to do this efficiently. This is a practical reality of the marketplace today as no one eBook format has won the format war, no one channel is dominating sales, and indeed no one channel is typically worth doing on its own. The revenues simply are not there yet. (Indeed, even if you decide that you will only do, say, PDF-based eBooks, the similarities from one channel to the next end with the PDF extension, necessitating technologies like codeMantra's Universal PDF).

Adobe is one of the vendors supporting EPUB, and their Digital Editions developer site has some good resources. They just added an EPUB Best Practices Guide (in, not surprisingly, EPUB format, so you can download Digital Editions if you want to get right to reading it).

March 22, 2008

Apropos of Nothing...

Or at least of nothing about publishing in the strict sense, but the new movie and television site Hulu.com is very impressive. Among other things, it tells me that Flash video may carry the day when all is said and done.

July 31, 2007

DITA and Dynamic Content Delivery

Note: This entry also appeared on the Gilbane CTO Blog.

Have you ever waded through a massive technical manual, desperately searching for the section that actually applied to you? Or have you found yourself performing one search after another, collecting one-by-one the pieces of the answer you need from a mass of documents and web pages? These are all examples of the limitations of static publishing; that is, the limitations of publishing to a wide audience when people’s needs and wants are not all the same. Unfortunately, this classic “one size fits all” approach can end up fitting no one at all.

In the days when print publishing was our only option, and we thought only in terms of producing books, we really had no choice but to mass-distribute information and hope it met most people’s needs. But today, with Web-based technology and new XML standards like DITA, we have other choices.

DITA (Darwin Information Typing Architecture) is the hottest thing to have hit the technical publishing world in a long time. With its topic-based approach to authoring, DITA frees us from the need to think in terms of “books”, and lets us focus on the underlying information. With DITA’s modular, reusable information elements, we can not only publish across different formats and media – but also flexibly recombine information in almost any way we like.

Initial DITA implementations have focused primarily on publishing to pre-defined PDF, HTML and Help formats – that is, on static publishing. But the real promise of DITA lies in supporting dynamic, personalized content delivery. This alternative publishing model – which I’ll call dynamic content delivery – involves “pulling” rather than “pushing” content, based on the needs of each individual user.

In this self-service approach to publishing, end users can assemble their own “books” using two kinds of interfaces (or a hybrid of the two):

o Information Shopping Cart – in which the user browses or searches to choose the content (DITA Topics) that she considers relevant, and then places this information in a shopping cart. When done “shopping”, she can organize her document’s table of contents, select a stylesheet, and automatically publish the result to HTML or PDF.

This approach is appropriate when users are relatively knowledgeable about the content, and where the structure of their output documents can be safely left up to them. Examples include engineering research, e-learning systems, and customer self-service applications.

o Personalization Wizard – in which the user answers a number of pre-set questions in a wizard-like interface, and the appropriate content is automatically extracted to produce a final document in HTML or PDF.

This approach is appropriate for applications that need to produce a personalized but highly standard manual, such as a product installation guide or regulated policy manual. In this scenario, the document structure and stylesheet are typically preset.

In a hybrid interface, we could use a personalization wizard to dynamically assemble required material in a fixed table of contents – but then use the information shopping cart approach to allow the user to add supplementary material. Or, depending on the application, we might do the same thing but assemble the initial table of contents as a suggestion or starting point only. The first method might be appropriate for a user manual; the second might be better for custom textbooks.

Dynamic content delivery is made possible by the kind of topic-based authoring embraced by DITA. A topic is a piece of content that covers a specific subject, has an identifiable purpose, and can stand on its own (i.e., does not require a specific context in order to make sense). Topics don’t start with “as stated above” or end with “as further described below,” and they don’t implicitly refer to other information that isn’t contained within them. In a word, topics are fully reusable, in the sense that they can be used in any context where the information provided by the topic is needed.

The extraction and assembly of relevant topics is made possible by another relatively new standard called XQuery, which is able to both find the right information based on user profiles, filter the results accordingly, and automatically transform results into output formats like HTML or PDF. Of course, this approach is only feasible if the XQuery engine is extremely fast – which led us to build our own dynamic content delivery solution offering around Mark Logic, an XQuery-based content delivery platform optimized for real-time search and transformation.

The dynamic content delivery approach is an answer to the hunger for relevant, personalized information that pervades today’s organizations. Avoiding the pitfalls of the classic “one size fits all” publishing of the past, it instead allows a highly personalized and relevant interaction with “an audience of one.” I invite you to read more about this in a whitepaper I wrote that is available on our website (www.FlatironsSolutions.com).

June 29, 2007

Free or Ambient??

It has been a week since the O’Reilly Tools of Change conference adjourned.
The topics and presentations were provocative and there are sure to be some lively debates continuing on for months to come...

Several of the keynotes centered on the theme that “information wants to be free”.
Chris Anderson, Editor in Chief of Wired and author of the best seller, “The Long Tail”, was one of the early keynoters. He started the debate by announcing that the title of his next book would be “Free” and would focus upon making a case for providing content to consumers for no charge.

Towards the end of the first day, Jimmy Wales, President of the Wikimedia foundation, spoke about his new company called Wikia. His goal is for Wikia to do for the rest of the library what Wikipedia did for the Encyclopedia section and to make the assembled knowledge of the world available to the masses for free. And by the way, he's going to produce a free Google competitor at the same time. (He certainly doesn’t lack for ambition.)

Erin McKean of the Oxford University Press closed the conference with a vivid discussion of “book-shaped objects” and openly questioned whether books were the best information package for the future. As a lexicographer, she weighed in on the “free” debate by saying that it would be better to state that all “information wants to be ambient”. She indicated that in this sense ambient means readily available for use. (Because I always had associated the word ambient with sound, lighting, or atmosphere, I checked several other dictionaries to clarify this sense of ambient. It did not appear. When I went to OED online, I found that their definition of ambient was neither free nor ambient but available for a mere $29.95/month. Because, Erin has yet to post her presentation on the O’Reilly site, you’ll have to trust my memory for this definition.) Her point is well taken; the word “free” is simply to vague. The American Heritage Dictionary (AHD) lists 17 different definitions or senses ranging from “matters of liberty” to “lack of restraint” to “lack of encumbrance” to “provided without consideration or reward”.

Mr. Anderson’s talk seemed to stress the economic meaning of “free”. . Thus, we must assume that he means that information or content should be available without consideration or reward. The popular justification for this approach to content valuation is that because cost of digital distribution is negligible, it is unfair to charge for content or information that is essentially free. If the distribution cost method were used to price traditional book content, the cost of printing, paper and binding would be the determining factor. Of course, that’s not the case. That approach omits many of the costs of publishing content including: reviewing, editing, formatting, proofreading, publicizing, selling, and marketing. And, of course, most authors like to be paid royalties for their work. It also neglects to consider the notion that many readers prefer traditional book formats and therefore many content elements will eventually be published in several media formats.

Mr. Anderson is first to admit, his book entitled "Free" won't actually be free. Okay, the audio book or e-book may be free to those who purchase the printed book.. But if you want to purchase only the e-book or the audio book, you'll be expected to pay for it. Mr. Anderson is also exploring various forms of online books or printed books that would either be sponsored by corporations or supported by (gasp) advertising. While some of these offerings might be free to the reader, it seems that there will be considerations and rewards built somewhere into this project.

He went on to say that he might well be in favor of making the book free because of the publicity benefits to his consulting practice, speaking fees, and the promotion value to his “personal brand”. However, he felt that his publisher would likely object because they are in the business of generating sales and revenues from selling books. (After the session, I heard several people suggest that Mr. Anderson could self-publish his new work and then he would be free to make it free).

Based upon his work at Wikipedia, Mr. Wales could be seen as a developer of truly free content. The governing organization is a charitable foundation and all of the authors are volunteers. The costs of supporting the staff and technology infrastructure are paid by donations. However, during his keynote, Mr. Wales was quick to point out two major differences between Wikipedia and Wikia. The scope of Wikia is much broader and it has been organized as a for profit entity.

Wikipedia represents the “perfect storm” for a collaborative work or “peering”. According to Wikinomics by Tapscott and Williams, there are three factors necessary to make peering effective: 1) the object of production is information or culture which keeps the cost of participation low for contributors; 2) Tasks can be chunked out into bite-sized pieces that individuals can contribute in small increments and independent of other producers. This makes their overall investment of time and energy minimal in relation to the benefits they receive in return; 3) The costs of integrating those pieces into a finished end product, including the leadership and quality control mechanisms must be low.”

While these criteria would apply to a number of other works found in a library, there are many others (novels, monographs, complex texts, dissertations, etc) that don’t meet these criteria well at all and aren’t likely to be challenged by collaboratative works. The costs associated with “wikiing” the rest of the library would likely be enormous. Mr. Wales seems to be banking on advertising dollars to support this effort. If this were the model, these works might be considered free from a consumer pricing perspective but the advertising revenues and potential profits would have to be deemed consideration in economic terms.

One also wonders whether the volunteer authorship model is extensible. I can see how it would work when the author is writing about a topic that is intensely interesting ( as I am doing at this moment). However, a great deal of content is more the result of craftsmanship than inspiration. It seems unlikely that volunteers could be recruited to write “drudge works”.
And as appealing as it may be to write essays or modules on interesting topics for free, I for one would enjoy it even more if I received some consideration for my hard work. (no wise cracks please). As popular as Wikipedia is today, someday it too may face a challenge from some organization (i.e. Google or Microsoft) that might add new features or devise a revenue sharing model that provides authors with incentives.

I guess my point is that an author’s ideas and created content are products. And like other products they should have a value proposition and a go-to-market strategy. The valuation/pricing options might include: purchase, subscription, sponsorship, syndication, promotion, and free. Depending on the utility, creativity, entertainment value, uniqueness of content, etc of content, one or more of the above models might be appropriate. While ‘free’ is one option, why should it be the preferred choice? Talented people like Mssrs. Anderson and Wales create content that is very good and are entitled to receive proper consideration if they so desire. Jeff Patterson, CEO of Safari Books Online, had some fascinating data on this topic. He studied how their customers for largely technical information value their information products vs. other content some of which are "free". He found that some quality content was indeed free. However, most "free" content was either advertising supported or was offered in exchange for specific information about the content consumer or their work projects. He asked customers about the tradeoffs that they were willing to make to obtain content for free. If I recall correctly, about ¼ of their customers would rather pay for content if the advertising was inappropriate and/or distracting. !/3 of their customers would rather pay for content than reveal any personal information other that name and e-mail address. And 2/3 of their customers would rather pay for content than reveal in depth information about a project that they were working on. (Patterson’s presentation was one of the best—I hope that he posts his slides)

Advocating for universally free content might indeed have the unwanted effect of reducing the amount of excellent content that is created by professional authors who depend on their writing as their livelihood. I think that Bruce Chizen, Adobe’s CEO, summed it up nicely when asked by Tim O’Reilly where he stood in this debate. He said that Adobe makes many large investments of human and financial capital in the inventions and products that they produce. While he is all in favor of providing some of their intellectual property to consumers, standards organizations, and society for free, he reserves the right to make the decision as to what should be free. I’m sure that his stakeholders support that position.

I’ll conclude this entry with some thoughts on the provocative concept of ambient information. For many years, content was created for a single purpose and was closely regulated against peripheral usages. With the advent of the digital era, came significant opportunities for deriving additional value from content. I will forever be grateful to the senior management of Houghton Mifflin Company who saw the wisdom of freeing the content of their dictionary from exclusively “booklike objects” and allowed linguists and software engineers to build spelling correction technology. In fact, most of the English language spelling technology that is used today was derived from their American Heritage Dictionary database. And as I described in an earlier Blog entry, Thomson’s retiring CEO Richard Harrington made information ambience central to their core strategy and judging from their financial statements, they are receiving significant consideration for their efforts!!

Making content accessible to inform, educate, and entertain people is a worthy goal, Making content more ambient will offer content creators and publishers many new opportunities to publicize their work, create goodwill, answer new questions, solve difficult problems, and the option to generate new income streams that are appropriate and commensurate with the value of the content.

June 1, 2007

HP Unveils Print 2.0 - a New Era for Printing

The title of this blog entry is the title of HP's press release on the same topic. On May 30th in New York Hewlett-Packard announced "Print 2.0" at the company’s annual Imaging and Printing Conference.

According to the press material: "Vyomesh Joshi, executive vice president of the company's Imaging and Printing Group, described how HP will seek to capture a significant share of the 53 trillion digital pages estimated to be printed in 2010 alone – an opportunity valued at more than $296 billion.

"Joshi identified three key areas of focus of the Print 2.0 strategy:

● Make it easier to print from websites, such as blogs and travel sites, and bring new printing capabilities to online properties;
● Extend the company's digital content creation and publishing platforms - for example, Snapfish and Logoworks - across customer segments spanning from consumers to enterprises; and
● Deliver a digital printing platform that increases print speeds and lowers the cost of printing for high-volume commercial markets."

There might be something a little different in HP's strategy here. I’m still not sure. There was a ton of video recorded at the event, and it’s easily accessible on HP's site. I didn't start with the videos of the executive presentations, but instead with the presentation by John Battelle. Battelle has had a unique (and eventful) relationship to both print and electronic media, making him a uniquely fascinating speaker on where publishing is headed. The third video, of his Q&A session, is particularly revealing of his unique perspective on the evolving relationship of print to electronic media.

After watching Battelle, I'm willing to spend more time here, even listening to the executive presentations. HP may be onto something.

May 19, 2007

Thomson Learning-- What's next??

Earlier this year, I wrote that the announcement that Thomson Learning was for sale was an indictment of the current fundamentals of most learning market segments. From the perspective of Thomson senior management, the decision was to divest seems clear cut. Consider this comparative financial data:

Thomson Learning All Other Thomson Units
Organic Growth 4.0% 6.0%
Adj Ebitda 24.5% 29.2%
Operating Margin 12.9% 18.9%
Electronic Revenues 36.0% 80.0%
Recurring Revenues 24.0% 82.0%

(Source Thomson 4th Q Investor Presentation)

The percentages of electronic and recurring revenues are particularly at odds with CEO Harrington's goal of integrating Thomson's content with their customer's work flows. After examining this data combined with declining unit volumes, growing price resistance, and increased government regulation, one wonders what motivated the private equity firms to pay the lofty multiples described in Thad McIlroy's excellent post earlier this week.

Perhaps, they see the opportunity to create more new products that will blend content and technology to add value to the student's learning experience. Vivid simulations and multimedia can help bring clarity to the explication of complex topics. Linking the appropriate content to solving problems improves student understanding while saving them lots of time and frustration. Making texts searchable and providing fresh links to appropriate Internet sites brings life and exploration opportunities to static textbook content.

Transitioning from a reliance on the sale of books and specific ancillary items to an intellectual property licensing model that is based upon usage metrics and attributes value to all aspects of course package (including the many package elements currently provided to faculty at no cost) would enable profound changes to the income statement. Revision cycles could be lengthened, sampling and selling costs reduced, and the percentage of recurring revenue increased substantially.

For several years, the potential of such changes have been obvious to industry executives and observers. Why then would the new owners be better able to institute these changes and transitions? The answer is simple, the short term costs of technology investments coupled with the transition to a recurring model would produce some "difficult quarters" for a publicly traded company. The opportunity to retool and restructure while private could create a company that would have excellent recurring revenues and better margins when reintroduced to public markets in a few years.

Should Thomson (and possibly Houghton-Mifflin) adopt this strategy, the impact on the rest of the industry could be profound. However, if these changes were to take place, authors, students, universities, and the publishing companies would eventually all be winners! Here's hoping that this deal lends impetus to this industry transition.

May 15, 2007

A New eCollegey in Higher Ed Publishing??

Pearson made an interesting acquisition yesterday. Their acquisition of eCollege continues their corporate foray into Student Information Systems and Course Management. Last year, Pearson acquired PowerSchool and Chancery Software yielding a very strong position in Student Information Systems for the K-12 market. Clearly, they like these learning infrastructure markets for several good reasons.
1. At present, they seem to be solid businesses with only a few competitors that are poised to grow at rates exceeding their traditional textbook businesses.
2. The acquired customer base brings them many new customers and brings them closer to the students (and parents) who use their instructional products. The information about these students and the ability to reach them with additional product offerings is not to be underestimated in this digital world.
3. As the range of course materials such as content modules, learning software, simulations, educational websites, etc. continues to grow, the value of the course infrastructure technology will increase as well as provide a strategic advantage for integration with their broad range of course materials.

Last week at the Digital Book conference in New York, several speakers agreed that college textbook publishers will look more and more like software publishers over the next ten years. The reasons for this transition will center on using technology to: 1. deliver appropriate content to the student when it is needed to solve homework problems and prepare for tests; 2. integrate traditional material with innovative simulations and learning modules available from communities like MERLOT; 3. add life to static published content by enabling further exploration via web links and domain specific search engines and content repositories.

Pearson is wise to acquire successful software and technology companies to give them the pockets of technical expertise that would take many years to develop within the company. While there may be some culture clashes, this strategy should serve Pearson well and position them to maintain or expand their leadership position in educational publishing.

Thomson Learning Sold for Big Bucks!

Well, Thomson Learning has finally been sold (subject to rote “due diligence”) to private equity firms. Everyone figured it would be private equity firms that would make the purchase, partly because these firms are buying just about everything these days except your old underwear, and also because the higher education textbook market is so concentrated that even George Bush’s “I’ve never seen a merger I didn’t like” administration would have had trouble fobbing this one off. Too many children would have been left behind.

The big surprise was the price. A whopping $7.75 billion, over 3 times the annual sales of the division, and apparently roughly 15 times cash flow (see http://www.canada.com/nationalpost/financialpost/story.html?id=42ad804c-38b3-4017-9a6b-5c5073515cc5&k=78619). The same article points out that “by comparison, the average cash flow multiple paid in leveraged buyouts of $500 million or more last year was around eight times cash flow, with media deals typically in the low-double digits, according to buyout industry statistics.” The price is also some 50% more than company officials originally stated they thought they could fob the division off for.

Would we say there’s a little too much cash out there looking for comfy homes? Or would we wonder why this Thomson division, much maligned by management when the sale was first announced, is suddenly as valuable as DaimlerChrysler? (http://www.nytimes.com/2007/05/15/automobiles/15chrysler-web.html)

I guess we’re stuck with Thomson’s overriding stated view that higher education just wasn’t getting with the program fast enough in an online, electronic sort of way, and so the division had to be jettisoned. (Although Thomson CEO Richard J. Harrington admitted after the sale announcement that the company had no complaints about the educational unit’s financial performance. Textbooks are, by and large, a high-margin product (http://www.nytimes.com/2007/05/12/business/worldbusiness/12deal.html)).

On the other hand, memory serves to remind us that Thomson was previously determined in a fierce way to get the heck out of the news business, and now it’s about to merge with Reuters.
What I’m most cognizant of is that Thomson shares had been languishing in the mid-$30s for years before the announcement of the bold move to get rid of textbooks. Now those shares are in the $40s. A lot of senior Thomson executives have made a whole lot of cash from these recent maneuvers (not to mention the Thomson family). No senior Thomson executive was left behind (as for the the operating staff; it is not polite to ask).

(To glimpse the stock chart: http://thefutureofpublishing.com/index.html)

April 21, 2007

The Negroponte Laptop

There has been much speculation and lots of quick judgments on the Negroponte laptop project. I've found all of the articles and blogs I've seen to be so devoid of factual matter, while so packed with unsubstantiated opinion, that I've found it impossible to form any kind of judgment of my own on the topic.

Obviously addressing the huge problem of literacy in the third-world is essential to a bright future for publishing. Will this be part of the solution?

Read a truly excellent article here:
http://www.spectrum.ieee.org/apr07/4985

It's that rara avis -- an article full of both technical, social and political insight, and extremely balanced in presenting its ideas and information. Fascinating and provocative.

April 10, 2007

A Kodak Moment

Last month,I had the privilege of being a guest lecturer at MIT for Howard Anderson and Peter Kurzina's Course entitled "Managing in Crisis". I prepared a case study about the current status of the College Publishing market. It included concerns about price pressures, used book competition, channel issues, new competitors and new media requirements, pending legislation about pricing practices, and the continued lack in the growth of unit sales.

The class was composed of 40 or so very bright students who did an excellent job analyzing the case. One student really captured the essence of the case when she said that it reminded her of the photography industry as the major players struggle with the transition from traditional film and paper products to digital photography.

While Kodak moments have long been associated with joyful celebrations, the aforementioned transition has been anything but a celebration. In fact, it is likely that this Kodak moment will come to exemplify the struggles of a powerful corporation as they strive and perhaps even fail due their inability to recognize the customer benefits, opportunities, and challenges associated with non-traditional types of photographic media.

Many publishers are struggling with their own "Kodak Moments". And I think that the transition issues are similar to the photography industry. First, Kodak seems to have been focused on the products that people had been buying for years and trying to preserve the advantages that their film and paper technologies provided. Like many of us who have had market leading products, they were arrogant about their technology, processes, market position and quality. While they clearly were aware of digital photography technology, they dismissed digital products because the image quality was significantly poorer. Then they concentrated on turning digital photos into traditional photos. It seems to met hat they missed
the potential in offering less expensive digital images that could easily be posted on the Internet or e-mailed to relatives.

For many years, publishers offerings have been closely related to technological developments in the Software and Printing industries. For example, software has enabled improvements in authoring, composition, and thereby lowered the costs of elegant or complex page designs. Printing Technology has made four color printing much more affordable and had made shorter print runs economical. These changes have been passed along to consumers of information. (While I understand that there are differences between the terms: content, information, and intellectual property, I will use the term information to subsume all three terms for purposes of brevity) In many cases, they have added value to the customer's experience but there are cases where formats were enhanced and color offered because they could be rather than because they were beneficial. In reality, I believe that the net result was that the size, format, and frequency of the traditional economical delivery unit EDU (my term) of information (a book, journal, or magazine) were modified by technology advances but the traditional media form has remained essentially the same for 100+ years.

The Internet has presented publishers with a radical paradigm shift ( I don't like the term either). All types of publishing entities have had to deal with changes in customer expectations that are easily as profound as those experienced by the Photography industry. They don't just want their information to be more timely and less expensive, they also want their information to concisely answer their questions and seamlessly integrate with their work flows or learning styles.

Perhaps the most significant change is the redefinition of the EDU. In the purely print era, there needed to be a certain mass of information to build a product that would be economical to print and sufficiently valuable to consumers to generate a profit. In manycases, it was assumed that relatively few information consumers would use all of the information that was presented in a single EDU. Rather, the scope of the information (or content) had to be broad enough to attract enough customers without being so broad as to make customers feel that they were procuring too much information that wasn't pertinent to their interests. Hence, we have witnessed a generation of books where authors and market researchers work closely together.

In the digital world, authors and publishers are potentially freed from the strictures of printing economies. Therefore, information currently found in textbooks, references, magazines and journals can be rendered as as short information objects or more comprehensive content modules. Or publishers can produce information objects or content modules that are not anticipated to ever take book form. The objects can be delivered in many ways including search engines such as Google. These new EDUs can be purchased or licenses separately or mixed and matched to create a course of instruction or a personal reference work. One benefit of these nimbler EDUs is that they blend nicely with software to offer increased value in the form of better instruction or more productive work flows.

The availability of these more compact EDUs will likely spawn many debates concerning academic traditions and learning methodologies that we have come to hold dear. It has long been the practice for students to read and master significant quantities of information with the expectation that many of the specific facts will fade from memory leaving a general understanding of the topic. And many people are considered well read because they have plowed through many traditional EDUs (Books) The question will be: Could one become well educated or well read by learning to explore topics of interest through smaller EDUs and/or what blend of contextual and specific information delivers the best and most productive intellectual outcome. There will be some interesting face-offs between technology enabled active exploration and discovery of information to allow students to pursue topics that they find interesting vs. the more structured mastery of a set of information presented in book form. Of course, it is not an either/or proposition as they methods must eventually be blended to enable meaningful knowledge acquisition.

The digital world has also created a demand for information that is developed and delivered as rapidly as possible. Where traditional publishers often justified their value by guaranteeing the accuracy and authority of their published information, many of today's information consumers are willing to trade authority for velocity of information and now rely upon other information consumers to tell them what information is the most accurate and useful. Individuals now actively participate in communities that generate and evaluate large quantities of information objects and content modules. The Wikipedia/media organization and the MERLOT community are excellent examples of communities that produce quality information modules.

While many information consumers may still prefer to consume their information in print form, they may now wish to print their own copies or to create and purchase custom versions produced by rapidly improving print on demand technology. To many publishers, the perfectly formatted page has become almost an art form. They consider those pages to have many of the same aesthetic values that Kodak attributed to images produced via their traditional film technologies. Because customers rarely have the choice of formats, it is difficult to gauge the value that they derive from "perfect pages" vs potentially less expensive simpler pages. Chip Pettibone of O'Reilly Publishing reported at the recent Gilbane Conference that when readers of e-books were offered the choice between a simple HTML design and a faithful rendition of the original book page, 50% chose the HTML version and that population seemed to be gaining in numbers. Because books and computer screens represent quite different form factors, the value of the perfect page can actually limit rather that enhance the effective presentation of information in digital formats. Therefore, rather than trying to maintain the integrity of the printed page, modern publishers are designing their content to be presented equally well in a variety of media forms. Publishers that cling to the page metaphor are putting their futures in jeopardy.

This paradigm shift is replete with challenges and opportunities. Many traditional reference products (including Microsoft's Encarta) have been decimated by new products created in the Internet Era. Newspapers and magazines have had to adapt to the challenges or multiple media environments by creating online products as they have seen their traditional readership dwindle. Journal publishers have had to derive new models to serve their subscriber base. Many categories of trade books now include websites with fancy multimedia elements and discussion groups. I think that some of the most exciting and interesting challenges and opportunities will be found in the world of educational publishing. As witnessed by the decisions of major publishing conglomerates to divest their educational publishing operations, the challenges of mastering the Internet paradigm shift are both daunting and expensive. To succeed, new generations of products will need to be built to take advantage of technology as opposed to being web versions of existing products. Business models will need to be revised and channel strategies re-engineered. One important outcome will be increased information accessibility for readers and learners with disabilities.

Over the next few years, the publishing industry will witness many Kodak moments.... Hopefully the majority will be the old fashioned Kodak moments of victory and celebration.

March 24, 2007

Is Adobe Really Going to Mars?

I’m becoming concerned whether Adobe is really serious about Mars. My evidence:

1. The FAQ has not be updated since 27 Oct 2006.

2. At the end of the FAQ it reads:
Q: When will the Mars format be frozen for 1.0?
A: A date for this has not yet been set.

Q: When will the Mars plug-in be available?
A: It is planned to be available before the end of the year.

This all seems very tentative.

As Joe Wilcox observes here: http://www.microsoft-watch.com/content/developer/adobe_goes_to_mars.html?kc=MWRSS02129TX1K0000535

“Adobe’s competitive response to XPS makes sense. PDF's heritage predates the populist Web, and Adobe created the format for the purpose of mimicking paper documents. In the 21st century, however, digital documents are often containers that likely will never be printed. Paper’s relevance — and so the need to mimic — has greatly diminished.”

All of this is, on the surface, true (except perhaps the “makes sense” part). Does tossing high-fidelity page-oriented PDF into an XML container really address this issue?

I think even more significant is Adobe’s clearly stated, and obviously honestly intended, design to make PDF an ISO standard. My cynical blog entry is here: http://gilbane.com/publishing_blog/2007/01/the_pdf_iso_standard.html. But, as Adobe and others point out, subsets of PDF, very useful subsets I’d say, are already ISO standards, including “Several trade-specific subsets of the PDF spec are either ISO approved or in the approval process, including PDF/X for printers, PDF/E for engineers, PDF/A for archivists and PDF/UA for making documents compliant with Section 508 regulations.” (http://www.eweek.com/article2/0,1895,2088277,00.asp).

These aspects of PDF, including, of course, the entire spec, only serve to “encrypt” as standards that which makes PDF uniquely the “Portable Document Format.” PDF as an XPS competitor is not uniquely PDF, in the historic sense of the format, and with the entire PDF spec submitted to ISO, Mars needs to succeed within this process. Do we really think it can?

I think that Adobe is far more interested in Apollo (http://labs.adobe.com/technologies/apollo/). Although this is a very different beast than Mars, I believe Adobe knows that its future lies much moreso with this kind of technology than it does on the not-very-hospitable planet Mars.

March 21, 2007

We Are Smarter Than Me-- Report

Last fall, Martin Clifford-CEO of the web community juggernaut Wis.dm, informed me that I was hopelessly out of date regarding the phenomena of web communities and hinted that due to my advanced years I might never comprehend the impact of many-to-many publishing. It's true that most of my experience is in traditional forms of one-to-many publishing. However, I've always loved a good challenge so I began my exploration of the role of communities in the creation of content.. Early in my explorations, I came across the We Are Smarter Than Me project. This project is the joint effort of Pearson Educational Publishing, Wharton, MIT, and Shared Insights. The goal was to form a community that would write a book about how communities could change and enhance the way that companies do business. I tuned into the "Buzz" to get a sense of the passion of the participants And then, I joined the community and contributed a small section on the importance of word-of-mouth in the marketing of services. As the project progressed, I watched its progress and waited eagerly to see what would happen when the many-to-many model was invoked to produce a traditional business book.

To hear first hand accounts of the project, I travelled to the Community 2.0 conference in Las Vegas. Barry Libert of Shared Insights and Tim Moore of Pearson Educational Publishing presented a fascinating progress report and a conversation with co-founder Jon Spector (soon to be CEO of the Conference Board) filled in some additional information.

The participants are to be congratulated for commissioning the project as a pure experiment. As Mr. Moore said, "I just wanted to see what would happen" As one might imagine, the interaction between web communities and large esteemed institutions presented some interesting challenges. Not surprisingly, the first significant issue arose when Pearson faxed their contract to Shared Insights. While the contract was entirely appropriate for traditional author teams, indemnification clauses took on entirely new meaning when the work of hundreds or thousands of author/contributors would be scrutinized. The prolonged wrangling broke the project's early momentum. It was assumed that the Academic Dream Team of Pearson's business authors and the faculties of Wharton and MIT would produce numerous thoughtfully written content modules. Surprisingly, none of the authors or profs chose to participate in the project. The project team reverted to Plan B by sending participation invitations to a large list of people affiliated with the sponsoring institutions. The response was enthusiastic and the community began to grow. Current membership is approximately 3500 with 650 individual wiki posts.

As the active participation increased, the project team learned another important lesson. Suddenly the community wanted to take over the project leadership and asked the project team to step aside. Even though the project team knew alot about community dynamics, they weren't ready for their own community to be so assertive and found it difficult to relinquish control. When they did step back, the community flourished.

How did the book by community turn out? One speaker reported that the journey was more interesting than the destination meaning that the content created was plentiful but uneven in quality and style. To yield an acceptable business book, it would be necessary to hire an accomplished professional author who would also handle the fact checking process.

That are the open questions and lessons learned from this project.

1. Why didn't the authors and professors participate?
Possible explanations included:
Generation Gap- Authors and profs didn't grow up with MySpace or Facebook. Web Communities are foreign to their professional milieu.
Status Issues- They are used to being the authority and weren't willing to have their writings publicly challenged. And they have already made their reputation so that they have little status to gain.
No Financial Benefit- Their time is very valuable and they expect to be paid for their efforts.
Lack of Passion or Connection with the Project- Community participation is not their avocation nor were they passionate about the topic.
Those that did participate did so out of a passion for the topic and seemed most motivated by the opportunity to build their reputation within the community. For many members, community participation is one of their hobbies. And they seemed not to desire any remuneration for their contributions.
Observation- Just like in the early days of the Internet, there is currently more cache attached to eyeballs and recognition than to traditional financial rewards. However, there are significant costs to forming, hosting and moderating communities. And the work of cummunities can be very valuable to companies of all sorts. New business models are emerging that will manage the costs and reflect the value of the contributions.

2. Given the uneven content and need to bring in a professional author, should anyone even try to write another book by committee?
It depends on the type of Book!! Wikipedia has demonstrated that this model is very effective in creating a comprehensive reference work. ( I suppose that some purists would argue that Wikipedia isn't really a book but rather a collection of content modules), For traditional authored book projects, communities might play a valuable role in helping authors research topics that are outside of their primary expertise and in reviewing the authors work for accuracy and clarity.

3. Will there be instances where community created content modules will compete with traditional published works?

Given the Google world that we live in, consumers of information often seek a terse answer to a specific question. And there is a definite trend towards the integration of content with the information consumers' workflows. For these information consumers, A well structured repository of content modules is potentially more valuable than traditional books.

5, So was the project aiming at the wrong goal?

Perhaps! Old habits die hard and many people in my generation have books to thank for alot of their professional knowledge. Maybe the goal of the project should have been to develop an outstanding repository of content modules and resources that could become an authorative source of information about communities and their role in changing and enhancing the ways that companies do business. In the long run, the mission critical task is creating outstanding intellectual property. Creating multiple media versions of that IP will allow publishers to reach a wider range of customers.

6. Will the many-to-many content model put traditional publishers out of business?

There is much more opportunity than risk for publishers.
Most of us would agree that we already suffer from information overload. Communities have the potential to raise that overload to an even higher level. Information consumers want to know that the content they are reading is accurate and authoritative. This has been the primary domain of publishers for many years. If publishers find new ways to harness the wisdom of crowds in creating new content and improving existing content, their future is bright. If not, someone else will seize the opportunity. And if they trivialize new methods of content creation as being less pure and authoritative than their time-tested editorial processes, they will face serious consequences. If you're not convinced, just ask your favorite encyclopedia publisher!!

That is my report on many-to-many versus one-to-many content creation models. Now I'm trying to figure out whether the few-to-few model refers to custom publishing or to an underperforming web community.

..


March 7, 2007

Major Annoucement Coming Soon

Are you interested in the future of publishing? Stay tuned to this Blog for a major announcement within the next few weeks... This will be a major theme of the Publishing Track at this spring's Gilbane Conference in San Francisco April 10-12.