Curated for content, computing, and digital experience professionals

Category: Publishing & media (Page 38 of 52)

New Enterprise Search Engine Specializes In Engineering Documents

Elmo Solutions, experienced in the extraction and processing of CAD and engineering metadata, announced the official launch of Agni Enterprise Search 2008. Agni Enterprise Search is an enterprise search solution designed to meet the distinct needs of CAD, R&D, Engineering and IT managers, end-users and senior managers. Agni Enterprise Search 2008 allows CAD and non-CAD users to index and retrieve a wide range of document formats, including AutoCAD, Autodesk Inventor, Office, eMail, Acrobat, image formats, SolidWorks, Lotus Notes, Visio, etc. Agni Enterprise Search 2008 was designed specifically to maximize document usability and reusability. Agni Enterprise Search allows not only free-form keyword-based searches, but also structured-form metadata-based searches as well. Agni Enterprise Search 2008 features include– Display of document thumbnails, Stemming, Phonic searches, and a query language. The software ships in English and French, and is available as a free, fully functional, 30-day trial license for download directly from the Elmo Website. http://www.elmosolutions.com/

DITA and Dynamic Content Delivery

Have you ever waded through a massive technical manual, desperately searching for the section that actually applied to you? Or have you found yourself performing one search after another, collecting one-by-one the pieces of the answer you need from a mass of documents and web pages? These are all examples of the limitations of static publishing; that is, the limitations of publishing to a wide audience when people’s needs and wants are not all the same. Unfortunately, this classic “one size fits all” approach can end up fitting no one at all.

In the days when print publishing was our only option, and we thought only in terms of producing books, we really had no choice but to mass-distribute information and hope it met most people’s needs. But today, with Web-based technology and new XML standards like DITA, we have other choices.

DITA (Darwin Information Typing Architecture) is the hottest thing to have hit the technical publishing world in a long time. With its topic-based approach to authoring, DITA frees us from the need to think in terms of “books”, and lets us focus on the underlying information. With DITA’s modular, reusable information elements, we can not only publish across different formats and media – but also flexibly recombine information in almost any way we like.

Initial DITA implementations have focused primarily on publishing to pre-defined PDF, HTML and Help formats – that is, on static publishing. But the real promise of DITA lies in supporting dynamic, personalized content delivery. This alternative publishing model – which I’ll call dynamic content delivery – involves “pulling” rather than “pushing” content, based on the needs of each individual user.
In this self-service approach to publishing, end users can assemble their own “books” using two kinds of interfaces (or a hybrid of the two):

  • Information Shopping Cart – in which the user browses or searches to choose the content (DITA Topics) that she considers relevant, and then places this information in a shopping cart. When done “shopping”, she can organize her document’s table of contents, select a stylesheet, and automatically publish the result to HTML or PDF.
    This approach is appropriate when users are relatively knowledgeable about the content, and where the structure of their output documents can be safely left up to them. Examples include engineering research, e-learning systems, and customer self-service applications.
  • Personalization Wizard – in which the user answers a number of pre-set questions in a wizard-like interface, and the appropriate content is automatically extracted to produce a final document in HTML or PDF. This approach is appropriate for applications that need to produce a personalized but highly standard manual, such as a product installation guide or regulated policy manual. In this scenario, the document structure and stylesheet are typically preset.

In a hybrid interface, we could use a personalization wizard to dynamically assemble required material in a fixed table of contents – but then use the information shopping cart approach to allow the user to add supplementary material. Or, depending on the application, we might do the same thing but assemble the initial table of contents as a suggestion or starting point only. The first method might be appropriate for a user manual; the second might be better for custom textbooks.

Dynamic content delivery is made possible by the kind of topic-based authoring embraced by DITA. A topic is a piece of content that covers a specific subject, has an identifiable purpose, and can stand on its own (i.e., does not require a specific context in order to make sense). Topics don’t start with “as stated above” or end with “as further described below,” and they don’t implicitly refer to other information that isn’t contained within them. In a word, topics are fully reusable, in the sense that they can be used in any context where the information provided by the topic is needed.

The extraction and assembly of relevant topics is made possible by another relatively new standard called XQuery, which is able to both find the right information based on user profiles, filter the results accordingly, and automatically transform results into output formats like HTML or PDF. Of course, this approach is only feasible if the XQuery engine is extremely fast – which led us to build our own dynamic content delivery solution offering around Mark Logic, an XQuery-based content delivery platform optimized for real-time search and transformation.

The dynamic content delivery approach is an answer to the hunger for relevant, personalized information that pervades today’s organizations. Avoiding the pitfalls of the classic “one size fits all” publishing of the past, it instead allows a highly personalized and relevant interaction with “an audience of one.” I invite you to read more about this in a whitepaper I wrote that is available on our website (www.FlatironsSolutions.com).

Atom is Finished

Tim Bray reports Atom 1.0 is complete. Atom is the XML app that is meant to be a better RSS. We have been using both in our newsfeeds for awhile, sometimes switching depending on various newsreader support issues, and have been reluctant to completely switch from RSS. But this will edge us closer.

Adobe Introduces Captivate 3

Adobe Systems Incorporated (Nasdaq:ADBE) announced Adobe Captivate 3 software, an eLearning authoring tool for the delivery of computer-based simulations, scenario-based training, and interactive quizzes. Adobe Captivate 3 offers enhanced features including multi-mode recording, rerecording, new Microsoft Powerpoint import capabilities and support for rich media formats, such as Adobe Flash Player compatible .SWF, MP3, and AVI files. Adobe Captivate 3 allows learning professionals to create software training in a simple screen capture session. The screen capture generates multiple learner modes, including demonstrations with mouse movements and automated text descriptions of each recorded task, practice simulations with instructional automated or customized feedback, and assessments with scored user interactions. The enhanced Microsoft PowerPoint import functionality supports the conversion of slide animations into Flash Player compatible SWF format and allows authors to create interactive presentations incorporating audio and video. Adobe Captivate 3 software automatically generates Adobe Flash Player-compatible content, allowing files to be e-mailed, posted to Web sites, intranet sites, and online help systems. The new XML export and import feature simplifies the localization process of projects by exporting captions to the XML Localization Interchange File Format (XLIFF). Adobe Captivate 3 will be available for Microsoft Windows XP, Windows 2000 and Windows Vista and is expected to ship in August 2007 or later this summer at an estimated price of US$699. Localized versions in French, German, Japanese, Italian and Spanish are expected to be available in September 2007. Users of Macromedia Captivate 1 and Adobe Captivate 2 can upgrade to Adobe Captivate 3 for an estimated price of US$299. http://www.adobe.com/go/captivate

Free or Ambient?

It has been a week since the O’Reilly Tools of Change conference adjourned. The topics and presentations were provocative and there are sure to be some lively debates continuing on for months to come…

Several of the keynotes centered on the theme that “information wants to be free”.

Chris Anderson, Editor in Chief of Wired and author of the best seller, “The Long Tail”, was one of the early keynoters. He started the debate by announcing that the title of his next book would be “Free” and would focus upon making a case for providing content to consumers for no charge.

Towards the end of the first day, Jimmy Wales, President of the Wikimedia foundation, spoke about his new company called Wikia. His goal is for Wikia to do for the rest of the library what Wikipedia did for the Encyclopedia section and to make the assembled knowledge of the world available to the masses for free. And by the way, he’s going to produce a free Google competitor at the same time. (He certainly doesn’t lack for ambition.)

Erin McKean of the Oxford University Press closed the conference with a vivid discussion of “book-shaped objects” and openly questioned whether books were the best information package for the future. As a lexicographer, she weighed in on the “free” debate by saying that it would be better to state that all “information wants to be ambient”. She indicated that in this sense ambient means readily available for use. (Because I always had associated the word ambient with sound, lighting, or atmosphere, I checked several other dictionaries to clarify this sense of ambient. It did not appear. When I went to OED online, I found that their definition of ambient was neither free nor ambient but available for a mere $29.95/month. Because, Erin has yet to post her presentation on the O’Reilly site, you’ll have to trust my memory for this definition.) Her point is well taken; the word “free” is simply to vague. The American Heritage Dictionary (AHD) lists 17 different definitions or senses ranging from “matters of liberty” to “lack of restraint” to “lack of encumbrance” to “provided without consideration or reward”.

Mr. Anderson’s talk seemed to stress the economic meaning of “free”. . Thus, we must assume that he means that information or content should be available without consideration or reward. The popular justification for this approach to content valuation is that because cost of digital distribution is negligible, it is unfair to charge for content or information that is essentially free. If the distribution cost method were used to price traditional book content, the cost of printing, paper and binding would be the determining factor. Of course, that’s not the case. That approach omits many of the costs of publishing content including: reviewing, editing, formatting, proofreading, publicizing, selling, and marketing. And, of course, most authors like to be paid royalties for their work. It also neglects to consider the notion that many readers prefer traditional book formats and therefore many content elements will eventually be published in several media formats.

Mr. Anderson is first to admit, his book entitled “Free” won’t actually be free. Okay, the audio book or e-book may be free to those who purchase the printed book.. But if you want to purchase only the e-book or the audio book, you’ll be expected to pay for it. Mr. Anderson is also exploring various forms of online books or printed books that would either be sponsored by corporations or supported by (gasp) advertising. While some of these offerings might be free to the reader, it seems that there will be considerations and rewards built somewhere into this project.

He went on to say that he might well be in favor of making the book free because of the publicity benefits to his consulting practice, speaking fees, and the promotion value to his “personal brand”. However, he felt that his publisher would likely object because they are in the business of generating sales and revenues from selling books. (After the session, I heard several people suggest that Mr. Anderson could self-publish his new work and then he would be free to make it free).

Based upon his work at Wikipedia, Mr. Wales could be seen as a developer of truly free content. The governing organization is a charitable foundation and all of the authors are volunteers. The costs of supporting the staff and technology infrastructure are paid by donations. However, during his keynote, Mr. Wales was quick to point out two major differences between Wikipedia and Wikia. The scope of Wikia is much broader and it has been organized as a for profit entity.

Wikipedia represents the “perfect storm” for a collaborative work or “peering”. According to Wikinomics by Tapscott and Williams, there are three factors necessary to make peering effective: 1) the object of production is information or culture which keeps the cost of participation low for contributors; 2) Tasks can be chunked out into bite-sized pieces that individuals can contribute in small increments and independent of other producers. This makes their overall investment of time and energy minimal in relation to the benefits they receive in return; 3) The costs of integrating those pieces into a finished end product, including the leadership and quality control mechanisms must be low.”

While these criteria would apply to a number of other works found in a library, there are many others (novels, monographs, complex texts, dissertations, etc) that don’t meet these criteria well at all and aren’t likely to be challenged by collaboratative works. The costs associated with “wikiing” the rest of the library would likely be enormous. Mr. Wales seems to be banking on advertising dollars to support this effort. If this were the model, these works might be considered free from a consumer pricing perspective but the advertising revenues and potential profits would have to be deemed consideration in economic terms.

One also wonders whether the volunteer authorship model is extensible. I can see how it would work when the author is writing about a topic that is intensely interesting ( as I am doing at this moment). However, a great deal of content is more the result of craftsmanship than inspiration. It seems unlikely that volunteers could be recruited to write “drudge works”.

And as appealing as it may be to write essays or modules on interesting topics for free, I for one would enjoy it even more if I received some consideration for my hard work. (no wise cracks please). As popular as Wikipedia is today, someday it too may face a challenge from some organization (i.e. Google or Microsoft) that might add new features or devise a revenue sharing model that provides authors with incentives.

I guess my point is that an author’s ideas and created content are products. And like other products they should have a value proposition and a go-to-market strategy. The valuation/pricing options might include: purchase, subscription, sponsorship, syndication, promotion, and free. Depending on the utility, creativity, entertainment value, uniqueness of content, etc of content, one or more of the above models might be appropriate. While ‘free’ is one option, why should it be the preferred choice? Talented people like Mssrs. Anderson and Wales create content that is very good and are entitled to receive proper consideration if they so desire. Jeff Patterson, CEO of Safari Books Online, had some fascinating data on this topic. He studied how their customers for largely technical information value their information products vs. other content some of which are “free”. He found that some quality content was indeed free. However, most “free” content was either advertising supported or was offered in exchange for specific information about the content consumer or their work projects. He asked customers about the tradeoffs that they were willing to make to obtain content for free. If I recall correctly, about ¼ of their customers would rather pay for content if the advertising was inappropriate and/or distracting. !/3 of their customers would rather pay for content than reveal any personal information other that name and e-mail address. And 2/3 of their customers would rather pay for content than reveal in depth information about a project that they were working on. (Patterson’s presentation was one of the best—I hope that he posts his slides)

Advocating for universally free content might indeed have the unwanted effect of reducing the amount of excellent content that is created by professional authors who depend on their writing as their livelihood. I think that Bruce Chizen, Adobe’s CEO, summed it up nicely when asked by Tim O’Reilly where he stood in this debate. He said that Adobe makes many large investments of human and financial capital in the inventions and products that they produce. While he is all in favor of providing some of their intellectual property to consumers, standards organizations, and society for free, he reserves the right to make the decision as to what should be free. I’m sure that his stakeholders support that position.

I’ll conclude this entry with some thoughts on the provocative concept of ambient information. For many years, content was created for a single purpose and was closely regulated against peripheral usages. With the advent of the digital era, came significant opportunities for deriving additional value from content. I will forever be grateful to the senior management of Houghton Mifflin Company who saw the wisdom of freeing the content of their dictionary from exclusively “booklike objects” and allowed linguists and software engineers to build spelling correction technology. In fact, most of the English language spelling technology that is used today was derived from their American Heritage Dictionary database. And as I described in an earlier Blog entry, Thomson’s retiring CEO Richard Harrington made information ambience central to their core strategy and judging from their financial statements, they are receiving significant consideration for their efforts!!

Making content accessible to inform, educate, and entertain people is a worthy goal, Making content more ambient will offer content creators and publishers many new opportunities to publicize their work, create goodwill, answer new questions, solve difficult problems, and the option to generate new income streams that are appropriate and commensurate with the value of the content.

Adobe Digital Editions

At O’Reilly’s Tools of Change Conference (TOC), Adobe made a very unAdobelike product announcement. Their new Digital Editions is very impressive!! I’ve never been a huge PDF fan because it is so stubbornly page centric in a world where pages are becoming much less important to the display of content. It has often awkward and painful to read PDFs on computer screens and hand held devices.

This new technology is based upon the IDPF EPUB standard that has been developed as the universal distribution format for reflow-centric content. The dynamic layout capability is amazingly agile as it reflows content from large to small screens with excellent speed and seemingly miminal effort. Adobe is currently mum about whether it will be included in the IPhone launch.

Digital Editions has optional DRM capability and will support contextual advertising, subscription and membership based business models. It features the expected compatibility with PDF and InDesign CS3.

The functionality and openness to industry standards are a radical departure from many of Adobe’s traditional practices. Bill McCoy, General Manager- ePublishing Business explains that the MacroMedia acquisition played a major role in making this strategic transition possible. This is more evidence that the Macromedia acquisition was one of the better acquisitions in recent memory.

The relatively small download (under 3MB) can be found at: www.adobe.com/products/digitaleditions.

Webinar: Medtronic, DITA, Single-Sourcing, and Multi-Channel Communications

On Wednesday, June 13 at 1:00 EST, Senior Analyst Bill Trippe will be doing a Webinar with Medtronic and the XMetal folks at JustSystems.
While documentation is a necessary deliverable for all companies, its value and contribution to bottom-line business results is often underestimated and overlooked. For Medtronic, one of the world’s most innovative medical device manufacturers, documentation is much more than a checkbox on a product release timeline – it is a direct link to customer satisfaction and patient well-being. Medtronic’s Rob Kimm will discuss Medtronic’s approach to delivering a better customer experience while also ensuring compliance with regulations that impact technical documentation.
Prior to using DITA, Medtronic had a decentralized, heterogeneous environment that slowed production and resulted in redundant workflows. Seven project deliverables were developed in 5 different tools, and the mutually-exclusive tools allowed for little to no ability to achieve true reuse of common content. They now can reuse common content across deliverable types, which has led to great efficiency, accuracy, and consistency.
To register for the Webinar, please visit here.

« Older posts Newer posts »

© 2024 The Gilbane Advisor

Theme by Anders NorenUp ↑