The World is Curved

The announcement of this new book caught my attention for a number of reasons, many obviously due to the state of the financial markets. More attuned to the Globalization practice is that we noted in our Multilingual Communications as a Business Imperative report that:

A common observation made during industry discussion of Internet-driven opportunities is that the proliferation of the worldwide web has made the business world “flat.” In other words, companies of all sizes can compete on a level playing field wherein everyone has the same access to technology and information. While our study respondents acknowledge the “flattening world” as Thomas Friedman has described it, they also recognize that different geographies and cultures have varying and distinct expectations. Thus, generalized information access does not equate to generalized information delivery. From this perspective, a flattening world requires far deeper levels of content relevancy, localization, and personalization than ever before. From this perspective, “one size fits all” is hardly the recipe for success in the global economy.

Risking the wrath of Friedman’ites, we contend that as far as multilingual communications are concerned, the world is most definitely not flat. Giving Friedman his due, David Smick contends that as far as global financial markets are concerned, the world is most definitely curved, where one “can’t see over the horizon and sight lines are limited.” Describing globalization as the great paradox of our time, this review quickly convinced me to put it on the “must read” list.

RSuite CMS Releases Adobe CS3 Connector

RSuite CMS now offers a CS3 Connector for InCopy users. The integration with Adobe’s CS3 enables InCopy users the ability to browse and open XML or InCopy documents in RSuite directly from the Adobe application. The RSuite CS3 Connector allows users to manage their content as XML within RSuite and to create a transformation to and from their own XML content model to the native XML file format of InCopy. This will help publishers who want to manage their content as XML throughout its life-cycle but also want to use the Adobe tools in their editorial and production process. Users can also store and develop workflows around InCopy and InDesign documents in RSuite.

Vignette Launches QuickSite to Speed Web Site Development

Vignette announced the worldwide availability of QuickSite, a new service offering that simplifies the Vignette Content Management implementation process and enables organizations to launch new Web sites faster. QuickSite delivers a consistent infrastructure, helping marketing departments to launch multiple microsites and branded sites without having to recreate Web pages from scratch. The service deployment includes content management processes, templates and business adoption workshops before the customer is asked to determine additional site requirements. QuickSite also includes support for multilingual Web sites, displays of content information through tag libraries and CSS templates to manage the look and feel of a site with limited help from IT. Site Cloning allows organizations to replicate a site within minutes rather than days by reusing the templates.

EPiServer Releases CMS 5 R2

EPiServer announced the introduction of multiple new features for its content management system, EPiServer CMS 5 R2, including solutions for mobility and the iPhone. EPiServer has worked with two partners, Mobiletech A/S and Mobizoft AB, to provide a mobile experience to the visitors of their site, including mobile rendering, video conversion and payments. iPhone support is available as open source templates enabling the system to be viewed from an iphone. Images can now be prepared directly in EPiServer CMS so that web editors no longer need to work on them in another application before moving onto the web page. New dynamic content features enable external data which appears in many places on the website, such as financial or legal text, to be updated throughout the site. Page Type Converter makes it easier to merge pages of different types, and change other page types. Five standard reports are now available— Non-published pages, published pages, modified pages, expiring/expired pages and an overview of simple addresses. External data such as an archive of articles at a media company can be integrated and displayed in a website using EPiServer CMS. The data will be appear as a native EPiServer CMS page. This enables structured data stored on another document management system to be converted to a webpage in EPiServer and viewed. EPiServer CMS now supports Oracle, Windows Server 2003 and 2008, as well as XP and Vista, Visual Studio 2008 and 2000 Express, and ASP Net 3.5 SP1 or later.

Machine Translation (Finally) Comes of Age

In our Multilingual Communications as a Business Imperative report, we noted the fact that machine translation (MT) has long been the target of “don’t let this happen to you” jokes throughout the globalization industry. Unpredictable results and poor quality allowed humor to become the focus of MT discussions, making widespread adoption risky at best.

On the other hand, we also noted that scientists, researchers, and technologists have been determined to unlock MT potential since the 1950’s to solve the same core challenges the industry struggles with today: cost savings, speed, and linguist augmentation. Although the infamous report on Languages and Machines from the Automatic Language Processing Advisory Committee (ALPAC) published in 1966 discussed these challenges in some depth (albeit from a U.S. perspective), it sent a resounding message that “there is no emergency in the field of translation.” Research funding suffered; researcher Margaret King described the impact as effectively “killing machine translation research in the States.”

Borrowing from S.E. Hinton, that was then, this is now. Technology advancements and pure computing power have made machine translation not only viable, but also potentially game-changing. A global economy, the volume and velocity of content required to run a global business, and customer expectations is steadily shifting enterprise postures from “not an option” to “help me understand where MT fits.” Case in point — participants in our study identified MT as one of the top three valuable technologies for the future.

There’s lots of game-changing news for our readers to digest.

  • An excellent place to start is with our colleagues at Multilingual Magazine, who dedicated the April-May issue to this very subject. Don Osborn over at the Multidisciplinary Perspectives blog provides an excellent summary, posing the question: “Is there a paradigm shift on machine translation?”
  • Language Weaver predicts a potential $67.5 billion market for digital translation, fueled by MT. CEO Mark Tapling explains why.
  • SYSTRAN, one of the earliest MT software developers provides research and education here.
  • And finally (for today), there’s no way to deny the Google impact — here’s their FAQ about the beta version of Google Translate. TAUS weighs in on the subject here.

Mary and I will be at Localization World Madison to provide practical advice and best practices for making the enterprise business case for multilingual communications investments as part of a Global Content Value Chain. But we’re also looking forward to the session focused on MT potential, issues, and vendor approaches. The full grid is here. Join us!

CM Pros Summit in Boston

The Content Management Professionals Association (CM Pros) will once again be holding their annual Fall Summit in conjunction with Gilbane Boston in December. There are details over on our Events blog which I won’t duplicate here, or even better, go right to the source at If you are a member we hope to see you, and if you are not you can find out about joining on the CM Pros site at

Taxonomy, Yes, but for What?

The term taxonomy crept into the search lexicon by stealth and is now firmly entrenched. The very early search engines, circa 1972-73, presented searchers with the retrieval option of selecting content using controlled vocabularies from a standardized thesaurus of terminology in a particular discipline. With no neat graphical navigation tools, searches were crafted on a typewriter-like device, painfully typed in an arcane syntax. A stray hyphen, period or space would render the query un-computable, so after deciphering the error message, the searcher would try again. Each minute and each result cost money, so errors were a real expense.

We entered the Web search era bundling content into a directory structure, like the “Yellow Pages,” or organizing query results into “folders” labeled with broad topics. The controlled vocabulary that represented directory topics or folder labels became known as a taxonomic structure, with the early ones at NorthernLight and Yahoo crafted by experts with knowledge of the rules of controlled vocabulary, thesaurus development and maintenance. Google derailed that search model with its simple “search box” requiring only a word or phrase to grab heaps of results. Today we are in a new era. Some people like searching by typing keywords in a box, while others prefer the suggestions of a directory or tree structure. Building taxonomic structures for more than e-commerce sites is now serious business for searches within enterprises where many employees prefer to navigate through the terminology to browse and discover the full scope of what is there.

Taxonomies for navigation are but one purpose for them to be used in search. Depending on the application domain, richness of the subject matter, scope and depth of topics, these lists can become quite large and complex. The more cross-references (e.g. cell phones USE wireless phones) are embedded in the list, the more likely the searcher’s preferred term will be present. There is a diminishing return, however; if the user has to navigate to a system’s preferred term too often; the entire process of searching becomes unwieldy and abandoned. On the other hand, if the system automates the smooth transition from one term to another, the richness and complexity of a taxonomy can be an asset.

In more sophisticated applications of taxonomies, the thesaurus model of relationships becomes a necessity. When a search engine, has embedded algorithms that can interpret explicit term relationships, it indexes content according to a taxonomy and all its cross-references. Taxonomy here informs the index engine. It requires substantial maintenance and governance of a much more granular nature than for navigation. To work well, a large corpus of terminology needs to be built to assure that what the content says and means, and what the searcher expects are a match in results. If the results of a search give back unsatisfactory results due to a poor taxonomy, trust in the search system fails rapidly and the benefits of whatever effort was put into building a taxonomy are lost.

I bring this up because the intent of any taxonomy is the first step in deciding whether to start building one. Either model is an on-going commitment but the latter is a much larger investment in sophisticated human resources. The conditions that must be met to have any taxonomy succeed must be articulated in selling the project and value proposition.

Multilingual Communications Report Resonates

We’ve had an overwhelmingly positive response to our Multilingual Communications as a Business Imperative report, for which we’re grateful – and thrilled! I can summarize the response as “peer sharing works!” And not only works, but spurs conversation, new ideas, and without a doubt, more sharing. For the Globalization Practice team, it’s true validation of the people perspective of Web 2.0.

It would be a long list to point out all the countries represented through report downloads and additional conversations we’ve had since July, but here’s just a sample. We’ve heard from content and translation management professionals from all across the USA in addition to:

  • Austria
  • Belguim
  • Canada
  • Chile
  • China
  • Finland
  • France
  • Germany
  • India
  • Indonesia
  • Ireland
  • Israel
  • Japan
  • Korea
  • Netherlands
  • New Zealand
  • Russia
  • Singapore
  • Slovenia
  • South Africa
  • South Korea
  • Spain
  • Sweden
  • Switzerland
  • United Kingdom

What resonates most? Unwaveringly first is the need to look at multilingual communications creation, management, and delivery in a new way; as less a cost center and more an integral part of business value. Next – the inherent connection readers have with our definition of operational champions and the stories told by those that shared challenges and strategies in the report’s Best Practices Profiles section. Of course those links have pros and cons; the former obviously cementing the growing need for community sharing and the latter validating the struggles of educating senior management and making the business case for focused investment.

Those “on the ground floor” clearly want more – and we aim to provide it. As Frank documented in our Events blog on Fall Speaking Gigs, we’re focused on sharing our experiences and more importantly, learning from yours. Particularly exciting for our team is the Content Globalization track we’ve put together for Gilbane Boston, December 2-4. The full conference schedule is here. Join us!

