The Gilbane Advisor

Curated for content, computing, data, information, and digital experience professionals

Page 272 of 939

Search Fundamentals: Why Search Fails Us

When search fails me, the reasons may be hard to discover as a user but once on the inside of an enterprise I can learn a lot about what is going on. After listening to scores of business case studies, personal experiences and reading about rampant dissatisfaction with search it is discouraging to recognize the simple reasons for most negative outcomes.

Consider this scenario. I was attempting to find the address of the office of a major global platform vendor (one of the largest) that sells an entire suite of enterprise search and content management software products. One can usually find business location information from links on the home page of any corporate Web site or at least from the site representing the division one is visiting. But there was no such link for this corporate site. Then using the “search” box and later the “advanced search” option, trying a dozen variations of the division name, town in which the office is located, and product names I struck out on every query. All paths lead to a page with a single corporate address, or a couple of other remote addresses, and links to web pages that contained no address. Even those pages with addresses had no link to directions. I followed up with queries using Google and these got me back to the same dead-ends. Finally, I found the address through various online non-specific business directories.

This experience lead to a couple of conclusions about why my search failed: 1. The content does not exist; there is no such listing of locations. 2. The search engine is not properly tuned or metadata is not supplied with labels such as “locations,” “directions,” “business offices,” etc. The immediate solution for this case is to ensure that someone with practical business sense and usability competency has ownership of the overall web site experience to make sure that essential company data is available and easy to find. Or, if the company has made a conscious decision not to publish that information, at the least they should have a page stating the alternative for potential visitors as to how they can find their destination or to what office they can direct postal mail.

I had to two reasons for needing this information; one was a visit to an individual who was not available to give me the address in time to reach the office, and the second was a personal follow-up letter after someone from the company had been a speaker at an event I chaired. As things stand, I have been left with personal skepticism about the commitment of this company to build, produce and actually use content management or search products that will be truly responsive to needs of their potential buyers. When you don’t or can’t showcase your products, I question “why.” This is not a technology problem; it is a human factors and human resource allocation problem.

This brings me to some search fundamentals:

  • No content – If content that customers or employees expect to find is not included in explicit directives to the search engine for the repositories to be crawled and indexed, it will never be found.
  • No metadata – Any content lacking explicit language likely to be used by a searcher will probably not be found if it also lacks sufficient metadata.
  • Poor indexing or search rule base – If the content being searched is business documents without many unique contextual “hooks,” such as product names, technical terminology or topics of narrow interest, the search engine being used must be “smart” enough to glean the intent of the searcher from the context of query. In my case, I supplied a half a dozen terms to layer the context, tried them in different combinations, with and without quotations around phrases, but nothing worked.

Conclusion, if you really don’t want searchers to find what they want to find, it is not hard at all to compromise findability. I will not arrive at my destination and you won’t get any first class letters from me.

Valuing Social Connections

A team of researchers from International Business Machines Corporation (IBM) and the Massachusetts Institute of Technology (MIT) released a very interesting piece of academic research this week, which presents some findings from a study of “the largest organizational social network ever collected.”  The researchers collected and mined data related to c. 400,000 IBM employees.  The researchers further focused on a subset of that dataset — 2,600 consultants — to draw insights on how connectedness impacts the productivity of employees who generate revenues by logging billable hours.

What makes the study so interesting — in addition to the extraordinarily huge dataset used — is that it is one of the first attempts I’ve seen to assign a currency-based value to social network connections.  In this case, the social network is based in email; it lives in IBM’s internal deployment of Lotus Notes.

The study associates incremental revenue earned by a consultant with both individual and project-level email activity.  For example, the study finds that if an IBM consultant uses email to reach out to a manager that is not his direct supervisor, he produces, on average, an additional $588/month in revenue as compared to a consultant that only interfaces with her direct manager.

This is fascinating stuff, and my head is spinning with the possibilities of how this might be applied to inter-enterprise interactions conducted via emergent social software, rather than through well-institutionalized email.  I just came across this study today and haven’t had time to properly digest it yet, but will do so and comment further.  In the meanwhile, I invite you to read it for yourself and leave observations and  comments here.

SEC Issues Summarized XBRL Guidance

The SEC has posted newly summarized XBRL compliance information on their website (http://www.sec.gov/info/smallbus/secg/interactivedata-secg.htm ). The guidance is directed towards small businesses but contains a concise description of the program for all companies. Information covers: the three year phase-in period, certification requirements, third-party involvement, Modified liability, consequences of non-compliance, web posting, grace periods, due dates, applicable financial statements, required formats, optional early compliance, and other helpful resources.

The information is not meant to replace the rules as published in the EDGAR Filing Manual (Chapter 6, Interactive Data), located here: http://www.sec.gov/rules/final/2009/33-9002.pdf.

The bottom line is that each company will be responsible for the content in their SEC XBRL filings and should become very familiar with all reporting requirements. http://www.sec.gov/info/smallbus/secg/interactivedata-secg.htm is an excellent place for companies large and small to begin to explore the SEC’s XBRL mandate.

March Madness in the Search Industry

In keeping with conventional wisdom, it looks like a number of entrepreneurs are using the economic downturn as opportunity time, judging from the larger than normal number of announcements in the enterprise search sector. The Microsoft acquisition of FAST, Autonomy’s foray into the document/content management market, and Google’s Search Appliance ramping its customer base are old news BUT we have a sweep of changes. Newcomers to the enterprise search marketplace and news of innovative releases of mature products really perked up in March. Here are my favorite announcements and events in chronological order and the reasons why I find them interesting:

Travis, Paul. March 2, 2009 Digital Reef Comes Out of Stealth Mode. 03/02/2009. Byteandswitch.com.

Startup offers content management platform to index unstructured data for use in e-discovery, risk mitigation, and storage optimization. Here is the first evidence that entrepreneurs see opportunity for filling a niche vacuum. In the legal market the options have been limited and pretty costly, especially for small firms. This will be an interesting one to watch. http://www.digitalreefinc.com/

Banking, Finance, and Investment Taxonomy Now Available from the the Taxonomy Experts at WAND. 03/02/2009, PR Web (press release), Ferndale,WA,USA

The taxonomy experts at WAND have made this financial taxonomy available now for integration into any enterprise search software. I have been talking with Ross Lehr, CEO at Wand, for over a year about his suite of vertical market taxonomies and how best to leverage them. I am delighted that Wand is now actively engaged with a number of enterprise search and content management firms, enabling them to better support their customers’ need for navigation. The Wand taxonomies offer a launching point from which organizations can customize and enhance the vocabulary to match their internal or customer interests. http://www.wandinc.com/main/default.aspx

Miller, Mark. Lucid Imagination » Add our Lucene Ecosystem Search Engine to Firefox. 03/02/2009

I predicted back in January that open source search and search appliances were going to spawn a whole new industry of services providers and expert integrators because there are just not enough search experts to staff in-house experts in all the companies that are adopting these two types of search products. Well, it is happening and these guys at Lucid are some of the smartest search technologists around. Here is an announcement that introduces you to a taste of what they can do. Check it out and check them out at http://www.lucidimagination.com/

To see the full article with commentary about: social search at NASA, QueSearch, MaxxCat, Aardvark on social search, Attivio, ConceptSearching, Google user-group, Simplexo, Endeca, Linguamatics, Coveo, dtSearch and ISYS.

Microsharing has benefits for NASA. 03/04/2009.

It has been about 18 months since I wrote on social search and this report reveals a program that takes the concept to a new level, integrating content management, expertise locators and search in a nifty model. To learn more about NASAsphere, read this report written by Celeste Merryman. Findings from the NASAsphere Pilot. Jet Propulsion Laboratory, California Institute of Technology Knowledge Arciteture (sic) and Technology Task [Force]. 08/20/2008. The success of the pilot project is underscored in this report recommendation: the NASAsphere pilot team recommends that NASAsphere be implemented as an “official” employee social networking and communication tool. This project is not about enterprise search per se, it just reflects how leveraging content and human expertise using social networks requires a “findability” component to have a successful outcome. Conversely, social tools play a huge role in improving findability.

March 16, 2009. QueSearch: Unlocking the Value of Structured Data with Universal Search really caught my eye with their claim to “universal search” (yes, another) for large and mid-size organizations.

This offering with a starting price of $19,500, is available immediately, with software and appliance deployment options. I tried to find out more about their founders and origins on their Web site without luck but did track down a Wikipedia article and a neat YouTube interview with the two founders, Steven Yaskin and Paul Tenberg. It explains how they are leveraging Google tools and open source to deliver solutions.

Stronger, Better, Faster — MaxxCat’s New Search Appliance Aspires to Be Google Search Appliance Killer, by Marketwire. 03/11/2009.

This statement explains why the announcement caught my attention: MaxxCat product developers cite “poor performance and intrinsic limitations of Google Mini and Google Search Appliance” as the impetus to develop the device. The enterprise search appliance, EX-5000, is over seven times faster than Google Search Appliance (GSA) and the small business search appliance, the XB-250, is 16 times faster than Google Mini. There is nothing like challenging the leading search appliance company with a statement like that to throw down the gauntlet. OK I’m watching and will be delighted to read or hear from early users.

Just one more take on “social search” as we learn about Aardvark: Answering the Tough Questions, David Hornik on VentureBlog. 03/12/2009

This week the Aardvark team is launching the fruits of that labor at South By Southwest (SXSW). They have built a “social search engine” that lives inside your IM and email. It allows you to ask questions of Aardvark, which then goes about determining who among your friends and friends of friends is most qualified to answer those questions. As the Aardvark team point out in their blog, Social Search is particularly well suited to answer subjective questions where “context” is important. I am not going to quibble now but I think I would have but this under my category of “semantic search” and natural language processing. Until we see it in action, who knows?

A new position at Attivio was announced on March 16th, Attivio Promotes John O’Neil to Chief Scientist, which tells me that they are still expanding at the end of their first official year in business.

Getting to the point, 03/18/2009, KMWorld. http://www.kmworld.com/Articles/ReadArticle.aspx?ArticleID=53070

Several announcements about Concept Searching’s release v. 4 of its flagship product, conceptClassifier for SharePoint highlight the fact that Microsoft’s acquisition of FAST has not slowed the number of enterprise search solution companies that continue to partner with or offer independent solutions for SharePoint. In this case the company offers its own standalone concept search solution applications for other content domains but is continuing to bank on lots of business from the SharePoint user community. This relationship is reflected in these statements: The company says features include a new installer that enables installation in a SharePoint environment in less than 20 minutes, requires no programmatic support and all functionality can be turned on or off using standard Microsoft SharePoint controls. Full integration with Microsoft Content Types and greater support for multiple taxonomies are also included in this release. Once the FAST search server becomes a staple for Microsoft SharePoint shops, there will undoubtedly be fallout for some of these partners.

Being invited to the Google Enterprise Search Summit in Cambridge, MA on March 19, 2009 was an opportunity for me to visit Google’s local offices and meet a bunch of customers.

They were a pretty enthusiastic crowd and are enjoying a lot of attention as this division of Google works to join the ranks of other enterprise application software companies. I suspect that it is a whole new venture for them to be entertaining customers in their offices in a “user-group like” forum but the Google speakers were energetic and clearly love the entrepreneurial aspects of being a newish run-away success within a run-away successful company. New customer announcements continue to flow from Google with SITA (The State Information Technology Agency in South Africa) acquiring GSA to drive an enterprise-wide research project. The solution will also be deployed and implemented by JSE-listed IT solutions and services company Faritec, and RR Donnelly. Several EMC users were represented at the meeting, which made me ask why they aren’t using the search tools being rolled out by the Documentum division…well, don’t ask.

Evans, Steve. Simplexo boosts public sector search options. Computer Business Review – UK. 03/18/2009.

This is interesting as an alternative to the Lucene/solr scene, UK-based open source enterprise search vendor Simplexo has launched a new search platform aimed at the public sector, which aims to enable central and local government departments to simultaneously search multiple disparate data sources across the organisation on demand. I have wondered when we would see some other open source offerings.

And all of the preceding is about just the startups (plus EMC at Google) and lesser known company activity. This was not a slow month. I don’t want all my contacts in the “established” search market to think that I am not paying attention because I am. I’ve exchanged communications with or been briefed by these known companies with news about new releases, advancing market share, or new executive teams. In no particular order these were the highlights of the month:

Endeca announced three new platforms on Mar 23, 2009: Endeca Announces the Endeca Publishing Suite, Giving Editors Unprecedented Control Over the Online Experience; Endeca Announces the Endeca Commerce Suite, Giving Retailers Continuous Targeted Merchandizing; and Endeca Unveils McKinley Release of the Information Access Platform, Allowing for Faster and Easier Deployment of Search Applications

Linguamatics Agile Text Mining Platform to Be Used by Novo Nordisk. 03/26/2009

I had a fine briefing by Coveo’s CEO Laurent Simoneau and Michel Besmer new VP of Global Marketing and see them making great strides capturing market share across numerous verticals where rapid deployment and implementation are a big selling point. They also just announced: Bell Mobility and Coveo Partner to Create Enterprise Search from Bell, an Exclusive Enterprise-Grade Mobile Search Solution.

A new Version 7.6 of a mainstay, plug-and-play search solution for SMBs since 1991, dtSearch, was just released. 3/24/2009

And finally, ISYS is having a great growth path with a new technology release, ISYS File Readers, new executives and a new project … completed in conjunction with ArnoldIT.com. Steve Arnold, industry expert and author of the Beyond Search blog, compiled more than a decade of Google patent documents. To offer a more powerful method for analyzing and mining this content, we produced the Google Patent Search Demonstration Site, powered by our ISYS: web application.

Weatherwise, March, 2009 is out like a lamb but hot, hot, hot when it comes to search.

Quark Teams with IBM Enterprise Content Management to Bring XML and DITA to the Masses

Quark announced that it has teamed with IBM Enterprise Content Management (ECM) to enable the broad adoption of XML across the enterprise by integrating Quark XML Author with IBM FileNet Content Manager. Quark makes it possible for any IBM FileNet Content Manager user working in Microsoft Word to author intelligent content that can be reused and delivered to multiple channels or formats. The ability to author, manage, and reuse structured content enables critical business needs, such as managing intellectual property, complying with regulatory mandates, and automating business processes. A simple and streamlined process for XML authoring also helps organizations to enable enterprise-wide adoption of XML and DITA. Quark XML Author for Microsoft Word is an XML authoring tool that allows users to create XML content in a familiar word processing environment. Quark XML Author enhances Microsoft Word’s native XML support by allowing users to create narrative XML documents directly, without seeing tags, being constrained to boxes, or being aware of the technical complexities associated with XML. http://www.quark.com/

Second Life Gets an International Life: An Interview with Danica Brinton of Linden Lab

At the recent Worldware Conference in Santa Clara, California, I was delighted to learn about how a high-tech company was achieving great success in internationalizing their software through crowdsourcing. The story gets more interesting. This was not back-room software plumbing but an innovative application, none other than Second Life, a virtual world and a social-networking MMORG (Massive Multi-Player Online Role-Playing Game).  Launched by Linden Lab in 2003, Second Life enables its users, called residents, to interoperate with a virtual world  through software called a Second Life Viewer. Residents can socialize, participate in group activities, and create and trade virtual property.  According to Google, there are over 9 million residents currently on Second Life.

I attended the presentation, “Brave New (Virtual) World,” and had an opportunity to catch up with Danica Brinton, Director of International Strategies and Localization at Linden Lab.  Here’s what she had to say.

Kadie:  When did Linden Lab realize the importance of internationalization?

Brinton: Around the middle of 2008, Linden Lab realized some discrepancies between U.S. and international business.  While 60% of the residents and twice the new registrations were from outside the U.S., revenue and retention numbers, while still healthy, indicated a gap in the localized  user experience.

Kadie: What happened when you entered the scene?

Brinton: I joined the company in June.  When I checked things out, I was stunned.  I discovered that we were paying $40,000 per quarter to LSPs.  What were we getting?  The viewer was translated only partially into 3 languages, and was nearly incomprehensible.  The website was translated partially into 2 key languages.  In both cases there were a lot of localization bugs.  On the flip side, hundreds of wiki-based Help pages were translated quite well into 8 languages, which was pretty darn good.  An interesting trend…

Kadie: So what did you do?

Brinton: Although we were a small company, when I showed my management the opportunity they were very supportive…but with limited funding.  So we had to get creative.  We enlisted the help of power users to translate the application and website.  To ensure quality control, we set up a repeatable localization framework, with translation, editing, testing, and end user review.  We established a tier system of resident translators, drawing on our super-users.   We built and acquired localization tools to manage translation memories and the localization process, and installed a locale-based ROI calculator to manage costs.  Finally, we hired 3 in-house linguists.  So you can see, it was a hybrid of crowdsourcing from the Second Life community on the one hand, and our in-house linguists and contracted translation agencies on the other.

Kadie: How did you divide up the work?

Brinton:  Who did what depended on the language tier.  Let’s look at the viewer, for example.  For tier-1 languages, we developed the glossary, did the translation, and collaborated with the Second Life community on the editing, QA, and some of the glossary.  For tier-2 languages, the Second Life community did nearly everything.

Kadie: What kind of results did you achieve?

Brinton: Less than a year later, I can truthfully say that we achieved some dramatic results.  We now translate the viewer and the website into 10 languages, and expect to reach 16 in May.  The active residents from outside the U.S. grew to 64% of the user base, and new registrations are now more than 2.5 times the U.S.  Even better, international revenues have surpassed U.S. domestic revenues.  Between the Viewer, the website, and the knowledge base, we now regularly localize over 150,000 words per language.

Kadie: What’s next for localization at Linden Lab?

Brinton: Strangely enough, past is prologue.  This new localization program is helping to increase customer satisfaction and bolster an affinity group.  You can even say that community-driven translation is building brand advocacy.  Some of the elite power users are evolving into business partners.  Localization is not only supporting our business, it’s helping to grow it.

The Content Globalization practice at the Gilbane Group closely follows and  blogs on the role of multilingual communication in social networking (see interview with Plaxo).

« Older posts Newer posts »

© 2026 The Gilbane Advisor

Theme by Anders NorenUp ↑