Curated for content, computing, and digital experience professionals

Category: Semantic technologies (Page 39 of 72)

Our coverage of semantic technologies goes back to the early 90s when search engines focused on searching structured data in databases were looking to provide support for searching unstructured or semi-structured data. This early Gilbane Report, Document Query Languages – Why is it so Hard to Ask a Simple Question?, analyses the challenge back then.

Semantic technology is a broad topic that includes all natural language processing, as well as the semantic web, linked data processing, and knowledge graphs.


New Research on Enterprise Social Software Use

Finally there is some quantitative research on enterprise use of blogs, wikis, tagging, etc. to complement the very informal surveys we have taken, and the work done at the University of Massachusetts. Reports from Forrester (CIOs Want Suites For Web 2.0) and McKinsey (How businesses are using Web 2.0: A McKinsey Global Survey) published this week provide interesting, though not surprising, data. The McKinsey report is free with registration, and the Forrester report isn’t expensive.

I haven’t read the Forrester report (119 CIOs), but the executive summary focuses on their finding that most CIOs want to buy enterprise social software in suite form from large vendors rather from the smaller specialist software vendors. This fact itself is of course totally predictable, but it raises two interesting issues. First, just what are all the larger vendors, as well as midsize (e.g., content management vendors) doing about all this? (Short answer – all are doing something, but the details are often vague.) Second, what will be lost or gained in the process of force-fitting the “engage and collaborate” functions and culture into the “command and control” (last week’s post) of top-down IT directives?

The McKinsey report (2847 executives, 44% C-level) found “widespread but careful interest” in “Web 2.0 technologies”, and that they are strategic and will be invested in. I think their conclusion might be a little overly conservative given their findings. For example, 77% of retail and 74% of high tech plan to increase investment in these technologies. Note, however that McKinsey includes web services as a “Web 2.0” technology which not everyone would agree with.

See comments on these reports from Nick Carr, who points out where the Forrester and McKinsey findings differ. And see Richard MacManus’ comments on what the Forrester findings mean for the startups in this space.
For a couple of vendor perspectives, Socialtexts’ Ross Mayfield covers these findings here, and FAST’s Hadley Reynolds talks about some similar research they have been working on with the Economist here.
Also (while not commenting on these reports) Andrew McAfee provides some info on how he is seeing enterprises using these technologies.

Trying to Take the High Road

My last blog was in reaction to two recent vendor experiences. One had just briefed me on an enterprise search offering; the other had been ignoring my client’s efforts to get software support, training and respond to bug reports. The second blogged a reaction with a patronizing: “So Lynda should not feel too bad. I know its (sic) frustrating to deal with vendors but not all vendors are the same and she certainly hasn’t tried us all.”

With dozens of vendors offering search tools, it was fair to assume that I haven’t tried them all. However, having used search engines of all types since 1974 both as a researcher and analyst I have a pretty good sense of what’s out there. Having evaluated products for clients, and for embedded use in products I brought to market for over 20 years, it doesn’t take me long with a new product to figure out where the problems are. I also talk to a lot of vendors, search users, and read more reports and evaluations than I can count. The evidence about any one product’s strengths and weaknesses piles up pretty quickly. “Searching” for stuff about search has been my career and I do make it my business to keep score on products.

I’m going to continue to hold my counsel on naming different search tools that I’ve experienced for the time being. Instead, in this blog I’ll focus on keeping buyers informed about search technologies in general. My work as a consultant is about helping specific clients look at the best and most appropriate options for the search problems they are trying to solve and to help guide their selection process. Here is some quick generic guidance on making your first search tool choice:

  • If you have not previously deployed an enterprise search solution in your domain for the corpus of content you plan to search, do not begin with the highest priced licenses. They are often also the most costly and lengthy implementations and it will take many months to know if a solution will work for you over the long haul.
  • Do begin with one or more low cost solutions to learn about search, search product administration, and search engine tuning. This helps you discover what issues and problems are likely to arise, and it will inform you about what to expect (or want) in a more sophisticated solution. You may even discover that a lower cost solution will do just fine for the intended application.
  • Do execute hundreds of searches yourself on a corpus of content with which you are very familiar. You want to learn if you can actually find all the content you know is there, and how the results are returned and displayed.
  • Do have a variety of types of potential searchers test-drive the installed product over a period of time, review the search logs to get a sense of how they approach searching; then debrief them about their experiences, and whether their search expectations were met.

It is highly unlikely that the first enterprise search product you procure will be the best and final choice. Experience will give you a much better handle on the next selection. It is certainly true that not all vendors or products are the same but you need to do serious reality-based evaluations to learn your most important differentiators.

Setting and Meeting Customer Expectations

I had a briefing from a vendor that is a strong contender for a piece of the enterprise search market this week. The offering is impressive, other reviews have given it high technical marks and the pricing model is reasonable. But because I am currently immersed in the deployment of another enterprise search engine with a client, the issue of vendor client relationship is foremost in my focus.

I asked the CEO of this relatively new offering, what are the fundamental assumptions his company makes about customer technology environments (e.g. the mix of software applications, hardware environment) and the competencies required to integrate his software with that environment. His answer was given strictly in terms of what the IT staff needs to know to bring the product online. My question did have several levels of complexity and was probably badly phrased but I was trying to make a point by asking it.

There are three specific elements missing from search vendors:

  • Documentation or explicit models for deployment in environments where there are numerous technological variables to be considered
  • Availability of training that takes into consideration the context for enterprise search in a specific customer’s organization
  • Frank discussions with customers that set expectations about deployment and implementation, potential bottlenecks, and the need for experienced searchers, search analysts and subject matter experts on the team with the IT group

Downloading software and using automatic installers has become routine; with the launch of a menu and a few simple clicks on boxes on an administrative screen, vendors can claim “out-of-the-box” functionality. Never mind that what you find when you first search your targeted domain is nonsensical, the software finds “stuff.”

The IT guys are happy because it was easy to install, met their architecture requirement and, knowing little about the actual corpus of content, they are satisfied that everything works.

I am in a bit of a pickle with the current project, software from another vendor, because:

  • What the documentation says will happen when I make certain choices in the set-up does not, in fact, happen when a search is executed
  • My attempts through email and phone to schedule training have gone unanswered
  • My messages to the support service citing problems also get no response

I’ve only spent two weeks trying to get this software working but three weeks ago, on a holiday, I got a briefing from two executives from this firm because they were “going to be in the area” and wanted face time with a search analyst. Knowing my role as an analyst and as a client you would think they’d answer my phone calls.
What is it that makes the customer experience so easily ignored? All these products look great in demos; what is under the hood is often technologically wonderful but, boy, getting them to work in my environment always seems to be one long nightmare. I wish I could find out what I really need to know. A terrific search engine might help.

SearchBlox Content Search Software Version 4.0 Released

SearchBlox Software, Inc. has released SearchBlox Content Search Software Version 4.0 with replication support and a completely re-designed AJAX-based Admin Console. The new release also supports hit highlighting of search terms in HTML and PDF documents. The replication feature, available in SearchBlox ENTERPRISE Edition, keeps search indexes in multiple instances of SearchBlox synchronized. Using the new replication feature, search applications can be deployed using multiple instances of SearchBlox. The new release also features a new AJAX-based Admin Console which makes the management of SearchBlox Content Search Software more interactive. The new UI hides the complexities of managing the built-in HTTP, File system and RSS crawlers and makes the software accessible to novice users. SearchBlox is 100% Java and uses the open source Apache Lucene search API. SearchBlox supports indexing and searching of content in 37 languages and can be deployed on any platform that supports Java. It been tested on Mac OS X, Windows, Linux, UNIX, IBM AIX, Solaris, HP-UX, and z/OS operating systems. SearchBlox can be deployed as a J2EE Web Component to all major J2EE Application Servers including BEA Weblogic, JBoss, IBM Websphere, Oracle OAS, JRun and Apache Tomcat. It is also available as a standalone server for Windows, Mac OS X and Unix/Linux platforms. The SearchBlox Free Edition is available free of charge and can index up to 1000 documents. The software can be downloaded from http://www.searchblox.com/.

IBM to Make Google Gadgets and Sitemaps Available to Corporate Portal Users

IBM (NYSE: IBM) announced that it is bringing Google Gadgets – or consumer-style web utilities – into commercial portal software. Available at no cost to WebSphere Portal and WebSphere Portal Express Version 6.0 customers, IBM now lets users create, customize and use rich Internet applications with Google Gadgets directly from within WebSphere Portal so they appear as ready-to-use services. Users can choose from nearly 4,000 Google Gadgets such as language translators, package delivery tracking, Podcast searches, Wikipedia information, YouTube postings and more. These features can be offered through a company’s portal with a click of a button. IBM is also announcing its search sitemap utility, based on a new sitemap protocol, agreed on by Google, Microsoft and Yahoo, that will make it possible to optimize publication of portal content for improved search by public search engines. This feature also includes the ability to notify search engines of the update frequency, last modification date, and relative priority of the content that is being published. The end result is an improved content relationship with external search engines so that all of the public content in a portal can be found and crawled efficiently. The IBM Portlet for Google Gadgets will be available in April via IBM WebSphere Portal catalog. WebSphere Portal Version 6.0 customers, including those using WebSphere Portal Express to deploy solutions for Small and Medium Sized businesses and WebSphere Portal Server Version 6.0 are entitled to use Google Gadget at no cost. Enablement for the Sitemap 0.90 protocol will be delivered for WebSphere Portal as a sitemap utility that customers can download from the WebSphere Portal catalog later in 2007. http://catalog.lotus.com/wps/portal/portalhttp://www.sitemaps.org/

Evidence of a Shift Away from Total Enterprise Search

A tough truth about complex and integrated software applications is the lack of expertise and professional depth available to implement and maintain them. This explains a lot about why small business units and project teams often find and deploy their own software tools to get work done.

I am particularly concerned at the lack of will by organizations to fund implementation of applications like enterprise search to aggregate at the retrieval-end the content stored within disparate applications. No rational business planning can justify having workers sift through multiple repositories, each with a separate sign-on, search interface, and search engine protocols just to find a single document. True, organizations need highly competent professionals to meaningfully implement, tune, and administer enterprise search engines. They require the expertise of search analysts, taxonomists, librarians, IT specialists with security, platform, and software development training. However, developing a team of six to twelve “search engineers” to give workers in a thousand person company quick access to relevant content is an ROI no brainer when we know workers waste significant (5 – 15%) amounts of working hours hunting for stuff.

This week’s Information Week article by Nicholas Hoover on Web 2.0 contained a comment about Wells Fargo “…on another Enterprise 2.0 front, integrated search, the company has limited employees’ ability to search across data repositories because of the complex authorization schemes needed to keep people from accessing information they shouldn’t.”

Today’s (Feb. 27th) New York Times headlined with a story about Microsoft buying “a specialized search engine tailored to deliver useful medical information to consumers,” Medstory, Inc. The story goes on to cite comments by Esther Dyson who refers to the technology as “an ontology engine.” This underscores another truth about quality semantic (natural language) search; it depends on the existence of meaningful, topic specific vocabulary and concept maps to work well, a complexity in narrower markets.
Finally, we have seen the recent shift of companies like FAST moving from a strategy of selling solutions directly to enterprises for the purpose of aggregating content through a unified search portal to focusing on niche markets and highly tailored search architectures.

These are just three cases of a shift among search companies to leverage their search technology IP in more lucrative offerings. The losers will be organizations that really do need to deliver content more holistically to workers through a single search engine. Yes, security is a concern, and skilled search technologists must be hired and dedicated to delivering search options that tie directly to business operations.These efforts are not one-off projects but need to be sustained as permanent infrastructure. If you are in a position to influence search procurement solutions make your case for the most suitable software that will really help deliver the best retrieval option company-wide. Be realistic about funding and staffing; then go for it. If Enterprise Search is what you need, make sure that is what you get and deploy.

Which Would You Have? Software as Service or Service with Your Software

I received an unsolicited email from jetBlue yesterday, one of many that I routinely receive from various travel providers. This one was different. I was not one of the thousands stranded by them last week and I have only traveled on jetBlue for one trip. They could have omitted this mea culpa letter to me in hopes that I had not already noticed all the media hype around their operational breakdowns and plans to recover from a faulty infrastructure. However, by calling attention to their lapses in such public ways this week, they have insured that I will include them in future travel planning.

Years ago as the President of a software company, I received a truly disturbing email lashing from a client sent after 6 PM on a Friday. The accusations about my company’s service were vitriolic and uncharacteristic of client reactions. I stayed at the office late gathering all the information I could find from the customer support database to learn what might have precipitated the outburst because I wanted to send a thoughtful, accurate and timely response. Without attacking the client I sent a chronology of inquiries and responses with a copy of a remedy sent to them weeks earlier. Then I went home with hopes that Monday would bring a more constructive dialog between the client and my company. The issues were amicably resolved, the client remained a good client, gave us high marks in referrals, and the matter was never mentioned again.

Unfortunately, personalization of client vendor relationships is missing in too many business relationships. A great amount of marketing copy appears describing how software tools and search interfaces support “personalization.” We know that SaS (software as service) or ASP (application service provider) models have come into their own. We also see the major search software vendors posting record growth and grand projections for even more. What this all adds up to is the convergence of a perfect storm of client disappointment as we experience a total disconnect between what vendors mean by “personalization” and “service,” and what customers want. Customers want software that is intuitively simple to personalize, and service that places the responsibility for software problems squarely with the vendor.

Based on my recent experiences with vendors, I see huge industry problems ahead. These are being exposed at all levels: discussions with sales representatives, exchanges with search company executives, deployment of software issues, documentation and training quality, and exchanges with customer support personnel.
Here is my list of vendor weaknesses:

  • Lack of understanding by company representative how their software works
  • Failure to really understand prospect needs, environments, and requirements
  • Poorly written documentation and training giving no context for how the software might be deployed
  • Technically sophisticated features delivered with no coherent path to deployment
  • Inability to communicate honestly with clients
  • Lack of clarity on what industry standards and terminology mean to clients
  • Failure to use their own products by all employees in vendor organizations
  • Inattention to building quality support infrastructures to service clients

I am not calling for a “customer bill of rights” for the enterprise search software industry. Instead, I am calling for you who procure software to take control of your own experience by doing a lot more than looking under the hood for technical specifications, features and functionality. You need to:

  • Look inside the vendor’s organization to see what kind of personnel it has, what the turnover is, how many people are supporting service functions compared to developers, etc.
  • Listen to what you are being told; do serious validating research, on your own, to discover customers using the software. Talk to as many as you find; look at blogs and chat rooms to discover where the pain points and good experiences lie.
  • Read documentation to understand how much time, effort, and expertise the deployment and maintenance will really require.
  • Test drive products with your own data.

Every search company can’t grow 100% year-over-year for years on end. You will be suffering mightily for a long time if you make a big investment in one of those who ignore the customer experience. There is also a good chance they’ll be sold off to the lowest bidder once they realize their inability to service their clients and remain profitable. Take your destiny in your own hands; take enterprise search on in slow and measured increments so you will know what you are getting into.

The “2.0” Qualifier and A Reality Check

Last week’s FASTforward 07 event, sponsored by FAST Search, was a great opportunity to immerse ourselves in search and the state of our collective efforts to solve the knotty problems associated with finding information. (The escape to San Diego during an East Coast winter freeze was an added bonus.)

Much of the official program covered topics “2.0” — Web 2.0, Enterprise 2.0, Search 2.0, Transformation 2.0, Back Office 2.0. Regular readers know that the Gilbane team generally approaches most things “2.0” with skepticism. In the case of its use as a qualifier for the Web, it’s not that we question the potential value of bringing greater participation to Web-based interactions. Rather, it’s that use of the term causes the needle on our hype-o-meter to zip into the red alert zone. This reaction is further aggravated by the trend towards appending “2.0” to other words, sometimes just to make what’s old seem new again. We note, without comment, that O’Reilly Media’s conference in May has been dubbed Where 2.0.

We listened carefully to the 2.0’s being tossed out like Mardi Gras coins at FASTforward last week. One voice that stood out as a great reality check is that of Andrew McAfee, associate professor at Harvard Business School. In his keynote talk, “Enterprise 2.0: The Next Disrupter,” he presented a definition of Enterprise 2.0:

Enterprise 2.0 is the use of emergent social software platforms within companies, or between companies and their partners or customers.

The important word in McAfee’s definition is emergent, which is not the same as emerging. McAfee also outlined the ground rules for an enterprise that can legitimately lay claim to the use of the 2.0 qualifier. Read the FASTforward entries on his blog for his own eloquent summary.

In addition to his talk on Enterprise 2.0, McAfee also participated in a lunch presentation on research conducted by Economist Intelligence Unit on executive awareness of Web 2.0 and in a limited-seating roundtable on 2.0 topics. Both are briefly described on his blog.
In short, McAfee’s work is recommended reading for anyone interested in separating 2.0 market hype from potential business value.

Another highlight of FASTforward for us was keynoter Chris Anderson on “The Long Tail” and the application of Long Tail theories to search and content life cycles. By pure happenstance, the Gilbane team shared a limo to the airport with Anderson. In his day job as editor-in-chief of Wired magazine, he and his staff are experiencing significant levels of frustration with the publishing process — specifically, getting content out of a leading professional publishing tool and into the web content management system. While we found his Long Tail talk interesting, the conversation in the limo reminded us that solving some basic business communication problems is still a challenge. It was a thought-provoking way to end the week.

For more on FASTforward ’07, check out our enterprise search blog.

« Older posts Newer posts »

© 2025 The Gilbane Advisor

Theme by Anders NorenUp ↑