Yesterday was obviously a big day in the enterprise search space. “Enterprise search”, as opposed to web search news, doesn’t usually make the New York Times, Wall Street Journal Boston Globe etc. We (especially Lynda!) spent a lot of time yesterday just dealing with all the inquiries. Lynda posted her initial thoughts before the analyst call yesterday, as did Steve Arnold. Both will certainly have more to say. In addition to their blogging keep an eye out for the two reports we’ll be publishing this Spring: Enterprise Search Markets and Applications: Capitalizing on Emerging Demand, by Lynda Moulton, and Beyond Search: What to do When You’re Enterprise Search System Doesn’t Work, by Steve Arnold.
Category: Enterprise search & search technology (Page 37 of 61)
Research, analysis, and news about enterprise search and search markets, technologies, practices, and strategies, such as semantic search, intranet collaboration and workplace, ecommerce and other applications.
Before we consolidated our blogs, industry veteran Lynda Moulton authored our popular enterprise search blog. This category includes all her posts and other enterprise search news and analysis. Lynda’s loyal readers can find all of Lynda’s posts collected here.
For older, long form reports, papers, and research on these topics see our Resources page.
Microsoft Corp. (Nasdaq “MSFT”) announced that it will make an offer to acquire Fast Search & Transfer ASA (OSE: “FAST”) through a cash tender offer for 19.00 Norwegian kroner (NOK) per share. This offer represents a 42 percent premium to the closing share price on Jan. 4, 2008 (the last trading day prior to this announcement), and values the fully diluted equity of FAST at 6.6 billion NOK (or approximately $1.2 billion U.S.). FAST’s board of directors has unanimously recommended that its shareholders accept the offer. In addition, shareholders representing in aggregate 37 percent of the outstanding shares, including FAST’s two largest institutional shareholders, Orkla ASA and Hermes Focus Asset Management Europe, have irrevocably undertaken to accept the offer. The transaction is expected to be completed in the second quarter of calendar year 2008. In addition to bolstering Microsoft’s enterprise search efforts, this acquisition increases Microsoft’s research and development presence in Europe, complementing existing research teams in Cambridge, England, and Copenhagen, Denmark, with new capabilities in Norway. http://microsoft.com, http://www.fast.no/
I closed 2007 with some final takeaways from the Gilbane Conference and notes about semantic search. Already we are planning for Gilbane San Francisco and you are invited to participate. There is no question that enterprise search, in all its dimensions, will be a central theme of several sessions at the conference, June 17th through 19th. I will lead with a discussion in which a whole range of search topics, technologies and industry themes will be explored in a session featuring guest Steve Arnold, author of Google Version 2.0, The Calculating Predator. To complement the sessions, numerous search technology vendors will be present in the exhibit hall.
A most important conference component will be a highlight for conference goers, shared-experiences about selecting, implementing and engaging with search tools in the enterprise. Everyone wants to know what everyone else is doing, learning and what they know about enterprise search. You may want to present your experiences or those of your organization. If you are interested, considering presenting, know of a good case study, usability or “lessons learned” from implementing search technology, please raise your hand. You can do this by reaching out through this link to submit a proposal and make reference to the “enterprise search blog call for papers.” You can be sure I’ll follow-up soon to explore the options for you or a colleague to participate. This is a great opportunity to be part of a community of practitioners like you and attend a conference that always has substantive value for participants.
Leave it to Microsoft to end the year with a big announcement and open the next one with an even bigger one. We knew that the world of enterprise search was going to contract in terms of the number of established vendors, even though it is expanding in new and innovative offerings. Microsoft had to make a bold play in an industry where Google has been the biggest player on the WWW stage while reaching deeper into the enterprise, tickling at Microsoft’s decades-old hold on content creation and capture. So, with the acquisition of FAST Search & Transfer, whose technology may not be the best in the enterprise search market but is certainly the most widely deployed at the high-end, Microsoft opens with a direct challenge to its largest competitor.
Boy! Have the emails been flying this morning. At least I know there will be plenty of material to ponder in the next few weeks and months. P.S. Don’t miss the action in San Francisco!
That the Gilbane Group launched its Enterprise Search Practice this year was timely. In 2007 enterprise search become a distinct market force, capped off with Microsoft announcing in November that it has definitively joined the market.
Since Jan. 1, 2007, I have tried to bring attention to those issues that inform buyers and users about search technology. My intent has been to make it easier for those selecting a search tool while helping them to get a highly satisfactory result with minimal surprises. Playing coach and lead champion while clarifying options within enterprise search is a role I embrace. It is fitting then, that I wrap up this year with more insights gained from Gilbane Boston; these were not previously highlighted and relate to semantic search.
The semantic Web is a concept introduced almost ten years ago reflecting a vision of how the Worldwide Web (WWW) would evolve. In the beginning we needed a specific address (URL) to get to individual Web sites. Some of these had their own search engines while others were just pages of content we scrolled through or jumped through from link to link. Internet search engines like Alta Vista and Northern Light searched limited parts of the WWW. Then, Yahoo and Google came to provide much broader coverage of all “free” content. While popular search engines provided various categorizing, taxonomy navigation, keyword and advanced searching options, you had to know the terminology that content pages contained to find what you meant to retrieve. If your terms were not explicitly in the content, pages with synonymous or related meaning were not found. The semantic Web vision was to “understand” your inquiry intent and return meaningful results through its semantic algorithms.
The most recent Gilbane Boston conference featured presentations of commercial applications of various semantic search technologies that are contributing to enterprise search solutions. A few high level points gleaned from speakers on analytic and semantic technologies follow.
- Jordan Frank on blogs and wikis in enterprises articulated how they add context by tying content to people and other information like time. Human commentary is a significant content “contextualizer,” my term, not his.
- Steve Cohen and Matt Kodama co-presented an application using technology (interpretive algorithms integrated with search) to elicit meaning from erratic and linguistically difficult (e.g. Arabic, Chinese) text in the global soup of content.
- Gary Carlson gave us understanding of how subject matter expertise contributes substantively to building terminology frameworks (aka “taxonomies”) that are particularly meaningful within a unique knowledge community.
- Mike Moran helped us see how semantically improved search results can really improve the bottom line in the business sense in both his presentation and later in his blog, a follow-up to a question I posed during the session.
- Colin Britton described the value of semantic search to harvest and correlate data from highly disparate data sources needed to do criminal background checks.
- Kate Noerr explained the use of federating technologies to integrate search results in numerous scenarios, all significant and distinct ways to create semantic order (i.e. meaning) out of search results chaos.
- Bruce Molloy energized the late sessions with his description of how non-techies can create intelligent agents to find and feed colleagues relevant information by searching in the background in ways that go far beyond the typical keyword search.
- Finally, Sean Martin and John Stone co-presented an approach to computational data gathering and integrating the results in an analyzed and insightful format that reveals knowledge about the data, not previously understood.
Points taken are that each example represents a building block of the semantic retrieval framework we will encounter on the Web and within the enterprise. The semantic Web will not magically appear as a finished interface or product but it will become richer in how and what it helps us find. Similar evolutions will happen in the enterprise with a different focus, providing smarter paths for operating within business units.
There is much more to pass along in 2008 and I plan to continue with new topics relating to contextual analysis, the value, use and building of taxonomies, and the variety of applications of enterprise search tools. As for 2007, it’s a wrap.
Following on my last post in which I covered the unique value propositions offered by a variety of enterprise search products, this one takes a look at the evolution of enterprise search. The commentary by search company experts, executives, and analysts indicates some evolutionary technologies and the escalation of certain themes in enterprise search. Furthermore, the pursuit of organizations to strengthen the link between searching technologies and knowledge enablers has never been more prominently featured taking search to a whole new level beyond mere retrieval.
The following paraphrased comments from the Enterprise Search Keynote session are timely and revealing. When I asked, Will Web and Internet Search Technologies Drive the Enterprise (Internal) Search Tool Offerings or Will the Markets Diverge?, these were some thoughts from the panelists.
Matt Brown, Principal Analyst from Forrester Research, commented that enterprise search demands much different and richer content interpretation types of search technologies. What Web-based searching does is create such high visibility for search that enterprises are being primed to adopt it, but only when it comes with enhanced capabilities.
Echoing Matt’s remarks, Oracle search solution manager Bob Bocchino commented on the difficulty of making search operate well within the enterprise because it needs to deal with structured database content and unstructured files, while also applying sophisticated security features that let only authorized viewers see restricted content. Furthermore, security must be deployed in a way that does not degrade performance while supporting continuous updates to content and permissions.
Hadley Reynolds, VP & Director of the Center for Search Innovation at Fast Search & Transfer, noted that the Web isn’t really making a direct impact on enterprise search innovation but many of the social tools found on the Web are being adopted in enterprises to create new kinds of content (e.g. social networks, blogs and wikis) with which enterprise search engines must cope in richer contextual ways.
Don Dodge, Director of Business Development for the Emerging Business Team at Microsoft further noted that the Internet’s biggest problem is scale. That is a much easier problem to solve than in the enterprise where user standards for what qualifies as a good and valuable search results are much higher, therefore making the technology to deliver those results more difficult.
Among the other noteworthy comments in this session was a negative about taxonomies. The gist of it was that they require so much discipline that they might work for a while but can’t really be sustained. If this attitude becomes the norm, many of the semantic search engines which depend on some type of classification and categorization according to industry terminologies or locally maintained lists will be challenged to deliver enhanced search results. This is a subject to be taken up in a later blog entry.
A final conclusion about enterprise search was a remark about the evolution of adoption in the marketplace. Simply put, the marketplace is not monolithic in its requirements. The diversity of demands on search technologies has been a disincentive for vendors to focus on distinct niches and place more effort on areas like e-commerce. This seems to be shifting, especially with all the large software companies now seriously announcing products in the enterprise search market.
IBM (NYSE: IBM) announced new capabilities in its content classification software used to automatically categorize large volumes of enterprise information, making it easier to find, access and use in the context of enterprise content management systems. With its service-oriented architecture-based capabilities, the IBM Classification Module provides connection to the IBM FileNet P8 content management platform to tackle the categorization of unstructured content, especially content stored or arriving in FileNet repositories. It automates the process of determining whether content is important, and how it should be handled. It can also automatically classify previously unmanaged content or reclassify content already under management so it can be leveraged for business purposes such as records management. The IBM Classification Module helps users to determine the right level of automation for their business scenario, providing a balance between automation and oversight through its configurable confidence levels and workflows designed within the classification review interface. This review capability uses the IBM Classification Module’s real-time learning to provide the system with feedback in order to improve accuracy and automatically adapt to changes. IBM Classification Module is currently available from IBM and IBM Business Partners. http://www.ibm.com/software/ecm/classification
It has been a week since the annual Gilbane Boston 2007 Conference closed and I am still searching for the most important message that came out of Enterprise Search and Semantic Web Technology sessions. There were so many interesting case studies that I’ll begin with a search function that illustrates one major enterprise search requirement – aggregation.
Besides illustrating a business case for aggregating disparate content using search, the case studies shared three themes:
- Search is just a starting point for many business processes
- While few very large organizations present all of their organization’s content through a single portal, the technology options to manage such an ideal design are growing and up to supporting entire enterprises
- All systems were implemented and operational for delivering value in less than one year, underscoring the trend toward practical and more out-of-the box solutions
Here is a brief take on what came out of just the first two of seven sessions.
Small-medium solutions:
- Use of ISYS to manipulate search results and function as a back-office data analysis tool for DirectEDGAR, the complete SEC filings, presented by Prof. Burch Kealey of the University of Nebraska. Presentation
- Support for search by serendipity across the shareable content domains of members of a trade association (ARF) by finding results that satisfy the searcher in his pursuit of understanding with Exalead, presented by Alain Heurtebise CEO of Exalead. Presentation
- A knowledge portal enabling rapid and efficient retrieval of the complete technical documentation for field service engineers at Otis Elevator to meet rapid response goals when supporting customers using a customized implementation of dtSearch, presented by project consultant Rob Wiesenberg of Contegra Systems, Inc. Presentation
Large solutions calling for search across multi-million record domains:
- Hosted Vivisimo solution federating over 40 million documents across 22,000 government web sites accessible with search results clustered; it records over a half million page views per day on http://USA.gov and was deployed in 8 weeks, presented by Vivisimo co-founder Jerome Pesenti. Presenation
- Intranet knowledge portal for improving customer services by enabling access to internal knowledge assets (over half a million customer cases with all their associated documents) at USi (an AT&T company) using Endeca, a search product USi had experience deploying and hosting for very large e-commerce catalogs, presented by development leader Toby Ford of USi. With one developer it was running in six months. Presentation
- Within a large law firm (Morrison Foerster) and the legal departments of two multi-national pharmaceutical companies (Pfizer and Novartis), Recommind aggregates and indexes content for numerous internal application repositories, file shares and external content sources for unified search across millions of documents, contributing a direct ROI in saved labor by ensuring that required documents are retrieved in a single search process. Presentation
In each of these cases, content from numerous sources was aggregated through the crawling and indexing algorithms of a particular search engine pointed at a bounded and defined corpus of content, with or without associated metadata to solve a particular business problem. In each case, there were surrounding technologies, human architected design elements, and interfaces to present the search interface and results for a predefined audience. This is what we can expect from search in the coming months and years, deployments to meet specialized enterprise needs, an evolving array of features and tools to leverage search results, and a rapid scaling of capabilities to match the explosion of enterprise content that we all need to find and manipulate to do our jobs.
Next week, I will reconstruct more themes and messages from the conference.
Gilbane Group Inc. announced they have seven research studies underway that will be published over the next few months. The research for some of these studies is already complete, and preliminary results will be discussed at this week’s Gilbane Boston conference at the Westin Copley Place Hotel. The 7 studies are: “Survey on the Web Content Management User Experience” – From our Web Content Management Practice, led by Tony White; “Enterprise Collaboration and Social Computing: A Report on Industry Trends & Best Practices” – From our Social Computing and Collaboration Practice, led by Geoffrey Bock; “Digital Magazine & Newspaper Editions: Growth, Trends, and Best Practices” – From our Cross Media Publishing Strategy & Technology Practice led by Steve Paxhia; “Enterprise Search Markets and Applications: Capitalizing on Emerging Demand” – From our Enterprise Search Practice, led by Lynda Moulton; “Enterprise Digital Rights Management: Business Imperatives and Implementation Readiness” – From our Cross Media Publishing Strategy & Technology Practice, study led by Bill Rosenblatt; “Digital Platforms & Technologies for Book Publishers: Implementations Beyond ‘eBook'” – From our Cross Media Publishing Strategy & Technology Practice led by Steve Paxhia; and “Beyond Search: What to do When you’re Enterprise Search System Doesn’t Work” – A study authored by Steve Arnold, from our Enterprise Search Practice, led by Lynda Moulton. https://gilbane.com