IBM (NYSE: IBM) announced new capabilities in its content classification software used to automatically categorize large volumes of enterprise information, making it easier to find, access and use in the context of enterprise content management systems. With its service-oriented architecture-based capabilities, the IBM Classification Module provides connection to the IBM FileNet P8 content management platform to tackle the categorization of unstructured content, especially content stored or arriving in FileNet repositories. It automates the process of determining whether content is important, and how it should be handled. It can also automatically classify previously unmanaged content or reclassify content already under management so it can be leveraged for business purposes such as records management. The IBM Classification Module helps users to determine the right level of automation for their business scenario, providing a balance between automation and oversight through its configurable confidence levels and workflows designed within the classification review interface. This review capability uses the IBM Classification Module’s real-time learning to provide the system with feedback in order to improve accuracy and automatically adapt to changes. IBM Classification Module is currently available from IBM and IBM Business Partners. http://www.ibm.com/software/ecm/classification
Category: Enterprise search & search technology (Page 36 of 60)
Research, analysis, and news about enterprise search and search markets, technologies, practices, and strategies, such as semantic search, intranet collaboration and workplace, ecommerce and other applications.
Before we consolidated our blogs, industry veteran Lynda Moulton authored our popular enterprise search blog. This category includes all her posts and other enterprise search news and analysis. Lynda’s loyal readers can find all of Lynda’s posts collected here.
For older, long form reports, papers, and research on these topics see our Resources page.
It has been a week since the annual Gilbane Boston 2007 Conference closed and I am still searching for the most important message that came out of Enterprise Search and Semantic Web Technology sessions. There were so many interesting case studies that I’ll begin with a search function that illustrates one major enterprise search requirement – aggregation.
Besides illustrating a business case for aggregating disparate content using search, the case studies shared three themes:
- Search is just a starting point for many business processes
- While few very large organizations present all of their organization’s content through a single portal, the technology options to manage such an ideal design are growing and up to supporting entire enterprises
- All systems were implemented and operational for delivering value in less than one year, underscoring the trend toward practical and more out-of-the box solutions
Here is a brief take on what came out of just the first two of seven sessions.
Small-medium solutions:
- Use of ISYS to manipulate search results and function as a back-office data analysis tool for DirectEDGAR, the complete SEC filings, presented by Prof. Burch Kealey of the University of Nebraska. Presentation
- Support for search by serendipity across the shareable content domains of members of a trade association (ARF) by finding results that satisfy the searcher in his pursuit of understanding with Exalead, presented by Alain Heurtebise CEO of Exalead. Presentation
- A knowledge portal enabling rapid and efficient retrieval of the complete technical documentation for field service engineers at Otis Elevator to meet rapid response goals when supporting customers using a customized implementation of dtSearch, presented by project consultant Rob Wiesenberg of Contegra Systems, Inc. Presentation
Large solutions calling for search across multi-million record domains:
- Hosted Vivisimo solution federating over 40 million documents across 22,000 government web sites accessible with search results clustered; it records over a half million page views per day on http://USA.gov and was deployed in 8 weeks, presented by Vivisimo co-founder Jerome Pesenti. Presenation
- Intranet knowledge portal for improving customer services by enabling access to internal knowledge assets (over half a million customer cases with all their associated documents) at USi (an AT&T company) using Endeca, a search product USi had experience deploying and hosting for very large e-commerce catalogs, presented by development leader Toby Ford of USi. With one developer it was running in six months. Presentation
- Within a large law firm (Morrison Foerster) and the legal departments of two multi-national pharmaceutical companies (Pfizer and Novartis), Recommind aggregates and indexes content for numerous internal application repositories, file shares and external content sources for unified search across millions of documents, contributing a direct ROI in saved labor by ensuring that required documents are retrieved in a single search process. Presentation
In each of these cases, content from numerous sources was aggregated through the crawling and indexing algorithms of a particular search engine pointed at a bounded and defined corpus of content, with or without associated metadata to solve a particular business problem. In each case, there were surrounding technologies, human architected design elements, and interfaces to present the search interface and results for a predefined audience. This is what we can expect from search in the coming months and years, deployments to meet specialized enterprise needs, an evolving array of features and tools to leverage search results, and a rapid scaling of capabilities to match the explosion of enterprise content that we all need to find and manipulate to do our jobs.
Next week, I will reconstruct more themes and messages from the conference.
Gilbane Group Inc. announced they have seven research studies underway that will be published over the next few months. The research for some of these studies is already complete, and preliminary results will be discussed at this week’s Gilbane Boston conference at the Westin Copley Place Hotel. The 7 studies are: “Survey on the Web Content Management User Experience” – From our Web Content Management Practice, led by Tony White; “Enterprise Collaboration and Social Computing: A Report on Industry Trends & Best Practices” – From our Social Computing and Collaboration Practice, led by Geoffrey Bock; “Digital Magazine & Newspaper Editions: Growth, Trends, and Best Practices” – From our Cross Media Publishing Strategy & Technology Practice led by Steve Paxhia; “Enterprise Search Markets and Applications: Capitalizing on Emerging Demand” – From our Enterprise Search Practice, led by Lynda Moulton; “Enterprise Digital Rights Management: Business Imperatives and Implementation Readiness” – From our Cross Media Publishing Strategy & Technology Practice, study led by Bill Rosenblatt; “Digital Platforms & Technologies for Book Publishers: Implementations Beyond ‘eBook'” – From our Cross Media Publishing Strategy & Technology Practice led by Steve Paxhia; and “Beyond Search: What to do When you’re Enterprise Search System Doesn’t Work” – A study authored by Steve Arnold, from our Enterprise Search Practice, led by Lynda Moulton. https://gilbane.com
My theme leading into the Gilbane Boston Conference this week comes straight from the headlines and New Hampshire political ads that manage to spill over the border into our fair Commonwealth of Massachusetts. If you live outside of the zone of early caucus and primary states, you are probably spared the ad nauseam recitations of all the crises that Rudy Giuliani has met and conquered. In thinking about our collective longing for a true leader in the White House, I began to reflect on all the other places I would like to see leadership. My musings brought me straight to a message I try to impart to clients and professional colleagues struggling with issues of leveraging knowledge and technology.
True leadership is very hard because it requires thinking, projecting and anticipating. It requires abstract thinking about possibilities for making improvements in complex areas. It requires the ability to mentally juggle huge numbers of variables, many of which the true leader knows he/she can’t possibly control but may be able to foresee as possible complications. It requires bucking the status quo.
Anyone can react, and many can react with reasonably appropriate actions, actions that work for the immediate crisis. However, sizing up an enterprise in which things are running in a seemingly routine fashion, and taking the initiative to systematically seek out lurking crises, potential problems, and areas for improvement, and then applying thoughtful and incremental change activities to ensure better outcomes may seem boring – but this is true leadership.
Finally, think about all the ways in which our political leaders seem to thrive on talking about only the monumental crises of the country and world. Think about how our news is driven by immediate crises. We seem to be conditioned to only react to what we are being shown and told. True leaders are seekers, self-educators, investigators, learners and thinkers. Our best leaders are those who get to the core of our political and business enterprises and find a better way for the whole to work more smoothly, with an ultimate goal of bringing positive good to the members of the community. They succeed though personal diligence, finding the will to persevere while immersing themselves in the mundane and routine operations of their domains. They observe and they think about what they observe; they also talk to others and reflect mindfully on what they hear before acting.
As I prepare my opening remarks for several sessions on enterprise search and semantic technologies at the conference over the next three days, I am pondering how I can stimulate the audience to take the time to open their minds to think about what speakers and exhibitors introduce. I want them to think, really think, about what they are hearing. I want them to develop new ideas, new ways of innovating, new ways to make the mundane better and take it back to their enterprises with a purpose – not just with information to be used in the event of a direct work challenge, demand or crisis. I want to lead others to lead from a thoughtfully critical point of view. So, take a look at technologies from the perspective of action toward systemic improvements instead of a reaction to solving only the latest crisis in your enterprise.
In a week when the KMWorld and Enterprise Search Summit were running concurrently in San Jose, Microsoft made an enterprise search product announcement that was actually a well-kept secret for Microsoft. There was plenty of other new product news floating about the marketplace, too. Mark Logic, MuseGlobal, Cognos, SchemaLogic, and Brainware all had their own announcements.
Between November 6 and November 13, these five companies had interesting news to share. The announcements all related to leveraging enterprise content in tandem with search engines. This underscores a strong trend in software product deployment, specifically, that much of it is being rolled out in partnerships in highly heterogeneous environments. While Microsoft’s announcement about free Search Server 2008
Express establishes them as the last major software company to adopt search as a platform, the other technology announcements remind us that integration activity is a core operational consideration and even a necessity for gaining value from search.
In order to tie all the bits and pieces of content across the enterprise into a tidy bundle for simple retrieval, or in order for content to really bring value to solving business problems, it needs packaging. It needs to be packaged at the front end so that search engines can grab useful context and metadata for smarter indexing. It also needs to be well packaged at the output end to present results meaningfully for a particular audience or purpose.
Here is a quick look at what these five complementary technologies do for search plus a link to each of their latest announcements:
- Brainware – combines data capture with a content extraction and distillation learning engine for enhancing categorization relevancy in preparation for natural language queries. It will be embedded in search for a leading enterprise library system, Sirsi/Dynix.
- Cognos – a leading Business Intelligence (BI) software company is being acquired by IBM, whose search products are often paired with Cognos.
- Mark Logic – is a company with an XML content server platform for managing or converting content in XML formats. They just announced MarkMail, a community-focused searchable message archive service, which stores emails as XML documents. Expect more from them on this front.
- MuseGlobal – offers solutions that integrate content from multiple search engines. They just announced availability for presenting results in a fully unified and consistent format from multiple search engines in a SharePoint portal interface.
- SchemaLogic – specializes in content and document type modeling, metadata and vocabulary management using SchemaServer. In the past two weeks they have announced integration with SharePoint to manage metadata. A webinar this week described the interplay with Documentum for document production and retrieval using the FAST search engine.
And what do the other enterprise search vendors have to say about the “surprise” Microsoft announcement? Comments ranged from “we knew it was just a matter of time before they announced” to “good for business, enterprise search is officially now a market.” To the first comment I say, “Not so fast.” For several years rumors have been floated about the imminent acquisition of any number of search companies by MS but nothing materialized. Yes, Microsoft was doing something about enterprise search but until last week “what” was still the question. To the latter I say, “We’ve had an enterprise search market for several years, Microsoft just wanted to be sure it was well established before joining the club.” That was smart of them; let others lay the foundation for a growth industry. It also looks like this is a leveling of the field with Google already playing in Microsoft’s backyard in the free office tools area.
Now the positioning really begins.
MuseGlobal announced the adoption of its content integration platform by Microsoft’s SharePoint Server 2007 enterprise suite. With Muse technology, SharePoint customers are now able to present local and licensed content to their enterprise users in a controlled and protected environment. The Microsoft Office SharePoint Server is an enterprise portal platform that makes it easy to build and maintain portal sites. Through consolidated access to existing business applications and content, companies can drive consistent performance of common business tasks, and SharePoint’s integrated Web content management capabilities enable people to publish Web content with a content authoring tool and a built-in approval process. MuseGlobal search integration and management systems enable institutions to build search products and services, unifying a wide range of content sources into custom search solutions. http://www.museglobal.com
I am in a mode of indecision about prioritizing a lot of news in the enterprise search space; it all seems important because we have an agenda focused on search at the upcoming conference on November 27 – 29 in Boston. The following, in no particular order, is not an exhaustive list but representative of happenings in the past month that will surely be the subject of much commentary and discussion by our speakers and panelists:
- Don Dodge of Microsoft, one of our panelists in the search keynote session, is taking on Google’s customer support positioning as an enterprise solutions provider in his blog.
- MondoSoft went from being shut down by its investors to being acquired by possibly two companies (one acquiring MondoSearch and the other acquiring Ontolica) and now both are being acquired by one company. Meanwhile, IntelliSearch is making an offer to MondoSearch clients to “switch and save.”
- Steve Arnold is continuing his drumbeat on Google search patents and their significance.
- Oracle is getting very serious about search as you will hear in this Webinar download and is positioning itself for a holistic approach to managing content in the enterprise.
- Fast and Autonomy are making acquisitions, too, and have begun to act like they are not the only search options for the enterprise by re-focusing their marketing.
- Companies like Connotate, ISYS, Exalead, Recommind, SchemaLogic and Coveo are acquiring good clients and showing their strengths in important niche markets with new enhancements
- Vivisimo’s social search is getting a lot of positive press, inviting a lot of blogging and has set the bar high for competitors to match their offerings.
- Endeca continues to expand its staff, client-base and well-engineered product line; they are also building important alliances with technology and business partners when it makes good sense to do so
- IBM is talking up the potential of semantic search in the enterprise.
Whew!
Most of the above mentioned will be making an appearance or two at GilbaneBoston, as speakers or exhibitors, or both. I am trying to figure out how to make sure the seven sessions on search and semantic Web technologies touch most of the bases but with so much afoot in the search arena, we will be working overtime.
If you are going to be an attendee in any capacity, I hope you’ll blog or make comments when I do. We want to hear what you think and learn about from the experts and users alike. There are sure to be surprises. Your take on the programs will be of interest to many. If you do make the conference, be sure to find me and introduce yourself so we can have a chat.
I sat in on a demonstration for an enterprise search engine this week. There were several things that really impressed me. First it was not canned; although the demonstrators were clearly following a script, and members of the audience members were asking questions throughout the demonstration. The audience was a mix of consultants and prospective customers; they had a lot of questions, asked to be shown features, functions and “what ifs.” The person demonstrating was very soon off script and doing nothing more than answering questions and demonstrating completely ad hoc searches and with excellent results.
The second impressive aspect was that the entire corpus of content was company information for the vendor. It was not everything they have on their intranet for this public view but enough to see that this is a company that actually uses its own technology throughout the enterprise for all of its divisions. Content from Lotus Notes, SharePoint, Documentum, an employee gallery, emails and files were all there and presented in a clear format for the audience to see and understand.
Finally, what impressed me was the extent to which their content reflected how their search engine was being used as a platform for doing business and working collaboratively, internally. This caused me to reflect back to a presentation for an analyst group earlier in this year by another search company. That company was rolling out a number of features in the business intelligence (BI) space. So, I asked them a couple of questions, “How are you using these tools to manage your own enterprise, doing business intelligence? How are you exploiting internal content to understand the dynamics of a rapidly growing company?” There was a long silence but the answer was along the lines of, we aren’t but we should probably consider it and it takes time to think through a strategy for deploying a good BI solution.
In my experience, if everyone in a company with an enterprise search technology product does not find a use for and evolve with its own offerings, then there is a missed opportunity and a flawed business strategy. This missed opportunity is the terrific feedback the developers will get from that internal use and the evangelism that can come from enthusiastic employees. The flawed business strategy is the huge disconnect that develops between customers and suppliers when supplier employees just never have a customer experience themselves. Search and BI are two functions that every company can benefit from using; why on earth wouldn’t every search vendor be aggressively seeking opportunities to apply its own tools.
So, buyers be aware – look for evidence that the purveyors of search actually have their own experiences to talk about and demonstrate. That can tell you a whole lot about how you will be treated as a customer because they are one, too.