Curated for content, computing, and digital experience professionals

Category: Semantic technologies (Page 35 of 72)

Our coverage of semantic technologies goes back to the early 90s when search engines focused on searching structured data in databases were looking to provide support for searching unstructured or semi-structured data. This early Gilbane Report, Document Query Languages – Why is it so Hard to Ask a Simple Question?, analyses the challenge back then.

Semantic technology is a broad topic that includes all natural language processing, as well as the semantic web, linked data processing, and knowledge graphs.


Enterprise Search: Leveraging and Learning from Web Search and Content Tools

Following on my last post in which I covered the unique value propositions offered by a variety of enterprise search products, this one takes a look at the evolution of enterprise search. The commentary by search company experts, executives, and analysts indicates some evolutionary technologies and the escalation of certain themes in enterprise search. Furthermore, the pursuit of organizations to strengthen the link between searching technologies and knowledge enablers has never been more prominently featured taking search to a whole new level beyond mere retrieval.

The following paraphrased comments from the Enterprise Search Keynote session are timely and revealing. When I asked, Will Web and Internet Search Technologies Drive the Enterprise (Internal) Search Tool Offerings or Will the Markets Diverge?, these were some thoughts from the panelists.

Matt Brown, Principal Analyst from Forrester Research, commented that enterprise search demands much different and richer content interpretation types of search technologies. What Web-based searching does is create such high visibility for search that enterprises are being primed to adopt it, but only when it comes with enhanced capabilities.

Echoing Matt’s remarks, Oracle search solution manager Bob Bocchino commented on the difficulty of making search operate well within the enterprise because it needs to deal with structured database content and unstructured files, while also applying sophisticated security features that let only authorized viewers see restricted content. Furthermore, security must be deployed in a way that does not degrade performance while supporting continuous updates to content and permissions.

Hadley Reynolds, VP & Director of the Center for Search Innovation at Fast Search & Transfer, noted that the Web isn’t really making a direct impact on enterprise search innovation but many of the social tools found on the Web are being adopted in enterprises to create new kinds of content (e.g. social networks, blogs and wikis) with which enterprise search engines must cope in richer contextual ways.

Don Dodge, Director of Business Development for the Emerging Business Team at Microsoft further noted that the Internet’s biggest problem is scale. That is a much easier problem to solve than in the enterprise where user standards for what qualifies as a good and valuable search results are much higher, therefore making the technology to deliver those results more difficult.

Among the other noteworthy comments in this session was a negative about taxonomies. The gist of it was that they require so much discipline that they might work for a while but can’t really be sustained. If this attitude becomes the norm, many of the semantic search engines which depend on some type of classification and categorization according to industry terminologies or locally maintained lists will be challenged to deliver enhanced search results. This is a subject to be taken up in a later blog entry.

A final conclusion about enterprise search was a remark about the evolution of adoption in the marketplace. Simply put, the marketplace is not monolithic in its requirements. The diversity of demands on search technologies has been a disincentive for vendors to focus on distinct niches and place more effort on areas like e-commerce. This seems to be shifting, especially with all the large software companies now seriously announcing products in the enterprise search market.

IBM Boosts Content Classification Software to Streamline Enterprise Content Management

IBM (NYSE: IBM) announced new capabilities in its content classification software used to automatically categorize large volumes of enterprise information, making it easier to find, access and use in the context of enterprise content management systems. With its service-oriented architecture-based capabilities, the IBM Classification Module provides connection to the IBM FileNet P8 content management platform to tackle the categorization of unstructured content, especially content stored or arriving in FileNet repositories. It automates the process of determining whether content is important, and how it should be handled. It can also automatically classify previously unmanaged content or reclassify content already under management so it can be leveraged for business purposes such as records management. The IBM Classification Module helps users to determine the right level of automation for their business scenario, providing a balance between automation and oversight through its configurable confidence levels and workflows designed within the classification review interface. This review capability uses the IBM Classification Module’s real-time learning to provide the system with feedback in order to improve accuracy and automatically adapt to changes. IBM Classification Module is currently available from IBM and IBM Business Partners. http://www.ibm.com/software/ecm/classification

Case Studies: Enterprise Search Success Stories

It has been a week since the annual Gilbane Boston 2007 Conference closed and I am still searching for the most important message that came out of Enterprise Search and Semantic Web Technology sessions. There were so many interesting case studies that I’ll begin with a search function that illustrates one major enterprise search requirement – aggregation.

Besides illustrating a business case for aggregating disparate content using search, the case studies shared three themes:

  • Search is just a starting point for many business processes
  • While few very large organizations present all of their organization’s content through a single portal, the technology options to manage such an ideal design are growing and up to supporting entire enterprises
  • All systems were implemented and operational for delivering value in less than one year, underscoring the trend toward practical and more out-of-the box solutions

Here is a brief take on what came out of just the first two of seven sessions.

Small-medium solutions:

  • Use of ISYS to manipulate search results and function as a back-office data analysis tool for DirectEDGAR, the complete SEC filings, presented by Prof. Burch Kealey of the University of Nebraska. Presentation
  • Support for search by serendipity across the shareable content domains of members of a trade association (ARF) by finding results that satisfy the searcher in his pursuit of understanding with Exalead, presented by Alain Heurtebise CEO of Exalead. Presentation
  • A knowledge portal enabling rapid and efficient retrieval of the complete technical documentation for field service engineers at Otis Elevator to meet rapid response goals when supporting customers using a customized implementation of dtSearch, presented by project consultant Rob Wiesenberg of Contegra Systems, Inc. Presentation

Large solutions calling for search across multi-million record domains:

  • Hosted Vivisimo solution federating over 40 million documents across 22,000 government web sites accessible with search results clustered; it records over a half million page views per day on http://USA.gov and was deployed in 8 weeks, presented by Vivisimo co-founder Jerome Pesenti. Presenation
  • Intranet knowledge portal for improving customer services by enabling access to internal knowledge assets (over half a million customer cases with all their associated documents) at USi (an AT&T company) using Endeca, a search product USi had experience deploying and hosting for very large e-commerce catalogs, presented by development leader Toby Ford of USi. With one developer it was running in six months. Presentation
  • Within a large law firm (Morrison Foerster) and the legal departments of two multi-national pharmaceutical companies (Pfizer and Novartis), Recommind aggregates and indexes content for numerous internal application repositories, file shares and external content sources for unified search across millions of documents, contributing a direct ROI in saved labor by ensuring that required documents are retrieved in a single search process. Presentation

In each of these cases, content from numerous sources was aggregated through the crawling and indexing algorithms of a particular search engine pointed at a bounded and defined corpus of content, with or without associated metadata to solve a particular business problem. In each case, there were surrounding technologies, human architected design elements, and interfaces to present the search interface and results for a predefined audience. This is what we can expect from search in the coming months and years, deployments to meet specialized enterprise needs, an evolving array of features and tools to leverage search results, and a rapid scaling of capabilities to match the explosion of enterprise content that we all need to find and manipulate to do our jobs.

Next week, I will reconstruct more themes and messages from the conference.

Gilbane Group Announces Collection of New Research Studies & Reports

Gilbane Group Inc. announced they have seven research studies underway that will be published over the next few months. The research for some of these studies is already complete, and preliminary results will be discussed at this week’s Gilbane Boston conference at the Westin Copley Place Hotel. The 7 studies are: “Survey on the Web Content Management User Experience” – From our Web Content Management Practice, led by Tony White; “Enterprise Collaboration and Social Computing: A Report on Industry Trends & Best Practices” – From our Social Computing and Collaboration Practice, led by Geoffrey Bock; “Digital Magazine & Newspaper Editions: Growth, Trends, and Best Practices” – From our Cross Media Publishing Strategy & Technology Practice led by Steve Paxhia; “Enterprise Search Markets and Applications: Capitalizing on Emerging Demand” – From our Enterprise Search Practice, led by Lynda Moulton; “Enterprise Digital Rights Management: Business Imperatives and Implementation Readiness” – From our Cross Media Publishing Strategy & Technology Practice, study led by Bill Rosenblatt; “Digital Platforms & Technologies for Book Publishers: Implementations Beyond ‘eBook'” – From our Cross Media Publishing Strategy & Technology Practice led by Steve Paxhia; and “Beyond Search: What to do When you’re Enterprise Search System Doesn’t Work” – A study authored by Steve Arnold, from our Enterprise Search Practice, led by Lynda Moulton. https://gilbane.com

Surrounding and Supporting Enterprise Search

In a week when the KMWorld and Enterprise Search Summit were running concurrently in San Jose, Microsoft made an enterprise search product announcement that was actually a well-kept secret for Microsoft. There was plenty of other new product news floating about the marketplace, too. Mark Logic, MuseGlobal, Cognos, SchemaLogic, and Brainware all had their own announcements.

Between November 6 and November 13, these five companies had interesting news to share. The announcements all related to leveraging enterprise content in tandem with search engines. This underscores a strong trend in software product deployment, specifically, that much of it is being rolled out in partnerships in highly heterogeneous environments. While Microsoft’s announcement about free Search Server 2008

Express establishes them as the last major software company to adopt search as a platform, the other technology announcements remind us that integration activity is a core operational consideration and even a necessity for gaining value from search.

In order to tie all the bits and pieces of content across the enterprise into a tidy bundle for simple retrieval, or in order for content to really bring value to solving business problems, it needs packaging. It needs to be packaged at the front end so that search engines can grab useful context and metadata for smarter indexing. It also needs to be well packaged at the output end to present results meaningfully for a particular audience or purpose.

Here is a quick look at what these five complementary technologies do for search plus a link to each of their latest announcements:

  • Brainware – combines data capture with a content extraction and distillation learning engine for enhancing categorization relevancy in preparation for natural language queries. It will be embedded in search for a leading enterprise library system, Sirsi/Dynix.
  • Cognos – a leading Business Intelligence (BI) software company is being acquired by IBM, whose search products are often paired with Cognos.
  • Mark Logic – is a company with an XML content server platform for managing or converting content in XML formats. They just announced MarkMail, a community-focused searchable message archive service, which stores emails as XML documents. Expect more from them on this front.
  • MuseGlobal – offers solutions that integrate content from multiple search engines. They just announced availability for presenting results in a fully unified and consistent format from multiple search engines in a SharePoint portal interface.
  • SchemaLogic – specializes in content and document type modeling, metadata and vocabulary management using SchemaServer. In the past two weeks they have announced integration with SharePoint to manage metadata. A webinar this week described the interplay with Documentum for document production and retrieval using the FAST search engine.

And what do the other enterprise search vendors have to say about the “surprise” Microsoft announcement? Comments ranged from “we knew it was just a matter of time before they announced” to “good for business, enterprise search is officially now a market.” To the first comment I say, “Not so fast.” For several years rumors have been floated about the imminent acquisition of any number of search companies by MS but nothing materialized. Yes, Microsoft was doing something about enterprise search but until last week “what” was still the question. To the latter I say, “We’ve had an enterprise search market for several years, Microsoft just wanted to be sure it was well established before joining the club.” That was smart of them; let others lay the foundation for a growth industry. It also looks like this is a leveling of the field with Google already playing in Microsoft’s backyard in the free office tools area.

Now the positioning really begins.

Gearing up for Gilbane Boston 2007

I am in a mode of indecision about prioritizing a lot of news in the enterprise search space; it all seems important because we have an agenda focused on search at the upcoming conference on November 27 – 29 in Boston. The following, in no particular order, is not an exhaustive list but representative of happenings in the past month that will surely be the subject of much commentary and discussion by our speakers and panelists:

  • Don Dodge of Microsoft, one of our panelists in the search keynote session, is taking on Google’s customer support positioning as an enterprise solutions provider in his blog.
  • MondoSoft went from being shut down by its investors to being acquired by possibly two companies (one acquiring MondoSearch and the other acquiring Ontolica) and now both are being acquired by one company. Meanwhile, IntelliSearch is making an offer to MondoSearch clients to “switch and save.”
  • Steve Arnold is continuing his drumbeat on Google search patents and their significance.
  • Oracle is getting very serious about search as you will hear in this Webinar download and is positioning itself for a holistic approach to managing content in the enterprise.
  • Fast and Autonomy are making acquisitions, too, and have begun to act like they are not the only search options for the enterprise by re-focusing their marketing.
  • Companies like Connotate, ISYS, Exalead, Recommind, SchemaLogic and Coveo are acquiring good clients and showing their strengths in important niche markets with new enhancements
  • Vivisimo’s social search is getting a lot of positive press, inviting a lot of blogging and has set the bar high for competitors to match their offerings.
  • Endeca continues to expand its staff, client-base and well-engineered product line; they are also building important alliances with technology and business partners when it makes good sense to do so
  • IBM is talking up the potential of semantic search in the enterprise.

Whew!

Most of the above mentioned will be making an appearance or two at GilbaneBoston, as speakers or exhibitors, or both. I am trying to figure out how to make sure the seven sessions on search and semantic Web technologies touch most of the bases but with so much afoot in the search arena, we will be working overtime.

If you are going to be an attendee in any capacity, I hope you’ll blog or make comments when I do. We want to hear what you think and learn about from the experts and users alike. There are sure to be surprises. Your take on the programs will be of interest to many. If you do make the conference, be sure to find me and introduce yourself so we can have a chat.

Trust is a Pact between a Vendor and Its Customer

There is nothing more disappointing to a consultant than to learn that a project in which you gave significant guidance to a client is experiencing a project meltdown…except maybe having everything get off to a positive start only to falter due to problems with the technologies being implemented. I have been burned several times lately and that surprises me because, as a former software vendor myself, I have pretty deep skepticism when it comes to overblown claims and can usually spot the companies I wouldn’t want my clients to trust. This was not one of them.

It is hard to deal with situations that you didn’t consider likely. A big one is a broken promise, even if it is implicit, not explicit. For a vendor to deliver a solid CMS product with a buggy search interface to toggle between keyword and metadata search is one thing. My client spent months getting it to work so that users could seek by keyword or on explicit metadata fields. They rolled it out and it was “OK,” if not great. But after much discussion with the vendor about the bugs, my client was pressured into adopting an upgrade to “solve the problem.” Unfortunately, the upgrade was an experience from hell, but worse was the fact that the old search controls no longer worked and there was no way to search metadata any longer. Having predicated the procurement on being able to search metadata… well, you get the picture.

What happened to the old motto of “first do no harm?” In my world that means you never release an “upgrade” that subtracts functionality. In the words of my client, “we consider this a major regression.” I consider it a serious breach of trust between them and the supplier but also between me and my client. Why would they ever trust my guidance about the solidness of a vendor again? Guess I have my work cut out for me to find some recourse for my client.

On a much more positive note, I will be offering commentary on the subject of trust and technology solutions when I participate in my first Gilbane Webinar with Oracle’s Brian Dirking, Wednesday, October 10th. The title is The Trust Factor: Secure Enterprise Search for High-Value Content and it will include some key considerations when considering your path to a successful search implementation. I’m still optimistic and enthusiastic that you can implement an excellent search solution for your organization if you really chose your strategy, your technology and your business partners carefully and I’m teaming with Oracle to reinforce that message.

It is free, so click on the title to sign up, even if you are in the beginning stages of your quest for a search product. I hope you will join us for the discussion.

New Enterprise Search Engine Specializes In Engineering Documents

Elmo Solutions, experienced in the extraction and processing of CAD and engineering metadata, announced the official launch of Agni Enterprise Search 2008. Agni Enterprise Search is an enterprise search solution designed to meet the distinct needs of CAD, R&D, Engineering and IT managers, end-users and senior managers. Agni Enterprise Search 2008 allows CAD and non-CAD users to index and retrieve a wide range of document formats, including AutoCAD, Autodesk Inventor, Office, eMail, Acrobat, image formats, SolidWorks, Lotus Notes, Visio, etc. Agni Enterprise Search 2008 was designed specifically to maximize document usability and reusability. Agni Enterprise Search allows not only free-form keyword-based searches, but also structured-form metadata-based searches as well. Agni Enterprise Search 2008 features include– Display of document thumbnails, Stemming, Phonic searches, and a query language. The software ships in English and French, and is available as a free, fully functional, 30-day trial license for download directly from the Elmo Website. http://www.elmosolutions.com/

« Older posts Newer posts »

© 2024 The Gilbane Advisor

Theme by Anders NorenUp ↑