Definitions: January 2007 Archives

Structured search (noun) was rooted firmly in the enterprise when publishers of print index resources (e.g. Chemical Abstracts, Index Medicus from the National Library of Medicine, GRA&I from the National Technical Information Service) became available on-line in the early 1970s. The Systems Development Corporation launched ORBIT developed by a team lead by Carlos Cuadra. Orbit was a command driven search tool accessible to professional searchers. In those days searchers were usually special librarians in corporations, large public libraries, government agencies and major universities. Using the ORBIT command language through a terminal connected by a phone line to remote large computers, librarians would type search commands to find data in specific structured fields. These remote computers held electronic versions of paper indices. Citations resulting from a query for specific chemical compounds, diseases, or government reports, would contain information needed to retrieve articles, patents or books from library shelves.

Corporations spent hundreds of thousands of dollars each year to access external specialized, and structured indices, and the journals, conference proceeding, patents and government documents to which the indices pointed. Hard copy (paper or microform) was the only practical way to read content. Computer screens were not accessible to most researchers and even if they had been, content could not be rendered on them in easily readable forms. Also, until computer storage technologies became cheap, indexing large amounts of text (full-text, or unstructured content) was not affordable.

Even with the advent of graphical interfaces, searching for non-specialists made only minor advances in the early-1980s when library systems offered index browsing to find citations. Library users still needed to read content in hard copy. It was only in the late 1980s and early 90s that full-text content began to be searchable by large numbers of library users on CD-ROMs. Users would go to a library computer, which held multiple CD-ROMs containing journals and other subscriptions, and use a menu to find content on the CD-ROMs by typing keywords that would look through all the content to find matches. This was the first routine use of full-text searching by library users.

These technologies are just memories for a few of us, and unknown to most, but they do point to the differentiation between structured and unstructured searching. Both have been around for a couple of decades but it has taken Web search engines to put search in the hands of everyone. Only recently is frustration with retrieving buckets of unfiltered content pushing enterprises to reconfirm the added value of structured searching.

Technical and business users are appreciating the value of being able to search for a precise title, all documents contributed to a specific project, or all presentations delivered by the CEO in the past two years. Each of these searches requires a defined set of data points, stored with the content and retrievable with a search interface that can support the “structured” query.

Yes, librarians have been here before but, just now, the rest of the organization is learning how they managed to get such good search results all along. Structured searching is now a lot simpler than it was in the 1970s. It is only one aspect in enterprise search but it is an important requirement for most enterprise users when they need reliable and clearly defined search results. And, by the way, Carlos is still around building systems for enterprises to manage and search their critical proprietary content.

Good Will and Responsibility

user-pic
Vote 0 Votes  

If you signed up for feeds from this site, new posts have been slow coming. Gilbane's announcement of an Enterprise Search Practice has not gone unnoticed. The past two weeks have resulted in more good will than this analyst could easily digest and filter. The good news is that ideas for posting on "enterprise search" are already accumulating faster than they can get written, and the number of enthusiastic well-wishers is encouraging. It looks like we have an audience and community of practice in the making. Thank you to all who have sent their support and good cheer.

Quite a number of responses have come from companies who want to discuss their technology offerings and positioning. At Gilbane we are following up on those requests and beginning to schedule time for discussions and presentations. With the recognition that vendors/suppliers of technologies want ink, and plenty of it, comes a responsibility of which I am acutely aware because I was one of that community for over 20 years. Having founded, in 1980, and lead an integrated library automation firm in the corporate arena, I know how industry press coverage can make or break the fortunes of even the best offerings. While blogs are intended to launch and promote discussions, even play devil's advocate, I don't take this role lightly. Every good intention and hard work by vendors deserves thoughtful and unbiased consideration. It deserves to have analysts who know what they are talking about, and those that would present what they can fairly assess in a useful context. The very definition of analyst (noun) supposes a responsible action, to analyze (verb) the offerings. While my analysis may not focus on what a vendor wants me to consider, it will try to present information that is both helpful and thought-provoking without being mean-spirited or dismissive, and content that helps potential users of the technology focus their own choices and decisions.

Now it's time to get down to business and start making this a more frequent happening. Based on a number of comments, let's begin with clarifying what we mean by enterprise search at Gilbane. While the marketplace often categorizes enterprise search as a specific kind of search product, we at Gilbane don’t. Any technology that serves any type of enterprise by helping it find electronic or physical content through an electronic search interface is fair game. Enterprise search is about looking for content in the organization or for the organization. It may be embedded in a specialized application, may be a platform designed to collectively search and aggregate content from many internal silos, or it may combine search of desktops, enterprise hard drives and the Internet. There is a very big universe of content out there; enterprises need all the search tools they can (afford to) leverage to harvest what they need and when they need it.

Now this analyst’s job is to give you a balance between what the vendors are saying and offering, and what the users really need, and get the two engaging more effectively with each other.

NewsShark

Sign-up for our weekly NewsShark newsletter.
Content technology industry news without the hype:

* Email

* First Name

* Last Name

* = Required Field