<?xml version="1.0" encoding="utf-8"?>
<feed xmlns="http://www.w3.org/2005/Atom">
    <title>Enterprise Search Blog</title>
    <link rel="alternate" type="text/html" href="http://gilbane.com/search_blog/" />
    <link rel="self" type="application/atom+xml" href="http://gilbane.com/search_blog/atom.xml" />
    <id>tag:gilbane.com,2008-12-28:/search_blog//49</id>
    <updated>2010-09-02T12:59:39Z</updated>
    <subtitle>Analysis, opinion, and advice on enterprise search
technologies, applications, and practices
   </subtitle>
    <generator uri="http://www.sixapart.com/movabletype/">Movable Type Pro 4.32-en</generator>

<entry>
    <title>Semantically Focused and Building on a Successful Customer Base</title>
    <link rel="alternate" type="text/html" href="http://gilbane.com/search_blog/2010/09/semantically_focused_and_building_on_a_successful_customer_base.html" />
    <id>tag:gilbane.com,2010:/search_blog//49.10713</id>

    <published>2010-09-01T17:12:38Z</published>
    <updated>2010-09-02T12:59:39Z</updated>

    <summary>Now, as confidence in and understanding of the technology ramps up, Linguamatics are getting more complex and sophisticated questions from their customers and prospects. This is the exciting part as they are able to sell I2E&apos;s ability to &quot;synthesize new information from millions of sources in ways that humans cannot.&quot; This is done by using the technology to keep track of and processing the voluminous connections among information resources that exceed human mental limits.</summary>
    <author>
        <name>Lynda Moulton</name>
        <uri>http://gilbane.com/blog/mt-cp.cgi?__mode=view&amp;blog_id=49&amp;id=14</uri>
    </author>
    
        <category term="Search Technologies and Products" scheme="http://www.sixapart.com/ns/types#category" />
    
    <category term="lifesciencesindustries" label="Life sciences industries" scheme="http://www.sixapart.com/ns/types#tag" />
    <category term="semanticsoftwareapplications" label="Semantic software applications" scheme="http://www.sixapart.com/ns/types#tag" />
    
    <content type="html" xml:lang="en" xml:base="http://gilbane.com/search_blog/">
        <![CDATA[<p>Dr. Phil Hastings and Dr. David Milward spoke with me in June, 2010, as I was completing the Gilbane report, <em>Semantic Software Technologies: A Landscape of High Value Applications for the Enterprise</em>. My interest in a conversation was stimulated by several months of discussions with customers of numerous semantic software companies. Having heard perspectives from early adopters of Linguamatics' I2E and other semantic software applications, I wanted to get some comments from two key officers of Linguamatics about what I heard from the field. Dr. Milward is a founder and CTO, and Dr. Hastings is the Director of Business Development.</p>

<p>A company with sustained profitability for nearly ten years in the enterprise semantic market space has credibility. Reactions from a maturing company to what users have to say are interesting and carry weight in any industry. My lines of inquiry and the commentary from the Linguamatics officers centered around their own view of the market and adoption experiences.</p>

<p>When asked about growth potential for the company outside of pharmaceuticals where Linguamatics already has high adoption and very enthusiastic users, Drs. Milward and Hastings asserted their ongoing <em>principal focus in life sciences</em>. They see a lot more potential in this market space, largely because of the vast amounts of unstructured content being generated, coupled with the very high-value problems that can be solved by text mining and semantically analyzing the data from those documents. Expanding their business further in the life sciences means that they will continue engaging in research projects with the academic community. It also means that Linguamatics semantic technology will be helping organizations solve problems related to healthcare and homeland security.</p>

<p>The wisdom of a measured and consistent approach comes through strongly when speaking with Linguamatics executives. They are highly focused and cite the pitfalls of trying to "do everything at once," which would be the case if they were to pursue all markets overburdened with tons of unstructured content. While pharmaceutical <em>terminology</em>, a critical component of I2E, is complex and extensive, there are many aids to support it. The language of life sciences is in a constant state of being enriched through refinements to published thesauri and ontologies. However, <u>in other industries with less technical language, Linguamatics can still provide important support to analyze content in the detection of signals and patterns of importance to intelligence and planning</u>.</p>

<p>Much of the remainder of the interview centered on what I refer to as the "team competencies" of individuals who identify the need for any semantic software application; those are the people who select, implement and maintain it. When asked if this presents a challenge for Linguamatics or the market in general, Milward and Hastings acknowledged a learning curve and the need for a larger pool of experts for adoption. This is a professional growth opportunity for <em>informatics</em> and<em> library science</em> people. These professionals are often the first group to identify Linguamatics as a potential solutions provider for semantically challenging problems, leading business stakeholders to the company. They are also good advocates for selling the concept to management and explaining the strong benefits of semantic technology when it is applied to elicit value from otherwise under-leveraged content.</p>

<p>One Linguamatics core operating principal came through clearly when talking about the personnel issues of using I2E, which is the necessity of working closely with their customers. This means making sure that expectations about system requirements are correct, examples of deployments and "what the footprint might look like" are given, and best practices for implementations are shared. They want to be sure that their customers have a sense of being in a community of adopters and are not alone in the use of this pioneering technology. Building and sustaining close customer relationships is very important to Linguamatics, and that means an emphasis on services co-equally with selling licenses.</p>

<p>Linguamatics has come a long way since 2001. Besides a steady effort to improve and enhance their technology through regular product releases of I2E, there have been a lot of "show me" and "prove it" moments to which they have responded. Now, as confidence in and understanding of the technology ramps up, they are getting more complex and sophisticated questions from their customers and prospects. This is the exciting part as they are able to sell I2E's ability to "synthesize new information from millions of sources in ways that humans cannot." This is done by using the technology to keep track of and processing the voluminous connections among information resources that exceed human mental limits.</p>

<p>At this stage of growth, with early successes and excellent customer adoption, it was encouraging to hear the enthusiasm of two executives for the evolution of the industry and their opportunities in it.</p>

<p>The Gilbane report and a deep dive on Linguamatics are available through this <a href="http://www.linguamatics.com/welcome/news/press_releases/Semantic_Software_Technologies_Study.html">Press Release</a> on their Web site.<br />
</p>]]>
        
    </content>
</entry>

<entry>
    <title>Semantic Technology: Sharing a Large Market Space</title>
    <link rel="alternate" type="text/html" href="http://gilbane.com/search_blog/2010/08/semantic_technology_sharing_a_large_market_space.html" />
    <id>tag:gilbane.com,2010:/search_blog//49.10711</id>

    <published>2010-08-26T14:28:49Z</published>
    <updated>2010-08-26T14:38:05Z</updated>

    <summary>Along the lines of simplicity, they emphasized the specialized nature of most of the successful semantic software applications, noting that these are not coming from the largest software companies. State-of-the-art tools are being commercialized and deployed for highly refined applications out of companies with a small footprint of experienced experts.</summary>
    <author>
        <name>Lynda Moulton</name>
        <uri>http://gilbane.com/blog/mt-cp.cgi?__mode=view&amp;blog_id=49&amp;id=14</uri>
    </author>
    
        <category term="Search Technologies and Products" scheme="http://www.sixapart.com/ns/types#category" />
    
    <category term="expertsystem" label="Expert System" scheme="http://www.sixapart.com/ns/types#tag" />
    <category term="semanticsoftwareapplications" label="Semantic software applications" scheme="http://www.sixapart.com/ns/types#tag" />
    
    <content type="html" xml:lang="en" xml:base="http://gilbane.com/search_blog/">
        <![CDATA[<p>It is always interesting to talk shop with the experts in a new technology arena. My interview with Luca Scagliarini, VP of Strategy and Business Development for Expert System, and Brooke Aker, CEO of Expert System USA was no exception. They had been digesting my research on <em>Semantic Software Technologies</em> and last week we had a discussion about what is in the Gilbane report.</p>

<p>When asked if they were surprised by anything in my coverage of the market, the simple answer was "not really, nothing we did not already know." The longer answer related to the presentation of our research illustrating the scope and depth of the marketplace. These two veterans of the semantic industry admitted that the number of players, applications and breadth of semantic software categories is impressive when viewed in one report. Mr. Scagliarini commented on the huge amount of potential still to be explored by vendors and users.</p>

<p>Our conversation then focused on where we think the industry is headed and they emphasized that this is still an early stage and evolving area. Both acknowledged the need for simplification of products to ease their adoption. It must be straightforward for buyers to understand what they are licensing, the value they can expect for the price they pay; implementation, packaging and complementary services need to be easily understood.</p>

<p>Along the lines of simplicity, they emphasized the specialized nature of most of the successful semantic software applications, noting that these are not coming from the largest software companies. State-of-the-art tools are being commercialized and deployed for highly refined applications out of companies with a small footprint of experienced experts.</p>

<p>Expert System knows about the need for expertise in such areas as ontologies, search, and computational linguistic applications. For years they have been cultivating a team of people for their development and support operations. It has not always been easy to find these competencies, especially right out of academia. Aker and Scagliarini pointed out the need for a lot of pragmatism, coupled with subject expertise, to apply semantic tools for optimal business outcomes. It was hard in the early years for them to find people who could leverage their academic research experiences for a corporate mission.</p>

<p>Human resource barriers have eased in recent years as younger people who have grown up with a variety of computing technologies seem to grasp and understand the potential for semantic software tools more quickly.</p>

<p>Expert System itself is gaining traction in large enterprises that have segmented groups within IT that are dedicated to "learning" applications, and formalized ways of experimenting with, testing and evaluating new technologies. When they become experts in tool use, they are much better at proving value and making the right decisions about how and when to apply the software.</p>

<p>Having made good strides in energy, life sciences, manufacturing and homeland security vertical markets, Expert System is expanding its presence with the Cogito product line in other government agencies and publishing. The executives reminded me that they have semantic nets built out in Italian, Arabic and German, as well as English. This is unique among the community of semantic search companies and will position them for some interesting opportunities where other companies cannot perform.</p>

<p>I enjoyed listening and exchanging commentary about the semantic software technology field. However, Expert System and Gilbane both know that the semantic space is complex and they are sharing a varied landscape with a lot of companies competing for a strong position in a young industry. They have a significant share already.</p>

<p>For more about Expert System and the release of this sponsored research you can view their recent <a href="http://www.expertsystem.net/news.asp?idd=1660">Press Release</a>.<br />
</p>]]>
        
    </content>
</entry>

<entry>
    <title>Data Mining for Energy Independence</title>
    <link rel="alternate" type="text/html" href="http://gilbane.com/search_blog/2010/08/data_mining_for_energy_independence.html" />
    <id>tag:gilbane.com,2010:/search_blog//49.10706</id>

    <published>2010-08-17T17:28:40Z</published>
    <updated>2010-08-17T17:38:42Z</updated>

    <summary>We have decades of information and knowledge assets in energy related fields (e.g. chemistry, materials science, geology, and engineering) that semantic technologies can leverage to move us toward a future of energy independence. Finding nuggets of old information in unexpected relationships to content from previously disconnected sources is a role for semantic search that can stimulate new ideas and technical research.</summary>
    <author>
        <name>Lynda Moulton</name>
        <uri>http://gilbane.com/blog/mt-cp.cgi?__mode=view&amp;blog_id=49&amp;id=14</uri>
    </author>
    
        <category term="Search Problems/Solved Search Problems" scheme="http://www.sixapart.com/ns/types#category" />
    
    <category term="contentconversions" label="Content conversions" scheme="http://www.sixapart.com/ns/types#tag" />
    <category term="datamining" label="Data mining" scheme="http://www.sixapart.com/ns/types#tag" />
    <category term="semanticsearch" label="Semantic search" scheme="http://www.sixapart.com/ns/types#tag" />
    <category term="textanalytics" label="Text analytics" scheme="http://www.sixapart.com/ns/types#tag" />
    
    <content type="html" xml:lang="en" xml:base="http://gilbane.com/search_blog/">
        <![CDATA[<p>Mining content for facts and information relationships is a focal point of many semantic technologies. Among the text analytics tools are those for mining content in order to process it for further analysis and understanding, and indexing for semantic search. This will move enterprise search to a new level of research possibilities.</p>

<p>Research for a forthcoming Gilbane report on semantic software technologies turned up numerous applications used in the life sciences and publishing. Neither semantic technologies nor text mining are mentioned in this recent article <a href="http://www.nytimes.com/2010/08/13/health/research/13alzheimer.html">Rare Sharing of Data Leads to Progress on Alzheimer's</a> in the <u>New York Times</u> but I am pretty certain that these technologies had some role in enabling scientists to discover new data relationships and synthesize new ideas about Alzheimer's biomarkers. The sheer volume of data from all the referenced data sources demands computational methods to distill and analyze.</p>

<p>One vertical industry poised for potential growth of semantic technologies is the energy field. It is a special interest of mine because it is a topical area in which I worked as a subject indexer and searcher early in my career. Beginning with the 1st energy crisis, oil embargo of the mid-1970s, I worked in research organizations that involved both fossil fuel exploration and production, and alternative energy development.</p>

<p>A hallmark of technical exploratory and discovery work is the time gaps between breakthroughs; there are often significant plateaus between major developments. This happens if research reaches a point that an enabling technology is not available or commercially viable to move to the next milestone of development. I observed that the starting point in the quest for innovative energy technologies often began with decades-old research that stopped before commercialization.</p>

<p>Building on what we have already discovered, invented or learned is one key to success for many "new" breakthroughs. Looking at old research from a new perspective to lower costs or improve efficiency for such things as photovoltaic materials or electrochemical cells (batteries) is what excellent companies do.</p>

<p>How does this relate to semantic software technologies and data mining? We need to begin with content that was generated by research in the last century; much of this is just now being made electronic. Even so, most of the conversion from paper, or micro formats like fîche, is to image formats. In order to make the full transition to enable data mining, content must be further enhanced through optical character recognition (OCR). This will put it into a form that can be semantically parsed, analyzed and explored for facts and new relationships among data elements.</p>

<p>Processing of old materials is neither easy nor inexpensive. There are government agencies, consortia, associations, and partnerships of various types of institutions that often serve as a springboard for making legacy knowledge assets electronically available. A great first step would be having DOE and some energy industry leaders collaborating on this activity.</p>

<p>A future of potential man-made disasters, even when knowledge exists to prevent them, is not a foregone conclusion. Intellectually, we know that energy independence is prudent, economically and socially mandatory for all types of stability. We have decades of information and knowledge assets in energy related fields (e.g. chemistry, materials science, geology, and engineering) that semantic technologies can leverage to move us toward a future of energy independence. Finding nuggets of old information in unexpected relationships to content from previously disconnected sources is a role for semantic search that can stimulate new ideas and technical research.</p>

<p>A beginning is a serious program of content conversion capped off with use of semantic search tools to aid the process of discovery and development. It is high time to put our knowledge to work with state-of-the-art semantic software tools and by committing human and collaborative resources to the effort. Coupling our knowledge assets of the past with the ingenuity of the present we can achieve energy advances using semantic technologies already embraced by the life sciences.<br />
</p>]]>
        
    </content>
</entry>

<entry>
    <title>Leveraging Language in Enterprise Search Deployments</title>
    <link rel="alternate" type="text/html" href="http://gilbane.com/search_blog/2010/07/leveraging_language_in_enterprise_search_deployments.html" />
    <id>tag:gilbane.com,2010:/search_blog//49.10680</id>

    <published>2010-07-23T15:44:52Z</published>
    <updated>2010-07-23T15:53:17Z</updated>

    <summary>Without a cornucopia of software tools to build terminology maps and dictionaries, analyze content linguistically in context to elicit meaning, parse and evaluate unstructured text data sources, and manage vocabularies of ever more complex topical domains, semantic search could not exist.</summary>
    <author>
        <name>Lynda Moulton</name>
        <uri>http://gilbane.com/blog/mt-cp.cgi?__mode=view&amp;blog_id=49&amp;id=14</uri>
    </author>
    
        <category term="Search Problems/Solved Search Problems" scheme="http://www.sixapart.com/ns/types#category" />
    
    <category term="enterprisevocabularymanagement" label="Enterprise vocabulary management" scheme="http://www.sixapart.com/ns/types#tag" />
    <category term="semanticsearch" label="Semantic search" scheme="http://www.sixapart.com/ns/types#tag" />
    <category term="taxonomyforsearch" label="Taxonomy for search" scheme="http://www.sixapart.com/ns/types#tag" />
    
    <content type="html" xml:lang="en" xml:base="http://gilbane.com/search_blog/">
        <![CDATA[<p>It is not news that <em>enterprise search</em> has been relegated to the long list of failed technologies by some. We are at the point where many analysts and business writers have called for a moratorium on the use of the term. Having worked in a number of markets and functional areas (knowledge management/KM, special libraries, and integrated library software systems) that suffered the death knell, even while continuing to exist, I take these pronouncements as a game of sorts.</p>

<p>Yes, we have seen the demise of vinyl phonograph records, cassette tapes and probably soon musical CD albums, but those are explicit devices and formats. When you can't buy or play them any longer, except in a museum or collector's garage, they are pretty dead in the marketplace. This is not true of search in the enterprise, behind the firewall, or wherever it needs to function for business purposes. People have always needed to find "stuff" to do their work. KM methods and processes, special libraries and integrated library systems still exist, even as they were re-labeled for PR and marketing purposes.</p>

<p>What is happening to <em>search</em> in the enterprise is that it is finding its purpose, or more precisely its hundreds of purposes. It is not a monolithic software product, a one-size-fits-all. It comes in dozens of packages, models, and price ranges. It may be embedded in other software or standalone. It may be procured for a point solution to support retrieval of content for one business unit operating in a very narrow topical range, or it may be selected to give access to a broad range of documents that exist in numerous enterprise domains on many subjects. </p>

<p>Large enterprises typically have numerous search solutions in operation, implementation, and testing, all at the same time. They are discovering how to deploy and leverage search systems and they are refining their use cases based on what they learn incrementally through their many implementations. Teams of search experts are typically involved in selecting, deploying and maintaining these applications based on their subject expertise and growing understanding of what various search engines can do and how they operate.</p>

<p>After years of hearing about "the semantic Web," the long sought after "holy grail" of Web search, there is a serious ramping of technology solutions. Most of these applications can also make search more semantically relevant behind the firewall. These technologies have been evolving for decades beginning with so-called <em>artificial intelligence</em>, and now supported by some categories of computational linguistics such as specific algorithms for parsing content and disambiguating terms. A soon to-be released study featuring some of noteworthy applications reveals just how much is being done in enterprises for specific business purposes.</p>

<p>With this "teaser" on what is about to be published, I leave you with one important thought, meaningful search technologies depend on rich linguistically-based technologies. Without a cornucopia of software tools to build terminology maps and dictionaries, analyze content linguistically in context to elicit meaning, parse and evaluate unstructured text data sources, and manage vocabularies of ever more complex topical domains, semantic search could not exist.</p>

<p>Language complexities are challenging and even vexing. Enterprises will be finding solutions to leverage what they know only when they put human resources into play to work with the <em>lingo</em> of their most valuable domains.</p>]]>
        
    </content>
</entry>

<entry>
    <title>Weighing In On The Search Industry With The Enterprise In Mind</title>
    <link rel="alternate" type="text/html" href="http://gilbane.com/search_blog/2010/06/weighing_in_on_the_search_industry_with_the_enterprise_in_mind.html" />
    <id>tag:gilbane.com,2010:/search_blog//49.10614</id>

    <published>2010-06-10T13:12:01Z</published>
    <updated>2010-06-10T13:55:56Z</updated>

    <summary>executives in the search industry....picking up important signals. Industry segments as important as search evolve and its appropriate applications in enterprises are still being discovered and proven.</summary>
    <author>
        <name>Lynda Moulton</name>
        <uri>http://gilbane.com/blog/mt-cp.cgi?__mode=view&amp;blog_id=49&amp;id=14</uri>
    </author>
    
        <category term="Search Technologies and Products" scheme="http://www.sixapart.com/ns/types#category" />
    
    <category term="acquisitions" label="acquisitions" scheme="http://www.sixapart.com/ns/types#tag" />
    <category term="industry" label="industry" scheme="http://www.sixapart.com/ns/types#tag" />
    <category term="mergers" label="mergers" scheme="http://www.sixapart.com/ns/types#tag" />
    <category term="search" label="Search" scheme="http://www.sixapart.com/ns/types#tag" />
    
    <content type="html" xml:lang="en" xml:base="http://gilbane.com/search_blog/">
        <![CDATA[<p>Two excellent postings by executives in the search industry give depth to the importance of Dassault Système's acquisition of Exalead. If this were simply a <em>ho-hum</em> failure in a very crowded marketplace, <a href="http://www.kellblog.com/2010/06/09/quick-take-on-the-dassault-acquisition-of-exalead/">Dave Kellogg of Mark Logic </a>Corporation and <a href="http://www.blogger.com/profile/07143606688857386402">Jean Ferré of Sinequa</a> would not care. Instead they are picking up important signals. Industry segments as important as search evolve and its appropriate applications in enterprises are still being discovered and proven. Search may change, as could the label, but whatever it is called it is still something that will be done in enterprises.</p>

<p>This analyst has praise for the industry players who continue to persevere, working to get the packaging, usability, usefulness and business purposes positioned effectively. Jean Ferré is absolutely correct; the nature of the deal underscores the importance of the industry and the vision of the acquirers.</p>

<p>As we segue from a number of conferences featuring search (<a href="http://www.searchenginemeeting.net/2010/">Search Engines</a>, <a href="http://www.enterprisesearchsummit.com/2010/">Enterprise Search Summit</a>, <a href="http://gilbanesf.com/">Gilbane</a>) to broader enterprise technologies (<a href="http://www.e2conf.com/">Enterprise 2.0</a>) and semantic technologies (<a href="http://semtech2010.semanticuniverse.com/">SemTech</a>), it is important for enterprises to examine the interplay among product offerings. Getting the mix of software tools <strong>just right</strong> is probably more important than any one industry-labeled class of software, or any one product. Everybody's software has to play nice in the sandbox to get us to the next level of adoption and productivity.</p>

<p>Here is one analyst cheering the champions of search and looking for continued growth in the industry...but not so big it fails.<br />
</p>]]>
        
    </content>
</entry>

<entry>
    <title>Search Engines - Architecture Meets Adoption</title>
    <link rel="alternate" type="text/html" href="http://gilbane.com/search_blog/2010/05/search_engines_-_aarchitecture_meets_adoption.html" />
    <id>tag:gilbane.com,2010:/search_blog//49.10563</id>

    <published>2010-05-06T23:49:24Z</published>
    <updated>2010-05-07T00:03:47Z</updated>

    <summary>&quot; the ubiquitous search box fails because it does not demand context or mechanisms for resolving ambiguity. Obviously, this breaks down adoption for enterprise search when it is the only option offered...&quot; a thought from Skekhar Pradhan at Search Engine Meeting, Boston, April 27, 2010.</summary>
    <author>
        <name>Lynda Moulton</name>
        <uri>http://gilbane.com/blog/mt-cp.cgi?__mode=view&amp;blog_id=49&amp;id=14</uri>
    </author>
    
        <category term="Search Technologies and Products" scheme="http://www.sixapart.com/ns/types#category" />
    
    <category term="naturallanguageprocessingnlp" label="Natural Language Processing (NLP)" scheme="http://www.sixapart.com/ns/types#tag" />
    <category term="searchusability" label="Search usability" scheme="http://www.sixapart.com/ns/types#tag" />
    <category term="semanticsearch" label="Semantic search" scheme="http://www.sixapart.com/ns/types#tag" />
    
    <content type="html" xml:lang="en" xml:base="http://gilbane.com/search_blog/">
        <![CDATA[<p>Trying to summarize a technology space as varied as that covered in two days at the <a href="http://www.searchenginemeeting.net/2010/">Search Engines Meeting</a> in Boston, April 26-27, is a challenge and opportunity. Avoiding the challenge of trying to represent the full spectrum, I'll stick with the opportunity. Telling you that search is everywhere, in every technology we use and has a multitude of cousins and affiliated companion technologies is important.</p>

<p>The Gilbane Group focuses on content technologies. In its early history this included Web content management, document management, and CMS systems for publishers and enterprises. We now track related technologies expanding to areas including standards like DITA and XML, adoption of social tools, plus rapid growth in the drive to localize and globalize content; Gilbane has kept up with these trends. </p>

<p>My area, search and more specifically "enterprise search" or search "behind the firewall," was added just over three years ago. It seemed logical to give attention to the principal reason for creating, managing and manipulating content, namely finding it. When I pay attention to search engines, I am also thinking about adjoining content technologies. My recent interest is helping readers learn about how technology on both the search side and content management/manipulation side need better context; that means relating the two.</p>

<p>If one theme ran consistently through all the talks at Enterprise Search Meeting, it was the need to define search in relationship to so many other content technologies. The speakers, for the most part, did a fine job of making these connections.</p>

<p><u>Here are just some snippets</u>:<br />
<strong>Bipin Patel </strong>CIO of <em>ProQuest</em>, shared the technology challenges of maintaining a 24/7 service while driving improvements to the search usability interface. The goal is to deliver command line search precision to users who do not have the expertise to (or patience) to construct elaborate queries. Balancing the tension between expert searchers (usually librarians) with everyone else who seeks content underscores the importance of human factors. My take-away: underlying algorithms and architecture are worth little if usability is neglected.</p>

<p><strong>Martin Baumgartel</strong> spoke on the <a href="http://theseus-programm.de/en-us/theseus-basic-technologies/default.aspx">Theseus project</a> for the semantic search marketplace, a European collaborative initiative. An interesting point for me is their use of <a href="http://www.eclipse.org/smila/">SMILA </a>(SeMantic Information Logistics Architecture) from Eclipse. By following some links on the Eclipse site I found this <a href="http://www.eclipse.org/project-slides/SMILA Release Review 0.7.pdf">interesting presentation</a> from the International Theseus Convention in 2009. The application of this framework model underscores the interdependency of many semantically related technologies to improve search.</p>

<p><strong>Tamas Doszkocs</strong> of the <em>National Library of Medicine</em> told a well-annotated story of the decades of search and content enhancement technologies that are evolving to contribute to semantically richer search experiences. His metaphors in the evolutionary process were fun and spot-on at a very practical level: Libraries as knowledge bases > Librarians as search engines > the Web as the knowledge base > Search engines as librarians > moving toward understanding, content, context, and people to bring us semantic search. A <a href="C:\Documents and Settings\LWMoulton\My Documents\Gilbane\Gilbane-Blog\units.sla.org\division\dpht\meetings\spring2010\PHTD2010-Doszkocs.ppt">similar presentation</a> is posted on the Web.</p>

<p><strong>David Evans</strong> noted that there is currently no rigorous evaluation methodology yet for mobile search but is it very different than what we do with desktop search. One slide that I found most interesting was the Human Language Technologies (HLT) that contribute to a richer mobile search experience, essentially numerous semantic tools. Again, this underscores that the challenges of integrating sophisticated hardware, networking and search engine architectures for mobile search are just a piece of the solution. Adoption will depend on tools that enhance content findability and usability.</p>

<p><strong>Jeff Fried</strong> of <em>Microsoft/Fast</em> talked about "social search" and put forth this important theme: that people like to connect to content through other people. He made me recognize how social tools are teaching us that the richness of this experience is a self-reinforcing mechanism toward "the best way to search." It has lessons for enterprises as they struggle to adopt social tools in mindful ways in tandem with improving search experiences.</p>

<p><strong>Shekhar Pradhan</strong> of <em>Docunexus</em> shared this relevant thought about a failure of interface architecture and that is (to paraphrase): the ubiquitous search box fails because it does not demand context or mechanisms for resolving ambiguity. Obviously, this breaks down adoption for enterprise search when it is the only option offered.</p>

<p>Many more talks from this meeting will get rolled up in future reports and blogs.</p>

<p>I want to learn your experiences and observations about semantic search and semantic technologies, as well. Please note that we have posted a brief survey for a short time at: <a href="http://click.icptrack.com/icp/relay.php?r=11669219&msgid=4329112&act=MP0X&c=18079&destination=http%3A%2F%2Fwww.surveymonkey.com%2Fs%2FXP3Z2R8">Semantic Technology Survey</a>. If you have any involvement with semantic technologies, please take it.<br />
</p>]]>
        
    </content>
</entry>

<entry>
    <title>Prove It! - The POC &amp; Other Types of Evaluation for Enterprise Search</title>
    <link rel="alternate" type="text/html" href="http://gilbane.com/search_blog/2010/04/prove_it_-_the_poc_other_types_of_evaluation_for_enterprise_search.html" />
    <id>tag:gilbane.com,2010:/search_blog//49.10519</id>

    <published>2010-04-21T13:48:46Z</published>
    <updated>2010-04-21T19:49:55Z</updated>

    <summary>Judging business impact, scope and scaling enterprise search products is a tricky proposition. You absolutely cannot do it by totaling the number of positive checks a vendor ticks off on a spreadsheet of requirements. While such a device can be useful for narrowing down a field of products to those you might select, it is only a beginning. All too often, this is where the selection process ends.</summary>
    <author>
        <name>Lynda Moulton</name>
        <uri>http://gilbane.com/blog/mt-cp.cgi?__mode=view&amp;blog_id=49&amp;id=14</uri>
    </author>
    
        <category term="Product Selection" scheme="http://www.sixapart.com/ns/types#category" />
    
    <category term="enterprisesearchindustry" label="Enterprise search industry" scheme="http://www.sixapart.com/ns/types#tag" />
    <category term="productevaluation" label="Product evaluation" scheme="http://www.sixapart.com/ns/types#tag" />
    <category term="producttesting" label="Product testing" scheme="http://www.sixapart.com/ns/types#tag" />
    <category term="vendorrelations" label="Vendor relations" scheme="http://www.sixapart.com/ns/types#tag" />
    
    <content type="html" xml:lang="en" xml:base="http://gilbane.com/search_blog/">
        <![CDATA[<p>When a product is described as "the only", "the best", the "most complete", "the fastest," "the leading," etc., does anyone actually read and believe these qualifiers in marketing copy, software or otherwise? Do we believe it when an analyst firm writes reviews, or when product hype appears in industry publications?</p>

<p>Most technology buyers have a level of cynicism about such claims and commentary because we know that in each case there is a bias, with good reasons, for the praise. However, also for good reasons, language containing positive sentiment can have an effect - otherwise, it would not be so widespread. At the very least, sentiment analysis tools that are integrated with search engines will pick up on pervasive tones of praise, and from that create new content streams that compound the positive spin.</p>

<p>Being aware of marketing methods and influences on our psyche should arm us with caution but not to a point of being risk averse or frozen to indecisiveness. Instead, we need to find a way to prove the hype and claims through thoughtful, artful and strategic analytical processes. We need methods for testing claims that are appropriate for the solution sought.</p>

<p>First, we need to establish what is appropriate for our business need. Cost is often the primary qualifying factor when narrowing products that will be considered, but this may be short sighted. <u>Business impact</u> and benefits from applying the right solution need to be directly in our line of sight. If the solution you acquire can be evaluated to demonstrate a significant business benefit, the cost of a higher priced product may also be high-value to your business. Add to business impact the <u>scope</u> for the use of an enterprise search engine (how widely deployed and leveraged) and whether it can <u>scale</u> to include multiple searchable repositories across the organization; these attributes may enhance business impact.</p>

<p>Judging business impact, scope and scaling enterprise search products is a tricky proposition. You absolutely cannot do it by totaling the number of positive checks a vendor ticks off on a spreadsheet of requirements. While such a device can be useful for narrowing down a field of products to those you might select, it is only a beginning. All too often, this is where the selection process ends.</p>

<p>What needs to be done next? I recommend these three steps that can be concurrent:<br />
<ul><br />
	<li>Find customers by using social tools; reading and researching. With so many Web-based social and search tools it should be easy to identify individuals and enterprises that are actually users of products you are considering. Reach out, schedule talk time and have a pointed list of questions ready to investigate their experiences - listen carefully and follow up on any comments that sound a note of caution.</li><br />
	<li>Run proofs-of-concept that include serious testing by key users with content that is relevant to the testers. Develop test cases and define explicitly for the testers what they are searching, and what you want to learn.</li><br />
	<li>Keep careful notes throughout your interactions with vendors, as you seek information, test their products and request answers to technical questions. The same goes for the conversations with their customers, the ones you find on your own, not just the ones vendors steer you to. Your inquiry needs to include information about business relationship issues, responsiveness, ease of use, and how well a vendor can understand and respond to your business needs in these early relationship stages.</p>

<p><br />
If things are not going smoothly, how does a vendor react and respond, and what is their follow-up and follow-through in the pre-purchase stage. Never succumb to the excuse that because they are "going through growing pains," have "so much business demand" they are stretched thin, or that something else is more important to them than your product evaluation. If any of these creep in before you purchase, you have a major symptom conveying clearly that your business is not as important or valuable as someone else's.</li><br />
</ul><br />
Longevity of use of an enterprise search application must be foremost in your mind throughout all of these steps. While many enterprises try to plan for upgrading or replacing legacy software applications to remain competitive and current using newer technologies, actual experiences are rarely ideal. You could be "stuck" with your choice for a decade or longer. Being in a dependent relationship with a vendor or product you are not happy with, will be a miserable experience and no benefit to your enterprise, no matter how popular the product is in the industry press.</p>

<p><br />
The steps for selection will take a little longer than just sending out RFPs and reading responses, but it is really worth it over the long haul relationship you are about to engage.<br />
</p>]]>
        
    </content>
</entry>

<entry>
    <title>Forecasting Software Product Abandonment</title>
    <link rel="alternate" type="text/html" href="http://gilbane.com/search_blog/2010/03/forecasting_software_product_abandonment.html" />
    <id>tag:gilbane.com,2010:/search_blog//49.10409</id>

    <published>2010-03-02T14:45:32Z</published>
    <updated>2010-03-02T14:58:48Z</updated>

    <summary>The bottom line is that customers who depend on technology of any kind, for keeping their own businesses running effectively and efficiently, must be aware of what is transpiring with their most valued vendor/suppliers.</summary>
    <author>
        <name>Lynda Moulton</name>
        <uri>http://gilbane.com/blog/mt-cp.cgi?__mode=view&amp;blog_id=49&amp;id=14</uri>
    </author>
    
        <category term="Product Selection" scheme="http://www.sixapart.com/ns/types#category" />
    
    <category term="microsoft" label="Microsoft" scheme="http://www.sixapart.com/ns/types#tag" />
    <category term="searchindustry" label="Search industry" scheme="http://www.sixapart.com/ns/types#tag" />
    <category term="softwareprocurement" label="Software procurement" scheme="http://www.sixapart.com/ns/types#tag" />
    
    <content type="html" xml:lang="en" xml:base="http://gilbane.com/search_blog/">
        <![CDATA[<p>Given <a href="http://www.pcworld.com/businesscenter/article/188659/microsoft_to_drop_linux_unix_versions_of_enterprise_search.html">the announcement</a> from Microsoft that it would make 2010 releases of Fast on Linux and UNIX the last for these operating systems, a lot of related comments have appeared over the past few weeks. For those of us who listened intently to early commentary on the Fast acquisition by Microsoft about its high level of commitment to dual development tracks, it only confirms what many analysts suspected would happen. Buyers rarely embrace their technology acquisitions solely (or even primarily) for the technology.</p>

<p>While these 2010 releases of Fast ESP on UNIX and Linux will continue to be supported for ten years, and repositories are projected to be indexable on these two platforms by future Fast releases, some customers will opt out of continuing with Fast. As newer and more advanced search technologies support preferred operating systems, they will choose to move. Microsoft probably expects to retain most current customers for the time being - inertia and long evaluation and selection processes by enterprises are on their side.</p>

<p>This recent announcement did include a small aside questioning whether Microsoft would continue to offer a standalone search engine outside of its SharePoint environment where the Fast product has been embedded and leveraged first. It sounds like the short term plan is to continue with standalone ESP, but certainly no long term commitment is made.</p>

<p>So, whatever stasis/constancy pre-Microsoft Fast customers were feeling sanguine about, it is surely being shaken around. Let's take a look at some reasons that vendors abandon their acquisitions. First we need to consider why companies add products through acquisition in the first place. A simple list looks like this:</p>

<ol>
	<li>Flat sales</li>
	<li>Need to penetrate a growth market or industry</li>
	<li>Desire to demonstrate strength to its existing customer base by acquiring a high-end brand name</li>
	<li>Need for technology, IP, and expertise</li>
	<li>Desire to expand the customer base, quickly</li>
</ol>

<p>While item 1 probably was not a contributor to the Microsoft Fast acquisition, 2 and 3 certainly factored into their plan. Fast was "the" brand and had become synonymous in the marketplace with "enterprise search leader." Surely Microsoft considered the technology IP and key employees that they would be acquiring, and having a ready-made customer-base and maintenance revenue stream would be considerations, too.</p>

<p>Customers do have reasons to be nervous in any of these big acquisitions, however. Here is what often get exposed and revealed once the onion is peeled:</p>

<ul>
	<li>Game changing technology is playing havoc in the marketplace; in search there are numerous smaller players with terrific technologies, more nimble and innovative development teams with rigorous code control mentalities, and the experience of having looked at gaps in older search technologies.</li>
	<li>Cost of supporting multiple code bases is enormous, so the effort of developing native support on multiple platforms becomes onerous.</li>
	<li>For any technology, loss of technical gurus (particularly when there has been a culture of loose IP control, poor capture of know-how, and limited documentation) will quickly drive a serious reality check as the acquirer strives to understand what it has bought.</li>
	<li>Brand name customers may not stick around to find out what is going to happen, particularly if the product was on the path to being replaced anyway. Legacy software may be in place because it is irreplaceable or simply due to the neglect of enterprises using it. It may be very hard for the acquiring buyer to determine which situation is the case. A change of product ownership may be just the excuse that some customers need to seek something better. Customers understand the small probability of having a quick and smooth integration of a just-acquired product into the product mix of a large, mature organization.</li>
	<li>A highly diverse customer base, in many vertical markets, with numerous license configuration requirements for hardware and operating system infrastructures will be a nightmare to support for a company that has always standardized on one platform. Providing customer support for a variety of installation, tuning and on-going administration differences is just not sustainable without a lot of advance planning and staffing.</li>
</ul>

<p>The Microsoft/Fast circumstance is just an illustration. You might take a look at what is also going on with SAP after its acquisition of Business Objects (BO) in this <a href="http://analytics.informationweek.com/abstract/7/2275/Enterprise-Software/sap-at-a-crossroads.html">lengthy analysis</a> at <u>Information Week</u>. In this unfortunate upheaval, BO's prior acquisition of Inxight has been a particular loss to those who had embraced that fine analytics and visualization engine.</p>

<p>The bottom line is that customers who depend on technology of any kind, for keeping their own businesses running effectively and efficiently, must be aware of what is transpiring with their most valued vendor/suppliers. When there is any changing of the guardians of best-of-breed software tools, be prepared by becoming knowledgeable about what is really under the skin of the newly harvested onion. Then, make your own plans accordingly.</p>]]>
        
    </content>
</entry>

<entry>
    <title>What is the Price and What is the Cost?</title>
    <link rel="alternate" type="text/html" href="http://gilbane.com/search_blog/2010/02/what_is_the_price_and_what_is_the_cost.html" />
    <id>tag:gilbane.com,2010:/search_blog//49.10359</id>

    <published>2010-02-02T19:53:08Z</published>
    <updated>2010-02-02T20:07:49Z</updated>

    <summary>Net sales of any company that is large is a significant determinant of its reputation and potential staying power in its industry. However, when actual sales for a search product line are a tiny fraction of total company revenue, potential buyers of enterprise search need to know that and factor it into their decision-making for these reasons...</summary>
    <author>
        <name>Lynda Moulton</name>
        <uri>http://gilbane.com/blog/mt-cp.cgi?__mode=view&amp;blog_id=49&amp;id=14</uri>
    </author>
    
        <category term="Product Selection" scheme="http://www.sixapart.com/ns/types#category" />
    
    <category term="enterprisesearchindustry" label="Enterprise search industry" scheme="http://www.sixapart.com/ns/types#tag" />
    <category term="enterprisesoftwarepricing" label="Enterprise software pricing" scheme="http://www.sixapart.com/ns/types#tag" />
    <category term="softwareprocurement" label="Software procurement" scheme="http://www.sixapart.com/ns/types#tag" />
    
    <content type="html" xml:lang="en" xml:base="http://gilbane.com/search_blog/">
        <![CDATA[<p>Enterprise software pricing runs the gamut from nominal to 100s of thousands of dollars. Unless software for enterprise search reaches a commodity status with a defined baseline of functional specifications, the marketplace will continue to be confused and highly segmented.</p>

<p>What buyers need to do first is to stop limiting their procurement selection choices based primarily on license prices. When enterprises begin their selection by considering prices first, many options are eliminated that may be functionally more appropriate and for which the total cost of ownership may be even less.</p>

<p>Product pricing correlates more to the market domain in which a vendor sells or aims to sell than to actual product value per installed user. Therefore, companies in the small to mid-range are particularly vulnerable to unreasonable licensing. I have <a href="http://gilbane.com/search_blog/2008/10/what_determines_a_leader_in_th.html">written about this before</a> but it bears repeating, the strength of the underlying technology has little to do with the price but can influence the total-cost-of-ownership (TCO) dramatically.</p>

<p>Buyers often believe high license price relates to top product value; in general you still need to add another 60-80% for services and support costs to get that value out. But let's look at the business reality and corporate context for sellers of high-priced enterprise search.</p>

<p>Net sales of any company that is large is a significant determinant of its reputation and potential staying power in its industry. However, when actual sales for a search product line are a tiny fraction of total company revenue, potential buyers of enterprise search need to know that and factor it into their decision-making for these reasons:</p>

<p>•	The largest software companies are heavily vested in subscribing to analyst services that write about the industry. They are diligent in reporting their sales figures to those companies and publications that do annual surveys on various industry segments. The reporting is usually careful to note when revenues for a particular sector ( like search) <u>are not broken out</u>, but this often escapes the notice of buyers who only see that company X has enormous revenues compared to others. This leaves the impression that they are also a standout in the search sector.<br />
•	The fact that a company offers many software products, of which search is only one, has often resulted from acquisition of a lot of products. Search may only be in the mix because it complements other products. The company may or may not have actually retained the technology gurus who originally designed, developed and supported the software. A lot of software quickly becomes <u>stale</u> once acquired by a third-party.<br />
•	When a very large company offers many products, it focuses sales, account management, support and development on those with the largest revenue stream or growth potential. Marketing for marginal products may be sustained for a longer period to bring in "easy" business but unfortunately, for too long, search has been treated as a loss leader to attract revenues for other product lines. Where "search" fits into a mix of products, how well it will be serviced and supported over time may be difficult to discern. <br />
•	The final situation that happens for very large software companies is that competition is an ever-present cause for shifting agendas. The largest software firms will often abandon technologies whose architecture, unique functions and even their customers do not fit their changing market interests. They will abandon products for which they have paid huge sums once the initial value of the procurement has been realized, when a product's technology has been captured for embedding in other product suites, or if the product is no longer viewed as strategic.</p>

<p>In the next blog posting we'll take a look at some other reasons that vendors make and then abandon their acquisitions. But in the meantime, here is a recommendation to buying decision-makers:</p>

<p>When you see a very long list of customer logos on the web sites of major software vendors there is important context that is not provided. Large corporations can and do buy competing products all the time. Some products get into enterprise-wide use and adoption for the long term while others are used briefly or in smaller applications. You can't know whether a product is even in use in the company whose logo is displayed. Because it is almost impossible for an outsider to find the actual buyer/user of a product in a large enterprise; the posted logos tell you little. Inside an enterprise one may discover endless tales of when, why and how competing products were acquired, many as part of package deals or through a subsidiary acquisition. What is also true is that stories of successful implementations or brand loyalty do not abound.</p>

<p>For you who are new to enterprise search, take control of your own destiny by educating yourself using a lower priced product with a good reputation for a niche application. Invest your budget instead in human resources (internal or 3rd party) to craft the solution you really need. Start with a vision of appropriate scale, tackling a small domain of high value content that is currently hard to find in your organization.</p>

<p>Use the experience of implementing and leveraging this search product and engaging with the vendor to bring a deeper understanding of the technology and applications of search. Working with a vendor dedicated exclusively to search will have another cost benefit because of the focused attention you are more likely to receive. Delving deeply into planning and implementation for a targeted result will have a cost that brings multiple benefits moving forward to larger and more complex implementations - even if you move on to another product.</p>]]>
        
    </content>
</entry>

<entry>
    <title>Search Industry in 2010</title>
    <link rel="alternate" type="text/html" href="http://gilbane.com/search_blog/2010/01/search_industry_in_2010.html" />
    <id>tag:gilbane.com,2010:/search_blog//49.10232</id>

    <published>2010-01-13T16:55:41Z</published>
    <updated>2010-01-15T12:50:11Z</updated>

    <summary>All of this leads me to think that, since enterprise search has gotten such a bad reputation as a failed technology, the big software houses are going to bury it in point solutions. Personally, I believe that enterprise search is a failed strategy... </summary>
    <author>
        <name>Lynda Moulton</name>
        <uri>http://gilbane.com/blog/mt-cp.cgi?__mode=view&amp;blog_id=49&amp;id=14</uri>
    </author>
    
        <category term="Search Technologies and Products" scheme="http://www.sixapart.com/ns/types#category" />
    
    <category term="enterpriseapplications" label="Enterprise applications" scheme="http://www.sixapart.com/ns/types#tag" />
    <category term="enterprisesearchindustry" label="Enterprise search industry" scheme="http://www.sixapart.com/ns/types#tag" />
    <category term="integratingtechnologies" label="Integrating technologies" scheme="http://www.sixapart.com/ns/types#tag" />
    
    <content type="html" xml:lang="en" xml:base="http://gilbane.com/search_blog/">
        <![CDATA[<p>Just in from <u>Information Week</u> is this article (<a href="http://www.informationweek.com/news/software/systems_management/showArticle.jhtml?articleID=222300627&cid=nl_IW_daily_2010-01-13_h">Exclusive: IBM Reorganizes Software Group </a>) that prompted me to launch 2010 with some thoughts on where we are heading with enterprise search this year. When IBM does something dramatic it impacts the industry because it makes others react.</p>

<p>I don't make forecasts or try to guess whether strategic changes will succeed or fail but a couple of years ago, I blogged on IBM's introduction of Yahoo OmniFind, a free offering and then followed up with <a href="http://gilbane.com/search_blog/2009/09/competition_among_search_vendors.html">these comments</a> just a few months ago. IBM makes their competitors change, try to outsmart, outguess, or copy, just as Microsoft or Google changes cause ripples in the industry.</p>

<p>Meanwhile, OpenText, another large software company with search offerings, is <a href="http://www.cmswatch.com/Trends/1765-Open-Text-Enterprise-Search">not going to offer search outside of its other product suites</a>. [More is likely to come out after the scheduled analyst meetings today but I'm not there and can't brief you on deeper intent.] We have recently seen an <a href="http://blogs.msdn.com/enterprisesearch/archive/2009/10/28/fast-meets-sharepoint-what-s-coming-in-search-for-sharepoint-2010.aspx">announcement</a> about FAST being delivered with new SharePoint offerings, the first major release of FAST announced since Microsoft acquired them almost two years ago. While FAST is still available as a standalone product from MS, it and other search engines may be steadily moving into being embedded in suites by their acquirers. </p>

<p>Certainly IBM has a lot of search components that they have acquired, so continuing to bind with other content offerings is a probable strategy. Oracle and Autonomy may soon come up with similar suite offerings embedding search once again. Oracle SES (Secure Enterprise Search) does not appear to have a lot of traction and it's possible that supporting pure search offerings may be a burden for Autonomy with its stable of many acquired content products.</p>

<p>All of this leads me to think that, since enterprise search has gotten such a bad reputation as a <u>failed technology</u>, the big software houses are going to bury it in point solutions. Personally, I believe that enterprise search is a <u>failed strategy</u> and SMBs can still find search engines that will serve the majority of their enterprise needs for several years to come. The same holds true for divisions or groups within large corporations. </p>

<p>Guidance: select and adopt one or more search solutions that fit your budget for small scale needs, point solutions and enterprise content that everyone in the organization needs to access on a regular basis. Learn how these products work, what they can and cannot deliver, making incremental adjustments as needs change and evolve. Do not install and think you are done because you will never be done. Cultivate a few search experts to stick with the evolving landscape and give them the means to keep up with changes in the search landscape. It is going to keep morphing for a long time to come.</p>]]>
        
    </content>
</entry>

<entry>
    <title>Layering Technologies to Support the Enterprise with Semantic Search</title>
    <link rel="alternate" type="text/html" href="http://gilbane.com/search_blog/2009/11/layering_technologies_to_support_the_enterprise_with_semantic_search.html" />
    <id>tag:gilbane.com,2009:/search_blog//49.10190</id>

    <published>2009-11-30T20:53:20Z</published>
    <updated>2009-11-30T23:28:01Z</updated>

    <summary>Teaming by vendors, each with a solution to one dimension of a need, create compound product offerings that are adding up to a very large semantic search marketplace.</summary>
    <author>
        <name>Lynda Moulton</name>
        <uri>http://gilbane.com/blog/mt-cp.cgi?__mode=view&amp;blog_id=49&amp;id=14</uri>
    </author>
    
        <category term="Search Technologies and Products" scheme="http://www.sixapart.com/ns/types#category" />
    
    <category term="productofferings" label="Product offerings" scheme="http://www.sixapart.com/ns/types#tag" />
    <category term="searchindustry" label="Search industry" scheme="http://www.sixapart.com/ns/types#tag" />
    <category term="semanticsearch" label="Semantic search" scheme="http://www.sixapart.com/ns/types#tag" />
    
    <content type="html" xml:lang="en" xml:base="http://gilbane.com/search_blog/">
        <![CDATA[<p>Semantic search is a composite beast like many enterprise software applications. Most packages are made up of multiple technology components and often from multiple vendors. This raises some interesting thoughts as we prepare for <a href="http://gilbaneboston.com/">Gilbane Boston 2009</a> to be held this week.</p>

<p>As part of a panel on semantic search, moderated by Hadley Reynolds of IDC, with Jeff Fried of Microsoft and Chris Lamb of the OpenCalais Initiative at Thomson Reuters, I wanted to give a high level view of semantic technologies currently in the marketplace. I contacted about a dozen vendors and selected six to highlight for the variety of semantic search offerings and business models. </p>

<p>One case study involves three vendors, each with a piece of the ultimate, customer-facing, product. My research took me to one company that I had reviewed a couple of years ago, and they sent me to their "customer" and to the customer's customer. It took me a couple of conversations and emails to sort out the connections; in the end the relationships made perfect sense.</p>

<p>On one hand we have conglomerate software companies offering "solutions" to every imaginable enterprise business need. On the other, we see very unique, specialized point solutions to universal business problems with multiple dimensions and twists. Teaming by vendors, each with a solution to one dimension of a need, create compound product offerings that are adding up to a very large semantic search marketplace.</p>

<p>Consider an example of data gathering by a professional services firm. Let's assume that my company has tens of thousands of documents collected in the course of research for many clients over many years. Researchers may move on to greater responsibility or other firms, leaving content unorganized except around confidential work for individual clients. We now want to exploit this corpus of content to create new products or services for various vertical markets. To understand what we have, we need to mine the content for themes and concepts. </p>

<p>The product of the mining exercise may have multiple uses: help us create a taxonomy of controlled terms, preparing a navigation scheme for a content portal, providing a feed to some business or text analytics tools that will help us create visual objects reflecting various configurations of content. A text mining vendor may be great at the mining aspect while other firms have better tools for analyzing, organizing and re-shaping the output. </p>

<p>Doing business with two or three vendors, experts in their own niches, may help us reach a conclusion about what to do with our information-rich pile of documents much faster. A multi-faceted approach can be a good way to bring a product or service to market more quickly than if we struggle with generic products from just one company. </p>

<p>When partners each have something of value to contribute, together they offer the benefits of the best of all options. This results in a new problem for businesses looking for the best in each area, namely, vendor relationship management. But it also saves organizations from dealing with huge firms offering many acquired products that have to be managed through a single point of contact, a generalist in everything and a specialist in nothing. Either way, you have to manage the players and how the components are going to work for you.</p>

<p>I really like what I see, semantic technology companies partnering with each other to give good-to-great solutions for all kinds of innovative applications. By the way, at the conference I am doing a quick snapshot on each: <a href="http://www.expertsystem.net/page.asp?id=1521&idd=18">Cogito</a>, <a href="http://www.connotate.com/">Connotate</a> (with <a href="http://www.cormineid.com/">Cormine</a> and <a href="http://www.cormineid.com/worldtech_case_study">WorldTech</a>), <a href="http://www.lexalytics.com/lexalytics-home/">Lexalytics</a>, <a href="http://www.lingumatics.com/">Linguamatics</a>, <a href="http://www.sinequa.com/">Sinequa </a>and <a href="http://www.temis.com/">TEMIS</a>.<br />
</p>]]>
        
    </content>
</entry>

<entry>
    <title>Where and How Can You Look for Good Enterprise Search Interface Design?</title>
    <link rel="alternate" type="text/html" href="http://gilbane.com/search_blog/2009/10/where_and_how_can_you_look_for_good_enterprise_search_interface_design.html" />
    <id>tag:gilbane.com,2009:/search_blog//49.10124</id>

    <published>2009-10-29T19:08:31Z</published>
    <updated>2009-10-29T18:32:46Z</updated>

    <summary>Frequently, enterprise stakeholders will reference a commercial web site they like or even search tools within social sites. These are a great starting point for a designer to explore. It makes a lot of sense to visit scores of sites that are publicly accessible or sites where you have an account and navigate around to see how they handle various design elements.</summary>
    <author>
        <name>Lynda Moulton</name>
        <uri>http://gilbane.com/blog/mt-cp.cgi?__mode=view&amp;blog_id=49&amp;id=14</uri>
    </author>
    
        <category term="Search Problems/Solved Search Problems" scheme="http://www.sixapart.com/ns/types#category" />
    
    <category term="enterpriseapplications" label="Enterprise applications" scheme="http://www.sixapart.com/ns/types#tag" />
    <category term="searchinterfacedesign" label="Search interface design" scheme="http://www.sixapart.com/ns/types#tag" />
    
    <content type="html" xml:lang="en" xml:base="http://gilbane.com/search_blog/">
        <![CDATA[<p>Designing an enterprise search interface that employees will use on their intranet is challenging in any circumstance. But starting from nothing more than verbal comments or even a written specification is really hard. However, conversations about what is needed and wanted are informative because they can be aggregated to form the basis for the overarching design.</p><p>Frequently, enterprise stakeholders will reference a commercial web site they like or even search tools within social sites. These are a great starting point for a designer to explore. It makes a lot of sense to visit scores of sites that are publicly accessible or sites where you have an account and navigate around to see how they handle various design elements.</p>  <p>To start, look at:</p><ul><li>How easy is it to find a search box?</li><li>Is there an option to do advanced searches (Boolean or parametric searching)?</li><li>Is there a navigation option to traverse a taxonomy of terms?</li><li>Is there a &quot;help&quot; option with relevant examples for doing different kinds of searches?</li><li>What happens when you search for a word that has several spellings or synonyms, a phrase (with or without quotes), a phrase with the word <em>and</em> in it, a numeral, or a date?</li><li>How are results displayed: what information is included, what is the order of the results and can you change them? Can you manipulate results or search within the set?</li><li>Is the interface uncluttered and easily understood?</li></ul><p>The point of this list of questions is that you can use it to build a set of criteria for designing what your enterprise will use and adopt, enthusiastically. But this is only a beginning. By actually visiting many sites outside your enterprise, you will find features that you never thought to include or aggravations that you will surely want to avoid. From these experiences on external sites, you can build up a good list of what is important to include or banish from your design.</p><p>When you find sites that you think are exemplary, ask key stakeholders to visit them and give you their feedback, preferences and dislikes. Particularly, you want to note what confuses them or enthusiastic comments about what excites them.</p>  <p>This post originated because several press notices in the past month brought to my attention Web applications that have sophisticated and very specialized search applications. I think they can provide terrific ideas for the enterprise search design team and also be used to demonstrate to your internal users just what is possible.</p>  <p>Check out these applications and articles: on <a href="http://www.deskeng.com/virtual_desktop/?p=500">KNovel</a>, particularly this <a href="http://www.knovel.com/web/portal/browse">KNovel page</a>;&nbsp;<a href="http://www.thomasnet.com/">ThomasNet</a>; EBSCOHost mentioned in <a href="http://www.allbusiness.com/technology/software-services-applications-search-engines/11745552-1.html">this article</a> about the &quot;deep Web.&quot;. All these applications reveal superior search capabilities, have long track records, and are already used by enterprises every day. Because they are already successful in the enterprise, some by subscription, they are worth a second look as examples of how to approach your enterprise's search interface design.</p>]]>
        
    </content>
</entry>

<entry>
    <title>Meta Tags and Trusted  Resources  in the Enterprise</title>
    <link rel="alternate" type="text/html" href="http://gilbane.com/search_blog/2009/10/meta_tags_and_trusted_resources_in_the_enterprise.html" />
    <id>tag:gilbane.com,2009:/search_blog//49.10103</id>

    <published>2009-10-21T13:31:57Z</published>
    <updated>2009-10-21T13:46:16Z</updated>

    <summary>Working with the language of the enterprise audience that relies on finding critical content to do their jobs, a meta tagger will bring out topical language known to be the lingua franca of the dominant searchers as well as the language that will be used by novice employee searchers. The key here is to recognize that in any specific piece of content its &quot;aboutness&quot; may never be explicitly spelled out in terminology by the author.</summary>
    <author>
        <name>Lynda Moulton</name>
        <uri>http://gilbane.com/blog/mt-cp.cgi?__mode=view&amp;blog_id=49&amp;id=14</uri>
    </author>
    
        <category term="Search Problems/Solved Search Problems" scheme="http://www.sixapart.com/ns/types#category" />
    
    <category term="metadata" label="Metadata" scheme="http://www.sixapart.com/ns/types#tag" />
    <category term="tagging" label="Tagging" scheme="http://www.sixapart.com/ns/types#tag" />
    
    <content type="html" xml:lang="en" xml:base="http://gilbane.com/search_blog/">
        <![CDATA[<p>A <a href="http://googlewebmastercentral.blogspot.com/2009/09/google-does-not-use-keywords-meta-tag.html">recent article</a> about how Google Internet search <u>does not</u> use <i>meta tags</i> to find relevant content got me thinking about a couple of things.</p>  <p>First it explains why none of the articles I write for this blog about <i>enterprise search</i> appear in Google alerts for &ldquo;enterprise search.&rdquo; Besides being a personal annoyance, easily resolved if I invested in some Internet search optimization, it may explain why meta tagging is a hard sell behind the firewall.</p>  <p>I do know something about getting relevant content to show up in enterprise search systems and it does depend on a layer of what I call &ldquo;value-added metadata&rdquo; by someone who knows the subject matter in target content and the audience. Working with the language of the enterprise audience that relies on finding critical content to do their jobs, a meta tagger will bring out topical language known to be the <i>lingua franca</i> of the dominant searchers as well as the language that will be used by novice employee searchers. The key here is to recognize that in any specific piece of content its &ldquo;aboutness&rdquo; may never be explicitly spelled out in terminology by the author.</p>  <p>In one example, let&rsquo;s consider some fundamental HR information about &ldquo;holiday pay&rdquo; or &ldquo;compensation for holidays&rdquo; or &ldquo;compensation for time-off.&rdquo; The strings in quotes were used throughout documents on the intranet of one organization where I consulted. When some complained about not being able to find this information using the company search system, my review of <i>search logs </i>showed a very large number of searches for &ldquo;vacation pay&rdquo; and almost no searches for &ldquo;compensation&rdquo; or &ldquo;holidays&rdquo; or &ldquo;time off.&rdquo; Thus, there was no way that using the search engine employees would stumble upon the useful information they are seeking &ndash; <b>unless</b>, meta tags make &ldquo;vacation pay&rdquo; a retrievable index pointer to these documents. The tagger would have analyzed the search logs, seen the high number of searches for that phrase and realized that it was needed as a meta tag.</p>  <p>Now, back to Google&rsquo;s position on ignoring meta tags because writers and marketing managers were &ldquo;gaming the system.&rdquo; They were adding tags they thought would be popular to get people to look at content not related but for which they were seeking a huge audience.</p>  <p>I have heard the concern that people within enterprises might also hijack the usefulness of content they were posting in blogs or wikis to get more &ldquo;eyeballs&rdquo; in the organization. This is a foolish concern, in my opinion. First I have never seen evidence that this happens and don&rsquo;t believe that any productive enterprise has people engaging in this obvious foolishness.</p>  <p>More importantly, professional growth and success depends on the perceptions of others, their belief in you and your work, and the value of your ideas. If an employee is so foolish as to misdirect fellow employees to useless or irrelevant content, he is not likely to gain or keep the respect of his peers and superiors. In the long run persistent, misleading or mischievous meta tagging will have just the opposite effect, creating a pathway to the door.</p>  <p>Conversely, the super meta tagger with astute insights into what people are looking for and how they are most likely to look for it, will be the valued expert we all need to care for and spoon feed us our daily content. Trusted resources rise to the top when they are appropriately tagged and become bedrock content when revealed through enterprise search on well-managed intranets.</p>]]>
        
    </content>
</entry>

<entry>
    <title>Competition among Search Vendors</title>
    <link rel="alternate" type="text/html" href="http://gilbane.com/search_blog/2009/09/competition_among_search_vendors.html" />
    <id>tag:gilbane.com,2009:/search_blog//49.10031</id>

    <published>2009-09-30T13:26:43Z</published>
    <updated>2009-09-30T13:39:01Z</updated>

    <summary>Once enterprises get immersed in a complex implementation (and search done well does require that) they won&apos;t budge for a long, long time, even if the solution is less than optimal. By the time they are compelled to upgrade they are usually so wedded to their vendor that they will accept any reasonable offer to upgrade that the vendor offers. Seeking competitive options is really difficult for most enterprises to pursue without an overwhelmingly compelling reason.</summary>
    <author>
        <name>Lynda Moulton</name>
        <uri>http://gilbane.com/blog/mt-cp.cgi?__mode=view&amp;blog_id=49&amp;id=14</uri>
    </author>
    
        <category term="Product Selection" scheme="http://www.sixapart.com/ns/types#category" />
    
    <category term="competitivepositioning" label="Competitive positioning" scheme="http://www.sixapart.com/ns/types#tag" />
    <category term="enterprisesearchindustry" label="Enterprise search industry" scheme="http://www.sixapart.com/ns/types#tag" />
    
    <content type="html" xml:lang="en" xml:base="http://gilbane.com/search_blog/">
        <![CDATA[<p>Is there any real competition when it comes to enterprise search? Articles like this one in <u>ComputerWorld</u> make good points but also foster the idea that this could be a differentiator for buyers: <a href="http://www.computerworld.com/s/article/9136248/Yahoo_deal_puts_IBM_Microsoft_in_enterprise_search_pickle?taxonomyId=11">Yahoo deal puts IBM, Microsoft in enterprise search pickle</a>, by Juan Carlos Perez, August 4, 2009.</p>

<p>I wrote about the <a href="http://gilbane.com/search_blog/2007/01/the_right_message_for_the_smal.html">IBM launch of the OmniFind suite</a> of search products a couple of years ago with positive comments. The reality ended up being quite different as I noted later. Among the negatives were three that stand out in my mind. First, free (as in the <em>IBM OmniFind Yahoo no-charge edition</em>) is rarely attractive to serious enterprises looking for a well-supported product. Second, the substantial computing overhead for the free product was significant enough that some SMBs I know of were turned off; the costs associated with the hardware and support it would require offset "free." Third, my understanding that the search architecture for the free product would provide seamless upgrades to IBM's other OmniFind products was wrong. Each subsequent product adoption would require the same "rip and replace" that Steve Arnold describes in his report, <a href="http://gilbane.com/beyond-search.html">Beyond Search</a>. It is hard to believe that IBM got much traction out of this offering from the enterprise search market at large. Does anyone know if there was really any head-to-head competition between IBM and other search vendors over this product?</p>

<p>On the other hand, does the Microsoft Express Search offering appeal to enterprises other than the traditional Microsoft shop? If Microsoft Express Search went away, it would probably be replaced by some other Microsoft search variation with inconvenience to the customer who needs to <em>rip and replace</em> and left on his own to grumble and gripe. What else is new? The same thing would happen with IBM Yahoo OmniFind users and they would adapt.</p>

<p>I've noticed that free and cheap products may become heavily entrenched in the marketplace but not among organizations likely to upgrade any time soon. Once enterprises get immersed in a complex implementation (and search done well does require that) they won't budge for a long, long time, even if the solution is less than optimal. By the time they are compelled to upgrade they are usually so wedded to their vendor that they will accept any reasonable offer to upgrade that the vendor offers. Seeking competitive options is really difficult for most enterprises to pursue without an overwhelmingly compelling reason.</p>

<p>This additional news item indicates that Microsoft is still trying to get their search strategy straightened out with another new acquisition, <a href="http://news.prnewswire.com/DisplayReleaseContent.aspx?ACCT=104&STORY=/www/story/08-05-2009/0005072156&EDATE=">Applied Discovery Selects Microsoft FAST for Advanced E-Discovery Document Search</a>. E-discovery is a hot market in legal, life sciences and financial verticals but firms like <a href="http://isys-search.com/">ISYS</a>, <a href="http://recommind.com/">Recommind</a>, <a href="http://www.temis.com/">Temis</a>, and <a href="http://www.zylab.com/">ZyLab</a> are already doing well in that arena. It will take a lot of effort to displace those leaders, even if Microsoft is the contender. Enterprises are looking for point solutions to business problems, not just large vendors with a boatload of poorly differentiated products. There is plenty of opportunity for specialized vendors without going toe-to-toe with the big folks.</p>]]>
        
    </content>
</entry>

<entry>
    <title>Convergence of Enterprise Search and Text Analytics is Not New</title>
    <link rel="alternate" type="text/html" href="http://gilbane.com/search_blog/2009/08/convergence_of_enterprise_search_and_text_analytics_is_not_new.html" />
    <id>tag:gilbane.com,2009:/search_blog//49.9920</id>

    <published>2009-08-06T23:53:33Z</published>
    <updated>2009-08-07T00:10:22Z</updated>

    <summary>...extracting meaningful content from database in new and innovative formats requires a level of abstract thinking for which most employees are not well-trained. Putting descriptive data into a database via a screen form, then performing a transaction on the object of that data on another form, and then adding more data about another similar but different object are isolated in the database user&apos;s experience and memory. </summary>
    <author>
        <name>Lynda Moulton</name>
        <uri>http://gilbane.com/blog/mt-cp.cgi?__mode=view&amp;blog_id=49&amp;id=14</uri>
    </author>
    
        <category term="Types of Search" scheme="http://www.sixapart.com/ns/types#category" />
    
    <category term="commandlanguages" label="Command languages" scheme="http://www.sixapart.com/ns/types#tag" />
    <category term="databaseengines" label="Database engines" scheme="http://www.sixapart.com/ns/types#tag" />
    <category term="enterpriseapplications" label="Enterprise applications" scheme="http://www.sixapart.com/ns/types#tag" />
    <category term="textanalytics" label="Text analytics" scheme="http://www.sixapart.com/ns/types#tag" />
    <category term="unstructuredsearch" label="Unstructured search" scheme="http://www.sixapart.com/ns/types#tag" />
    
    <content type="html" xml:lang="en" xml:base="http://gilbane.com/search_blog/">
        <![CDATA[<p>Prompted by the <a href="http://www.nytimes.com/2009/07/29/technology/companies/29ibm.html">news item</a> about IBM's bid for SPSS and similar acquisitions by Oracle, SAP and Microsoft made me think about the predictions of more business intelligence (BI) capabilities being conjoined with enterprise search. But why now and what is new about pairing search and BI? They have always been complementary, not only for numeric applications but also for text analysis. Another <a href="http://www.kmworld.com/Articles/Editorial/Feature/Text-analytics%E2%80%94Improving-the-use-case-for-unstructured-text--52383.aspx">article by John Harney</a> in KMWorld referred to the "relatively new technology of text analytics" for analyzing unstructured text. The article is a good summary of some newer tools but the technology itself has had a long shelf life, too long for reasons which I'll explore later.</p>

<p>Like other topics in this blog this one requires a readjustment in thinking by technology users. One of the great things about digitizing text was the promise of ways in which it could be parsed, sorted and analyzed. With heavy adoption of databases that specialized in textual, as well as numeric and date data fields for business applications in the 1960s and 70s, it became much easier for non-technical workers to look at all kinds of data in new ways. Early database applications leveraged their data stores using command languages; the better ones featured statistical analysis and publication quality report builders. Three that I was familiar with were DRS from ADM, Inc., BASIS from Battelle Columbus Labs and INQUIRE from IBM.</p>

<p>Tools that accompanied database back-ends had the ability to extract, slice and dice the database content, including very large text fields to report: word counts, phrase counts (breaking on any delimiter), transaction counts, relationships among data elements across associated record types, ability to create relationships on the fly, report expert activity and working documents, and describe distribution of resources. These are just a few examples of how new content assets could be created for export in minutes. In particular, a sort command with DRS had histogram controls that were invaluable to my clients managing corporate document and records collections, news clippings files, photographs, patents, etc. They could evaluate their collections by topic, date ranges, distribution, source, and so on, at any time.</p>

<p>So, there existed years ago the ability to connect data structures and use a command language to formulate new data models that informed and elucidated how information was being used in the organization, or to illustrate where there were holes in topics related to business initiatives. What were the barriers to wide-spread adoption? Upon reflection, I came to realize that extracting meaningful content from database in new and innovative formats requires a level of abstract thinking for which most employees are not well-trained. Putting descriptive data into a database via a screen form, then performing a transaction on the object of that data on another form, and then adding more data about another similar but different object are isolated in the database user's experience and memory. The typical user is not trained to think about how the pieces of data might be connected in the database and therefore is not likely to form new ideas of how it can all be extracted in a report with new information about the content. There is a level of abstraction that eludes most workers whose jobs consist of a lot of compartmentalized tasks.</p>

<p>It was exciting to encounter prospects that really grasped the power of these tools and were excited to push the limits of the command language and reporting applications, but they were scarce. It turned out that our greatest use came in applying text analytics to the extraction of valuable information from our customer support database. A rigorously disciplined staff populated it after every support call with not only demographic information about the nature of the call, linked to a customer record that had been created back at the first contact during the sales process (with appropriate updates along the way in the procurement process) but also a textual description of the entire transaction. Over time this database was linked to a "wish list" database and another "fixes" database and the entire networked structure provided extremely valuable reports that guided both development work and documentation production. We also issued weekly summary reports to the entire staff so everyone was kept informed about product conditions and customer relationships. The reporting tools provided transparency to all staff about company activity and enabled an early version of "social search collaboration."</p>

<p>Current text analytics products have significantly more algorithmic horsepower than the old command languages. But making the most of their potential and transforming them into utilities that any knowledge worker can leverage will remain a challenge for vendors in the face of poor abstract reasoning among much of the work force. The tools have improved but maybe not in all the ways they need to for widespread adoption. Workers should not have to be dependent on IT folks to create that unique analysis report that reveals a pattern or uncovers product flaws described by multiple customers. We expect workers to multitask, have many aptitudes and skills, and be self-servicing in so many aspects of their work, but for them to flourish the tools fall short too often. I'm putting in a big plug for text analytics for the masses, soon, so that enterprise search begins to deliver more than personalized lists of results for one person at a time. Give more reporting power to the user.</p>]]>
        
    </content>
</entry>

</feed>
