Archive for enterprise search

Speaker Spotlight: John Felahi – Making content findable

In another installment of Speaker Spotlight, we posed a couple of our frequently asked questions to speaker John Felahi, Chief Strategy Officer at Content Analyst Company, LLC. We’ve included his answers here. Be sure to see additional Speaker Spotlights from our upcoming conference.

John_Felahi-horiz

Speaker Spotlight: John Felahi

Chief Strategy Officer

Content Analyst Company, LLC

 

What is the best overall strategy for delivering content to web, multiple mobile, and upcoming digital channels? What is the biggest challenge? Development and maintenance cost? Content control? Brand management? Technology expertise?

One of the biggest challenges to delivering content to the web is making it as findable as possible to potential interested viewers.  While traditional, manual tagging and keyword search methods may have gotten us this far, and may be good enough for some use cases, they’re still not without limitations. The good news is, there are far more advanced, sophisticated – and automated – technologies available to remedy the numerous limitations of manual tagging content and keyword-based search. The limitations of manual tagging and keyword-based include:

  • Term creep – New terms constantly emerge, requiring taxonomies to be constantly updated.
  • Polysemy – Take Apple, for example. Is your user searching for the company, the Beatles’ record label, or the fruit?
  • Acronyms – Texting has introduced an entirely new language of acronyms (LOL, TTYL, WDYT).  Manually tagging content requires the editor to consider possible acronyms the users will be searching for.
  • Abbreviations – Tagging content with long, scientific terms, geographies, etc. require editors to factor these in along with the long terms they represent.
  • Misspellings – Thanks to spellcheck and autocorrect, technology has become much more forgiving for those who never made it past the first round eliminations in their sixth grade spelling bee. Content search, unfortunately, needs to be equally accommodating, if you want your users to find your content – which means tagging it with common misspellings.
  • Language – The web has certainly made the world a much smaller place, but that doesn’t mean everyone speaks English.  Making content findable in any language means it has to also be tagged in multiple languages.

On to the good news – there’s technology that’s been used for years in eDiscovery and the US Intelligence Community to overcome these very challenges, but for different reasons. Because the bad guys aren’t tagging their content to make it more findable, the intel community needs a better way to find what they’re looking for. And in eDiscovery, finding relevant content can make a multi-million dollar difference to the outcome of a particular litigation or other regulatory matter. That’s why tens of thousands of legal reviewers and countless analysts in the intel community use a technology known as concept-aware advanced analytics.

How concept-aware advanced analytics differs from manual tagging and keyword search

As its name implies, concept-aware understands the underlying concepts within the content. As such, it can tag content automatically.  On the viewer’s side, content can be found by simply saying, “find more like this.” Categories are defined by taking examples that represent the concepts of a category. The system “learns” what that category is all about, and can then identify conceptually similar content and apply the same category. The process is the same on the search side. The user points to a piece of content and says, “find more like this.” Or as the content publisher, you present the viewer with conceptually similar content, i.e., “you may also be interested in these articles.”

While concept-aware advanced analytics doesn’t necessarily replace manual tagging and keyword search – which work very well in certain situations – the technology clearly overcomes many of the limitations of traditional tagging and search methods.

Catch Up with John at Gilbane

Track E: Content, Collaboration, and the Employee Experience

E7: Strategic Imperatives for Enterprise Search to Succeed
Wednesday, December, 4: 2:00 p.m. – 3:20 p.m.

[button link=”http://gilbaneconference.com/program” variation=”red”]Complete Program[/button] [button link=”http://gilbaneconference.com/schedule” variation=”red”]Conference Schedule[/button] [button link=”http://gilbaneconference.com/registration” variation=”red”]Register Today[/button]

Gilbane Conference program and speakers posted

The Gilbane Conference program and speaker list are now available in addition to the conference schedule and pre-conference workshop schedule and program – there are just a few details to be added. Other changes between now and the conference will be minimal and will be reflected on the site if/as they occur, so check back once in a while.

The schedule for the product labs/case studies presented by sponsors will also be posted shortly.

Speaker proposal update

Thanks all for the speaker proposals!

Next step is a preliminary organization by the program committee to see if we have all the topics covered.

If you have submitted a proposal remember that it may be a few weeks before a decision is made, but we will keep you posted here on our overall progress.

New posts on embedded search and mobile development

Check out two new posts this week on the Bluebill blog, one from Lynda on Embedded Search in the Enterprise, and one from Frank on Time to Re-check Your Mobile Development Strategy.

Justifying Enterprise Search: Mitigating Risk and Getting the Right Fit

Today we highlight Workshop C: Justifying Enterprise Search: Mitigating Risk and Getting the Right Fit taking place at Gilbane Boston, November 29, 9:00am – 12:00pm at the Westin Waterfront.

While enterprise search has been debated, maligned, and challenged as a high value infrastructure application over the past decade, it has a place in every enterprise with valuable content. This presentation highlights how to make the right decisions about enterprise search applications. From embedded search to high-end semantic applications, the options are numerous and the technologies solid. However, the right choice is imperative and basing selection on business priorities requires artful analysis and justification. Illustrating the risks of continuing to operate with a faulty search solution is a good way to focus thinking about the search environment in any organization.

Instructor:

Lynda Moulton, Senior Analyst & Consultant, Outsell Gilbane Services

Register today!

Endeca Now Integrates Hadoop

Endeca Technologies, Inc., an agile information management software company, announced native integration of Endeca Latitude with Apache Hadoop. Endeca Latitude, based on the Endeca MDEX hybrid search-analytical database, is uniquely suited to unlock the power of Apache Hadoop. Apache Hadoop is strong at manipulating semi-structured data, which is a challenge for traditional relational databases. This combination provides flexibility and agility in combining diverse and changing data, and performance in analyzing that data. Enabling Agile BI requires a complete data-driven solution that unites integration, exploration and analysis from source data through end-user access that can adapt to changing data, changing data sources, and changing user needs. Solutions that require extensive pre-knowledge of data models and end-user needs fail to meet the agility requirement. The united Endeca Latitude and Apache Hadoop solution minimizes data modeling, cleansing, and conforming of data prior to unlocking the value of Big Data for end-users. http://www.endeca.com/ http://hadoop.apache.org/

Coveo Announces Enterprise Search (CES) 6.0

Coveo announced the general availability of Coveo Enterprise Search (CES) 6.0. The new Coveo CES 6.0 platform provides a combination of enterprise-grade scalability, deep connectivity and security to data repositories and performance improvements. Some of Coveo Enterprise Search 6.0’s key features include:  Improved scalability with up to 50 million documents per server, faster query performance and  contextual faceted search; Coveo’s unified view‚ Out-of-the-box ability to create unlimited unified interfaces, providing users with a tabbed search interface for optimized access to information; Deep connectivity, which enables companies to leverage existing and legacy IT assets; New and improved connectors and interfaces‚ Microsoft Exchange and Lotus Notes, Salesforce.com, Open Text LiveLink, Confluence, Quest Archive Manager, SiteCore, Symantec Enterprise Vault, etc.; A customizable super-user mode that can be integrated in the search interface for enterprise-wide content discovery across all employee data, and; Extended capabilities for mobile phones‚ Contextual faceted search & filtering, added support for intranet (SharePoint, etc) and CRM (Salesforce.com), document quick view (for a device friendly, low-bandwidth access to documents), conversation tracking and people search. http://www.coveo.com

ISYS Search Software Announces Release of ISYS:sdk 9

ISYS Search Software  announced the arrival of ISYS:sdk 9, the company’s next-generation enterprise search integration kit for original equipment manufacturers (OEM), independent software vendors (ISV) and systems integrators. ISYS:sdk 9 offers customers several major enhancements, all designed to deliver the performance, scalability and accuracy required for empowering third-party applications with search. Most significantly, ISYS has expanded its core engine’s content mining capabilities using deterministic and reliable methods that help customers better understand their content. Through its Intelligent Content Analysis, ISYS notes key characteristics about a content collection, such as metadata patterns and entities, thus enabling OEMs to leverage these facets for improved search and discovery. ISYS’ Intelligent Content Analysis identifies characteristics can that can be exploited by OEMs to bolster their applications with greater content mining capabilities, an increasingly critical requirement in compliance and ediscovery applications. ISYS’ Intelligent Content Analysis manifests itself in the form of several parametric search and refinement options, all of which are callable from a variety of languages, including C++, C#, VB.NET and Java. The core ISYS indexing engine can be configured to note aspects like entities in the full text (e.g., names, locations); commonly recurring metadata values in semi-structured and database formats; location of files; dates and numbers; and position of words. http://www.isys-search.com