Curated for content, computing, and digital experience professionals

Category: Semantic technologies (Page 40 of 72)

Our coverage of semantic technologies goes back to the early 90s when search engines focused on searching structured data in databases were looking to provide support for searching unstructured or semi-structured data. This early Gilbane Report, Document Query Languages – Why is it so Hard to Ask a Simple Question?, analyses the challenge back then.

Semantic technology is a broad topic that includes all natural language processing, as well as the semantic web, linked data processing, and knowledge graphs.


The “2.0” Qualifier and A Reality Check

Last week’s FASTforward 07 event, sponsored by FAST Search, was a great opportunity to immerse ourselves in search and the state of our collective efforts to solve the knotty problems associated with finding information. (The escape to San Diego during an East Coast winter freeze was an added bonus.)

Much of the official program covered topics “2.0” — Web 2.0, Enterprise 2.0, Search 2.0, Transformation 2.0, Back Office 2.0. Regular readers know that the Gilbane team generally approaches most things “2.0” with skepticism. In the case of its use as a qualifier for the Web, it’s not that we question the potential value of bringing greater participation to Web-based interactions. Rather, it’s that use of the term causes the needle on our hype-o-meter to zip into the red alert zone. This reaction is further aggravated by the trend towards appending “2.0” to other words, sometimes just to make what’s old seem new again. We note, without comment, that O’Reilly Media’s conference in May has been dubbed Where 2.0.

We listened carefully to the 2.0’s being tossed out like Mardi Gras coins at FASTforward last week. One voice that stood out as a great reality check is that of Andrew McAfee, associate professor at Harvard Business School. In his keynote talk, “Enterprise 2.0: The Next Disrupter,” he presented a definition of Enterprise 2.0:

Enterprise 2.0 is the use of emergent social software platforms within companies, or between companies and their partners or customers.

The important word in McAfee’s definition is emergent, which is not the same as emerging. McAfee also outlined the ground rules for an enterprise that can legitimately lay claim to the use of the 2.0 qualifier. Read the FASTforward entries on his blog for his own eloquent summary.

In addition to his talk on Enterprise 2.0, McAfee also participated in a lunch presentation on research conducted by Economist Intelligence Unit on executive awareness of Web 2.0 and in a limited-seating roundtable on 2.0 topics. Both are briefly described on his blog.
In short, McAfee’s work is recommended reading for anyone interested in separating 2.0 market hype from potential business value.

Another highlight of FASTforward for us was keynoter Chris Anderson on “The Long Tail” and the application of Long Tail theories to search and content life cycles. By pure happenstance, the Gilbane team shared a limo to the airport with Anderson. In his day job as editor-in-chief of Wired magazine, he and his staff are experiencing significant levels of frustration with the publishing process — specifically, getting content out of a leading professional publishing tool and into the web content management system. While we found his Long Tail talk interesting, the conversation in the limo reminded us that solving some basic business communication problems is still a challenge. It was a thought-provoking way to end the week.

For more on FASTforward ’07, check out our enterprise search blog.

Don’t Skip this Step: Knowing Your Knowledge Assets and How to Find Them

No matter how small your organization or domain, you are going to need tools to find content sooner than you think. Starting with a small amount of content you should already be thinking about what its purpose is, why you would need to find it again and under what circumstances. Trying to retrofit a search strategy and structure to a mountain of disconnected content, is not only very difficult but it is costly. Waiting means that human intelligence, which could have been applied to organizing content well as the supply grew, must be applied later to get it under control. Adding meaningful context around old but valuable content is a very laborious intellectual process.

Growing an organization successfully means tending to not only the products you are creating and selling. It is also about creating an environment in which your growing work force is well supported with a knowledge framework that keeps them centered and confident that content they need to do their jobs can be found quickly, efficiently and accurately.

I am frequently asked by other consultants if I can give advice on how to organize personal files and records. This is hard to answer because my own methods fall short of where I want to be. But in any new project or venture, I try to get a good sense of how content needs to be organized. I do create metadata using a controlled list of terminology. I also have a couple of search tools that I leverage to produce readings for clients on a special topic, or to put my hands on a specific document, article or Web site. Early stage companies need to think about how to safeguard the results of their work and how it will be made accessible to workers on a reliable basis. There are inexpensive search tools that are great for managing small domains.

Invest in tools, invest in someone to manage the tools, and plan to continue to invest in the resulting infrastructure of people and tools as the organization’s content and needs grow. Content management and search are overhead expenditures you must make early to prepare for growth and sustainability.

That reminds me, I keeping wondering how many enterprise search vendors use the technologies they build and sell to support their rapidly growing enterprises. That’s a great question to ask your potential search vendor as you decide what tools to procure for your enterprise. Get them to tell you how they use their tools and the benefits they see in their own enterprise. If they aren’t at least using their own search technology in their customer relationship management and technical support knowledge-base operations, think carefully about what that might mean concerning ease of deployment and utilization.

Search Transitions from Support Function to Marketplace Enhancement

My silence last week had more to do with information overload than lack of interesting things to write about. Be forewarned, the floodgates of my brain are beginning to creak open. I just returned from Fast Search’s FastForward 07 conference in San Diego where their current and future visions for search technologies were front and center. While there seem to be no lack of innovations for how to make search engines smarter, faster, and more adaptable, the innovations being hyped at FastForward 07, and by others with only slightly less hyperbole, are notable. Search is becoming sexy and not just for the amount of money that Google and second-ranked Fast are raking in. In this arena search is the new business frontier, the marketplace-enabler, the marketplace-maker.

Consider this, search technologies have been business necessities for 35 years. For the first 30, search was strictly a support feature to many other kinds of finding mechanisms. In the earliest days search was performed by specialists as a service to other operations in the organization. Attempts to market search technology options to line managers, analysts, attorneys and R&D staff were marginal in their success. This is because search was not used enough for these groups to acquire the skill required for it to be really valuable. Once Web search engines exposed everyone to the possibilities of search in a far simpler modality, the innovation light bulbs popped off.

Suddenly search for use within the enterprise has become search for the enterprise’s marketplace, a major business driver that will put an organization’s products, services, and assets squarely in front of the right buying audience. What this means for those poor souls who still need to find the stuff mounting valuelessly in inaccessible silos remains to be seen. I am excited by what I saw but concerned by what I am witnessing. It is great that I may be able to find that weird audio adapter on the Web to let me connect to the sound system in the skating rink. But it is really awful when an engineering firm can’t put it’s hands on the schematic that shows how a circuit board was modified and delivered three years ago to a top customer.

Back to Search Roots for the Enterprise – Structured Search That Is

Structured search (noun) was rooted firmly in the enterprise when publishers of print index resources (e.g. Chemical Abstracts, Index Medicus from the National Library of Medicine, GRA&I from the National Technical Information Service) became available on-line in the early 1970s. The Systems Development Corporation launched ORBIT developed by a team lead by Carlos Cuadra. Orbit was a command driven search tool accessible to professional searchers. In those days searchers were usually special librarians in corporations, large public libraries, government agencies and major universities. Using the ORBIT command language through a terminal connected by a phone line to remote large computers, librarians would type search commands to find data in specific structured fields. These remote computers held electronic versions of paper indices. Citations resulting from a query for specific chemical compounds, diseases, or government reports, would contain information needed to retrieve articles, patents or books from library shelves.

Corporations spent hundreds of thousands of dollars each year to access external specialized, and structured indices, and the journals, conference proceeding, patents and government documents to which the indices pointed. Hard copy (paper or microform) was the only practical way to read content. Computer screens were not accessible to most researchers and even if they had been, content could not be rendered on them in easily readable forms. Also, until computer storage technologies became cheap, indexing large amounts of text (full-text, or unstructured content) was not affordable.

Even with the advent of graphical interfaces, searching for non-specialists made only minor advances in the early-1980s when library systems offered index browsing to find citations. Library users still needed to read content in hard copy. It was only in the late 1980s and early 90s that full-text content began to be searchable by large numbers of library users on CD-ROMs. Users would go to a library computer, which held multiple CD-ROMs containing journals and other subscriptions, and use a menu to find content on the CD-ROMs by typing keywords that would look through all the content to find matches. This was the first routine use of full-text searching by library users.

These technologies are just memories for a few of us, and unknown to most, but they do point to the differentiation between structured and unstructured searching. Both have been around for a couple of decades but it has taken Web search engines to put search in the hands of everyone. Only recently is frustration with retrieving buckets of unfiltered content pushing enterprises to reconfirm the added value of structured searching.

Technical and business users are appreciating the value of being able to search for a precise title, all documents contributed to a specific project, or all presentations delivered by the CEO in the past two years. Each of these searches requires a defined set of data points, stored with the content and retrievable with a search interface that can support the “structured” query.

Yes, librarians have been here before but, just now, the rest of the organization is learning how they managed to get such good search results all along. Structured searching is now a lot simpler than it was in the 1970s. It is only one aspect in enterprise search but it is an important requirement for most enterprise users when they need reliable and clearly defined search results. And, by the way, Carlos is still around building systems for enterprises to manage and search their critical proprietary content.

The Right Message for the Small-Medium Business Market

IBM just launched a very interesting suite of enterprise search products. I have yet to try it or examine the specifications but the marketing message is the right one for the small and medium business enterprise buyer. What I like in the message:

  • Pricing ranges from free to reasonable to ? (sky is probably the limit).
  • Deployment is simple, intuitive and clean.
  • IBM knows that simple and easy is the right call for IT but also the way to keep costs down.
  • The solution is scaleable from a departmental solution to the entire business domain on the same basic software platform.

For the short term, I am placing OmniFind on the long list of products to consider for enterprise search. Check out the Web site at:
http://www-306.ibm.com/software/data/enterprise-search/omnifind-enterprise/

How smart is it that IBM and Yahoo have chosen to team in this way? It could be a great strategy.
Disclaimer: This is the first product mention in this very new blog. It won’t be the last; I have a lot of interesting products on my list on which to comment. There will be much backfilling in the next few months but I have to start somewhere. The marketing message resonated; I hope OmniFind users will keep us informed by posting case experiences on whether the product delivers on the promises.

Sonic Foundry Unveils Enhanced Multi-Modal Search Capabilities on Mediasite.com

Sonic Foundry Inc. (NASDAQ:SOFO) has enabled advanced search capabilities for Mediasite.com, a searchable Website focused exclusively on offering expert information via rich media presentations with video, audio and graphics. The site enhancements combine phonetic search, optical character recognition, language processing and contextual analysis. Mediasite.com aggregates nearly 13,000 rich media presentations, including over 250,000 slides and almost 9,500 hours of analyzed audio on topics ranging from the treatment of contagious diseases and general health-related issues to robotics, business start-ups, sociology, ethics, financial advice, personal improvement and career advancement. Mediasite.com provides visitors with a directory of these publicly available presentations created by hundreds of experts around the globe. Optical character recognition allows for word and phrase spotting within slide content or visual aids. Advanced phonetic search algorithms coupled with language processing and contextual analysis provide the ability to locate specific spoken words or phrases within a rich media archive, based on algorithms developed by Sonic Foundry. Sonic Foundry customers can now take advantage of these advanced search features for their own Mediasite content through new hosted services offered through Sonic Foundry’s professional services group. http://www.mediasite.com/, http://www.sonicfoundry.com/

ISYS Search Software Supports Microsoft Office 2007 Document Formats

ISYS Search Software announced support for the newly released Microsoft Office 2007 document formats, offering immediate enterprise search functionality across core Office 2007 applications. ISYS also announced full support for Microsoft Windows Vista. Microsoft Office 2007 includes new document types for all main Office applications such as Word, Excel, PowerPoint, Outlook and Access. Microsoft has also included enhancements in the form of XML Paper Specification (XPS), its new alternative to the Adobe Portable Document Format (PDF). Congruent with enhancements in the Office product, Microsoft Windows Vista also comes with built-in support for XPS. To ensure a seamless transition for its customers moving to Office 2007 or Vista, ISYS has incorporated support for these new formats into its ISYS 8 platform. In addition to support for new Microsoft file formats, ISYS 8 also introduced support of .wma and .wmv. formats. http://www.isys-search.com/

Inmagic and WebFeat Partner to Provide Inmagic Presto

Inmagic, Inc.and WebFeat announced a partnership that will enable Inmagic Presto customers to conduct federated searches across virtually unlimited external data sources. Inmagic Presto is a Web-based enterprise application that enables organizations to provide authorized end users with immediate and consolidated access to the right information, even when it appears in varied formats and multiple locations across and outside of the organization. WebFeat users can simultaneously search across unlimited numbers of resources from a single interface. WebFeat’s translator authentication and session management technology enables WebFeat to search virtually any searchable database. WebFeat maintains a library of over 6,000 database translators. http://www.webfeat.org, http://www.inmagic.com

« Older posts Newer posts »

© 2025 The Gilbane Advisor

Theme by Anders NorenUp ↑