A2iA Corporation announced the first-ever automated data capture and keyword spotting software for freeform cursive handwriting, making it possible to quickly scan through the content contained within large volumes of unstructured handwritten letters and loosely structured fields within forms. A2iA DocumentReader enables a computer to mimic a human speed-reader. It locates, segments and processes the information contained in cursive handwritten information previously excluded from traditional forms processing and records management applications. A2iA DocumentReader converts handwritten words into electronic data, matching the words against language-specific and industry-specific vocabularies to produce a rough transcription. This gives companies the ability to index the information contained in documents that are typically read, archived and retrieved manually in their native format or stored in image archives. A2iA DocumentReader is also designed for keyword spotting on single images and entire image repositories. A2iA DocumentReader is a first-generation application available for integration by A2iA’s official licensees and resellers. http://www.a2ia.com
Category: Content management & strategy (Page 266 of 479)
This category includes editorial and news blog posts related to content management and content strategy. For older, long form reports, papers, and research on these topics see our Resources page.
Content management is a broad topic that refers to the management of unstructured or semi-structured content as a standalone system or a component of another system. Varieties of content management systems (CMS) include: web content management (WCM), enterprise content management (ECM), component content management (CCM), and digital asset management (DAM) systems. Content management systems are also now widely marketed as Digital Experience Management (DEM or DXM, DXP), and Customer Experience Management (CEM or CXM) systems or platforms, and may include additional marketing technology functions.
Content strategy topics include information architecture, content and information models, content globalization, and localization.
For some historical perspective see:
https://gilbane.com/gilbane-report-vol-8-num-8-what-is-content-management/
Mondosoft announced the availability of a new Database Retrieval Module (DRM) for its MondoSearch customers. Designed to meet the needs of e-commerce companies, particularly those with high-turnover online catalogues, multiple types of databases, platforms and applications, the new module enables users to simultaneously search and retrieve information from internal databases across the enterprise, and from external sources, such as public web content. Mondosoft’s Database Retrieval Module allows indexing of database content, web pages and other types of content into one unified search database. Users can view their search results as converted HTML layouts or can be redirected to existing web pages. DRM extracts content from the databases using live connections into the database, and can be set up to do incremental extracts of changed records only. The Mondosoft Database Retrieval Module is extensible to most database systems as needed and indexes Lotus Notes, SQL Server, and ODBC databases. DRM also allows for custom setups or table presentations of each individual database. Pricing for the Database Retrieval Module for MondoSearch is $9,000 and includes one day of consulting.
dtSearch Corp. announced Version 7 of its product line. The product line can now instantly search terabytes of text across a desktop, network, Internet or Intranet. The release covers: dtSearch Web with Spider, dtSearch Desktop with Spider, dtSearch Network with Spider, dtSearch Publish and the dtSearch Text Retrieval Engine. Previously, the dtSearch index format could hold from 4-8 gigabytes of text per index. The Version 7 index format can index over a terabyte of text in a single index. Search time with over a terabyte of text is typically less than a second. As with previous versions, a single search can span any number of indexes. The dtSearch Spider embedded in multiple dtSearch products provides integrated searching of remote Web site content, along with locally-available data. In addition to support for file formats, the dtSearch Spider can also index and search dynamically-generated content, such as ASP/ASP.NET, MS CMS, MS Sharepoinet, etc. Pricing is $199 for dtSearch Desktop with Spider, from $800 for dtSearch Network with Spider, from $999 for dtSearch Web with Spider, and from $2,500 for dtSearch Publish. dtSearch offers a variety of royalty-based and royalty-free pricing options for the dtSearch Text Retrieval Engine. http://www.dtsearch.com
Informative Graphics Corporation (IGC) announced the version 5.2 release of its Brava! Enterprise content visualization server, which now supports PDF interoperability and features enhanced markup and Visual Rights security control. Brava Enterprise is available as a custom integration for enterprise content management systems from Documentum and Open Text, as well as SharePoint Portal Services from Microsoft. It also serves as a core application within IGC’s ProjectDox project data management and ReproCentral planroom and print ordering solutions. PDF interoperability enhances Brava’s ability to function as a key support and enabling application for content management systems; in document workflow for regulated industry and governmental processes where PDF output is required or mandated. File viewed and annotated in Brava can be saved as rich content PDFs into a file system or vault, or sent as email attachments. Markups created in Brava and saved to PDF are retained as PDF comments and can be edited in Acrobat Professional 6.0 and greater. Watermarks and print banners are translated to PDF as well. Custom dialog boxes and export options are programmable. http://www.infograph.com
SchemaLogic released its enterprise metadata and taxonomy management platform – SchemaLogic Enterprise 3.0, with additional interface, integration and customization options. SchemaLogic SchemaServer provides an active metadata repository that drives modeling, mapping and synchronization across disparate enterprise systems. Enriching unstructured information through taxonomic metadata, controlled vocabularies and semantic mapping schema leads to information findability and enterprise-level content integration. SchemaLogic Workshop provides business domain experts and technical data architects the ability to model, rationalize and manage the creation, import/export, and synchronization of metadata models and schemas with other applications and systems. Workshop is the user portal into SchemaServer and SchemaLogic Integrator. SchemaLogic Integrator manages the synchronization of metadata to subscribing systems with adaptors. This integration framework provides adapters that communicate directly with subscribing systems, moving reference data, metadata and controlled vocabularies from SchemaServer’s repository. SchemaLogic offers standard adapters for content management, portal, search, categorization, database, XML schema and other enterprise systems. http://www.schemalogic.com
EMC Corporation announced a new generation of its Content Intelligence Services (CIS), EMC Documentum CIS 5.3, an add-on to the EMC Documentum 5.3 platform. EMC Documentum CIS 5.3 offers classification capabilities that can organize content based on information extraction and business rules, and categorize the content for easier navigation. Distributed competence management is a new capability to Documentum CIS 5.3 that enables customers to leverage human competence of individuals in various functions and distribute the categorization responsibilities across departments within an organization to classify content. A built-in role and user interface allows category owners or experts to manually make a classification decision in cases where the automated rules cannot classify the content with sufficient confidence. Documentum CIS 5.3 supports classification of multi-lingual content by enabling content classification in native languages against native taxonomies. Among the new supported languages are English, German, Italian, French, Spanish, Portuguese, Dutch, Swedish, Norwegian, Danish, and Finnish. Documentum CIS 5.3 now includes automotive, chemicals, drug and disease, energy, engineering, energy, legislation, financial, healthcare, high tech, military, and pharmaceutical industry taxonomies, and CRM, HR, IT, legal, marketing and other functional taxonomies. It also provides content classification for any text-based content type including documents, Web content, XML, OCRed images and other content types. http://www.emc.com
Passing this along from Don Day, Chair of the OASIS DITA Techical Committee:
The OASIS DITA Technical Committee seeks your input on the list of known requirements/enhancements for upcoming DITA TC activity. Your help in ranking this list (or suggesting additional new requirements) will help the TC prioritize the most urgent issues for upcoming DITA 1.1 design work, and beyond. I have posted a list osf the issues currently known to the TC at this location: http://www.oasis-open.org/committees/document.php?document_id=12814&wg_abbrev=dita
Please assess what you consider to be your top 5 requirements and submit those Issue numbers to the DITA TC via the comment form:
http://www.oasis-open.org/committees/comments/form.php?wg_abbrev=dita .
If you have a new issue or requirement not included in this list, please enter it as a separate comment via the comment form. We still need your “top 5” from this list, so read it carefully–most of the known hot issues are in there in one way or another, possibly including yours. There is no need to include more than 5 items in your list at this time; all of the 48 items are candidates for work, but we need to know which are MOST critical for initial work going into DITA 1.1.
This review period opens on May 23 2005 and closes end of day on June 6 2005 (2 weeks).
OASIS announced that its members have approved the Open Document Format for Office Applications (OpenDocument) v1.0 as an OASIS Standard, a status that signifies the highest level of ratification. OpenDocument provides a royalty-free, XML-based file format that covers features required by text, spreadsheets, charts, and graphical documents. OpenDocument provides a single XML schema for text, spreadsheets, charts, and graphical documents. It makes use of existing standards, such as HTML, SVG, XSL, SMIL, XLink, XForms, MathML, and the Dublin Core, wherever possible. OpenDocument has been designed as a package concept, enabling it to be used as a default file format for office applications with no increase in file size or loss of data integrity. Future plans for the OASIS OpenDocument Technical Committee include extending the standard to encompass additional areas of applications and users, as well as adapting it to incorporate ongoing developments in office applications. All those interested in advancing this work, including governments, open source initiatives, educational institutions, and software providers, are encouraged to participate in the Committee. OASIS hosts an open mail list for public comment and the opendocument-dev mailing list for exchanging information on implementing the standard. http://www.oasis-open.org