CambridgeDocs Corp. announced the release of Version 2.01 of its xDoc PDF-XML Converter and integration of it into its xDoc Converter Desktop and Server products, enhancing their platform for extracting document content to meaningful XML. As XML, the previously PDF content can be meaningfully used for indexing by search engines, XML repositories and content management systems, for example allowing it to be stored as chapters, sections, tables or cells within any repository for fast, easy and accurate re-use. The xDoc PDF-XML Converter extracts PDF content to XML and provides functionality for enabling conversion that yields: Stylistic XML, including format, layout and content information; Extraction of financial data; Organization of related XML “chunks”, such as financial tables; Compatibility with existing target XML schemas or DTD’s, such as Docbook or DITA; Conversion to HTML/XHTML, with visual information than surpasses even Google’s “view as HTML” functionality; and Conversion to simple text. Version 2.01 adds the PDF-XML Converter as a special module in the xDoc Converter 2.01 platform and includes sample conversions of PDF documents into a variety of XML formats, such as Docbook and DITA. The release also adds a new and improved user interface, called the TableDef interface for extracting financial data using positioning and textual clues. The integration of the PDF-XML Converter into the xDoc Converter enables easy access to its functionality by consolidating download, installation and licensing processes. It also provides access to xDoc’s Visual Mapping tool and works with xDoc’s Adobe Acrobat plug-in. The PDF-XML Conversion functionality is available for download now at
Category: Content technology news (Page 220 of 637)
Curated information technology news for content technology, computing, and digital experience professionals. News items are edited to remove hype, unhelpful jargon, iffy statements, and quotes, to create a short summary — mostly limited to 200 words — of the important facts with a link back to a useful source for more information. News items are published using the date of the original source here and in our weekly email newsletter.
We focus on product news, but also include selected company news such as mergers and acquisitions and meaningful partnerships. All news items are edited by one of our analysts under the NewsShark byline. See our Editorial Policy.
Note that we also publish news on X/Twitter. Follow us @gilbane
ClearStory Systems, Inc. (BULLETIN BOARD: CSYS) announced a definitive agreement with Datawatch Corporation (NASDAQ:DWCH). Under the terms of an asset purchase agreement, Datawatch Corporation will acquire intellectual property and a portfolio of assets associated with ClearStory’s Radiant Document Solutions including its Radiant Business Document Server Suite and Radiant MailManager. Radiant Business Document Server (BDS) is a system for high-volume document capture, archiving, and online presentment within financial services, insurance, and healthcare markets. Radiant MailManager is a scalable e-mail active archiving solution that provides lifecycle, compliance, and storage management for the corporate e-mail knowledge base. The boards of directors for the respective companies have approved the definitive asset purchase agreement. The purchase price under the asset purchase agreement is $4.3 million in cash at close, with an earn-out, calculated by multiplying the net revenues derived from the product sales by 30% for the next 18 months. The closing of this transaction will allow ClearStory Systems to focus on its core digital media businesses, while enabling Datawatch to immediately increase its presence and customer base in the document solutions market, including its email management and archiving solution, The transaction is contingent upon certain customary closing conditions and regulatory approvals, and is expected to occur on April 14, 2006. , http://www.datawatch.com
Document Sciences Corporation (NASDAQ:DOCX) announced an agreement with InterDoc Corporation. InterDoc will resell Document Sciences’ xPresso dynamic content publishing solution to organizations in industries that include financial services, pharmaceutical, aerospace, manufacturing, and government. ,
Kapow Technologies announced the release of its Kapow Adapter 1.2 for the SAP NetWeaver platform, which has achieved “Powered by SAP NetWeaver” certification. The adapter enables customers to access any web-based application, as if it was a service, for seamless integration with the SAP NetWeaver Exchange Infrastructure (SAP NetWeaver XI). Via the adapter, the Kapow Web Integration platform connects business procesess within SAP NetWeaver-based solutions with web page interactions. These interactions are automatically executed by Web Integration robots that transmit the results of the interactions back to SAP NetWeaver. The Kapow Adapter for SAP NetWeaver is available immediately. http://www.kapowtech.com
Ektron Inc. announced it has attained Gold Certified status in the Microsoft Partner Program with competencies in ISV/Software and Business Process and Integration Solutions. Ektron CMS400.NET was one of the first content management applications to be designated Microsoft .NET Connected, and its technology uses or integrates with other Microsoft technologies, including Windows Servers, Microsoft Office, SharePoint Portal Server, SQL Server, Visual Studio and ASP.NET 2.0. To reach Gold Certified status, Ektron had to declare its Microsoft Competencies. Microsoft Competencies are designed to help differentiate a partner’s capabilities with specific Microsoft technologies to customers looking for a particular type of solution. Each competency has a unique set of requirements and benefits, formulated to accurately represent the specific skills and services that partners bring to the technology industry. http://www.ektron.com
Nuxeo delivers the new release of its Enterprise Content Management platform. This major release puts strong emphasis on scalability, performances, usability and high-level features. Founded on a component architecture, CPS, through its integrated approach, enables organizations to implement global ECM solutions definitely focusing their investment on specific business needs. CPS 3.4.0 is available for download under open source license. http://www.nuxeo.com/en/
Day Software (SWX:DAYN) (OTC:DYIHY) announced the general availability of Day Content Repository Extreme (CRX) version 1.1, the latest version of Day’s product that enables the storage, management and exchange of content across large-scale enterprises and implements the Content Repository API for Java Technology JCR (JSR 170). Day CRX provides an open, standards-based infrastructure for integrating business applications with any structured or unstructured content in an enterprise. New features in Day CRX enable the exchange of content between applications and repositories. Major feature enhancements include: virtualization, integration with Microsoft SQLServer, integration with Oracle RDBMS, improved support for the development and implementation of CRX adapters and connectors, and improved resource management capabilities and overall performance increases. CRX can be downloaded at
The World Wide Web Consortium has released “XForms 1.0 Second Edition” as a W3C Recommendation. The new generation of Web forms, XForms separate presentation and content, minimize round-trips to the server, offer device independence, and reduce the need for scripting. This second edition adds clarifications and corrects errors as reported in the first edition errata. Second edition publications include the following documents: XForms 1.0 Second Edition http://www.w3.org/TR/2006/REC-xforms-20060314/, XForms for HTML Authors Part 2 http://www.w3.org/MarkUp/Forms/2006/xforms-for-html-authors-part2.html, XForms Quick Reference http://www.w3.org/MarkUp/Forms/2006/xforms-qr.html, XHTML to XForms Converter (XSLT), http://www.w3.org/MarkUp/Forms/2006/xforms.xsl, and a Revised XForms Test Suite, .