Curated for content, computing, and digital experience professionals

Category: Web technologies & information standards (Page 20 of 58)

Here we include topics related to information exchange standards, markup languages, supporting technologies, and industry applications.

Give Financial Statements an MRI with XBRL

Recent news item:  CLEVELAND (AP) — Indians outfielder Grady Sizemore is feeling better and will have an MRI on his strained left elbow on Monday.

It has become very commonplace for doctors to order an MRI for patients experiencing pain. According to Radiology Info.com, Magnetic resonance imaging (MRI) is a noninvasive medical test that helps physicians diagnose and treat medical conditions.  The website goes on to say, “MR imaging uses a powerful magnetic field, radio frequency pulses and a computer to produce detailed pictures of organs, soft tissues, bone and virtually all other internal body structures. The images can then be examined on a computer monitor, printed or copied to CD.” (see http://www.radiologyinfo.org/en/info.cfm?pg=bodymr)

In a similar fashion, XBRL puts a company’s financial statements under a transformation that exposes detailed pictures of the underlying accounting backing every line item.  This information can then be analyzed and compared by computer software to help determine a company’s financial health.

For example, let’s look at a sample line item from the Marathon Oil SEC filing covering the quarter ending September 30, 2008.  The form 10-Q has a line on financial statement that reads:

Loss on early extinguishment of debt 120 (nine months ending September 30, 2007, in millions)

When you give that line item the XBRL MRI treatment, computers can extract the XBRL label:

us-gaap:GainsLossesOnExtinguishmentOfDebt

the definition of the item:

Amount represents the difference between the fair value of the payments made and the carrying amount of the debt at the time of its extinguishment.
And the authoritative literature that backs up the accounting decisions:

the reference:

Presentation Reference

Name                  Accounting Principles Board Opinion (APB)
Number               26
Paragraph          20, 21
Publisher           AICPA

Inquiring minds will take a close look at APB 26 for more detail.  This examination should yield a much clearer understanding of the basis for reporting the number and therefore yield a better understanding of the financial statement.

Note:  Mr. Sizemore returned to full duty with the Cleveland Indians shortly after his MRI.

When is a Book Not a Book?

I recently wrote a short Gilbane Spotlight article for the EMC XML community site about the state of Iowa going paperless (article can be found here) in regards to its Administrative Code publication. It got me to thinking, “When is a book no longer a book?”

Originally the admin code was produced as a 10,000 page loose-leaf publication service containing all the regulations of the state. For the last 10 years it has also appeared on the Web as PDFs of pages, and more recently, independent data chunks in HTML. And now they have discontinued the commercial printing of the loose-leaf version and only rely on the electronic versions to inform the public. They still produce PDF pages that resemble the printed volumes that are intended for local printing of select sections by public users of the information. But the electronic HTML version is being enhanced to improve reusability of the content, present it in alternative forms and integrated with related materials, etc. Think mashups and improved search capabilities. The content is managed in an XML-based Single Source Publishing system that produces all output forms.

I have migrated many, many printed publications to XML SSP platforms. Most follow the same evolutionary path regarding how the information is delivered to consumers. First they are printed. Then a second electronic copy is produced simultaneously with the print using separate production processes. Then the data is organized in a single database and reformatted to allow editing that can produce both print and electronic. Eventually the data gets enhanced and possibly broken into chunks to better enable reusing the content, but the print is still a viable output format. Later, the print is discontinued as the subscription list falls and the print product is no longer feasible. Or the electronic version is so much better, that people stop buying the print version.
So back to the original question, is it no longer a book? Is it when you stop printing pages? Or when you stop producing the content in page-oriented PDFs? Or does it have to do with how you manage and store the information?

Other changes take place in how the information is edited, formatted, and stored that might influence the answer to the question. For instance, if the content is still managed as a series of flat files, like chapters, and assembled for print, it seems to me that it is still a book, especially if it still contains content that is very book oriented, like tables of contents and other front matter, indexes, and even page numbers. Eventually, the content may be reorganized as logical chunks stored in a database, extracted for one or more output formats and organized appropriately for each delivery version, as in SSP systems. Print artifacts like TOCs may be completely generated and not stored as persistent objects, or they can be created and managed as build lists or maps (like with DITA). As long as one version is still book-like, IMHO it is still a book.

I would posit that once the printed versions are discontinued, and all electronic versions no longer contain print-specific artifacts, then maybe this is no longer a book, but simply content.

Digital Platforms and Technologies for Publishers: Implementations Beyond “eBook”

We are very happy to announce that we’ve published our new report, Digital Platforms and Technologies for Publishers: Implementations Beyond “eBook.” The 142 page report is available at no charge for download here.

From the Introduction:

Much has changed since we decided to write a comprehensive study on the digital book publishing industry. The landscape has changed rapidly during the past months and we have tried to reflect as many of these changes as possible in the final version of our report. For example:

  • Sales of eBooks finally reached their inflection point in late 2008.
  • Customer acceptance of digital reading platforms such including dedicated reading devices like the Kindle and the Sony Reader and mobile devices like the iPhone and the BlackBerry have helped accelerate the market for digital products.
  • The Google settlement, once finally approved by the courts, will substantially increase the supply of titles available in digital formats.
  • New publishing technologies and planning processes are enabling publishers and authors to create digital products that have their own set of features that take full advantage of the digital media and platforms. Embedded context-sensitive search and the incorporation of rich media are two important examples.
  • Readers are self-organizing into reading communities and sharing their critiques and suggestions about which books their fellow readers should consider. This is creating a major new channel for authors and publishers to exploit.
  • Print-on-demand and short-run printing continue to make significant advances in quality and their costs per unit are dropping. These developments are changing the economics of publishing and are enabling publishers to publish books that would have been too risky in the previous economic model.
  • Lower publishing and channel costs are making it possible for publishers to offer their digital titles at lower prices. This represents greater value for readers and fair compensation for all stakeholders in the publishing enterprise.

We are privileged to report such a fine collection of best practices. And we are thankful that so many smart people were willing to share their perspectives and vision with us and our readers. We thank our sponsors for their ardent and patient support and hope that the final product will prove worth the many hours that went into its preparation.

We encourage readers of this report to contact us with their feedback and questions. We will be pleased to respond and try to help you find solutions to your own digital publishing challenges!

Busy Week in XML Content Management Market

Holiday weeks can be sleepy weeks in enterprise software news, but this week has seen one significant press release each day in the XML content management market, or component content management (CCM) market if you prefer.

  • On Monday, SDL announced the acquisition of XyEnterprise, and the creation of a new business unit based on XyEnterprise and Trisoft called SDL-XySoft.
  • On Tuesday, Really Strategies, the makers of the Marklogic-Server-based system RSuite, announced the acquisition of SaaS CCM provider DocZone.
  • Today, Quark and EMC announced an integration of Quark XML Author with Documentum.

First, the necessary disclosures and caveats. Of the six companies mentioned, we’ve worked with all of them, I believe, and I actually worked for XyEnterprise back in the 1980s and early 1990s. That said, each of these announcements is significant.

SDL, through both organic growth and acquistion, has grown into a substantial business that spans globalization technology, globalization services, CCM technology, and WCM technology. My colleagues Mary Laplante and Leonor Ciarlone know them much better as a company, but I believe it is safe to say that SDL is in a unique position spanning essentially four markets, but four markets that make a great deal of sense under a single umbrella. The product support content managed in a CCM technology is the best point of integration for globalization/translation tools. A CCM technology is also an excellent underpinning for a global company’s web presence or web precenses (the latter more likely, especially when one considers the need for localized web sites). And services are an essential piece of this puzzle. It’s the rare company that staffs heavily for localization, and even when they do, very few would staff full time to cover all of their language needs. Is SDL in a position to represent one-stop shopping for large companies with complex product content that needs to be localized into many languages? Again, my colleagues could answer that question more precisely, but it’s not a crazy question to ask.

Mary has more on SDL XySoft over in the globalization blog.

The acquisition also breathes new life into XyEnterprise, a company with highly functional, mature technology and excellent executive leadership. We take it as a very positive sign that XyEnterprise CEO Kevin Duffy will become the CEO of the newly combined business unit, reporting to Mark Lancaster, Chairman and CEO of SDL.

The Really Strategies acquistion of DocZone is on a smaller scale of course, but it is is significant in that these two companies represent two leading trends in the CCM marketplace–management of component content in native XML repositories (MarkLogic Server for RSuite and Documentum Content Store for one version of DocZone) and Software as a Service (SaaS). Count me among those who have been skeptical at times about SaaS for CCM, but DocZone, under Dan Dube’s leadership, has made it work. Really Strategies, in the mean time, has developed an impressive CCM offering on top of Mark Logic Server, and they have quietly built up a strong customer list.  We think the combined companies complement each other, and the new management team is excellent, with Barry Bealer as CEO, co-founder Lisa Bos as CTO, Ann Michael in charge of services, and Dan Dube as VP Sales and Marketing.

Which brings us to Quark and EMC. Both companies have been developing more CCM capabilities. EMC acquired X-Hive, and a lot of XML expertise along with it. They have since added more XML expertise on both the product management and engineering side. As they have integrated X-Hive into the Documentum platform, they have logically looked to build out more capabilities and applications for vertical markets. The integration with Quark XML Author makes perfect sense for them, giving their customers and prospects a ready mechanism for XML authoring in a familiar editorial tool.

For Quark’s part, the move is a logical and very positive next step. They had previously announced this kind of integration with IBM Content Manager, which has a strong presence in the manufacturing space. With EMC, Quark now has a strong partner in the pharma space. Documentum has long dominated pharma, and Quark XML Author, under Michael Boses and previous owner In.Vision, had built up a long list of pharma customers. Boses and his team know the pharma data structures inside and out, and it will be interesting to see the details of how Quark XML Author will integrate with Documentum and its storage mechanisms. (I am sure both EMC and Quark see the potential as more than just the pharma market–government is also a good target here–but the pharma angle will be fruitful I am sure.)

So, what news is on tap for tomorrow?

Oxygen XML Editor version 10.3

Syncro Soft Ltd has announced the availability of version 10.3 of its XML Editor and XML Author. Oxygen combines content author features like the CSS driven Visual XML editor with a fully featured XML development environment. It has ready to use support for the main document frameworks DITA, DocBook, TEI and XHTML and also includes support for all XML Schema languages, XSLT/XQuery Debuggers, WSDL analyzer, XML Databases, XML Diff and Merge, Subversion client and more. Version 10.3 of Oxygen XML Editor improves both the XML Authoring and the XML Development capabilities. As a result of user feedback the Oxygen XML Author API was reorganized and extended with additional functionality. There are various improvements to the existing frameworks (DITA, DocBook, TEI, etc.) like automatic ID generation or DITA aware search and replace. A new XML development feature is the Component Dependencies View that presents a tree of component dependencies starting with a specified component for XSLT, XML Schema, Relax NG and NVDL. The new version also integrates the Saxon SA XQuery Update functionality and updates a number of components to their latest versions. http://www.oxygenxml.com/

Mark Logic Releases MarkLogic Toolkit for Excel

Mark Logic Corporation released the MarkLogic Toolkit for Excel. This new offering provides users a free way to integrate Microsoft Office Excel 2007 with MarkLogic Server. Earlier this year, Mark Logic  delivered a Toolkit for Word and a Connector for SharePoint. Together, these offerings allow users to extend the functionality of Microsoft Office products and build applications leveraging the native document format, Office Open XML (OOXML). Distributed under an open source model, MarkLogic Toolkit for Excel comes with an Excel add-in that allows users to deploy information applications into Excel, comprehensive libraries for managing and manipulating Excel data, and a sample application that leverages best practices. The MarkLogic Toolkit for Excel offers greater search functionality, allowing organizations to search across their Excel files for worksheets, cells, and formulas. Search results can be imported directly into the workbooks that users are actively authoring. Workbooks, worksheets, formulas, and cells can be exported directly from active Excel documents to MarkLogic Server for immediate use by queries and applications. The Toolkit for Excel allows customers to easily create new Excel workbooks from existing XML documents. Users can now manipulate and re-use workbooks stored in the repository with a built-in XQuery library. For instance, a financial services firm can replace the manual process of cutting-and-pasting information from XBRL documents to create reports in Excel with an automated system. Utilizing Toolkit for Excel, this streamlined process extracts relevant sections of XBRL reports, combines them, and saves them as an Excel file. The Toolkit also allows users to add and edit multiple custom metadata documents across workbooks. This improves the ability for users to discover and reuse information contained in Excel spreadsheets. To download MarkLogic Toolkit for Excel, visit the Mark Logic Developer Workshop located at http://developer.marklogic.com/code/, http://www.marklogic.com

VFP XBRL Errors Will Not Carry Over to Mandatory Program

The article “Accuracy Essential to Success of XBRL Financial Filing Program,” by Eileen Z. Taylor and Matt Shipman, NC State News, June 8, 2009 — has been widely talked about recently in XBRL circles.

The key sentence in the news story about the academic paper states:

“The researchers are concerned that, if the upcoming XBRL filings do not represent a significant improvement from the voluntary reports, stakeholders in the financial community will not have any faith in the XBRL program – and it will be rendered relatively ineffective.”

Wrong on at least two counts.  First, to assume that the quality of XBRL submissions in the formal, rule laden, error checking mandatory XBRL program is going to be as error ridden as the sand-box, free for all no rules VFP is flat out wrong.  I suggest the authors of the paper read the Edgar filing manual, chapter 6, which details hundreds of rules that must be followed for an XBRL exhibit will be accepted by the system.  In other words, almost every error found in the VFP by the researchers will rejected by the SEC and require correction.

Second, validation programs can correct some of the accounting errors introduced into XBRL filings, responsible and knowledgeable humans at filing corporations must review submissions prior to filing.  The management team is responsible for the data contained in the XBRL exhibits.  The SEC has specifically stated that they expect corporations to have in place an XBRL preparation process that is documented and tested in a similar fashion to other required internal controls.  An accounting error on any future XBRL exhibit is an indication that the company does not have sufficient internal controls in place.

No, I’m not expecting the startup to be perfect.  However, I do expect XBRL filings to be as accurate or more accurate that existing HTML EDGAR filings.

« Older posts Newer posts »

© 2024 The Gilbane Advisor

Theme by Anders NorenUp ↑