“Content” in this context means unstructured data. The need to manage unstructured data is one of the main reasons big data technologies exist – the other being the need for dealing with scale and speed. This is why it is important for us to cover at our conferences. Not every company needs to build new infrastructures around Hadoop-like technologies… yet. But marketers need to manage the mostly unstructured content that is part of their world, and also process and manage the more structured analytic data that will rapidly become “big” for even small organizations, so big data technologies need to be on marketing organizations’ radar as they continue to increase their expertise and spending on technology. See yesterday’s post on Why marketing is the next big money sector in technology.
Informatica Corporation, the provider of data integration software, announced the immediate availability of Informatica HParser, a data parsing transformation solution for Hadoop environments. Informatica HParser runs on distributions of Apache Hadoop, exploiting the parallelism of the MapReduce framework to efficiently turn unstructured complex data, such as web logs, social media data, call detail records and other data formats, into a structured or semi-structured format in Hadoop. Once transformed into a more structured format, the data can be used and validated to drive business insights and improve operations. Available in a free community edition and commercial editions, Informatica HParser provides organizations with the solution they require to extract the value of complex, unstructured data. http://www.informatica.com
MarkLogic Corporation announced the availability of MarkLogic 5, the latest version of its product designed for Big Data applications across the enterprise. MarkLogic 5 defines Big Data by empowering organizations to build Big Data applications that make information actionable. With MarkLogic 5, organizations analyze structured, unstructured, and semi-structured data in the same application. A key feature is the MarkLogic Connector for Hadoop. www.marklogic.com
Endeca Technologies, Inc., an agile information management software company, announced native integration of Endeca Latitude with Apache Hadoop. Endeca Latitude, based on the Endeca MDEX hybrid search-analytical database, is uniquely suited to unlock the power of Apache Hadoop. Apache Hadoop is strong at manipulating semi-structured data, which is a challenge for traditional relational databases. This combination provides flexibility and agility in combining diverse and changing data, and performance in analyzing that data. Enabling Agile BI requires a complete data-driven solution that unites integration, exploration and analysis from source data through end-user access that can adapt to changing data, changing data sources, and changing user needs. Solutions that require extensive pre-knowledge of data models and end-user needs fail to meet the agility requirement. The united Endeca Latitude and Apache Hadoop solution minimizes data modeling, cleansing, and conforming of data prior to unlocking the value of Big Data for end-users. http://www.endeca.com/ http://hadoop.apache.org/