Curated for content, computing, and digital experience professionals

Year: 2022 (Page 12 of 27)

IBM Research open-sources toolkit for Deep Search

IBM released an open-sourced part of the IBM Deep Search Experience in a new toolkit, Deep Search for Scientific Discovery (DS4SD), for scientific research and businesses with the goal of spurring on the rate of scientific discovery.

To help achieve this goal, we’re now publicly releasing a key component of the Deep Search Experience, our automatic document conversion service. It allows users to upload documents in an interactive fashion to inspect a document’s conversion quality. DS4SD has a simple drag-and-drop interface, making it very easy for non-experts to use. We’re also releasing deepsearch-toolkit, a Python package, where users can programmatically upload and convert documents in bulk.

Deep Search uses AI to collect, convert, curate, and ultimately search huge document collections for information that is too specific for common search tools to handle. It collects data from public, private, structured, and unstructured sources and leverages state-of-the-art AI methods to convert PDF documents into easily decipherable JSON format with a uniform schema that is ideal for today’s data scientists. It then applies dedicated natural language processing and computer vision machine-learning algorithms on these documents and ultimately creates searchable knowledge graphs.

https://research.ibm.com/blog/deep-search-toolkit

BigCommerce expands in Europe and South America

BigCommerce, an ecommerce platform for fast-growing and established B2C and B2B brands, announced it has grown its European presence into Austria, Denmark, Norway and Sweden, and its Latin American presence into Peru. With its latest expansions and fully localized tech and agency partner ecosystem, BigCommerce will enable merchants of all sizes and complexities to build and scale their online businesses to deliver seamless shopping experiences to local and cross-border customers. Merchants can:

  • Create versatile shopping experiences. In addition to a fully translated control panel and native-language support, retailers can use multi-language, multi-currency, secure payment capabilities and create local storefronts or expand their customer reach in new markets.
  • Deliver an omnichannel shopping experience and surface new revenue streams. BigCommerce’s open SaaS approach enables direct integrations with global channels like Facebook Shopping, Instagram Shopping and TikTok.
  • Tap into B2B functionality to build B2B storefronts. BigCommerce’s suite of native functionality and ecosystem of integration partner applications make it easier for B2B merchants to offer B2C-like experiences.
  • Leverage headless technology for future-oriented storefronts. As a member of the MACH Alliance, BigCommerce’s integrations with top front-end frameworks and best-of-breed technology ecosystem enable businesses to evolve from monolithic architectures to future-proof tech stacks.

https://www.bigcommerce.com

Ontotext releases GraphDB 10

GraphDB 10.0 is the first major release since GraphDB 9.0 was released in September 2019. It implements next generation, simpler and more reliable cluster architecture to deliver better resilience with reduced infrastructure costs. GraphDB 10 lowers the complexity of operations with better automation interfaces and a self-organized cluster for automated recovery. Deployment and packaging optimizations allow for effortless upgrades across the different editions of the engine, all the way from GraphDB Free to the Enterprise Edition. The improved full-text search (FTS) connectors of GraphDB 10 enable more comprehensive filtering as well as easier downstream data replication. Finally, parallelization of the path search algorithms brings massive improvement in graph analytics workloads through better exploitation of multi-core hardware.

Unlike previous versions, GraphDB 10 is packaged as a single distribution that can run in Free, Standard or Enterprise Edition modes depending on the currently set license. It requires zero development effort to pass from one edition to another. It is also possible to export a repository with an expired license so users are never locked out of their own data. Two major areas of improvement coming in 10.1 will be query performance optimization and availability on some of the major cloud platforms.

https://www.ontotext.com

DataStax’s Astra Streaming now supports for Kafka and RabbitMQ

DataStax announced the general availability of Astra Streaming, a managed messaging and event streaming service built on Apache Pulsar. Now featuring built-in API-level support for Kafka, RabbitMQ and Java Message Service (JMS), Astra Streaming makes it easier for enterprises to get real-time value from their data-in-motion. Capabilities include:

  • Mobilizes all data-in-motion An enterprise’s data-in-motion encompasses all data in platforms that provide streaming, queuing and pub/sub capabilities, Astra Streaming can address these use cases at the scale enterprises need.
  • Modernizes event-driven architectures: Seamlessly leverage existing messaging/pub sub apps and turn them into streaming apps with a drop-in replacement; easily modernize Kafka applications with zero rewrites
  • Runs across an entire IT estate: multi-cloud + on prem: Supports a unified event fabric that stretches across an enterprise’s data-in-motion spread across their entire data estate: on premises, in the cloud and at the edge.
  • Powers a real-time data ecosystem: Through a wide range of connectors, Astra Streaming is connected to an enterprise’s data ecosystem, enabling real-time data to flow instantly from data sources and applications to streaming analytics and machine learning systems. It’s also integrated with Astra DB, powering its CDC capabilities.

https://www.datastax.com/press-release/datastax-s-astra-streaming-goes-ga-with-new-built-in-support-for-kafka-and-rabbitmq

Gilbane Advisor 6-29-22 — invisible XML, invisible misinformation

This week we feature articles from Kurt Cagle, and Alberto Romero.

Additional reading comes from Elizabeth Gibney, Jason Cohen, and Melissa Heikkilä & Will Douglas Heaven.

News comes from Tellius, Sinequa, TransPerfect & Acrolinx, and the Content Authenticity Initiative.

Note: We’re taking next week off, so our next issue will be on July 13.

Reminder: If you’ve missed any recent issues you can see them here.


Opinion / Analysis

The second coming of XML

When XML was first introduced, the W3C XML Working Group took a very unusual step: They created a language for transformations. This effort is now leading to a re-emergence of XML as the need for mapping between data representations becomes more and more pressing.

Kurt Cagle explains why this is the case. You don’t have to be a developer to understand the code samples, but even if you skip over them you should still see why he is right. (8 min, or 6 min if you skip over the code).

https://www.datasciencecentral.com/the-second-coming-of-xml/

AI has an invisible misinformation problem

Perhaps obvious, but Alberto Romero highly recommends reading until the end, and you should. (7 min).

https://thealgorithmicbridge.substack.com/p/ai-has-an-invisible-misinformation

All Gilbane Advisor issues

More Reading…


Content technology news

Tellius and Databricks partner to democratize data analysis

Customers get the ability to run Tellius natural language search queries and automated insights directly on Databricks’ Lakehouse Platform.
https://gilbane.com/2022/06/tellius-and-databricks-partner-to-democratize-data-analysis/

Sinequa adds neural search to Search Cloud

As an optional capability of Sinequa’s Enterpise Search Cloud platform, Neural Search uses four pre-trained deep learning language models.
https://gilbane.com/2022/06/sinequa-adds-neural-search-to-search-cloud/

TransPerfect GlobalLink CCMS upgrades Acrolinx Connector for Astoria

Astoria is an SaaS platform for building, managing, and publishing XML content, and Acrolinx is a platform for content quality optimization.
https://gilbane.com/2022/06/transperfect-globallink-ccms-upgrades-acrolinx-connector-for-astoria/

CAI releases digital content provenance tools

Enables developers to integrate digital content provenance across web, desktop, or mobile projects with the C2PA technical specification.
https://gilbane.com/2022/06/cai-releases-digital-content-provenance-tools/

All content technology news


The Gilbane Advisor is curated by Frank Gilbane for content technology, computing, and digital experience professionals. The focus is on strategic technologies. We publish recommended articles and content technology news weekly. We do not sell or share personal data.

Subscribe | Feed | View online | Editorial policy | Privacy policy | Contact

Acquia adds data subject deletion requests to Acquia CDP

Acquia announced new regulatory compliance features that help organizations using Acquia Customer Data Platform (CDP) to comply with data subject requests and privacy laws in general. Using a new self-service interface, organizations can rapidly process “Right to Erasure” (otherwise known as “Right to be Forgotten”) requests associated with regulations such as GDPR, CCPA, and more from their customers. The feature for legal and compliance workflows is to make it simple for organizations using Acquia CDP to process deletion requests from their own customers, ensuring that these requests are handled quickly.

Other recent self-service updates include secure credentials management for Acquia CDP out-of-the-box connectors. Organizations can now generate and manage their own credentials for pre-built connectors to external services such as Facebook or Google. In addition, they can set up new credentials for their own custom connectors. Both self-service credentials management and compliance features are meant to accelerate workflows within Acquia CDP, without having to wait for assistance from an Acquia customer support team member.

https://www.acquia.com

W3C to become a public-interest non-profit organization

From the W3C…

The World Wide Web Consortium is set to pursue 501(c)(3) non-profit status. The launch as a new legal entity in January 2023 preserves the core mission of the Consortium to shepherd the web by developing open standards with contributions from W3C Members, staff, and the international community.

At the operational level, which is not changing, W3C Members are bound together for our technical work, united around the W3C’s mission to lead the web to its full potential by creating open standards that ensure that the web remains open, accessible, internationalized, secure, and interoperable for everyone around the globe.

We need a structure where we meet at a faster pace the demands of new web capabilities and address the urgent problems of the web. The W3C Team is small, bounded in size, and the Hosted model hinders rapid development and acquisition of skills in new fields.

We need to put governance at the center of the new organization to achieve clearer reporting, accountability, greater diversity and strategic direction, better global coordination. A Board of Directors will be elected with W3C Member majority. It will include seats that reflect the multi-stakeholder goals of the Web Consortium. We anticipate to continue joint work with today’s Hosts in a mutually beneficial partnership.

As important as all these points are, they only represent a change to the shell around W3C. The proven standards development process must and will be preserved.

W3C processes promote fairness, enable progress. Our standards work will still be accomplished in the open, under the W3C Process Document and royalty-free W3C Patent Policy, with input from the broader community. Decisions will still be taken by consensus. Technical direction and Recommendations will continue to require review by W3C Members – large and small. The Advisory Board will still guide the community-driven Process Document enhancement. The Technical Architecture Group will continue as the highest authority on technical matters.

Our transition to launch the legal entity includes concrete stages – adoption of Bylaws: filing for 501(c)(3) non-profit status; election and seating of a Board of Directors – all to transfer staff, Member contracts, and operations to the new structure.

https://www.w3.org/2022/06/pressrelease-w3c-le.html.en

Tellius and Databricks partner to democratize data analysis

Tellius announced a partnership with Databricks to give joint customers the ability to run Tellius natural language search queries and automated insights directly on the Databricks Lakehouse Platform, powered by Delta Lake, without the need to move any data.

With Tellius, organizations can search and analyze their data to identify what is happening with natural language queries, understand why metrics are changing via AI-powered Insights, and determine next best actions with deep insights and AutoML. Connecting to Delta Lake on Databricks only takes a few clicks, and then users can perform a natural language search of their unaggregated structured and unstructured data to answer their own questions. They can drill down to get granular insights, leverage single-click AI analysis to uncover trends, key drivers, and anomalies in their data, and create predictive models via AutoML in Tellius. Answers and insights can be utilized to write back to source applications to operationalize insights. Faster data collaboration helps democratize data access across analytics teams with less worrying about performance or IT maintenance.

https://www.tellius.com/tellius-and-databricks-partner-to-deliver-ai-powered-decision-intelligence-for-the-data-lakehouse/

« Older posts Newer posts »

© 2024 The Gilbane Advisor

Theme by Anders NorenUp ↑