The Gilbane Advisor

Curated for content, computing, and digital experience professionals

Page 58 of 915

Google announces BigLake, to unify data lakes and data warehouses across clouds

From the Google Cloud blog…

The volume of valuable data that organizations have to manage and analyze is growing at an incredible rate. This data is increasingly distributed across many locations, including  data warehouses, data lakes, and NoSQL stores.

Today, we’re excited to announce BigLake, a storage engine that allows you to unify data warehouses and lakes. BigLake gives teams the power to analyze data without worrying about the underlying storage format or system, and eliminates the need to duplicate or move data, reducing cost and inefficiencies. With BigLake, users gain fine-grained access controls, along with performance acceleration across BigQuery and multicloud data lakes on AWS and Azure. BigLake also makes that data uniformly accessible across Google Cloud and open source engines with consistent security. BigLake enables you to:

  • Extend BigQuery to multicloud data lakes and open formats such as Parquet and ORC with fine-grained security controls, without needing to set up new infrastructure.
  • Keep a single copy of data and enforce consistent access controls across analytics engines of your choice, including Google Cloud and open-source technologies such as Spark, Presto, Trino, and Tensorflow.
  • Achieve unified governance and management at scale through seamless integration with Dataplex.

https://cloud.google.com/blog/products/data-analytics/unifying-data-lakes-and-data-warehouses-across-clouds-with-biglake

CYTRIO adds GDPR support to data privacy rights management solution

CYTRIO, a data privacy compliance company, added support for the European Union’s General Data Protection Regulation (GDPR) to its all-in-one, cloud-native data privacy rights management automation solution. GDPR impacts any company, regardless of location, that collects, uses, shares, or stores personal information from European Union citizens. In November, the company released its solution to help companies of all sizes comply with the growing list of data privacy regulations in the U.S., including CCPA, CPRA, VCDPA, CPA. GDPR provides EU citizens certain rights over their personal information that a company collects or uses, including Right to Access and Right to Erasure (Delete).

CYTRIO offers out-of-the-box workflows and automated data discovery to help companies reduce the time and cost to respond to a data subject access request (DSAR). The secure consumer facing portal provides a 2-click process for the consumer to submit a DSAR. CYTRIO also provides Article 30 Record of Processing Activity (ROPA) reports to meet audit requirements. 

CYTRIO helps automate GDPR and CCPA DSAR compliance management with an all-inclusive and simple usage based pricing. Companies pay based on DSARs processed by CYTRIO. First six (6) DSARs are free. Any company can sign up with no upfront cost and connect all in scope and supported data sources.

https://cytrio.com/cytrio-adds-gdpr-support-to-all-in-one-data-privacy-rights-management-solution/

Airbyte Cloud now available in U.S.

Airbyte, creators of an open-source data integration platform, made available in the U.S. its cloud service for data movement and unifying data integration pipelines. Airbyte Cloud’s pricing model is based on compute time, which can be less expensive than the industry-norm volume-based pricing which is cost prohibitive when replicating high volumes of data.

Airbyte’s open-source data integrations focused on solving two problems: First, companies have to build and maintain data connectors on their own because most less popular “long tail” data connectors are not supported by closed-source ELT technologies. Second, data teams often have to do custom work around pre-built connectors to make them work within their unique data infrastructure. In addition to providing hosting and management, Airbyte Cloud enables companies to have multiple workspaces and provides access management for their teams.

The company also announced cooperation with open-source maintainers within its user community. Airbyte will provide compensation for helping deliver new features and bug fixes for the continuously-growing list of data connectors. Contributors can earn money for work on data connectors for finding software bugs, and for bug fixes.

https://airbyte.com

Gilbane Advisor 3-30-22 — neurosymbolic hybrid, content understanding

This week we feature articles from Gary Marcus and Daniel Tunkelang.

News comes from Access Innovations, Brightspot, Super .AI, and Indico Data.

We’ll be off next week, back the week after.


Opinion / Analysis

Deep learning is hitting a wall

There has always been a healthy skepticism that deep learning would take us to artificial general intelligence (AGI). But the genuinely amazing things that can be accomplished with the combination of deep learning, big data, and computing power led to a lazy optimism. Fortunately, that is changing, and it is increasingly clear that deep learning alone is insufficient to accomplish much of what is expected. Gary Marcus provides historical and recent context and makes a case for a neurosymbolic hybrid approach. Not too technical.

https://nautil.us/deep-learning-is-hitting-a-wall-14467/

Content similarity

Content classification and annotation offer useful approaches for content understanding, recognizing whether a piece of content is about a particular topic or mentions a particular entity. But most content exists in a space that is too rich to reduce to classification and annotation. A document is more than a category and a bag of entities. For content understanding to be worthy of the name, it needs to embrace the richness of the content it represents… A more granular approach to content understanding focuses on the similarity between documents.

Daniel Tunkelang’s latest post in his series on content understanding techniques. A bit technical.

https://medium.com/content-understanding/content-similarity-d3c7a9cd7d44

All Gilbane Advisor issues

More Reading…


Content technology news

Indico Data updates its Unstructured Data Platform

To drive efficiency and accelerate automation and intelligent document processing (IDP) initiatives using unstructured data.
https://gilbane.com/2022/03/indico-data-updates-its-unstructured-data-platform/

Brightspot and ethinking partner to deliver CMS transformation

Brightspot CMS with ethinking’s of ‘XP Layer’ headless low-code software provides an interface between data sources and content channels.
https://gilbane.com/2022/03/brightspot-and-ethinking-partner-to-deliver-cms-transformation/

A “point in time” search allows users to find the precise time within the video/audio where the speaker discusses a specific topic.
https://gilbane.com/2022/03/access-innovations-announces-video-audio-to-text-to-tagging-solution-for-video-transcript-search/

Super.AI updates its Unstructured Data Processing platform

Unifies intelligent document processing (IDP), human-in-the-loop (HITL), redaction, and processing of any data type.
https://gilbane.com/2022/03/super-ai-updates-its-unstructured-data-processing-platform/

All content technology news


The Gilbane Advisor is curated by Frank Gilbane for content technology, computing, and digital experience professionals. The focus is on strategic technologies. We publish recommended articles and content technology news weekly. We do not sell or share personal data.

Subscribe | Feed | View online | Editorial policy | Privacy policy | Contact

Indico Data updates its Unstructured Data Platform

Indico Data, the unstructured data company, unveiled Indico 5, a major release of its AI-powered Unstructured Data Platform. Indico 5 addresses the rapidly growing market demand for software solutions that drive efficiency and accelerate automation and intelligent document processing (IDP) initiatives using unstructured data.

The Indico Unstructured Data Platform, through a combination of a proprietary training data corpus, composite AI technology, and machine teaching application interface, drives an AI success rate, with more than 90% of projects in production.

Indico 5 was purpose-built to streamline some of the toughest unstructured data automation problems in IDP, such as document unbundling of PDFs, and ensuring the human training corrections of models made in the review cycle can be automated in future situations. The addition of linked relationship labeling and a new, more intuitive visual interface empowers organizations to easily automate, analyze, and apply unstructured data, illuminating opportunities, improving efficiency, and reducing risk. New features include: Automatic Document Unbundling, Linked Labels, Staggered Loop Training, Universal Document Support, and Workflow Canvas.

https://www.indicodata.ai/indico-5

Access Innovations announces Video/Audio to Text to Tagging solution for video transcript search

Access Innovations, Inc. announced Video/Audio to Text to Tagging (VATT), a solution that translates audio files to time stamped text transcripts for indexing, classifying, and enriching by Data Harmony Hub. Originally developed to improve search precision on training videos for a large chemical manufacturer, the new tagging capabilities can be used on any video or audio content from lectures, demonstrations, conferences, and more. Once metadata tagging is completed by Data Harmony Hub, a “point in time” search allows for users to find the precise time within the video/audio where the speaker or narrator discusses a specific topic, without wasting time browsing and scrolling through the entire video to find the information they need.

Organizations are generating video content and placing it on YouTube and other video aggregation service platforms. If transcripts are not available and searchable, the viewer is disappointed when they attempt to search a library of videos. In most cases, search is only available on the video title, the speaker or performer name, and possibly the date. With the video/audio to text to tagging solution, viewers enjoy a more robust search experience, reduce noise within the search results, and pinpoint topics and concepts of interest.

https://www.accessinn.com

Gilbane Advisor 3-23-22 — hypergrowth myth, operationalizing privacy

This week we feature articles from Jason Cohen and Rachel Dulberg.

Additional reading is from John Glaser & Elizabeth Gardner, Christine Moorman, Mike Melanson, and Thomas Claburn.

News comes from Progress, Liferay, Semantic Web Co & Wand, and Stepes.


Opinion / Analysis

The Elephant in the room: The myth of exponential hypergrowth

Fast-growing startups are frequently described as “exponential,” especially when the product is “viral.” Turns out, this is incorrect, even for Facebook and Slack. If you have an incorrect model, you don’t understand growth, which means you can’t control it, nor predict it. Here is a different model to understand how companies actually grow.

This is a must-read from Jason Cohen — not just for investors and startups, but business, marketing, and product managers. Well written with lots of great supporting charts. h/t: @dharmesh

https://longform.asmartbear.com/docs/exponential-growth/

The Privacy Program Playbook Part I — How to design a winning privacy roadmap

Operationalising privacy is hard. Here’s how to design, plan and implement a successful privacy program (with your sanity intact).

Rachel Dulberg to the rescue…

https://medium.com/@rachel_d_ai/the-privacy-program-playbook-part-i-8f943fc05760

All Gilbane Advisor issues

More Reading…


Content technology news

Progress updates multi-channel digital experience platform

Progress Sitefinity DX adds a layer of composability to help organizations develop and deploy multichannel digital experiences with .NET 6.
https://gilbane.com/2022/03/progress-updates-multi-channel-digital-experiences/

Liferay announces cloud-based DXP-as-a-Service offering

It includes content management, account management, analytics, commerce, personalization, low code capabilities, delivered on an as-a-service.
https://gilbane.com/2022/03/liferay-announces-cloud-based-dxp-as-a-service-offering/

Semantic Web Company and WAND Inc. announce partnership

Combo of WAND Taxonomies & Semantic Web’s A.I. search algorithms, knowledge graphs, and taxonomy management system increases algorithm accuracy.
https://gilbane.com/2022/03/semantic-web-company-and-wand-inc-announce-partnership/

Stepes launches continuous terminology management solution

Enterprises can develop product glossaries and manage multilingual terminology for improved linguistic quality and consistency continuously.
https://gilbane.com/2022/03/stepes-launches-continuous-terminology-management-solution/

All content technology news


The Gilbane Advisor is curated by Frank Gilbane for content technology, computing, and digital experience professionals. The focus is on strategic technologies. We publish recommended articles and content technology news weekly. We do not sell or share personal data.

Subscribe | Feed | View online | Editorial policy | Privacy policy | Contact

Super.AI updates its Unstructured Data Processing platform

Super.AI announced the latest version of the company’s Unstructured Data Processing (UDP) Platform, to make it easier for global business services and IT departments to expand the scope and pace of intelligent automation.

Shared services centers typically deploy multiple point solutions for document processing, sensitive information redaction, and processing other forms of unstructured data such as emails, text, images, video, and audio. Super.AI’s UDP Platform unifies intelligent document processing (IDP), human-in-the-loop (HITL), redaction, and processing of any data type — reducing the number of platforms needed for intelligent automation. Enhancements in the latest release include:

  • Next-generation intelligent document processing (IDP) that utilizes artificial intelligence technology to deliver the highest quality results.
  • Efficient and accurate document, image, audio, and video redaction to streamline regulatory compliance and reduce risk.
  • Reimagined human-in-the-loop capabilities for data validation and labeling, allowing organizations to incorporate third-party and in-house experts into automation workflows.
  • 150+ quality control mechanisms built into the platform that guarantee output and ensure service level agreements (SLAs) are met.

https://super.ai

« Older posts Newer posts »

© 2024 The Gilbane Advisor

Theme by Anders NorenUp ↑