Curated for content, computing, data, information, and digital experience professionals

Category: Content creation and design (Page 3 of 75)

Technologies and strategies for authoring and editing, including word processors, structured editors, web and page layout and formatting, content conversion and migration, multichannel content, structured and unstructured  data integration, and metadata creation. 

AI agents can now act directly on WordPress.com sites

WordPress.com, Automattic’s hosted website platform built on the open source WordPress software, announced the launch of new write capabilities for its Model Context Protocol (MCP) server. The update enables AI agents — including Claude, ChatGPT, and Cursor — to create, edit, and manage content on WordPress.com sites directly through natural conversation, on behalf of users.

With today’s addition of write capabilities, WordPress.com has extended the agentic web. AI agents can now actively build and manage websites. Where the original MCP server let AI agents read, the new write and content authoring capabilities let them act: drafting posts, editing pages, and managing content on behalf of users, all with explicit user confirmation at every step.

The feature was designed with safeguards to ensure users remain in control. Updates require explicit user confirmation before any action is taken, and changes to already-published content are clearly flagged as going live immediately. The MCP server is opt-in only, with nothing enabled by default.

The MCP write capabilities are available now. The feature is compatible with any MCP-enabled AI agent WordPress.com plans. The MCP server is available to all paid WordPress.com plan users at no additional cost.

https://wordpress.com/blog/2026/03/20/ai-agent-manage-content/

Aprimo unveils Agentic DAM

Aprimo, a provider of digital asset management and content operations solutions, today announced the launch of Aprimo’s Agentic DAM, the next evolution of its digital asset management designed for enterprises, where AI agents increasingly discover, interpret, and act on content.

Aprimo’s Agentic DAM enables AI agents to become first-class content consumers. These agents create, review, govern, and personalize content, while maintaining human oversight and continuous compliance. 

Aprimo’s agents operate persistently across the entire content operations process, from planning, creation, and enrichment to review, transformation, and distribution. Governance is applied continuously, not only at ingestion but at runtime across upstream creative tools and downstream marketing systems.

Aprimo organizes its Agentic DAM into specialized AI agent categories, including Planning Agents, Librarian Agents, Critic Agents, Compliance Agents, and Production Agents. Together, these agents automate repeatable tasks, enrich metadata, validate claims, generate variants, and personalize content dynamically.

Aprimo’s Agentic DAM not only governs content within the DAM repository but extends intelligence across the broader marketing and creative stack. Agents can operate upstream in creative tools to review work in progress and downstream in CMS and campaign systems to validate assets at the point of deployment.

https://www.aprimo.com/platform/digital-asset-management

Adobe and NVIDIA announce strategic partnership

Adobe and NVIDIA today announced a strategic partnership that will bring together Adobe’s creative and marketing workflows, models and technology and NVIDIA’s open models, libraries, research and accelerated computing to deliver the next generation of foundational Adobe Firefly models and creative, marketing and agentic workflows.

Firefly models will be built on NVIDIA’s computing technology and tap into NVIDIA CUDA-X, NVIDIA NeMo libraries, NVIDIA Cosmos open models, and NVIDIA Agent Toolkit software to enable interactive, high-quality creation.

Adobe and NVIDIA will also work together on NVIDIA NemoClaw— an open source stack that simplifies running OpenClaw always-on assistants more safely.

With NVIDIA, Adobe is launching a cloud-native, brand identity-preserving 3D digital twin solution (public beta). The solution creates virtual replicas of physical products that act as permanent digital identities for marketing and commerce experiences. Integrating NVIDIA Omniverse libraries into Adobe technologies, the collaboration expands support for 3D digital twin workflows built on OpenUSD for marketing content automation.

Adobe will also harness NVIDIA AI infrastructure, AI libraries, services and models to optimize its AI-powered tools across creativity, productivity and customer experience orchestration.

Adobe and NVIDIA Announce Strategic Partnership to Deliver the Next Generation of Firefly Models and Creative, Marketing and Agentic Workflows

Flux voice AI platform now supports on-the-fly configurations

Deepgram announced Flux “on-the-fly configuration” for its voice AI platform, which lets developers dynamically update speech recognition settings — such as keyterms and end-of-turn detection — during a live voice conversation without disconnecting or restarting the audio stream.

A support call moves from identity verification to troubleshooting to scheduling a follow-up. A healthcare call shifts from intake questions to medication names to billing. Each phase has different intents, different critical phrases.

Today, teams configure their ASR (automatic speech recognition) once at connection time and live with it for the entire call. They load every keyterm they might need upfront, diluting biasing effectiveness across the board, or they keep the list minimal and accept lower accuracy on critical phrases. When the conversation shifts enough that the configuration truly doesn’t fit, the options are disconnecting and reconnecting mid-call or managing multiple concurrent streams and swapping between them.

Now your ASR configuration can shift with the conversation. No more choosing between loading every keyterm upfront or accepting lower accuracy. No more static configuration that’s “good enough” for the whole call. One connection that adapts as the call unfolds.

On-the-fly configuration is available now in the Flux v2 WebSocket API.

https://deepgram.com/learn/flux-on-the-fly-configuration

Siteimprove expands its agentic content intelligence platform

Siteimprove released its latest AI agent capabilities. The updates include conversational analytics enabling non-technical users to get answers, generate reports, and dashboards using natural language. Customers also gain new content accessibility coverage for PDF and Images, and keyword intelligence for Search in the world of “Answer Engine Optimization (AEO)”.

These capabilities help customers meet digital accessibility regulations such as Americans with Disabilities Act (ADA) and European Accessibility Act (EAA) while helping brands improve discoverability across answer engines and generative engines. Capabilities include:

  • Conversational Analytics Agent: Ask questions in natural language and instantly get answers to understand what matters across analytics data – democratizing insights across teams. Teams can quickly task the agent to generate answers on campaign performance, funnel diagnostics, and recommended targets for course correction.
  • PDF and Image Accessibility Agent: PDF Validate and Contextual Image Analysis agent surfaces accessibility issues before content goes live, helping teams reduce risk earlier in the content lifecycle. This helps customers increase accessibility coverage across more content types.
  • Keyword Intelligence Agent: Expanded keyword and topic intelligence agent uncovers competitive and topical gaps, giving teams deeper insight into growth opportunities for both traditional and AI-driven search in the world of AEO.

https://www.siteimprove.com/press/siteimprove-expands-its-agentic-content-intelligence-platform

Krisp launches real-time Voice Translation SDK

Krisp announced the launch of its Voice Translation SDK, enabling CX platform developers to embed real-time multilingual voice-to-voice translation into live customer conversations. The technology has been live in production CX environments since 2025 as part of Krisp’s Call Center AI platform, operating in customer conversations globally before its SDK release.

Real-time voice translation must operate on continuous audio streams where latency, accuracy and conversational flow are tightly linked. Systems must recognize diverse accents, perform reliably in noisy environments and preserve natural turn-taking.

Krisp’s Voice Translation SDK is engineered to balance these competing constraints in live, two-way conversations. It supports any combination of over 60 languages and is optimized for synchronous interactions where clarity and conversational continuity are critical. This enables multilingual interactions within live conversations without requiring human interpreters.

The SDK is available for Windows, macOS and Web developers, allowing integration into both native and browser-based applications. To improve performance in real-world conditions, Krisp applies local Noise Cancellation before audio is processed in the cloud, isolating the primary speaker and improving recognition accuracy. The SDK also supports custom vocabulary and domain-specific dictionaries, enabling teams to enforce terminology and maintain consistency across professional environments.

https://krisp.ai/blog/real-time-voice-translation-sdk/

Dataiku launches 575 Lab, its new open source initiative for responsible AI

As AI moves from pilots to business-critical deployment, the issue is no longer access. It’s trust. Open source tools support that trust by keeping core components inspectable and standardizable, enabling stronger oversight across modern AI systems. Today, Dataiku announced the launch of the 575 Lab, Dataiku’s Open Source Office. The 575 Lab will release two new open-source toolkits designed to help enterprises make AI systems more transparent, governable, and fit for real-world use.

The 575 Lab will focus on delivering deployable tools that strengthen explainability, privacy, and governance across modern AI and agentic systems. The two initial open-source projects will be: 

  • Agent Explainability Tools that will help teams trace and understand decision-making across multi-step agent workflows, making agent decisions transparent for data scientists, compliance teams, and end users.
  • Privacy-Preserving Proxies that will enable safer use of closed-source models by protecting sensitive data end-to-end, and that teams will be able to run locally.

Both projects will be designed to support responsible enterprise AI, with a focus on reliability, security, transparency, and explainability.

The 575 Lab is now available to the community of AI specialists, data scientists, and developers responsible for creating, deploying, and scaling AI agents and applications.

https://www.dataiku.com/press-releases/dataiku-launches-575-lab/

DeepL launches voice API for real-time speech transcription and translation for instant multilingual communication

DeepL, a global AI product and research company, announced the general availability of DeepL Voice API. Developers can now integrate real-time voice transcription and translation capabilities into their applications, enhancing multilingual support for businesses.

The DeepL Voice API allows businesses to stream audio and receive transcriptions in the source language, along with translations into up to five target languages. The API provides a seamless experience, so language barriers do not hinder effective communication.

The DeepL API enables: 

  • Hire for expertise, not language coverage DeepL Voice API lets contact centers staff agents who understand the customer issue and the business context, even when they do not speak the customer’s language.
  • Expand talent pools while managing costs By reducing the need for language specific staffing, teams can centralize or distribute support more flexibly, which can lower operating costs and improve coverage planning.
  • Provide reliable coverage in urgent moments Real time translation helps teams maintain service levels during nights, weekends, and holidays, when fewer specialized language agents are available.
  • Two way understanding, not just text on screen Agents can follow the conversation through live translated audio, alongside on screen transcription and translation, so they can respond naturally and confidently in the moment.

https://www.deepl.com/en/press-release/deepl_launches_voice_api_for_real_time_speech_transcription_and_translation

« Older posts Newer posts »

© 2026 The Gilbane Advisor

Theme by Anders NorenUp ↑