Curated for content, computing, and digital experience professionals

Author: Frank Gilbane (Page 6 of 70)

Gilbane Advisor 4-12-23 — Future data stacks, digital twin KGs

This week we feature content from Barr Moses & Shane Murray, and Peter Lawrence.

Additional reading comes from Jay Graber, Kingsley Uyi Idehen, Stanford, and the National Science Foundation.

News comes from Expert[.]ai & Reveal Group, Databricks, Algolia, and Stilo.

Note that news items now link to the original source of the news rather than our 200 word summaries, which are always available here.

All previous issues are available at https://gilbane.com/gilbane-advisor-index/


Opinion / Analysis

Zero-ETL, ChatGPT, and the future of data engineering

“The post-modern data stack is coming. Are we ready?”.

Barr Moses & Shane Murray aim to help you prepare with this very readable post on promising components of future data stacks. (9 min)

https://towardsdatascience.com/zero-etl-chatgpt-and-the-future-of-data-engineering-71849642ad9c

Knowledge Graphs + Large Language Models = The ability for users to ask their own questions?

One area the article recommended above doesn’t address is the incorporation of knowledge graphs in data stacks. This is an area we are keenly interested in, especially in combination with LLMs. Peter Lawrence examines what may seem a less-than-obvious use-case (digital twins) for such a combination, and describes how a knowledge graph can be used to train an LLM to be significantly more valuable. He also illustrates why it is not easy. (12 min)

https://medium.com/@peter.lawrence_47665/knowledge-graphs-large-language-models-the-ability-for-users-to-ask-their-own-questions-e4afc348fa72

More Reading

All Gilbane Advisor issues


Content technology news

Expert[.]ai and Reveal Group partner to combine NLP and RPA

Expert[.]ai and Reveal Group partner to combine NLP and RPA — To help organizations extend the value in intelligent automation programs with natural language processing and understanding (NLP/NLU).
https://www.expert.ai/ ■ https://revealgroup.com/

Databricks announces Lakehouse for Manufacturing

Lakehouse for Manufacturing offers pre-built solutions, partner-designed Brickbuilder offerings and integrated AI capabilities.
https://www.databricks.com/solutions/industries/manufacturing-industry-solutions

Algolia introduces developer-friendly plan 

The AI Search and Discovery platform’s two new plans include a “Build” plan that is free and a “Grow” plan that offers low-priced scalability.
https://www.algolia.com

Stilo launches Migrate 5.0 

Migrate 5 provides automated content conversion from legacy content to structured DITA XML.
https://www.stilo.com/

All content technology news


The Gilbane Advisor is authored by Frank Gilbane and is ad-free, cost-free, and curated for content, computing, web, data, and digital experience technology and information professionals. We publish recommended articles and content technology news weekly. We do not sell or share personal data.

Subscribe | View online | Editorial policy | Privacy policy | Contact

Gilbane Advisor 3-29-23 — ChatGPT plugins, knowledge graph tricks

This week we feature content from OpenAI and Kurt Cagle.

Additional reading comes from Alex Hollender, Dean Allemang, and Tom Warren.

News comes from Adobe, Ontotext, Kobai, and Syncro Soft.

Note that news items now link to the original source of the news rather than our 200 word summaries, which are always available here.

The next issue will be published April 12. All previous issues are available at https://gilbane.com/gilbane-advisor-index/


Opinion / Analysis

ChatGPT plugins

Many, perhaps most, of the problems with large language models like GPT are due to data limitations (availability, age, quality). The ability to complement GPT data with external knowledge and data sources you own or trust, or are specific to your industry or business, increases model quality, context, and utility. LlamaIndex APIs can help you accomplish this. Now ChatGPT plugins take integration possibilities up a level. This post by the OpenAI team is the best place to get up-to-speed on the first batch of plugins, and what they’re planning. (12 min)

https://openai.com/blog/chatgpt-plugins

Nine ChatGPT tricks for knowledge graph workers

A helpful show-and-tell from Kurt Cagle… 

“Where things get interesting is in the realm of coding and semantics… most of these samples made use of ChatGPT+ for experimentation and the DaVince OpenAI Playground for generating more complex output.” (22 min)

https://thecaglereport.com/2023/03/16/nine-chatgpt-tricks-for-knowledge-graph-workers/

More Reading

All Gilbane Advisor issues


Content technology news

Adobe announces multiple product updates

Adobe announcements focused on generative AI services, personalization, content management, product analytics, and content supply chain.
https://news.adobe.com/home/default.aspx

Ontotext releases Metadata Studio 3.2

Enables rapid text mining development based on an organization’s knowledge graph with non-technical contributors in the loop.
https://www.ontotext.com

Kobai launches Saturn Knowledge Graph 

Kobai Saturn platform users to query data at Lakehouse scale and integrate every use case and function into a single semantic layer.
https://www.kobai.io/products/kobai-saturn

Syncro Soft releases Oxygen XML suite 25.1 

Updates to Oxygen XML Editor, Author, Developer, Web Author, Publishing Engine, WebHelp, PDF Chemistry, and Scripting, and Oxygen Feedback.
https://www.sync.ro

All content technology news


The Gilbane Advisor is authored by Frank Gilbane and is ad-free, cost-free, and curated for content, computing, web, data, and digital experience technology and information professionals. We publish recommended articles and content technology news weekly. We do not sell or share personal data.

Subscribe | View online | Editorial policy | Privacy policy | Contact

Gilbane Advisor 3-15-23 — Building ChatGPT, Data-Centric AI

This week we feature content from Rama Ramakrishnan, and a course by, Anish Athalye, Curtis Northcutt, Jonas Mueller, Cody Coleman, Alexandra Zytek, & Sharon Zhou.

Additional reading comes from Benjamin Marie, Sarah Gooding, and Dmitry Kan.

News comes from Ontotext, Acquia, Lucid Software, and Databricks.

👉 No issue next week — we’ll be back March 29th.

Note that news items now link to the original source of the news rather than our 200 word summaries, which are always available here.

All previous issues are available at https://gilbane.com/gilbane-advisor-index/


Opinion / Analysis

The Road to ChatGPT — An informal explainer on how ChatGPT was built

Rama Ramakrishnan…
“I have written an informal “explainer” on how ChatGPT was built. I have tried to focus on the key ideas and have kept technical details to the bare minimum. Please view in full-screen mode. I hope you find it useful.”

This is excellent. You truly don’t need to be technical to follow it. Don’t let the 104 slides scare you off; you can fly through most of them. 

https://www.linkedin.com/feed/update/urn:li:activity:7038334518004482048/

Introduction to Data-Centric AI

“In real-world applications, data is messy and improving models is not the only way to get better performance. You can also improve the dataset itself rather than treating it as fixed. Data-Centric AI (DCAI) is an emerging science that studies techniques to improve datasets, which is often the best way to improve performance in practical ML applications.”

MIT has made this course freely available to all. Lectures and materials are online. This is a valuable resource for professional practitioners as well as students. (1 min for the course description – the rest is up to you).

https://dcai.csail.mit.edu

More Reading

All Gilbane Advisor issues


Content technology news

Ontotext releases GraphDB 10.2

Model Serving provides fully managed production machine learning (ML) capabilities natively within the Databricks Lakehouse Platform.
https://www.ontotext.com/

Acquia adds integrations to Digital Asset Management (DAM) System

The new integrations give customers more control over brand consistency, extend the value of content and data created in other systems.
https://www.acquia.com/

Lucid Software announces new integrations to enhance collaboration

Integrations streamline processes and workflows within a company’s tech stack and creates a foundation for effective and efficient collaboration.
https://lucid.co/

Databricks launches Databricks Model Serving

Model Serving provides fully managed production machine learning (ML) capabilities natively within the Databricks Lakehouse Platform.
https://www.databricks.com/

All content technology news


The Gilbane Advisor is authored by Frank Gilbane and is ad-free, cost-free, and curated for content, computing, web, data, and digital experience technology and information professionals. We publish recommended articles and content technology news weekly. We do not sell or share personal data.

Subscribe | View online | Editorial policy | Privacy policy | Contact

Gilbane Advisor 3-8-23 — LLMs, KBs & LlamaIndex, BigQuery + Search Console

This week we feature articles by Purvanshi Mehta, and Daniel Waisberg, Gaal Yahas, & Haim Daniel.

Additional reading from Jeremy Perdue, Rachel Gordon, and Nandan Grover.

News comes from Quark, TigerGraph, Slang Labs, and DeltaXML.

Note that news items now link to the original source of the news rather than our 200 word summaries, which are always available here.

All previous issues are available at https://gilbane.com/gilbane-advisor-index/


Opinion / Analysis

Indexing and Querying external KBs through GPT — GPT Index (LlamaIndex)

Purvanshi Mehta provides an overview of how you can use the GPT Index (now LlamaIndex) to customize large language models by connecting with external knowledge bases relevant to your application domain. She helpfully includes examples for different types of data sources. (6 min).

https://medium.datadriveninvestor.com/querying-external-kbs-through-gpt-gpt-index-llamaindex-fd8cbad2a4c

Bulk data export: a new and powerful way to access your Search Console data

From the Google Search Console Team…

You can configure an export in Search Console to get a daily data dump into your BigQuery project. The data includes all your performance data, apart from anonymized queries, which are filtered out for privacy reasons … This means you can explore your data to its maximum potential, joining it with other sources of data and using advanced analysis and visualization techniques. … This data export could be particularly helpful for large websites with tens of thousands of pages, or those receiving traffic from tens of thousands of queries a day (or both!). (3 min).

https://developers.google.com/search/blog/2023/02/bulk-data-export

More Reading

All Gilbane Advisor issues


Content technology news

Quark releases Quark Publishing Platform NextGen v3.0

The content automation platform is designed to simplify the complexities associated with enterprise content lifecycle management.
https://www.quark.com ■ https://developer.quark.com

TigerGraph expands cloud capabilities

Provides a comprehensive, streamlined approach to deploy and maintain multiple graph database solutions with visual analytics and machine learning tools.
https://www.tigergraph.com/

Slang Labs launches CONVA

CONVA is a full-stack solution that provides smart and highly accurate multilingual voice search capabilities inside e-commerce apps.
https://www.slanglabs.in/media

DeltaXML eases HTML table comparison with XML Compare 14

The finer-grained detail for cells and rows are easier to display and provide greater understanding when reviewed.
https://www.deltaxml.com

All content technology news


The Gilbane Advisor is authored by Frank Gilbane and is ad-free, cost-free, and curated for content, computing, web, data, and digital experience technology and information professionals. We publish recommended articles and content technology news weekly. We do not sell or share personal data.

Subscribe | View online | Editorial policy | Privacy policy | Contact

Gilbane Advisor 3-1-23 — Emergent properties in ML, RDF modeling

This week we feature articles by Jacob Steinhardt, and Dean Allemang.

Additional reading from Rocío Txabarriaga, Eric Broda, and Tony Seale.

News comes from MadCap Software & IXIASOFT, BetterCommerce, Wondershare, and Contentstack.

Note that news items now link to the original source of the news rather than our 200 word summaries, which are always available here.

All previous issues are available at https://gilbane.com/gilbane-advisor-index/


Opinion / Analysis

Emergent deception and emergent optimization

Emergent properties are in common in nature, and are often surprising. They are also found in machine learning. Jacob Steinhardt has a series of posts on emergence in machine learning worth checking out, but you can start with his most recent, and timely, piece. (17 min).

I’ve previously argued that machine learning systems often exhibit emergent capabilities, and that these capabilities could lead to unintended negative consequences. But how can we reason concretely about these consequences? … I’ll describe two specific emergent capabilities that I’m particularly worried about: deception (fooling human supervisors rather than doing the intended task), and optimization (choosing from a diverse space of actions based on their long-term consequences).

https://bounded-regret.ghost.io/emergent-deception-optimization/

Why I’m not excited about RDF-Star

Well, the title is a bit clickbaity. But Dean Allemang’s article illustrates an important point about RDF modeling in general. And if like me, you weren’t aware of RDF-Star, an added benefit is you’ll learn enough to consider how you might use it when the W3C standard becomes a recommendation. (10 min).

https://medium.com/@dallemang/why-im-not-excited-about-rdf-star-5f1993fd0ead

More Reading

All Gilbane Advisor issues


Content technology news

MadCap Software acquires IXIASOFT

Adds enterprise DITA CCMS to support content strategies for creating, translating, and delivering consistent, up-to-date content tailored to roles.
https://www.madcapsoftware.comhttps://www.ixiasoft.com

Contentstack announces Contentstack Launch

Extends Contentstack’s product suite, providing a composable, automated, digital experience stack from the front-end to the back-end.
https://www.contentstack.com/

BetterCommerce adds headless CMS functionality to its commerce stack

The headless, composable CMS functionality joins existing modules in the commerce stack including PIM, eCommerce, OMS, Analytics and Engage.
https://www.bettercommerce.io

Wondershare releases EdrawMind 10.5

Features new collaborative mind mapping and brainstorming tools to design solutions collaboratively and respond to trends and changes.
https://www.edrawsoft.com/edrawmind/

All content technology news


The Gilbane Advisor is authored by Frank Gilbane and is ad-free, cost-free, and curated for content, computing, web, data, and digital experience technology and information professionals. We publish recommended articles and content technology news weekly. We do not sell or share personal data.

Subscribe | View online | Editorial policy | Privacy policy | Contact

Gilbane Advisor 2-22-23 — ChatGPT, semantic grammar, hashtags

This week we feature articles by Stephen Wolfram, and Mark Wyner.

Additional reading from Steve Nouri, James Vincent, and Steve Nadis.

News comes from TerminusDB, Acquia, PayloadCMS, and Stilo.

Note that news items now link to the original source of the news rather than our 200 word summaries, which are always available here.

All previous issues are available at https://gilbane.com/gilbane-advisor-index/


Opinion / Analysis

What is ChatGPT doing … and why does it work?

Last week’s featured article by Ted Chiang used a familiar analogy to somewhat demystify why ChatGPT appears to work well while sometimes going off the rails. This week Stephen Wolfram explains what is actually going on under the covers. This is a long (75 min) read, but fascinating, well-written, and guaranteed to make you smarter about large language models in general. 

https://writings.stephenwolfram.com/2023/02/what-is-chatgpt-doing-and-why-does-it-work/

Hashtag accessibility, by everyone for everyone

Every hashtag on every post on every platform should always be pascal case (a.k.a. camel case). There are a number of reasons why this is important, the most notable being that screen readers have a hard time with conjoined words.

Mark Wyner illustrates why this is a no-brainer (4 min).

https://markwyner.medium.com/hashtag-accessibility-by-everyone-for-everyone-298667b2d891

More Reading

All Gilbane Advisor issues


Content technology news

TerminusDB launches TerminusCMS

The open-source, headless, developer-focused content management system (CMS) is built on an RDF graph database that connects JSON documents into a graph.
https://terminusdb.com/blog/category/content-knowledge/

Acquia announces new CDP features, pricing tiers, and delivery options

Acquia CDP supports composable customer experience and data strategies through integrations with Acquia DXP and third-party marketing products.
https://www.acquia.com

Payload releases CMS version 1.6.0

In addition to optimizing the TypeScript interface of the Local API, the entire API has gotten a significant overhaul.
https://payloadcms.com

Stilo management buys out Stilo Corporation

Acquisition transfers all IP, trademarks, and customer contracts and XML software products: OmniMark, Migrate, OptimizeR, and Analyzer.
https://www.stilo.com/

All content technology news


The Gilbane Advisor is authored by Frank Gilbane and is ad-free, cost-free, and curated for content, computing, web, data, and digital experience technology and information professionals. We publish recommended articles and content technology news weekly. We do not sell or share personal data.

Subscribe | View online | Editorial policy | Privacy policy | Contact

Gilbane Advisor 2-15-23 — Blurry JPEGS, Meta pixels, ontologies

This week we feature articles by Ted Chiang, and Maria Puertas & Simon Fondrie-Teitler.

Additional reading from Heather Hedden, Benj Edwards, and Tom Warren.

News comes from Weaviate, Expert[.]ai, MadCap Software, and Open Applications Group.

All previous issues are available at https://gilbane.com/gilbane-advisor-index/


Opinion / Analysis

ChatGPT is a blurry JPEG of the Web

This analogy to lossy compression is not just a way to understand ChatGPT’s facility at repackaging information found on the Web by using different words. It’s also a way to understand the “hallucinations,” or nonsensical answers to factual questions, to which large language models such as ChatGPT are all too prone.

Ted Chiang does an excellent job of communicating to a non-technical audience why caution is called for in the use of large language models. (13 min).

https://www.newyorker.com/tech/annals-of-technology/chatgpt-is-a-blurry-jpeg-of-the-web

How to fix your organization’s Meta pixel problem

Do you know whether, or what, information you are tracking and sending to Meta / Facebook? Does your company’s privacy policy makes claims about protecting customer data? If so, you may want to verify those claims are, in fact, supported across the organization. This may not be easy due to old code, turnover, size, and the number of websites and applications you have. The Markup’s Maria Puertas and Simon Fondrie-Teitler show you how to use their free research tool get started. (9 min).

https://themarkup.org/levelup/2023/01/31/in-2023-resolve-to-fix-your-organizations-meta-pixel-problem

More Reading

All Gilbane Advisor issues


Content technology news

Weaviate releases generative search module

Combines language abilities like ChatGPT’s with a vector database that is relevant, secure, real time, and less prone to hallucination.
https://gilbane.com/2023/02/weaviate-releases-generative-search-module/

Expert[.]ai announces new features to hybrid natural language platform

New features include expanded on-premise deployment options, enhanced taxonomy management via 3rd-party knowledge sources, and library integrations.
https://gilbane.com/2023/02/expert-ai-announces-new-features-to-hybrid-natural-language-platform/

Madcap Software adds cloud-based authoring to MadCap Central

You can now create and edit files, and maintain projects uploaded to MadCap Central independent of MadCap Flare.
https://gilbane.com/2023/02/madcap-software-adds-cloud-based-authoring-to-madcap-central/

Open Applications Group releases IOF Ontology Version 202301

Includes IOF Core in the Released status and the Supply Chain and the Maintenance Reference Ontologies in the Provisional Status.
https://gilbane.com/2023/02/oagi-releases-iof-ontology-version-202301/

All content technology news


The Gilbane Advisor is authored by Frank Gilbane and is ad-free, cost-free, and curated for content, computing, web, data, and digital experience technology and information professionals. We publish recommended articles and content technology news weekly. We do not sell or share personal data.

Subscribe | View online | Editorial policy | Privacy policy | Contact

Gilbane Advisor 2-8-23 — machine translation, contextual computing

This week we feature articles by Alan Morrison, and Rocío Txabarriaga, Yifan Wang, Zewei Sun, Shanbo Cheng, Weiguo Zheng, & Mingxuan Wang.

Additional reading from Yennie Jun, Dean Allemang, Sean Hollister.

News comes from Netlify & Gatsby, CrafterCMS, W3C, and AesirX.

All previous issues are available at https://gilbane.com/gilbane-advisor-index/


Opinion / Analysis

Enabling contextual computing in today’s enterprise information fabrics

What do get when you combine network effects, decentralized knowledge graphs, statistical machine learning, and blockchains? Alan Morrison: “… a siloless network of networks approach which P2P data networks such as IPFS are enabling will eventually result in…” a new level of connectivity, extended contextual computing, and value. OriginTrail is an instructive example. (5 min).

https://www.datasciencecentral.com/enabling-contextual-computing-in-todays-enterprise-information-fabrics/

Revisiting controlled language for better machine translation quality

Rocío Txabarriaga reports on (with link to) a paper proposing a methodology to more effectively and efficiently leverage style quality to improve controlled language MT results. The authors address limitations of current methods of accounting for language style, and show how their approach reduces the need for continuous model and fine tuning. (summary 2 min, paper 20 min).

https://slator.com/revisiting-controlled-language-better-machine-translation-quality/

More Reading

All Gilbane Advisor issues


Content technology news

Netlify acquires Gatsby

Netlify is a platform for modern web development and the acquisition is aimed at accelerating adoption of composable web architectures.
https://gilbane.com/2023/02/netlify-acquires-gatsby/

W3C re-launched as a public-interest non-profit organization

The new entity preserves member-driven approach, existing worldwide outreach and cooperation while allowing additional partners around the world.
https://gilbane.com/2023/01/w3c-re-launched-as-a-public-interest-non-profit-organization/

CrafterCMS expands its marketplace

The headless CMS and composable DXP vendor expands marketplace with 60+ open source plugins, blueprints, and packaged business capabilities.
https://gilbane.com/2023/02/craftercms-expands-its-marketplace/

AesirX launches headless CMS

The CMS includes marketing automation software, digiatal asset management (DAM), 1st-party analytics, business insights, and Single Sign On.
https://gilbane.com/2023/02/aesirx-launches-headless-cms/

All content technology news


The Gilbane Advisor is authored by Frank Gilbane and is ad-free, cost-free, and curated for content, computing, web, data, and digital experience technology and information professionals. We publish recommended articles and content technology news weekly. We do not sell or share personal data.

Subscribe | View online | Editorial policy | Privacy policy | Contact

« Older posts Newer posts »

© 2024 The Gilbane Advisor

Theme by Anders NorenUp ↑