This week we feature articles from Sayash Kapoor & Arvind Narayanan, and Daniil Gurgurov, Tanja Bäumel & Tatiana Anikina.

Additional reading comes from Sanjeev Mohan, Amber Case, and Michael Andrews.

News comes from RWS, GraphAware & Senzing, Elastic, and Box.

Our next issue will arrive July 17.

All previous issues are available at https://gilbane.com/gilbane-advisor-index


Opinion / Analysis

AI scaling myths

Scaling will run out. The question is when.

Sayash Kapoor and Arvind Narayanan shed light on a number of misconceptions or unstudied assumptions while answering the question.  (8 min)

https://www.aisnakeoil.com/p/ai-scaling-myths

Multilingual large language models and curse of multilinguality

“This paper aims to provide a brief overview of the architectures of the most prominent multilingual LLMs, including details such as their pre-training objective functions, data sources, tokenization schemas, the number of languages supported, and the peculiarities of each individual multilingual LLM. Subsequently, the primary challenge facing multilingual LLMs, known as the “curse of multilinguality” Conneau et al. (2020), and the current attempts to solve it, are discussed.”

This paper by Daniil Gurgurov, Tanja Bäumel, and Tatiana Anikina is a comprehensive and current update on multilingual LLM tools and challenges. (18 min)

Semantic Scholar link: https://www.semanticscholar.org/reader/2235420149c8f839e29d4cf8df1894d7a9f3de37
arXiv HTML and clean PDF link: https://arxiv.org/html/2406.10602v1

More Reading

All Gilbane Advisor issues


Content technology news

RWS launches Trados Studio 2024

The latest version includes advanced AI capabilities and a plethora of new features to boost productivity and increase translation quality.
https://www.trados.com

Elastic introduces Playground to accelerate RAG development with Elasticsearch

The new interface enables developers to iterate and build RAG applications for A/B testing LLMs, tuning prompts and chunking data.
https://www.elastic.co/search-labs/blog/rag-playground-introduction

Graph analytics enhanced by GraphAware & Senzing

The partnership integrates Senzing entity resolution capabilities into GraphAware Hume, GraphAware’s graph-based enterprise intelligence platform.
https://senzing.com/graph-analytics-graphaware/

Box enhances Box AI for intelligent content management

Enterprises to get unlimited access to Box AI in Notes, Documents, and Hubs, GPT-4o Integration in Hubs, new file types, and developer tools.
https://blog.box.com/box-announces-powerful-enhancements-box-ai-intelligent-content-management

All content technology news


The Gilbane Advisor is authored by Frank Gilbane and is ad-free, cost-free, and curated for content, computing, web, data, and digital experience technology and information professionals. We publish recommended articles and content technology news most Wednesdays. We do not sell or share personal data.

Subscribe | View online | Editorial policy | Privacy policy | Contact