This week we feature articles from Sayash Kapoor & Arvind Narayanan, and Daniil Gurgurov, Tanja Bäumel & Tatiana Anikina.
Additional reading comes from Sanjeev Mohan, Amber Case, and Michael Andrews.
News comes from RWS, GraphAware & Senzing, Elastic, and Box.
Our next issue will arrive July 17.
All previous issues are available at https://gilbane.com/gilbane-advisor-index
Opinion / Analysis
AI scaling myths
Scaling will run out. The question is when.
Sayash Kapoor and Arvind Narayanan shed light on a number of misconceptions or unstudied assumptions while answering the question. (8 min)
https://www.aisnakeoil.com/p/ai-scaling-myths
Multilingual large language models and curse of multilinguality
“This paper aims to provide a brief overview of the architectures of the most prominent multilingual LLMs, including details such as their pre-training objective functions, data sources, tokenization schemas, the number of languages supported, and the peculiarities of each individual multilingual LLM. Subsequently, the primary challenge facing multilingual LLMs, known as the “curse of multilinguality” Conneau et al. (2020), and the current attempts to solve it, are discussed.”
This paper by Daniil Gurgurov, Tanja Bäumel, and Tatiana Anikina is a comprehensive and current update on multilingual LLM tools and challenges. (18 min)
Semantic Scholar link: https://www.semanticscholar.org/reader/2235420149c8f839e29d4cf8df1894d7a9f3de37
arXiv HTML and clean PDF link: https://arxiv.org/html/2406.10602v1
More Reading
All Gilbane Advisor issues
Content technology news
RWS launches Trados Studio 2024
The latest version includes advanced AI capabilities and a plethora of new features to boost productivity and increase translation quality.
https://www.trados.com
Elastic introduces Playground to accelerate RAG development with Elasticsearch
The new interface enables developers to iterate and build RAG applications for A/B testing LLMs, tuning prompts and chunking data.
https://www.elastic.co/search-labs/blog/rag-playground-introduction
Graph analytics enhanced by GraphAware & Senzing
The partnership integrates Senzing entity resolution capabilities into GraphAware Hume, GraphAware’s graph-based enterprise intelligence platform.
https://senzing.com/graph-analytics-graphaware/
Box enhances Box AI for intelligent content management
Enterprises to get unlimited access to Box AI in Notes, Documents, and Hubs, GPT-4o Integration in Hubs, new file types, and developer tools.
https://blog.box.com/box-announces-powerful-enhancements-box-ai-intelligent-content-management
All content technology news
The Gilbane Advisor is authored by Frank Gilbane and is ad-free, cost-free, and curated for content, computing, web, data, and digital experience technology and information professionals. We publish recommended articles and content technology news most Wednesdays. We do not sell or share personal data.
Subscribe | View online | Editorial policy | Privacy policy | Contact