Curated for content, computing, and digital experience professionals

Day: January 15, 2021

Google introduces table-to-text generation dataset

Google introduced “ToTTo: A Controlled Table-To-Text Generation Dataset”, an open domain table-to-text generation dataset created using a novel annotation process (via sentence revision) along with a controlled text generation task that can be used to assess model hallucination. ToTTo (shorthand for “Table-To-Text”) consists of 121,000 training examples, along with 7,500 examples each for development and test. Due to the accuracy of annotations, this dataset is suitable as a challenging benchmark for research in high precision text generation. The dataset and code are open-sourced on our GitHub repo.

In the last few years, research in natural language generation, used for tasks like text summarization, has made tremendous progress. Yet, despite achieving high levels of fluency, neural systems can still be prone to hallucination (i.e.generating text that is understandable, but not faithful to the source), which can prohibit these systems from being used in many applications that require high degrees of accuracy.

While the process of assessing the faithfulness of generated text to the source content can be challenging, it is often easier when the source content is structured (e.g., in tabular format). Moreover, structured data can also test a model’s ability for reasoning and numerical inference. However, existing large scale structured datasets are often noisy (i.e., the reference sentence cannot be fully inferred from the tabular data), making them unreliable for the measurement of hallucination in model development.

https://ai.googleblog.com/2021/01/totto-controlled-table-to-text.html

Medium acquires social ebook platform Glose

Medium announced the acquisition of Glose, a social digital book platform based in Paris, France. Founded in 2014, Glose reaches more than one million readers in 200 countries. With a range of fiction and nonfiction, Glose carries ebooks and audiobooks from all major publishers, including Penguin Random House, HarperCollins, Macmillan, Hachette, and Simon & Schuster.

Medium supports a wide range of publishing, from short blog posts to in-depth investigative journalism, and everything in between. With the integration of Glose, Medium will have more than 1.5 million books available within its ecosystem. Readers will be able to find more of what they are looking for by an author or on a subject, and writers can have their different content forms all in one place, elevating their voice and perspective. Built as a reading hub, Glose allows readers to access ebooks and audiobooks on any device: laptop, tablet or smartphone, online and offline. On Glose, readers can create booklists, engage in reading groups, share highlights and annotations of the books they read, follow their daily and weekly reading activity, and set personal reading goals.

https://blog.medium.com/medium-acquires-social-ebook-platform-glose-cf2e94485d8, https://ev.medium.com/lets-rethink-the-book-experience-3087baa3cc91

© 2021 The Gilbane Advisor

Theme by Anders NorenUp ↑