Deep learning is a sub-field of machine learning based on a set of algorithms that attempt to model high level abstractions in data by using a deep graph with multiple processing layers, composed of multiple linear and non-linear transformations. Deep learning is part of a broader family of machine learning methods based on learning representations of data. An observation (e.g., an image) can be represented in many ways such as a vector of intensity values per pixel, or in a more abstract way as a set of edges, regions of particular shape, etc.
Category: Computing & data (Page 85 of 95)
Computing and data is a broad category. Our coverage of computing is largely limited to software, and we are mostly focused on unstructured data, semi-structured data, or mixed data that includes structured data.
Topics include computing platforms, analytics, data science, data modeling, database technologies, machine learning / AI, Internet of Things (IoT), blockchain, augmented reality, bots, programming languages, natural language processing applications such as machine translation, and knowledge graphs.
Related categories: Semantic technologies, Web technologies & information standards, and Internet and platforms.
Machine learning (ML) is a discipline of AI that includes basic pattern recognition and deep learning and other techniques to train machines to identify and categorize large numbers of entities and data points. Basic machine learning has been used since the 80s and is responsible for many capabilities such as recommendation engines, spam detection, image recognition, and natural language processing applications such as language translation (machine translation). Advances in neural networks, and computing performance and storage, combined with vast data sets in the 2000s created a whole new level of sophisticated machine learning applications. This type of “AI” is ready for prime time. Yet, as powerful as these new techniques are, they are not AGI. i.e, “human level”.
Less than half of Google searches now result in a click
Some mixed news about Google for publishers and advertisers in the past few weeks. We’ll start with the not-so-good news about clicks, especially as it turns out, for mobile, detailed by Rand Fishkin…
We’ve passed a milestone in Google’s evolution from search engine to walled-garden. In June of 2019, for the first time, a majority of all browser-based searches on Google resulted in zero-clicks. Read More
Google moves to prioritize original reporting in search
Nieman Labs’ Laura Hazard Owen provides some context on the most welcome change Google’s Richard Gingras announced last week. Of course there are questions around what ‘original reporting’ means, for Google and all of us, and we’ll have to see how well Google navigates this fuzziness. Read More
Designing multi-purpose content
The efficiency and effectiveness of multi-purpose content strategies are well known, as are many techniques for successful implementation. What is not so easy is justifying, assembling, and educating a multi-discipline content team. Content strategist Michael Andrews provides a clear explanation and example of the benefits of multi-purpose content designed by a cross-functional team that is accessible for non-specialists. Read More
Face recognition, bad people and bad data
Benedict Evans…
We worry about face recognition just as we worried about databases – we worry what happens if they contain bad data and we worry what bad people might do with them … we worry what happens if it [facial recognition] doesn’t work and we worry what happens if it does work.
This comparison turns out to be a familiar and fertile foundation for exploring what can go wrong and what we should do about it.
The article also serves as a subtle and still necessary reminder that face recognition and other machine learning applications are vastly more limited than what ‘AI’ conjures up for many. Read More
Also…
A few more links in this issue as we catch up from our August vacation.
- It is limited… Should we still be selling responsive web design? via Browser London
- The need for personal control… Getting past broken cookie notices via Doc Searls Weblog
- For open web scholars… Linked research on the decentralised web via Sarven Capadisli
- For (not only) librarians… Creating library linked data with Wikibase via OCLC
- AMP still controversial… Google is tightening its grip on your website via Owen Williams
- Good & bad: changes to nofollow… Google’s robots changes, the web & the law via Joost de Valk
- Good for your thumbs at least… Bottom navigation pattern on mobile web pages: a better alternative? via Smashing Magazine
- Data portability for all… Apple joins Data Transfer Project. Club also includes Google, Microsoft, Twitter, and Facebook. via 9to5mac
The Gilbane Advisor curates content for content management, computing, and digital experience professionals. We focus on strategic technologies. We publish more or less twice a month except for August and December.
Semi-structured data is a term seldom used these days, and has been used in different ways. But in general refers to structured data that does not conform with the formal structure of data models associated with relational databases or other forms of data tables, but nonetheless contains tags or other markers to separate semantic elements and enforce hierarchies of records and fields within the data.
A web style sheet is a form of separation of presentation and content for web design in which the markup of a webpage contains the page’s semantic content and structure, but does not define its visual layout (style). Instead, the style is defined in an external style sheet file using a style sheet language such as CSS or XSLT. This design approach is identified as a “separation” because it largely supersedes the antecedent methodology in which a page’s markup defined both style and structure.
Also see XSL-FO.
Style sheets predate web publishing and were used in proprietary electronic publishing publishing systems in the early 80s.
Unstructured Data (or unstructured information) refers to information that either does not have a predefined data model and/or does not fit well into relational tables, such as narrative text, audio, or visual data.
In the early days of information technology (1950s -1970s), information systems focused on structured data. Until the late 1970s there was little interest in managing unstructured data. In the 1980s computerized publishing systems were built to process unstructured information for creating, formatting, editing, and printing documents. And SGML was created to add structure to document information for computer processing. Electronic publishing and document management systems grew steadily until the early 1990s when the Web produced an explosion of unstructured data.
Unstructured data is also the main ingredient to most of today’s machine learning applications, which involve natural language processing, and image and streaming pattern recognition.
Modern data management strategies need to include a variety of structured and unstructured data types. PostgreSQL, MongoDB, Cassandra, Neo4j, Snowflake, and DataStax are some examples of modern database products. Many current versions of traditional SQL-based database products can also support NoSQL (non-SQL or not-onlySQL) data.
Managing ML in the enterprise
Regulated industries are often among the first to figure out how to implement new technologies in complex, high risk environments. This O’Reilly article looks at how finance (mostly) and health care model risk in the context of machine learning. There are useful and important lessons for enterprises in general. Read More
A contract for the Web
We all know the web has a boatload of challenges coming from a collection of commercial and national sources intent on subverting or replacing it. But organizations and consumers of the web have also been too complacent as these threats have grown. The World Wide Web Foundation’s mission is to “advance the open web as a public good and a basic right.” by changing government and business policies. The foundation has just published a draft “Contract for the Web” and is asking for input from governments, businesses, and citizens. That’s right, they want your opinion. Read More
Is Web3.0 the next lifestyle brand?
Web 3.0 does not, and will likely never have, a canonical definition. Web 3.0 refers to a collection of aspirations, similar to those of the Web Foundations’, and new technologies to support those aspirations and a decentralized web, such as blockchain and crypto. Since these technologies are not widely understood, marketing Web 3.0 etc. is a problem. Jeremy Epstein has some “half-baked” (his words) ideas on relating it to modern intentional lifestyle choices as away to build support. Read More
By running unwitting PR for Jeffrey Epstein, Forbes shows the risks of a news outlet thinking like a tech platform
If journalists want to criticize the anything-goes ethos of Facebook, it’s only fair to note when news organizations’ hunger for scale leads them down the same problematic path. Read More
Also…
- In case you (or a colleague) forgot… HTML is the Web via Pete Lambert’s journal
- Comprehensive… China Internet Report 2019 via South China Morning Post
- Wondering what the dif is?… Cognitive Maps, Mind Maps, and Concept Maps: Definitions via Nielsen Norman Group
- Come back to the GDPR, RTB, RTB… via The Content Advisory
The Gilbane Advisor curates content for content, computing, and digital experience professionals. We focus on strategic technologies. We publish more or less twice a month except for August and December.
Content management, or CM, is the set of processes and technologies that support the collection, managing, and publishing of information in any form or medium. When stored and accessed via computers, this information has come to be referred to as content or digital content. Digital content may take the form of text, multimedia files (such as audio or video files), or any other file type that follows a content lifecycle requiring management. Content management can be found in both dedicated systems and as a component of information technology systems.
The term ‘content management’ became popular when ‘web content management’ systems emerged to differentiate them from ‘document management’ systems which were associated with paper documents. ‘Content management’ then quickly evolved to cover all kinds of unstructured or semi-structured content. Common types of content management systems (CMS) include Enterprise Content Management (ECM), Digital Asset Management (DAM), Multichannel Content Management (MCM), Component Content Management (CCM), and web content management (WCM). The latter are now widely marketed as Digital Experience Management’ (DEM or DXM, DXP), and ‘Customer Experience Management’ (CEM or CXM) systems or platforms, and may include additional marketing technology functions.
For an extensive collection of content management news and blog posts see https://gilbane.com/category/content-management-strategy/
For more historical background see What is Content Management?



