Curated for content, computing, and digital experience professionals

Category: Computing & data (Page 70 of 80)

Computing and data is a broad category. Our coverage of computing is largely limited to software, and we are mostly focused on unstructured data, semi-structured data, or mixed data that includes structured data.

Topics include computing platforms, analytics, data science, data modeling, database technologies, machine learning / AI, Internet of Things (IoT), blockchain, augmented reality, bots, programming languages, natural language processing applications such as machine translation, and knowledge graphs.

Related categories: Semantic technologies, Web technologies & information standards, and Internet and platforms.

Artificial intelligence

Artificial Intelligence (AI) is a branch of computer science that studies intelligent systems (i.e. software, computers, robots, etc.). Alternatively, it may be defined as “the study and design of intelligent agents”, where an intelligent agent is a system that perceives its environment and takes actions that maximize its chances of success. John McCarthy, who coined the term in 1955, defines it as “the science and engineering of making intelligent machines”.

For practical purposes, it is useful to distinguish between two different interpretations of ‘AI’:

  • Artificial General Intelligence (AGI), where McCarthy’s “intelligent machines” have at least human level capabilities. AGI does not currently exist, and when, or if, it will is controversial.
  • Machine learning (ML) is a discipline of AI that includes basic pattern recognition and deep learning and other techniques to train machines to identify and categorize large numbers of entities and data points. Basic machine learning has been used since the 80s and is responsible for many capabilities such as recommendation engines, spam detection, image recognition, and language translation. Advances in neural networks, and computing performance and storage, combined with vast data sets in the 2000s created a whole new level of sophisticated machine learning applications. This type of “AI” is ready for prime time. Yet, as powerful as these new techniques are, they are not AGI. i.e, “human level”.

Deep learning

Deep learning is a sub-field of machine learning based on a set of algorithms that attempt to model high level abstractions in data by using a deep graph with multiple processing layers, composed of multiple linear and non-linear transformations. Deep learning is part of a broader family of machine learning methods based on learning representations of data. An observation (e.g., an image) can be represented in many ways such as a vector of intensity values per pixel, or in a more abstract way as a set of edges, regions of particular shape, etc.

machine learning

Machine learning (ML) is a discipline of AI that includes basic pattern recognition and deep learning and other techniques to train machines to identify and categorize large numbers of entities and data points. Basic machine learning has been used since the 80s and is responsible for many capabilities such as recommendation engines, spam detection, image recognition, and natural language processing applications such as language translation (machine translation). Advances in neural networks, and computing performance and storage, combined with vast data sets in the 2000s created a whole new level of sophisticated machine learning applications. This type of “AI” is ready for prime time. Yet, as powerful as these new techniques are, they are not AGI. i.e, “human level”. 

Gilbane Advisor 9-18-19 — Good/bad Google, multi-purpose content, face recognition & DBs

Less than half of Google searches now result in a click

Some mixed news about Google for publishers and advertisers in the past few weeks. We’ll start with the not-so-good news about clicks, especially as it turns out, for mobile, detailed by Rand Fishkin…

We’ve passed a milestone in Google’s evolution from search engine to walled-garden. In June of 2019, for the first time, a majority of all browser-based searches on Google resulted in zero-clicks. Read More

Google organic click stats

Google moves to prioritize original reporting in search

Nieman Labs’ Laura Hazard Owen provides some context on the most welcome change Google’s Richard Gingras announced last week. Of course there are questions around what ‘original reporting’ means, for Google and all of us, and we’ll have to see how well Google navigates this fuzziness. Read More

Designing multi-purpose content

The efficiency and effectiveness of multi-purpose content strategies are well known, as are many techniques for successful implementation. What is not so easy is justifying, assembling, and educating a multi-discipline content team. Content strategist Michael Andrews provides a clear explanation and example of the benefits of multi-purpose content designed by a cross-functional team that is accessible for non-specialists. Read More

Face recognition, bad people and bad data

Benedict Evans…

We worry about face recognition just as we worried about databases – we worry what happens if they contain bad data and we worry what bad people might do with them … we worry what happens if it [facial recognition] doesn’t work and we worry what happens if it does work.

This comparison turns out to be a familiar and fertile foundation for exploring what can go wrong and what we should do about it.

The article also serves as a subtle and still necessary reminder that face recognition and other machine learning applications are vastly more limited than what ‘AI’ conjures up for many. Read More

Also…

A few more links in this issue as we catch up from our August vacation.

The Gilbane Advisor curates content for content management, computing, and digital experience professionals. We focus on strategic technologies. We publish more or less twice a month except for August and December.

Semi-structured data

Semi-structured data is a term seldom used these days, and has been used in different ways. But in general refers to structured data that does not conform with the formal structure of data models associated with relational databases or other forms of data tables, but nonetheless contains tags or other markers to separate semantic elements and enforce hierarchies of records and fields within the data.

Style sheet

A web style sheet is a form of separation of presentation and content for web design in which the markup of a webpage contains the page’s semantic content and structure, but does not define its visual layout (style). Instead, the style is defined in an external style sheet file using a style sheet language such as CSS or XSLT. This design approach is identified as a “separation” because it largely supersedes the antecedent methodology in which a page’s markup defined both style and structure.

Also see XSL-FO.

Style sheets predate web publishing and were used in proprietary electronic publishing publishing systems in the early 80s.

Unstructured data

Unstructured Data (or unstructured information) refers to information that either does not have a predefined data model and/or does not fit well into relational tables, such as narrative text, audio, or visual data.

In the early days of information technology (1950s -1970s), information systems focused on structured data. Until the late 1970s there was little interest in managing unstructured data. In the 1980s computerized publishing systems were built to process unstructured information for creating, formatting, editing, and printing documents. And SGML was created to add structure to document information for computer processing. Electronic publishing and document management systems grew steadily until the early 1990s when the Web produced an explosion of unstructured data.

Unstructured data is also the main ingredient to most of today’s machine learning applications, which involve natural language processing, and image and streaming pattern recognition.

Modern data management strategies need to include a variety of structured and unstructured data types. PostgreSQL, MongoDB, Cassandra, Neo4j, Snowflake, and DataStax are some examples of modern database products. Many current versions of traditional SQL-based database products can also support NoSQL (non-SQL or not-onlySQL) data.

Gilbane Advisor 7-29-19 — Enterprise ML risk, web contract, web 3.0, news & scale

Managing ML in the enterprise

Regulated industries are often among the first to figure out how to implement new technologies in complex, high risk environments. This O’Reilly article looks at how finance (mostly) and health care model risk in the context of machine learning. There are useful and important lessons for enterprises in general. Read More

 

Model risk management

A contract for the Web

We all know the web has a boatload of challenges coming from a collection of commercial and national sources intent on subverting or replacing it. But organizations and consumers of the web have also been too complacent as these threats have grown. The World Wide Web Foundation’s mission is to “advance the open web as a public good and a basic right.” by changing government and business policies. The foundation has just published a draft “Contract for the Web” and is asking for input from governments, businesses, and citizens. That’s right, they want your opinion. Read More

Is Web3.0 the next lifestyle brand?

Web 3.0 does not, and will likely never have, a canonical definition. Web 3.0 refers to a collection of aspirations, similar to those of the Web Foundations’, and new technologies to support those aspirations and a decentralized web, such as blockchain and crypto. Since these technologies are not widely understood, marketing Web 3.0 etc. is a problem. Jeremy Epstein has some “half-baked” (his words) ideas on relating it to modern intentional lifestyle choices as away to build support. Read More

By running unwitting PR for Jeffrey Epstein, Forbes shows the risks of a news outlet thinking like a tech platform

If journalists want to criticize the anything-goes ethos of Facebook, it’s only fair to note when news organizations’ hunger for scale leads them down the same problematic path. Read More

Also…

The Gilbane Advisor curates content for content, computing, and digital experience professionals. We focus on strategic technologies. We publish more or less twice a month except for August and December.

« Older posts Newer posts »

© 2024 The Gilbane Advisor

Theme by Anders NorenUp ↑