Google has announced that it is providing an option for sites to opt out of having their data used to train AI models. Google’s new tool, Extended, allows sites to use Googlebot while avoiding using their data to train current and future AI models.

The company says Google-Extended will allow sites to “manage whether or not their sites help improve Bard and Vertex AI’s generative APIs.” Sites can use a toggle switch to “control access to content on the site.”

Earlier in July, Google confirmed that it was training its AI chatbot Bard on publicly available data collected from the internet.

Google-Extended is available through robots.txt, also known as a text file, that tells web crawlers if they can access certain sites.

Google notes that “as AI applications expand,” it will continue to explore “additional machine-readable selection and control approaches for web publishers.”

Many sites, including The New York Times, CNN, Reuters, and Medium, have already begun blocking the web crawler that OpenAI uses to collect data and train ChatGPT. However, there were doubts about how to block Google.

However, sites cannot completely close Google’s crawlers, otherwise they will not be indexed in search. This has led some sites, such as The New York Times, to legally block Google instead, updating their terms of service to prevent companies from using content to train AI.

Recall that Google is releasing a new update to ChromeOS, which will have significant novelties, including a new interface, useful elements for charging controls, and adjusting the availability of accent colors.

Commentary