Privacy

Poland opens privacy probe of ChatGPT following GDPR complaint

Comment

OpenAI logo is being displayed on a mobile phone screen in front of computer screen with the logo of ChatGPT
Image Credits: Didem Mente/Anadolu Agency / Getty Images

OpenAI is facing another investigation into whether its generative AI chatbot, ChatGPT, complies with European Union privacy laws.

Last month a complaint was filed against ChatGPT and OpenAI in Poland, accusing the company of a string of breaches of the EU’s General Data Protection Regulation (GDPR). Yesterday the Polish authority took the unusual step of making a public announcement to confirm it has opened an investigation.

“The Office for Personal Data Protection [UODO] is investigating a complaint about ChatGPT, in which the complainant accuses the tool’s creator, OpenAI, of, among other things, processing data in an unlawful, unreliable manner, and the rules under which this is done are opaque,” the UODO wrote in a press release [translated from Polish to English using DeepL].

The authority said it’s anticipating a “difficult” investigation — noting OpenAI is located outside the EU and flagging the novelty of the generative AI chatbot technology whose compliance it will be examining.

“The case concerns the violation of many provisions of the protection of personal data, so we will ask OpenAI to answer a number of questions in order to thoroughly conduct the administrative proceedings,” said Jan Nowak, president of the UODO, in a statement.

Deputy president, Jakub Groszkowski, added a warning to the authority’s press release — writing that new technologies do not operate outside the legal framework and must respect the GDPR. He said the complaint contains allegations that raise doubts about OpenAI’s systemic approach to European data protection principles, adding that the authority would “clarify these doubts, in particular against the background of the fundamental principle of privacy by design contained in the GDPR”.

The complaint, which was filed by local privacy and security researcher Lukasz Olejnik, accuses OpenAI of a string of breaches of the pan-EU regulation — spanning lawful basis, transparency, fairness, data access rights, and privacy by design.

It focuses on OpenAI’s response to a request by Olejnik to correct incorrect personal data in a biography ChatGPT generated about him — but which OpenAI told him it was unable to do. He also accuses the AI giant of failing to properly respond to his subject access request — and of providing evasive, misleading and internally contradictory answers when he sought to exercise his legal rights to data access.

The tech underlying ChatGPT is a so-called large language model (LLM) — a type of generative AI model that’s trained on masses of natural language data so it can both respond in a human like manner. But also, given the general purpose utility of the tool, it’s evidently been trained on all sorts of types of information so it can respond to different questions and asks — including, in many cases, being fed data about living people.

OpenAI’s scraping of the public Internet for training data, without people’s knowledge or consent, is one of the big factors that’s landed ChatGPT in regulatory hot water in the EU. Its apparent inability to articulate exactly how it’s processing personal data; or to correct mistakes when its AI “hallucinates” and produces false information about named individuals are others.

The bloc regulates how personal data is processed, requiring a processor has a lawful basis to collect and use people’s information. Processors must also meet transparency and fairness requirements. Plus a suite of data access rights are afforded to people in the EU — meaning EU individuals have (among other things) a right to ask for incorrect data about them to be rectified.

Olejnik’s complaint tests OpenAI’s GDPR compliance across a number of those dimensions. So any enforcement could be significant in shaping how generative AI develops.

Reacting to the UODO’s confirmation it’s investigating the ChatGPT complaint, Olejnik told TechCrunch: “Focusing on privacy by design/data protection by design is absolutely critical and I expected this to be the main aspect. So this sounds reasonable. It would concern the design and deployment aspects of LLM systems.”

He previously described the experience of trying to get answers from OpenAI about its processing of his information as feeling like Josef K, in Kafka’s book “The Trial.” “If this may be the Josef K. moment for AI/LLM, let’s hope that it may shed light on the processes involved,” he added now.

The relative speed with which the Polish authority is moving in response to the complaint, as well as its openness about the investigation, does look notable.

It adds to growing regulatory issues OpenAI is facing the European Union. The Polish investigation follows an intervention by Italy’s DPA earlier this year — which led to a temporary suspension of ChatGPT in the country. The scrutiny by the Garante continues, also looking into GDPR compliance concerns attached to factors like lawful basis and data access rights.

Elsewhere, Spain’s DPA has opened a probe. While a taskforce set up via the European Data Protection Board earlier this year is looking at how data protection authorities should respond to the AI chatbot tech with the goal of pushing to find some consensus among the bloc’s privacy watchdogs on how to regulate such novel tech.

The taskforce does not supplant investigations by individual authorities. But, in the future, it may lead to some harmonization in how DPAs approach regulating cutting edge AI. That said, divergence is also possible if there are strong and varied views among DPAs. And it remains to be seen what further enforcement actions the bloc’s watchdogs could take on tools like ChatGPT. (Or, indeed, how quickly they may act.)

In the UODO’s press release — which nods to the existence of the taskforce — its president says the authority is taking the ChatGPT investigation “very seriously”. He also notes the complaint’s allegations are not the first doubts vis-a-vis ChatGPT’s compliance with European data protection and privacy rules.

Discussing the authority’s openness and pace, Maciej Gawronski of law firm GP Partners, which is representing Olejnik for the complaint, told TechCrunch: “UODO is becoming more and more vocal about privacy, data protection, technology and human rights. So, I think, our complaint creates an opportunity for [it] to work on reconciling digital and societal progress with individual agency and human rights.

“Mind that Poland is a very advanced country regarding IT. I would expect UODO to be very reasonable in their approach and proceedings. Of course, as long as OpenAI remains open, for discussion.”

Asked if he’s expecting a quick decision on the complaint, Gawronski added: “The authority is monitoring technology advancements pretty closely. I am at UODO’s conference on new technologies at the moment. UODO has already been approached re AI by various actors. However, I do not expect a fast decision. Nor it is my intention to conclude the proceedings prematurely. I would prefer to have an honest and insightful discussion with OpenAI on what, when, how, and how much, regarding ChatGPT’s GDPR compliance, and in particular how to satisfy rights of the data subject.”

OpenAI was contacted for comment on the Polish DPA’s investigation but did not send any response.

The AI giant is not sitting still in response to an increasingly complex regulatory picture in the EU. It recently announced opening an office in Dublin, Ireland — likely with an eye on building towards streamlining its regulatory situation for data protection if it can funnel any GDPR complaints via Ireland.

However, for now, the US company is not considered “main established” in any EU Member State (including Ireland) for GDPR purposes, since decisions affecting local users continue to be taken at its US HQ in California. So far, the Dublin office is just a tiny satellite. This means data protection authorities across the bloc remain competent to investigate concerns about ChatGPT that arise on their patch. So more investigations could follow.

Complaints which predate any future main establishment status change for OpenAI could also still be filed anywhere in the EU.

ChatGPT-maker OpenAI accused of string of data protection breaches in GDPR complaint filed by privacy researcher

Italy gives OpenAI initial to-do list for lifting ChatGPT suspension order

Sam Altman’s big European tour

More TechCrunch

Amazon Web Services (AWS), Amazon’s cloud computing business, has confirmed further details of its European “sovereign cloud” which is designed to enable greater data residency across the region. The company…

AWS confirms European ‘sovereign cloud’ to launch in Germany by 2025, plans €7.8B investment over 15 years

Go Digit, an Indian insurance startup, has raised $141 million from investors including Goldman Sachs, ADIA, and Morgan Stanley as part of its IPO.

Indian insurance startup Go Digit raises $141M from anchor investors ahead of IPO

Peakbridge intends to invest in between 16 and 20 companies, investing around $10 million in each company. It has made eight investments so far.

Food VC Peakbridge has new $187M fund to transform future of food, like lab-made cocoa

For over six decades, the nonprofit has been active in the financial services sector.

Accion’s new $152.5M fund will back financial institutions serving small businesses globally

Meta’s newest social network, Threads is starting its own fact-checking program after piggybacking on Instagram and Facebook’s network for a few months. Instagram head Adam Mosseri noted that the company…

Threads finally starts its own fact-checking program

Looking Glass makes trippy-looking mixed-reality screens that make things look 3D without the need of special glasses. Today, it launches a pair of new displays, including a 16-inch mode that…

Looking Glass launches new 3D displays

Replacing Sutskever is Jakub Pachocki, OpenAI’s director of research.

Ilya Sutskever, OpenAI co-founder and longtime chief scientist, departs

Intuitive Machines made history when it became the first private company to land a spacecraft on the moon, so it makes sense to adapt that tech for Mars.

Intuitive Machines wants to help NASA return samples from Mars

As Google revamps itself for the AI era, offering AI overviews within its search results, the company is introducing a new way to filter for just text-based links. With the…

Google adds ‘Web’ search filter for showing old-school text links as AI rolls out

Blue Origin’s New Shepard rocket will take a crew to suborbital space for the first time in nearly two years later this month, the company announced on Tuesday.  The NS-25…

Blue Origin to resume crewed New Shepard launches on May 19

This will enable developers to use the on-device model to power their own AI features.

Google is building its Gemini Nano AI model into Chrome on the desktop

It ran 110 minutes, but Google managed to reference AI a whopping 121 times during Google I/O 2024 (by its own count). CEO Sundar Pichai referenced the figure to wrap…

Google mentioned ‘AI’ 120+ times during its I/O keynote

Firebase Genkit is an open source framework that enables developers to quickly build AI into new and existing applications.

Google launches Firebase Genkit, a new open source framework for building AI-powered apps

In the coming months, Google says it will open up the Gemini Nano model to more developers.

Patreon and Grammarly are already experimenting with Gemini Nano, says Google

As part of the update, Reddit also launched a dedicated AMA tab within the web post composer.

Reddit introduces new tools for ‘Ask Me Anything,’ its Q&A feature

Here are quick hits of the biggest news from the keynote as they are announced.

Google I/O 2024: Here’s everything Google just announced

LearnLM is already powering features across Google products, including in YouTube, Google’s Gemini apps, Google Search and Google Classroom.

LearnLM is Google’s new family of AI models for education

The official launch comes almost a year after YouTube began experimenting with AI-generated quizzes on its mobile app. 

Google is bringing AI-generated quizzes to academic videos on YouTube

Around 550 employees across autonomous vehicle company Motional have been laid off, according to information taken from WARN notice filings and sources at the company.  Earlier this week, TechCrunch reported…

Motional cut about 550 employees, around 40%, in recent restructuring, sources say

The keynote kicks off at 10 a.m. PT on Tuesday and will offer glimpses into the latest versions of Android, Wear OS and Android TV.

Google I/O 2024: Watch all of the AI, Android reveals

Google Play has a new discovery feature for apps, new ways to acquire users, updates to Play Points, and other enhancements to developer-facing tools.

Google Play preps a new full-screen app discovery feature and adds more developer tools

Soon, Android users will be able to drag and drop AI-generated images directly into their Gmail, Google Messages and other apps.

Gemini on Android becomes more capable and works with Gmail, Messages, YouTube and more

Veo can capture different visual and cinematic styles, including shots of landscapes and timelapses, and make edits and adjustments to already-generated footage.

Google Veo, a serious swing at AI-generated video, debuts at Google I/O 2024

In addition to the body of the emails themselves, the feature will also be able to analyze attachments, like PDFs.

Gemini comes to Gmail to summarize, draft emails, and more

The summaries are created based on Gemini’s analysis of insights from Google Maps’ community of more than 300 million contributors.

Google is bringing Gemini capabilities to Google Maps Platform

Google says that over 100,000 developers already tried the service.

Project IDX, Google’s next-gen IDE, is now in open beta

The system effectively listens for “conversation patterns commonly associated with scams” in-real time. 

Google will use Gemini to detect scams during calls

The standard Gemma models were only available in 2 billion and 7 billion parameter versions, making this quite a step up.

Google announces Gemma 2, a 27B-parameter version of its open model, launching in June

This is a great example of a company using generative AI to open its software to more users.

Google TalkBack will use Gemini to describe images for blind people

Google’s Circle to Search feature will now be able to solve more complex problems across psychics and math word problems. 

Circle to Search is now a better homework helper