OpenAI steals millions of users? Star big model becomes "data thief"!

author：LinkFocus 2023-07-03 16:36:00

"Despite having agreements in place to purchase and use personal information, the defendants took a different approach: stealing." A law firm took OpenAI to court in a 157-page lawsuit, accusing it of stealing large amounts of personal information to train AI models, driven by profits.

OpenAI steals millions of users? Star big model becomes "data thief"!

OpenAI's scraping of data was unprecedented in scale, and the company stole about 300 billion words of content from the internet, including books, articles, websites and posts, and even personal information without consent, the indictment said. The data theft involved an estimated million, potentially $3 billion in damages, and violated the terms of service agreements and state and federal privacy and property laws.

"By collecting and misappropriating previously obscure personal data of millions of people to develop unstable, untested technologies, OpenAI puts everyone at immeasurable risk, but any responsible data protection and use measures are unacceptable." Timothy K. Giordano, a partner at the law firm, said.

Therefore, the plaintiff asked the court to temporarily freeze commercial access and further development of OpenAI products. This includes allowing people to opt out of data collection and preventing their products from surpassing human intelligence and causing harm to others. In addition to OpenAI, Microsoft, the main backer behind it, was also named as a defendant.

OpenAI isn't the only company using the internet to acquire massive amounts of data to train AI models; Google, Meta, Microsoft, and a growing number of other companies are doing the same. But a partner at the law firm said they decided to go after OpenAI because last year OpenAI spurred bigger competitors to launch their own AI products through ChatGPT, so they were naturally the first target.

As data-based models proliferate, data security is becoming increasingly important. Therefore, whether OpenAI lawfully and reasonably collects and uses users' personal information in accordance with its privacy policy, and whether it effectively identifies and excludes "incidentally" personal information contained in its training data sources, may be the focus of the lawsuit.

This wave is not flat, and that wave is rising again. According to Reuters, two more authors sued OpenAI in federal court in San Francisco, arguing that OpenAI misused its work to train ChatGPT, mining data from thousands of books without permission, infringing on the authors' copyrights.

According to public information, in March this year, after ChatGPT was found to have accidentally leaked user chat records, the Italian Data Protection Authority announced at the end of March that it would temporarily disable ChatGPT and investigate the tool for allegedly violating privacy rules. Canada is also investigating complaints that OpenAI "collects, uses and discloses personal information without consent."

In April, Reddit officially announced that it would charge companies that call its APIs because OpenAI, Google and other companies use data on the platform to train models. For a while, the training data problems surrounding OpenAI were constantly exposed.

Generative artificial intelligence products built on the principle of large models are the "aesthetics of violence" under the blessing of computing power and data, data is the threshold, corpus massive data has a high degree of data compliance risk, ChatGPT with 100 million users and billions of visits is the first to suffer the problem because of its "big tree".

However, this is not an isolated case of OpenAI and ChatGPT, and the data security issues exposed by it, such as privacy leakage, storage of sensitive information, and unauthorized access, are common problems that large model products may face after they are applied. Since the release of ChatGPT, Chinese companies have released more than 70 basic large models. How to achieve data compliance in the next commercial process has become a "must-answer question" that every product needs to face.

summary

The wave of AI will not stop, and how to steer the rudder of the forward ship and find a balance between enterprise survival and compliant production has become the proposition of the times under the fourth industrial revolution. For enterprises that have released or are about to release the base big model, ensuring data compliance will be one of the issues they must deal with.

OpenAI steals millions of users? Star big model becomes "data thief"!

Read on

OpenAI responds to "gag" resignation clauses; Didi Chengwei: Liu Qing was promoted to permanent partner, and the company no longer has the position of president; NetBSD prohibits AI-generated code | Geek headlines

OpenAI employees were "sealed" when they left their jobs, the core security team was disbanded, and Altman responded urgently: there was an agreement, but it was never implemented!

聊聊OpenAI最新发布的GPT 4o

OpenAI Shock! The chief scientist suddenly left! Wang Yuquan's exclusive analysis!

OpenAI officially announced the launch of "next-generation cutting-edge model" training! It is expected that the training parameters will be further improved, or the "Wensheng video" model Sora will be integrated

Former OpenAI director reveals the inside story of Ultraman's recall: The board of directors knew that ChatGPT had been released from X

It's all "my own people"! OpenAI urgently set up a "safety committee", less than half a month after the disbandment of the "super alignment" team, and will face the first security "big test" in 90 days

OpenAI is caught in the biggest public relations crisis in history, and the head of Altman, who is in charge, donated half of his net worth to help the company tide over the difficulties

Current and former employees of OpenAI, Google DeepMind warn of the risks of artificial intelligence: it could lead to the extinction of humanity! Call for the protection of whistleblowers

US media: The United States will launch an antitrust investigation into Microsoft OpenAI and Nvidia

Endorsed by the "Godfather of AI", 13 current and former employees of OpenAI and Google jointly warned: AI is out of control or leads to the extinction of mankind

Musk withdrew the lawsuit against OpenAI and Ultraman and did not rule out the possibility of another lawsuit

Apple and OpenAI are together, why did Musk break the defense?

Apple CEO Tim Cook Interview: Responding to retirement rumors for the first time, teaming up with OpenAI is the best choice at the moment

OpenAI's four major controversies and two deep crises

Now it's like glue, but Microsoft has also been wary of OpenAI's "change of heart"

Inventory of the best 30 teams in NBA league all-time team - Lakers point guard: Elvin Johnson career total: 17,707 points, 6,559 rebounds, 10,141 assists in career games

Inventory of the 30 best teams in NBA league all-time team - Bulls chapter point guard: Derrick Rose career totals: 12,571 points, 2,324 rebounds, 3,770 assists, career averages per game

Inventory of the best 30 teams in NBA league team history - Pacers point guard: Victor Oladipo's career total: 8499 points, 2245 rebounds, 1970 assists in career games

Inventory of the best teams in NBA league 30 team history - Suns chapter point guard: Steve Nash's career total: 17,387 points, 3,642 rebounds, 10,335 assists, career averages per game

Inventory of the best team in the history of 30 teams in the NBA league - Pelicans point guard: Chris Paul's career total: 22,283 points, 5,681 rebounds, 11,894 assists, career averages per game

#最爱的影视演员#米兰时装周男明星生图: Tan Jianci, Xu Kai, Yi Yang Qianxi, Zhang Ruoyun, who is handsome to you?

Star: There is no dignity in giving birth! Spread your legs and spread out everything that is private in front of strangers!

I really don't understand! A team boss, a 29-year-old All-Star, has been on the shelf for a year and no one wants it

2-time All-American Champion! 5 times All-Star! The most underrated veteran, without him, he can't win the championship

During the Republic of China, how much money could you make as a star? It's time to tell everyone the truth!

It is recommended that you search for what is called fan circle culture. I would like to emphasize three points. 1. Not long ago, the fascia gun praised the 211 project in line with Article 4. 2, compared to Du Fu and saints, in line with Article 6. 3，

Star goddess series, the beauty of the world's stunners to appreciate ~ the goddess of abstinence Xin Zhilei

Star goddess series, the world's stunner beauty picture appreciation~ The new pure goddess Zhou also

Unmarried in the entertainment industry, over 30 years old, about 180cm tall, I feel "suitable" to be Liu Yifei's boyfriend There are these 5 stars: 1, Peng Yuyan (1982, about 18 tall

The amazing "waist-to-hip ratio" of female stars, Ni Nijue, Rebayan, Yin Tao can't hide it no matter how low-key she is

OMG！ What are female celebrities wearing?