We take aggressive, proactive countermeasures to protect our technology and will continue working closely with the U.S. government to protect the most capable models being built here,
We are aware of and reviewing indications that DeepSeek may have inappropriately distilled our models, and will share information as we know more,
I think one of the things you’re going to see over the next few months is our leading AI companies taking steps to try and prevent distillation,
I think that there’s a pretty obvious reason for that choice, which is that they harvested ChatGPT for training data,
If you ask it what model are you, it would say, ‘I’m ChatGPT,’ and the most likely reason for that is that the training data for DeepSeek was harvested from millions of chat interactions with ChatGPT that were just fed directly into DeepSeek’s training data,
Users who are high-risk in relation to mainland China, including human rights activists, members of targeted diaspora populations, and journalists should be particularly sensitive to these risks and avoid inputting anything into the system,
The same risks apply to all AI platforms, including those based in the United States,
Be careful about inputting sensitive personal data, financial details, trade secrets, or information about healthcare. Anything you type could be stored, analyzed, or requested by authorities under China’s data laws,
Anyone who is remotely critical of the administration, is a watchdog of the administration, or is part of a vulnerable or at-risk community, should exercise serious caution before using or inputting any data into what are largely ‘black boxes.’ Remember, as with virtually all social media platforms, users’ data is part of the raw material used to train those systems,
We engage in counter-measures to protect our IP, including a careful process for which frontier capabilities to include in released models, and believe as we go forward that it is critically important that we are working closely with the US government to best protect the most capable models from efforts by adversaries and competitors to take US technology,
There's no such thing as low cost, because the security and privacy costs are extremely high - let alone the perverted prism through which many answers will be presented
Deepseek R1 is one of the most amazing and impressive breakthroughs I´ve ever seen - and as open source, a profound gift to the world.
DeepSeek’s privacy policy openly states that the wide array of user data they collect goes to servers in China,
There is always the risk of cyberattacks,
DeepSeek’s privacy policy, which can be found in English, makes it clear: user data, including conversations and generated responses, is stored on servers in China,
There's substantial evidence that what DeepSeek did here is they distilled the knowledge out of OpenAI's models,
We know PRC based companies – and others – are constantly trying to distil the models of leading US AI companies,
You are prohibited from […] using Output to develop models that compete with OpenAI,
Deepseek's R1 is an impressive model, particularly around what they're able to deliver for the price,
Self-hosting ChatGPT Gov enables agencies to more easily manage their own security, privacy, and compliance requirements,