AI Newsletter 114 - Extracting Real Training Data from ChatGPT

Welcome to Nural's newsletter focusing on how AI is being used to tackle global grand challenges.

Packed inside we have

  • Extracting Training Data from ChatGPT - adversarial attacks
  • Can Generalist Foundation Models Outcompete Special-Purpose Tuning?
  • and Home Monitoring of Asthma Exacerbations in Children and Adults with AI

If you would like to support our continued work from £2/month then click here!

Marcel Hedman


Key Recent Developments


UK to invest £500M more in AI compute capacity, launch five new quantum projects

UK to invest £500M more in AI compute capacity, launch five new quantum projects - SiliconANGLE
UK to invest £500M more in AI compute capacity, launch five new quantum projects - SiliconANGLE

What: The U.K. government will invest £500 million, or $626 million, to provide local researchers and organizations with access to compute capacity for artificial intelligence projects. 


Extracting Training Data from ChatGPT

Extracting Training Data from ChatGPT

Abstract: We have just released a paper that allows us to extract several megabytes of ChatGPT’s training data for about two hundred dollars. (Language models, like ChatGPT, are trained on data taken from the public internet. Our attack shows that, by querying the model, we can actually extract some of the exact data it was trained on.) We estimate that it would be possible to extract ~a gigabyte of ChatGPT’s training dataset from the model by spending more money querying the model.


AI Ethics & 4 Good

🚀 Deep generative modeling of the human proteome reveals over a hundred novel genes involved in rare genetic disorders

🚀 Home Monitoring of Asthma Exacerbations in Children and Adults With Use of an AI-Aided Stethoscope

Other interesting reads

🚀 Dell Inks $150 Million Hardware Deal With AI Startup Imbue

🚀 Interview: Sam Altman on being fired and rehired by OpenAI

🚀 Google AI and robots join forces to build new materials

Papers

🚀Can Generalist Foundation Models Outcompete Special-Purpose Tuning? Case Study in Medicine [Microsoft]

🚀Researchers at UC Berkeley Introduced RLIF: A Reinforcement Learning Method that Learns from Interventions in a Setting that Closely Resembles Interactive Imitation Learning


Cool companies found this week

Foundation Models

Imbue - Building AI systems that can reason


Best,

Marcel Hedman
Nural Research Founder
www.nural.cc

If this has been interesting, share it with a friend who will find it equally valuable. If you are not already a subscriber, then subscribe here.

If you are enjoying this content and would like to support the work financially then you can amend your plan here from £2/month!