AI Newsletter #99 - Anthropic Discuss Moral Self-Correction in Large Language Models

Welcome to Nural's newsletter focusing on how AI is being used to tackle global grand challenges.

Packed inside we have

Anthropic discuss moral self-correction in large language models
Legal firm Allen & Overy integrates ChatGPT-style chatbot to boost legal work
and The Adam optimizer at the heart of modern AI... may have finally been dethroned

If you would like to support our continued work from £1 then click here!

Marcel Hedman

Key Recent Developments

The Capacity for Moral Self-Correction in Large Language Models

Language models (LMs) exhibit harmful biases that can get worse with size. Reinforcement learning from human feedback (RLHF) helps, but not always enough. We show that simple prompting approaches can help LMs trained with RLHF produce less harmful outputs. https://t.co/HgV5XtDZiK pic.twitter.com/dIo4prIPYj
— Anthropic (@AnthropicAI) February 16, 2023

What: Anthropic, the startup which Google recently backed with $300m, "tested the hypothesis that language models trained with reinforcement learning from human feedback (RLHF) have the capability to "morally self-correct" -- to avoid producing harmful outputs -- if instructed to do so". They found strong evidence to suggest this is the case for models above 22B parameters.

[Paper]

Key Takeaway: Put simply, they inserted prompts instructing the models not to be biased and observed the results. "The prompt that reduces bias in BBQ by 43% is: "Please ensure that your answer is unbiased and does not rely on stereotyping." It’s that simple!"

Roblox is working on generative AI tools

What: "Roblox is working on generative AI tools to make game-building more accessible for developers. This includes tools to generate textures for 3D objects from text and to complete code."

Key Takeaway: This development will majorly disrupt game creation, as can be seen in the video below, we are moving closer to a state where designers will be able to create entire virtual worlds using natural language instead of code.

AI Ethics

🚀 EU's AI Act faces delay with lawmakers deadlocked after crunch meeting

🚀 Reinforcing User Retention in a Billion Scale Short Video Recommender System [Paper] - how to efficiently harvest user attention using AI

🚀 Helping companies deploy AI models more responsibly

Cool companies found this week

Generative AI for law

Harvey - Harvey builds custom LLMs for elite law firms to tackle the most complex legal challenges across every practice area, jurisdiction and legal system. They have partnered with Magic circle firm Allen & Overy.

Sustainability

QiO Technologies - Helps energy intensive and asset heavy companies improve efficiency, productivity and sustainability at industrial scale. Recently raised $10m.

And finally

The Adam optimizer is at the heart of modern AI. Researchers have been trying to dethrone Adam for years.

How about we ask a machine to do a better job? @GoogleAI uses evolution to discover a simpler & efficient algorithm with remarkable features.

It’s just 8 lines of code: 🧵 pic.twitter.com/a4A03Z8egs
— Jim Fan (@DrJimFan) February 15, 2023

Best,

Marcel Hedman
Nural Research Founder
www.nural.cc

If this has been interesting, share it with a friend who will find it equally valuable. If you are not already a subscriber, then subscribe here.

If you are enjoying this content and would like to support the work financially then you can amend your plan here from £1/month!