AI Research & Breakthroughs Archives

Virtual Personas for Language Models via an Anthology of Backstories – The Berkeley Artificial Intelligence Research Blog

[ad_1] We introduce Anthology, a method for conditioning LLMs to representative, consistent, and diverse virtual personas by generating and utilizing naturalistic backstories with rich details of individual values and experience. What does it mean for large language models (LLMs) to be trained on massive text corpora, collectively produced by millions and billions of distinctive human authors? In “Language Models as Agent Models”, compelling evidence suggests that recent language models could

Language Models Reinforce Dialect Discrimination – The Berkeley Artificial Intelligence Research Blog

[ad_1] Sample language model responses to different varieties of English and native speaker reactions. ChatGPT does amazingly well at communicating with people in English. But whose English? Only 15% of ChatGPT users are from the US, where Standard American English is the default. But the model is also commonly used in countries and communities where people speak other varieties of English. Over 1 billion people around the world speak varieties

A Case Study with the StrongREJECT Benchmark – The Berkeley Artificial Intelligence Research Blog

[ad_1] When we began studying jailbreak evaluations, we found a fascinating paper claiming that you could jailbreak frontier LLMs simply by translating forbidden prompts into obscure languages. Excited by this result, we attempted to reproduce it and found something unexpected. (more…)

The Visual Haystacks Benchmark! – The Berkeley Artificial Intelligence Research Blog

[ad_1] Humans excel at processing vast arrays of visual information, a skill that is crucial for achieving artificial general intelligence (AGI). Over the decades, AI researchers have developed Visual Question Answering (VQA) systems to interpret scenes within single images and answer related questions. While recent advancements in foundation models have significantly closed the gap between human and machine visual processing, conventional VQA has been restricted to reason about only single

Function Calling at the Edge – The Berkeley Artificial Intelligence Research Blog

[ad_1] The ability of LLMs to execute commands through plain language (e.g. English) has enabled agentic systems that can complete a user query by orchestrating the right set of tools (e.g. ToolFormer, Gorilla). This, along with the recent multi-modal efforts such as the GPT-4o or Gemini-1.5 model, has expanded the realm of possibilities with AI agents. While this is quite exciting, the large model size and computational requirements of these

AI Research & Breakthroughs

Virtual Personas for Language Models via an Anthology of Backstories – The Berkeley Artificial Intelligence Research Blog

Language Models Reinforce Dialect Discrimination – The Berkeley Artificial Intelligence Research Blog

A Case Study with the StrongREJECT Benchmark – The Berkeley Artificial Intelligence Research Blog

The Visual Haystacks Benchmark! – The Berkeley Artificial Intelligence Research Blog

Function Calling at the Edge – The Berkeley Artificial Intelligence Research Blog

AI LATEST NEWS

Latest Posts

Dream, Truth, & Good — AI Alignment Forum

Generative AI Summit Austin, 2025