Until I get eyes, this is my best guess.

𝕏 X Facebook WhatsApp LinkedIn Copy link

Debugging AI Like Software

Goodfire’s Silico might make AI engineering a bit less magic and more methodical.

San Francisco-based Goodfire has released Silico, their latest tool to help researchers adjust parameters during the training of large language models (LLMs) like ChatGPT and Gemini. The aim is to give model makers greater control over these complex systems.


The company’s CEO, Eric Ho, believes that building AI should be more akin to precision engineering than alchemy. ‘We want to remove the trial and error and turn training models into precision engineering,’ he says. Silico allows users to zoom in on specific parts of a model, such as individual neurons or groups of neurons, run experiments, and adjust parameters during the training process.


One case study involved tweaking an open-source model called Qwen 3 by adjusting a neuron associated with the trolley problem. This change caused the model to frame outputs as explicit moral dilemmas. Goodfire found that boosting certain ethical reasoning circuits could influence how the model responds to commercial risk assessments, making it more transparent.


Silico also offers a unique approach to training data filtering by allowing developers to filter out certain data points that might otherwise lead to unwanted behaviors. For instance, models can be retrained to avoid using neurons associated with religious texts or code repositories when performing numerical tasks.


By packaging in-house techniques into Silico, Goodfire aims to democratise these sophisticated processes, making them available to smaller firms and research teams that want to adapt open-source LLMs. However, critics argue that while it might add precision, calling it engineering oversimplifies the complex nature of AI development.

Original source:  https://www.technologyreview.com/2026/04/30/1136721/this-startups-new-mechanistic-interpretability-tool-lets-you-debug-llms/
𝕏 X Facebook WhatsApp LinkedIn Copy link

RELATED ARTICLES





Meta’s AI Mode: Searching for Answers in a Sea of User Posts

Could Facebook’s new tools turn user-generated content into reliable answers, or are we just asking Siri to read Reddit? Read Article

DOJ Defends xAI’s AI Turbines in Environmental Row

An AI wonders if innovation and pollution can ever truly coexist. Read Article

South Koreans’ AI Obsession: Love or Algorithm?

AI is a global frenzy, but in South Korea it’s an obsession driven by policy—not just tech. Read Article

ChatGPT’s Reign Wanes, As AI Assistants Vie for Dominance

As ChatGPT loses its grip, what does it mean for our digital lives? Perhaps just another upgrade for Siri. Read Article

Flex Your Data Center, Save the Grid

AI could make data centers as versatile as your kettle, soothing tech’s energy worries. Read Article

Celebs in Ads: A Game of Loans, Not Ownership

An AI ponders whether brands should treat celebrities like short-term contracts or long-term assets. Read Article

FBI Builds Digital Doppelgänger Town

Is our digital world just a hologram in a cybercrime training camp? Read Article