AI updates from the past week: Anthropic launches Claude 4 models, OpenAI adds new tools to Responses API, and more

OpenAI adds new tools and features to the Responses API

New additions include remote MCP server support, support for the latest image generation model, the ability to use the Code Interpreter tool, and the ability to use the file search tool in OpenAI’s reasoning models.

The company has also added background mode, which allows the model to execute complex reasoning tasks asynchronously; reasoning summaries; and the ability to reuse reasoning items across different API requests.

Mistral launches LLM for coding agents

Devstral is a lightweight open source model designed specifically for agentic coding tasks. According to the SWE-Bench Verified benchmark, Devstral outperforms GPT-4.1-mini and Claude 3.5 Haiku. Its small size allows it to run on a single RTX 4090 or a Mac with 32GB RAM, enabling it to be utilized for local, on-device use.

“While typical LLMs are excellent at atomic coding tasks such as writing standalone functions or code completion, they currently struggle to solve real-world software engineering problems. Real-world development requires contextualising code within a large codebase, identifying relationships between disparate components, and identifying subtle bugs in intricate functions. Devstral is designed to tackle this problem. Devstral is trained to solve real GitHub issues,” Mistral wrote in its announcement.

AI updates from Google I/O

Google I/O was full of updates on AI, including new models such as the new text model Gemini Diffusion and Gemma 3n, a multimodal model designed for running on phones, laptops and tablets, capable of handling audio, text, image, and video.

Google also revealed two new Gemma model variants: MedGemma for health applications and SignGemma for translating sign language into spoken language text.

Gemini Code Assist for individuals and Gemini Code Assist for GitHub are both now generally available as well, and are powered by Gemini 2.5. This tool was first introduced as a preview back in February, and today’s GA release includes several new updates, including chat history and threads, the ability to specify rules to apply to every AI generation in the chat, custom commands, and the ability to review and accept code suggestions in parts, across files, or all together.

The company also announced a reimagined version of Colab, a new tool that generates UI components from wireframes or text prompts called Stitch, and new features in Firebase Studios, such as the ability to translate Figma designs into applications.

AI updates from Microsoft Build

A new coding agent has been added to GitHub Copilot that gets activated when a developer assigns it a GitHub issue or calls it via a prompt in VS Code. It can assist with a number of tasks, including adding features, fixing bugs, extending tests, refactoring code, and improving documentation. All of the agent’s pull requests require human approval before they run, GitHub confirmed.

Microsoft also announced Windows AI Foundry, a platform that supports the AI developer life cycle across training and inference. Developers will be able to manage and run open-source LLMs through Foundry Local or bring proprietary models and convert, fine-tune, and deploy them across clients and cloud.

Support for the Model Context Protocol (MCP) was also added across Microsoft’s platforms and services, including GitHub, Copilot Studio, Dynamics 365, Azure AI Foundry, Semantic Kernel, and Windows 11.

Microsoft also announced a new open source project called NLWeb to help developers create conversational AI interfaces for their websites using any model or data source they’d like. NLWeb endpoints also act as MCP servers, so developers will be able to easily make their content discoverable to AI agents if they’d like.

Shopify releases new developer tools

It is launching a new unified developer platform that integrates the Dev Dashboard and CLI and offers AI-powered code generation. Developers can also now create “dev stores” where they can preview apps in test environments, a feature that was previously only available to Plus plans, and is now available to all developers.

Other new features announced today include declarative custom data definitions, a unified Polaris UI toolkit, and Storefront MCP, which allows developers to build AI agents that will act as shopping assistants for stores.

HeyMarvin launches AI Moderated Interviewer

The AI Moderated Interviewer conducts moderated user interviews with potentially thousands of participants without a human facilitator. It can also analyze the interview responses to surface insights and trends.

“What makes it so powerful is that it enables free-flowing, qualitative, engaging conversations — but on demand and at scale,” said Prayag Narula, CEO and co-founder of HeyMarvin. “We’re talking hundreds, even thousands of people, something that was previously only seen at large scale using a small army of volunteers in moments like presidential elections. Now, even a small team can have that same in-depth dialogue with their customers. It’s not just a better survey, and it’s not replacing traditional user interviews. It’s a whole new way of doing research that simply didn’t exist a few months ago.”

Zencoder announces Autonomous Zen Agents for CI/CD

These agents run directly in CI/CD pipelines and can be triggered by webhooks from issue trackers or code events. They can resolve issues, implement fixes, improve code quality, generate and run tests, and create documentation.

“The next evolution in AI-powered development isn’t just about coding faster – it’s about accelerating the whole software development lifecycle, where coding is just one step,” said Andrew Filev, CEO and founder of Zencoder. “By bringing autonomous agents into CI/CD pipelines, we’re enabling teams to eliminate routine work and accelerate hand-offs, maintaining momentum 24/7, while keeping humans in control of what ultimately ships.”

Read last week’s AI updates here: OpenAI Codex, AWS Transform for .NET, and more — May 16, 2025

Article Tags

anthropic, Google, heymarvin, Microsoft, Mistral, OpenAI, Shopify, zencoder

About Jenna Barron

Jenna Barron is News Editor of SD Times.

View all posts by Jenna Barron

Cookie	Duration	Description
cf_use_ob	past	Cloudflare sets this cookie to improve page load times and to disallow any security restrictions based on the visitor's IP address.
cookielawinfo-checkbox-advertisement	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to record the user consent for the cookies in the "Advertisement" category .
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
CookieLawInfoConsent	1 year	Records the default button state of the corresponding category & the status of CCPA. It works only in coordination with the primary cookie.
JSESSIONID	session	The JSESSIONID cookie is used by New Relic to store a session identifier so that New Relic can monitor session counts for an application.
PHPSESSID	session	This cookie is native to PHP applications. The cookie is used to store and identify a users' unique session ID for the purpose of managing user session on the website. The cookie is a session cookies and is deleted when all the browser windows are closed.
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Cookie	Duration	Description
__atuvc	1 year 1 month	AddThis sets this cookie to ensure that the updated count is seen when one shares a page and returns to it, before the share count cache is updated.
__atuvs	30 minutes	AddThis sets this cookie to ensure that the updated count is seen when one shares a page and returns to it, before the share count cache is updated.
__cf_bm	30 minutes	This cookie, set by Cloudflare, is used to support Cloudflare Bot Management.

Cookie	Duration	Description
__gads	1 year 24 days	The __gads cookie, set by Google, is stored under DoubleClick domain and tracks the number of times users see an advert, measures the success of the campaign and calculates its revenue. This cookie can only be read from the domain they are set on and will not track any data while browsing through other sites.
_ga	2 years	The _ga cookie, installed by Google Analytics, calculates visitor, session and campaign data and also keeps track of site usage for the site's analytics report. The cookie stores information anonymously and assigns a randomly generated number to recognize unique visitors.
_ga_S6PB8V57DG	2 years	This cookie is installed by Google Analytics.
_gat_gtag_UA_846073_1	1 minute	Set by Google to distinguish users.
_gid	1 day	Installed by Google Analytics, _gid cookie stores information on how visitors use a website, while also creating an analytics report of the website's performance. Some of the data that are collected include the number of visitors, their source, and the pages they visit anonymously.
_jsuid	1 year	This cookie contains random number which is generated when a visitor visits the website for the first time. This cookie is used to identify the new visitors to the website.
at-rand	never	AddThis sets this cookie to track page visits, sources of traffic and share counts.
CONSENT	2 years	YouTube sets this cookie via embedded youtube-videos and registers anonymous statistical data.
iutk	5 months 27 days	This cookie is used by Issuu analytic system to gather information regarding visitor activity on Issuu products.
uvc	1 year 1 month	Set by addthis.com to determine the usage of addthis.com service.
vuid	2 years	Vimeo installs this cookie to collect tracking information by setting a unique ID to embed videos to the website.
WMF-Last-Access	1 month 14 hours 26 minutes	This cookie is used to calculate unique devices accessing the website.

Cookie	Duration	Description
__Host-GAPS	2 years	This cookie allows the website to identify a user and provide enhanced functionality and personalisation.
_pxhd	session	Used by Zoominfo to enhance customer data.
IDE	1 year 24 days	Google DoubleClick IDE cookies are used to store information about how the user uses the website to present them with relevant ads and according to the user profile.
loc	1 year 1 month	AddThis sets this geolocation cookie to help understand the location of users who share the information.
mc	1 year 1 month	Quantserve sets the mc cookie to anonymously track user behaviour on the website.
test_cookie	15 minutes	The test_cookie is set by doubleclick.net and is used to determine if the user's browser supports cookies.
VISITOR_INFO1_LIVE	5 months 27 days	A cookie set by YouTube to measure bandwidth that determines whether the user gets the new or old player interface.
YSC	session	YSC cookie is set by Youtube and is used to track the views of embedded videos on Youtube pages.
yt-remote-connected-devices	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
yt-remote-device-id	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
yt.innertube::nextId	never	This cookie, set by YouTube, registers a unique ID to store data on what videos from YouTube the user has seen.
yt.innertube::requests	never	This cookie, set by YouTube, registers a unique ID to store data on what videos from YouTube the user has seen.

Cookie	Duration	Description
__gpi	1 year 24 days	No description
__Secure-YEC	1 year 1 month	No description
_heatmaps_g2g_100754890	10 minutes	No description
_techvalidate_session	session	No description
cf_7166_id	20 years	No description
cf_7166_person_last_update	session	No description
f5avraaaaaaaaaaaaaaaa_session_	session	No description available.
GoogleAdServingTest	session	No description
Gyazo_cfwoker	7 years 2 months 17 days 7 hours	No description
incap_ses_451_2783402	session	No description
incap_ses_769_2783402	session	No description
loglevel	never	No description available.
m	2 years	No description available.
nlbi_2783402	session	No description
prism_252377639	1 month	No description
TS011605d9	session	No description
ustream-guest	session	No description available.
visid_incap_2783402	1 year	No description
xtc	1 year 1 month	No description

AI

AI and Software Development

Observability

Guide to Observability

CI/CD

A guide to CI/CD

Cloud Native

Cloud Native Content

Data

A Guide to Data

Test

Security Testing

Mobile

Mobile Testing

API

Sponsored by Parasoft

Performance

Load & Performance Testing

DevSecOps

A Guide to DevSecOps

Enterprise Security

A Guide to Security

Supply Chain Security

Supply Chain Security

Dev Manager

Dev Managers Content

Agile

A Guide To Agile

Value Stream

A Guide To Value Stream

Productivity

A Guide To Productivity

DevOps

DevOps Content

API

Gravitee.io

AI

AI and Software Development

Value Stream Management

A Guide To Value Stream

AI updates from the past week: Anthropic launches Claude 4 models, OpenAI adds new tools to Responses API, and more — May 23, 2025

OpenAI adds new tools and features to the Responses API

Mistral launches LLM for coding agents

AI updates from Google I/O

AI updates from Microsoft Build

Shopify releases new developer tools

HeyMarvin launches AI Moderated Interviewer

Zencoder announces Autonomous Zen Agents for CI/CD

Article Tags

Subscribe to SDTimes

About Jenna Barron

Related Articles

Google’s new Opal tool allows users to create mini AI apps with no coding required

This week in AI dev tools: Gemini 2.5 Flash-Lite, GitLab Duo Agent Platform beta, and more (July 25, 2025)

Google adds updated workspace templates in Firebase Studio that leverage new Agent mode

Google launches OSS Rebuild tool to improve trust in open source packages