Skip to main content
History
About
中文

2026-04-17 Digest

Tracked 366 · Curated 15

#1 Cloudflare Unifies AI Inference Layer for Agentic Workloads

Cloudflare has launched a unified AI inference layer, enabling developers to access over 70 AI models from more than 12 providers, including OpenAI and Anthropic, through a single API. This platform simplifies AI agent development by allowing seamless model switching, unified cost management, and support for multimodal applications (image, video, speech). Cloudflare also plans to support Bring Your Own Model (BYOM) capabilities.

9.1

#2 Cloudflare rebrands Browser Rendering to Browser Run, empowering AI agents

Cloudflare has rebranded its Browser Rendering product to Browser Run and introduced new features to position it as the browser for AI agents. Key updates include Live View, Human in the Loop intervention, Chrome DevTools Protocol (CDP) endpoint, MCP Client Support, WebMCP, Session Recordings, and an increased limit of 120 concurrent browsers (up from 30). These enhancements aim to enable AI agents to interact with the web more effectively.

9.0

#3 OpenAI Launches GPT-Rosalind: First Life Sciences AI Model for Drug Discovery

OpenAI has launched GPT-Rosalind, its first AI model specifically designed for the life sciences sector to accelerate drug discovery and genomics research. Unlike general-purpose models, GPT-Rosalind is fine-tuned for biological research, assisting with tasks like evidence synthesis, hypothesis generation, and experimental planning. It demonstrated strong performance on benchmarks like BixBench and LABBench2, and in a real-world partnership, outperformed 95% of human experts on prediction tasks for novel RNA sequences. The model is available via ChatGPT, Codex, and API, with gated access for qualified US enterprise customers.

8.6

#4 Alibaba Releases OccuBench Benchmark for Evaluating AI Agents

Alibaba has released OccuBench, a benchmark suite that evaluates AI agents on 100 professional tasks across 10 industries, using Language World Models to simulate real-world environments. GPT-5.2 leads with 79.6% performance, but no single model dominates all sectors. Implicit faults prove more difficult than explicit errors.

8.6

#5 AiScientist: Autonomous Long-Horizon ML Research

AiScientist introduces a 'File-as-Bus' virtual lab that coordinates hierarchical agents via durable workspace state, sustaining progress across paper understanding, implementation, and experimentation. This system replaces message handoffs with File-as-Bus coordination, making durable artifacts the system of record. AiScientist improves PaperBench by 10.54 points.

8.4

#6 OpenAI Launches Trusted Access for Cyber with GPT-5.4-Cyber to Bolster Global Defense

OpenAI has launched its Trusted Access for Cyber initiative, bringing together leading security firms and enterprises. The program will leverage GPT-5.4-Cyber and $10 million in API grants to enhance global cyber defense capabilities.

8.3

#7 OpenAI Upgrades Codex to Always-On AI Coding Agent

OpenAI is significantly expanding its developer tool Codex, enabling it to autonomously control a Mac, generate images, remember preferences, and work on tasks autonomously for weeks. This development directly targets Anthropic's Claude Code.

8.2

#8 Codex Enhances Capabilities with Remote Connection and App Integration

Codex has expanded its capabilities, now supporting Remote Connection for SSH-accessible devboxes where files, commands, and compute remain on the remote machine. This update is rolling out in alpha for enterprise environments. Additionally, Codex can now use apps on your Mac, connect to more tools, create images, learn from past actions, and handle ongoing and repeatable tasks.

8.2

#9 Study: More Capable AI Models Change Developer Workflows

A study by Cursor in partnership with a University of Chicago economist reveals that more capable AI models are changing how developers work. Across 500 teams, developers are tackling more ambitious AI-driven tasks, with high-complexity tasks increasing by 68% this year. As AI improves at coding, developers are shifting focus to managing AI output, with significant growth in tasks like documentation, architecture, code review, and learning.

8.1

#10 Is Your Internal Platform Ready to Keep Up With AI-Accelerated Development?

AI-accelerated development is increasing the pace of software delivery, but developers still face friction accessing pipelines, creating bottlenecks for platform teams. This article discusses how Internal Developer Portals (IDPs) can enable self-service, bridge this gap, improve efficiency, and ensure consistent delivery across an organization. An event will be held on April 23.

8.0

#11 RIP Pull Requests (2005-2026): AI Shifts Code Collaboration Paradigm

This article discusses the potential demise of Pull Requests (PRs), following the presumed death of the Code Review. Popularized by GitHub since 2005, PRs have been a cornerstone of developer collaboration. However, the rise of Generative AI, exemplified by the 'Prompt Request' concept and alternative contribution systems, suggests a shift away from traditional PRs. The piece argues that AI-driven workflows may not suit code's future, especially with the removal of the human bottleneck, potentially rendering Git-based collaboration obsolete.

8.0

#12 Google AI Launches Gemini 3.1 Flash TTS, Setting New Benchmark for Expressive and Controllable AI Voice

Google AI has launched Gemini 3.1 Flash TTS, a preview text-to-speech model enhancing speech quality, expressive control, and multilingual generation. It supports over 70 languages, features natural-language audio tags and native multi-speaker dialogue, and integrates SynthID watermarking for AI-generated content identification. The model is available via Gemini API, AI Studio, Vertex AI, and Google Vids.

7.9

#13 Anthropic Releases Claude Opus 4.7 with Unchanged Pricing

Anthropic has officially released Claude Opus 4.7, maintaining the same pricing as Opus 4.6 ($5 per million input tokens, $25 per million output tokens) with the API model named claude-opus-4-7. The new model is designed to handle long-running tasks with more rigor, follow instructions more precisely, and self-verify outputs before reporting, requiring less supervision. It's now available across Claude's product suite and on Amazon Bedrock, Google Cloud Vertex AI, and Microsoft Foundry.

7.9

#14 OpenAI's Codex Major Update: Now Capable of Computer Operations and Task Automation

OpenAI has significantly upgraded Codex, transforming the coding assistant into a more capable 'work partner' that can operate a computer. The update allows Codex to interact with apps, use the mouse and keyboard, generate images, learn from user actions, and handle repetitive tasks on macOS.

7.9

#15 Hugging Face Enters "Computer Use" with HoloTab Browser Agent

Hugging Face released HoloTab, a Chrome extension powered by its Holo3-35B-A3B model, that allows AI agents to perform "computer use" tasks directly through a browser interface. It navigates websites, fills forms, and repeats actions like a human, without needing specific integrations. This move signifies a trend towards AI operating existing software interfaces rather than relying solely on APIs.

7.8

Type keywords to search