AI News — Topics

5 model-release · OpenAI · 5 articles · 4 sources · 6h ago

OpenAI launches GPT-5.5 with improved coding, speed, and agentic capabilities

OpenAI published a system card for GPT-5.5, detailing the model's capabilities and safety properties.

Covered by: OpenAITechCrunch AIThe Verge AIOne Useful Thing (Ethan Mollick)

4 model-release · OpenAI single-source · 1 article · 2d ago

OpenAI launches GPT-Image-2 with better text rendering

OpenAI launches GPT-Image-2, a new image generation model with improved text rendering and thinking capabilities, now available on API and ChatGPT.

4 policy · Adobe, NVIDIA, WPP · 1 article · 3d ago

Adobe-NVIDIA-WPP partner to scale governed agentic marketing (hedged)

NVIDIA outlines expanded collaborations with Adobe and WPP to deploy enterprise-grade, policy-governed AI agents for marketing and creative workflows, with security and governance features.

4 research · Anthropic single-source · 1 article · 3d ago

Anthropic automates alignment research with autonomous agents

Anthropic demonstrates automated AI agents can conduct alignment research autonomously, outperforming human researchers on weak-to-strong supervision tasks.

4 research · Anthropic single-source · 1 article · 5d ago

Latent Space newsletter covers Claude Opus 4.7 launch, computer-use agents, and open-source agent research developments.

4 model-release · Anthropic single-source · 1 article · 7d ago

Anthropic launched Claude Opus 4.7, claiming improvements across reasoning, coding, vision, and instruction-following with unchanged pricing.

3 tool-launch · OpenAI · 2 articles · 1d ago

OpenAI rolls out workspace agents in ChatGPT for enterprises

OpenAI announced workspace agents in ChatGPT, cloud-based tools designed to automate complex workflows and help teams coordinate work across multiple applications.

Covered by: OpenAI

3 policy · GitHub · 2 articles · 2 sources · 1d ago

GitHub tightens Copilot limits and pauses new signups

GitHub is stopping new registrations for Copilot Pro, Pro+, and student plans, tightening usage quotas for personal plans, and removing Opus model from Pro; Pro users can upgrade to Pro+ for higher limits, with refunds available if limits cause issues before 2026-05-20.

Covered by: Simon WillisonGitHub Blog

3 tool-launch · Anthropic single-source · 1 article · 4h ago

Anthropic expands Claude connectors to personal apps

Anthropic announces new Claude connectors enabling direct access to personal apps like Spotify, Uber Eats, and TurboTax within conversations.

3 research · LiteParse single-source · 1 article · 4h ago

LiteParse brings browser-based PDF text extraction with OCR

LiteParse, an open-source project, now has a web-based version that extracts PDF text in the browser using PDF.js and optional OCR (Tesseract.js) with spatial parsing to maintain readable layouts, and includes a UI for text and JSON output.

3 policy · OpenAI; Google; Anthropic; DeepSeek; White House OSTP; Michael Kratsios single-source · 1 article · 4h ago

US boos distillation-style AI IP theft, China denies allegations

The US considers China’s distillation attacks on AI IP a threat and is preparing a crackdown, while China calls the accusations slander.

3 business · Sierra; Fragment single-source · 1 article · 5h ago

Sierra acquires YC-backed Fragment (unconfirmed terms)

Sierra announces the acquisition of Fragment, a YC-backed French startup.

3 business · Meta single-source · 1 article · 6h ago

Meta plans to lay off 10 percent of workforce in May

Meta is planning to lay off approximately 8,000 employees in May, according to a memo from the company's chief people officer.

3 policy · Tim Cook / Apple single-source · 1 article · 7h ago

Uncertain leadership at Apple after Tim Cook (report)

Discussion of Tim Cook's potential departure and its implications for Apple.

3 research · OpenAI · 1 article · 7h ago

OpenAI releases GPT-5.5 powering Codex on NVIDIA GB200 systems (hedged)

NVIDIA and OpenAI announce GPT-5.5-powered Codex running on GB200 NVL72 infrastructure with enterprise security features and a decade-long collaboration.

3 business · Anthropic single-source · 1 article · 8h ago

Anthropic's Claude Mythos model reportedly accessed by unauthorized users

Anthropic is investigating unauthorized access to its Claude Mythos model, reportedly available to a small group since the company announced controlled testing.

3 policy · The Decoder single-source · 1 article · 8h ago

Trump adviser alleges China copying US AI at scale (unconfirmed)

US government accuses Chinese actors of large-scale copying of American frontier AI models, with actions to counter.

3 policy · OpenAI, Meta, Microsoft, xAI (as examples of AI companies mentioned) single-source · 1 article · 11h ago

Data-center gas projects may exceed national emissions, WIRED analysis suggests (hedged)

New data-center natural gas projects in the US could emit up to 129 million tons of greenhouse gases annually, potentially rivaling Morocco's 2024 emissions.

3 business · Anthropic single-source · 1 article · 12h ago

Anthropic restricts OpenClaw access, implements paid tier

Anthropic has restricted the viral OpenClaw AI agent tool and introduced paid access requirements to manage system strain and improve profitability.

3 policy single-source · 1 article · 13h ago

Researchers warn US politics repeats ChatGPT mistakes with world models (hedged)

Researchers caution policymakers may underestimate the implications of advancing world models and robotics, noting China’s progress in robotics.

3 policy · Google single-source · 1 article · 13h ago

Google: 75% of new code AI-generated, reviewed by humans (unconfirmed)

Google reports that about 75% of its new code is generated by AI and then reviewed by human developers.

3 tool-launch · Microsoft single-source · 1 article · 15h ago

Microsoft rolls out Agent Mode in Office apps

Microsoft is rolling out Agent Mode, an upgraded Copilot experience, in Word, Excel, and PowerPoint this week.

3 tool-launch · OpenAI · 1 article · 16h ago

OpenAI adds automation for Codex scheduling and triggers

OpenAI announced automation capabilities for Codex enabling scheduled tasks and trigger-based workflows for reports and summaries.

3 tool-launch · OpenAI · 1 article · 16h ago

OpenAI introduces plugins and skills for automation

OpenAI announced plugins and skills features enabling users to connect tools, access data, and automate workflows.

3 tool-launch · OpenAI · 1 article · 1d ago

OpenAI launches GPT-5.5 biosafety bug bounty

OpenAI announced a bug bounty program for GPT-5.5 offering up to $25,000 for identifying universal jailbreaks related to biosafety risks.

3 policy · Orb, Bruno Mars single-source · 1 article · 1d ago

Orb's Bruno Mars partnership claim disputed by artist

WIRED reports that Sam Altman’s Orb company promoted a Bruno Mars partnership that the artist says does not exist.

3 tool-launch · OpenAI single-source · 1 article · 1d ago

OpenAI launches workspace agents for business automation

OpenAI released workspace agents in ChatGPT that can autonomously perform business tasks like gathering feedback and drafting emails for Business, Enterprise, Edu, and Teachers plan users.

3 business · SpaceX single-source · 1 article · 1d ago

SpaceX reportedly offers Cursor $60B path to buyout

SpaceX reportedly offered Cursor a $10 billion collaboration fee and $60 billion acquisition path, reportedly halting the startup's $2 billion fundraise.

3 research · MIT CSAIL single-source · 1 article · 1d ago

MIT CSAIL proposes calibration rewards to curb AI overconfidence

MIT researchers introduce RLCR, a training method that calibrates AI confidence alongside answers, reducing calibration error without sacrificing accuracy.

3 tool-launch · Google single-source · 1 article · 1d ago

Google Cloud releases two new TPU chips for performance

Google Cloud launched two new TPU chips offering faster performance and lower costs than previous generations.

3 policy single-source · 1 article · 1d ago

Unconfirmed report on AI model scams and social risks

WIRED AI discusses AI models used to scam and their potential social dangers.

3 research · Google single-source · 1 article · 1d ago

Google unveils two TPU-8 variants for agent-era workloads (unconfirmed)

Google introduces TPU8t and TPU8i, claiming improvements for training and inference in an agent-era AI platform.

3 tool-launch · Google · 1 article · 1d ago

Google releases Gemini Embedding 2 via API and Vertex AI

Google announced general availability of Gemini Embedding 2 through the Gemini API and Vertex AI.

3 business · Replit · 1 article · 1d ago

Replit Auto-Protect automates vulnerability patches

Replit announces Auto-Protect to automatically detect, patch, and guide deployment of critical CVE-related fixes for dependencies.

3 research single-source · 1 article · 1d ago

AI-assisted hackers steal up to $12M in three months (unconfirmed)

A hacker group reportedly used AI to develop malware and operate fake websites, stealing about $12 million over three months.

3 tool-launch · OpenAI · 1 article · 1d ago

OpenAI offers free ChatGPT access for U.S. clinicians

OpenAI made ChatGPT free for verified U.S. physicians, nurse practitioners, and pharmacists to support clinical care and documentation.

3 policy · NVIDIA; Google Cloud · 1 article · 1d ago

NVIDIA-Google Cloud expand agentic and physical AI capabilities ( hedge )

NVIDIA and Google Cloud announce expanded cooperation to advance agentic and physical AI, including new Vera Rubin-based A5X instances, Gemini on Google Distributed Cloud, confidential computing, and open model tooling.

3 research · Google DeepMind · 1 article · 1d ago

Google DeepMind introduces Decoupled DiLoCo training

Google DeepMind published research on Decoupled DiLoCo, a technique for resilient distributed AI model training.

3 tool-launch · OpenAI · 1 article · 1d ago

OpenAI adds WebSocket support to Responses API for agents

OpenAI introduced WebSocket support in the Responses API to reduce latency and API overhead for agentic workflows.

3 business · Cloudflare single-source · 1 article · 1d ago

Cloudflare Sandboxes and Containers reach GA for AI agents

Cloudflare announces general availability of Sandboxes and Containers, offering persistent isolated Linux environments for AI agent workloads with features like credential injection, PTY support, and snapshot recovery.

3 policy · Cloudflare single-source · 1 article · 1d ago

Cloudflare outlines MCP architecture for enterprises

Cloudflare proposes a reference architecture for scaling Model Context Protocol deployments with centralized governance, remote infrastructure, and cost controls.

3 tool-launch · OpenAI · 1 article · 2d ago

OpenAI releases Privacy Filter for PII detection

OpenAI released an open-weight model designed to detect and redact personally identifiable information in text.

3 policy · US Department of Defense; Defense Autonomous Warfare Group (DAWG) single-source · 1 article · 2d ago

Pentagon seeks $53.6B for drones and counter-drone programs

The US Defense Department requests about $53.6 billion in FY2027 for drone production, operations, logistics, and counter-drone systems under the Defense Autonomous Warfare Group.

3 research · Mozilla; Anthropic; Firefox; Mythos Preview single-source · 1 article · 2d ago

Mozilla cites Mythos Preview finding 271 Firefox 150 vulnerabilities (unconfirmed)

Mozilla reports Mythos Preview identified 271 security vulnerabilities in Firefox 150 during early access, according to a company blog post.

3 business · Anthropic single-source · 1 article · 2d ago

Cybercriminals are increasingly using generative AI tools to scale phishing, deepfakes, malware, and fraud attacks, while AI-powered defenses also improve.

3 tool-launch single-source · 1 article · 2d ago

Multi-agent tools begin shipping from major AI players

Multi-agent AI tools that coordinate multiple agents to complete complex tasks are beginning to ship from major AI companies.

3 research single-source · 1 article · 2d ago

Robotics firms collect movement data to train humanoid robots

Robotics companies are collecting human movement data at scale to train humanoid robots, raising questions about feasibility and labor implications.

3 business single-source · 1 article · 2d ago

Chinese AI labs are gaining market share through open-source model releases, reportedly surpassing US companies in global downloads.

3 research · OpenAI single-source · 1 article · 2d ago

AI companies develop autonomous research systems to assist scientific discovery, though research suggests potential risks to scientific diversity.

3 model-release · OpenAI single-source · 1 article · 2d ago

OpenAI released ChatGPT Images 2.0, a new image generation model that Sam Altman claims represents a leap equivalent to GPT-3 to GPT-5.

3 policy · OpenAI single-source · 1 article · 2d ago

Florida probes OpenAI over ChatGPT in campus shooting (unconfirmed)

Florida is reportedly investigating potential liability of OpenAI regarding ChatGPT's role in a mass shooting, with officials citing chat logs; OpenAI denies responsibility.

3 policy · Replit · 1 article · 2d ago

Replit named Google Cloud AI Tooling Partner of the Year (2026)

Replit is awarded Google Cloud's 2026 AI Tooling Partner of the Year.

3 tool-launch · Replit · 1 article · 2d ago

Replit launches Security Agent for rapid in-house security reviews

Replit unveils Security Agent to perform comprehensive security reviews of apps in under an hour using a hybrid Semgrep/HoundDog.ai approach.

3 research · Google Research · 1 article · 2d ago

ReasoningBank: enabling agents to learn from experience (hedged)

A Google Research blog post discusses ReasoningBank, a system enabling agents to learn from experience.

3 tool-launch · Google · 1 article · 2d ago

Google introduces Deep Research Max autonomous agent

Google announced Deep Research Max, an upgraded version of its autonomous research agent tool.

3 research · Anthropic single-source · 1 article · 2d ago

Anthropic unveils Managed Agents for Claude workflows (hedged)

Anthropic announces Managed Agents, a layer for handling orchestration, sandboxing, and state management of Claude-based agent workflows.

3 tool-launch · OpenAI · 1 article · 2d ago

OpenAI launches ChatGPT Images 2.0 with improved text rendering

OpenAI released ChatGPT Images 2.0, an updated image generation model with improved text rendering, multilingual support, and visual reasoning capabilities.

3 research · Dan Shapiro (author) single-source · 1 article · 2d ago

Neutral overview of 'Trycycle' and related software factories (hedged)

Editorial discusses the emergence of 'dark factories' and introduces Trycycle as a simple skill for planning and execution in AI-assisted software development, alongside related tooling and patterns.

3 research · DoorDash single-source · 1 article · 2d ago

DoorDash explores dynamic, moment-aware personalization with LLMs

DoorDash describes a hybrid approach using LLMs for consumer profiles and blueprints with deep learning for ranking to capture short-lived user intent.

3 policy · Replit · 1 article · 2d ago

Replit outlines defense-in-depth security for Vibe stack

Replit describes its defense-in-depth security architecture, grounded in Zero Trust, for securing customer code from development sandbox to production deployments.

3 model-release · Moonshot single-source · 1 article · 3d ago

Moonshot releases Kimi K2.6, a 1T-parameter open-weight MoE model reportedly competing with Opus and Gemini on coding and agentic tasks.

3 tool-launch · OpenAI · 1 article · 3d ago

OpenAI launches Codex Labs to scale Codex deployment

OpenAI launches Codex Labs and announces enterprise partnerships to scale Codex deployment across software development.

3 research · Git (the Git project) · 1 article · 3d ago

Git 2.54 adds experimental history and config hooks

Git 2.54 adds an experimental git history command for simple rewrites, plus configuration-based hooks and default geometric repacking for maintenance.

3 business · Noetik single-source · 1 article · 3d ago

Noetik signs $50M deal with GSK for AI cancer matching

Noetik signed a $50M deal with GSK to license AI models for cancer patient-treatment matching, aiming to improve clinical trial success rates.

3 research · LinkedIn single-source · 1 article · 3d ago

LinkedIn unveils Cognitive Memory Agent for persistent AI memory

LinkedIn announces Cognitive Memory Agent, a memory infrastructure layer for stateful, context-aware AI with persistent memory across episodic, semantic, and procedural layers.

3 research · Google single-source · 1 article · 3d ago

Gemini CLI adds subagents for task delegation

Google introduces subagents in Gemini CLI to delegate tasks to specialized AI agents alongside a main session.

3 policy · Google single-source · 1 article · 3d ago

Google ADK for Java hits 1.0 with new architecture

Google releases ADK for Java 1.0, adding external tools integration, app/plugin architecture, and context engineering features.

3 research · GRASP (gradient-based planner) single-source · 1 article · 3d ago

GRASP: Gradient-based Planner for Long-Horizon World Models (BAIR)

BAIR introduces GRASP, a gradient-based planner that enables long-horizon planning with learned world models by lifting dynamics into a parallelizable, collocation-based optimization framework.

3 policy · NVIDIA · 1 article · 3d ago

NVIDIA and partners showcase AI-driven manufacturing at Hannover Messe 2026 (hedged)

NVIDIA and partners demonstrate AI-driven manufacturing capabilities and AI infrastructure at Hannover Messe 2026, highlighting AI-powered design, real-time simulation, and autonomous robotics.

3 business · Salesforce single-source · 1 article · 4d ago

Simon Willison discusses how headless APIs are becoming essential for personal AI agents to interact with enterprise services like Salesforce.

3 policy · GitHub · 1 article · 6d ago

GitHub upgrades status page with degraded tier and per-service uptime

GitHub announces new status-page features: a Degraded Performance state, per-service uptime metrics, and a Copilot AI Model Providers component to improve incident transparency.

3 policy · Jacob Andreas; Brett McGuire; Harold E. Edgerton Faculty Achievement Award single-source · 1 article · 6d ago

MIT Edgerton Award honors Jacob Andreas and Brett McGuire (2026)

MIT awards the 2026 Harold E. Edgerton Faculty Achievement Award to Jacob Andreas and Brett McGuire for distinguished teaching, research, and service.

3 business · OpenProtein.AI single-source · 1 article · 6d ago

OpenProtein.AI expands no-code protein design platform

OpenProtein.AI provides a no-code platform and models for protein design, enabling biologists to use AI without deep ML expertise, with Boehringer Ingelheim expanding collaboration.

2 tool-launch · Simon Willison single-source · 1 article · 6h ago

Simon Willison releases llm-openai-via-codex plugin for GPT-5.5

Simon Willison released llm-openai-via-codex, a plugin enabling LLM CLI access to GPT-5.5 via OpenAI's Codex subscription mechanism.

2 policy · OpenAI single-source · 1 article · 7h ago

OpenAI announces GPT-5.5 with autonomous agent capabilities (unconfirmed)

OpenAI announces GPT-5.5 described as autonomous agents able to switch tools, but details and claims are unverified.

2 policy · Stanford CS 153 event single-source · 1 article · 8h ago

Study session crowds at Stanford draw Silicon Valley figures

A Stanford CS 153 event drew attention online and locally, eliciting mixed reactions.

2 policy · Google single-source · 1 article · 9h ago

Google open-sources DESIGN.md for brand-consistent prompts

Google has open-sourced the DESIGN.md prompt format used by its Stitch AI tool to help AI agents follow brand guidelines.

2 policy · OpenAI and Microsoft single-source · 1 article · 10h ago

OpenAI Trusted Access expands Microsoft cyber-defense capabilities (hedged)

OpenAI and Microsoft are collaborating on a Trusted Access program to leverage capable AI models for cybersecurity defense.

2 business · Era single-source · 1 article · 10h ago

Era raises $11M for AI hardware software platform

Era secured $11M in funding to develop a software platform for diverse AI hardware form factors including glasses, rings, and pendants.

2 research · Claude single-source · 1 article · 12h ago

Survey: Claude users cite new capabilities, slower speed as benefits

A Claude user survey finds new capabilities as the top productivity benefit, with speed close behind; creatives feel left behind, and the sample has notable bias.

2 business · Delve single-source · 1 article · 12h ago

Delve linked to Context AI security incident

Delve, a compliance startup, performed security certifications for Context AI before the AI training company disclosed a security incident.

2 business single-source · 1 article · 12h ago

Tech industry misreads AI backlash as marketing problem

A tech journalist argues the AI industry fundamentally misunderstands why people dislike AI, attributing resistance to a worldview gap rather than a marketing failure.

2 policy · OpenAI single-source · 1 article · 12h ago

OpenAI releases open-source model to redact personal data

OpenAI released Privacy Filter, an open-source model that detects and redacts personal data in text.

2 policy · Sony AI single-source · 1 article · 13h ago

Sony AI's table tennis robot reaches expert level (unconfirmed)

Sony claims its table tennis robot Ace has reached expert-level performance in a sport.

2 tool-launch · GeForce NOW · 1 article · 13h ago

GeForce NOW adds in-app labels for Xbox Game Pass and Ubisoft+ libraries

GeForce NOW introduces in-app labels to show which titles are available from connected Xbox Game Pass and Ubisoft+ libraries, and adds six new games and a Thor skin.

2 policy single-source · 1 article · 16h ago

OpenAI touts Clinicians version; claims GPT-5.4 outperforms doctors on tasks

OpenAI claims GPT-5.4-based ChatGPT for Clinicians outperforms doctors on clinical tasks even with unlimited time and web access, as reported by The Decoder.

2 tool-launch · OpenAI · 1 article · 16h ago

OpenAI introduces Codex for task automation and tool integration

OpenAI announced Codex, a tool designed to automate tasks, integrate with external tools, and generate outputs like documents and dashboards.

2 business · Google single-source · 1 article · 23h ago

Google announces TPU v8, enterprise agents gain traction across vendors

Google unveiled TPU v8 chips and Gemini Enterprise Agent Platform at Cloud Next, while OpenAI, Cursor, and others standardize agent harnesses for team workflows.

2 business · Tesla single-source · 1 article · 1d ago

Tesla raises 2026 capex plan to $25 billion

Tesla raised its 2026 capital expenditure plan to $25 billion, three times its historical spending, which the CFO said will result in negative free cash flow for the remainder of the year.

2 tool-launch · Google single-source · 1 article · 1d ago

Google adds AI-powered automation to Workspace

Google introduced new automated functions in Workspace powered by its Workspace Intelligence AI system.

2 tool-launch · X single-source · 1 article · 1d ago

X launches AI-powered custom feeds with Grok curation

X is replacing Communities with AI-powered custom timelines curated by Grok, including new advertising slots.

2 policy · Elizabeth Warren single-source · 1 article · 1d ago

Elizabeth Warren warns AI industry spending risks crisis

Sen. Elizabeth Warren warns that AI companies' massive spending and borrowing practices resemble pre-2008 financial crisis conditions and calls for congressional oversight.

2 business · Shopify single-source · 1 article · 1d ago

Shopify CTO discusses internal AI adoption and tools strategy

Shopify CTO Mikhail Parakhin discusses the company's internal AI initiatives including Tangle, Tangent, and SimGym for ML experimentation and optimization.

2 policy · Anthropic single-source · 1 article · 1d ago

Anthropic tests Claude Code access on Pro plan limits (unconfirmed)

Anthropic briefly tested removing Claude Code from the Pro plan for a small subset of new signups, later clarified as a small test.

2 tool-launch · Sony single-source · 1 article · 1d ago

Sony's Ace robot defeats top-ranked ping-pong players

Sony AI developed Ace, a robot that can beat elite table tennis players while following official ITTF rules.

2 tool-launch · Google single-source · 1 article · 1d ago

Google adds Gemini automation to Chrome for enterprise

Google has integrated Gemini-powered auto-browse capabilities into Chrome to help enterprise workers automate research and data entry tasks.

2 research · 1 article · 1d ago

Re-composed photos via AI: angle-based approaches (unclear)

The article title suggests exploration of recomposing photos by changing angles using AI, but the content provided is minimal.

2 tool-launch · Google single-source · 1 article · 1d ago

Google launches Gemini Enterprise Agent Platform for technical users

Google released Gemini Enterprise Agent Platform, an agent-building tool designed for IT and technical enterprise users.

2 tool-launch · Google single-source · 1 article · 1d ago

Google brings AI Overviews to Gmail workspace

Google is adding AI Overviews to Gmail for work accounts, providing instant summaries across multiple emails.

2 model-release · Alibaba single-source · 1 article · 1d ago

Alibaba releases Qwen3.6-27B dense model for coding tasks

Alibaba's Qwen released Qwen3.6-27B, a 27-billion-parameter dense model claiming flagship-level coding performance in a 55.6GB package.

2 tool-launch · Google · 1 article · 1d ago

Google releases Gemma 4 VLA demo for Jetson Orin Nano Super

Google has released a Gemma 4 Vision Language Agent demo optimized for NVIDIA's Jetson Orin Nano Super edge device.

2 business · OpenAI single-source · 1 article · 1d ago

OpenAI teams up with Infosys to bring AI tools to more businesses

OpenAI and Infosys announced a partnership to integrate AI tools into Infosys's enterprise offerings for software development, modernization, and DevOps.

2 policy · Sam (pseudonym) single-source · 1 article · 1d ago

Indian student nets income with AI-generated persona for online photos (unconfirmed)

An Indian medical student describes earning money by creating and selling AI-generated bikini photos using a Gemini-based tool.

2 research · NVIDIA · 1 article · 1d ago

NVIDIA AI Approaches Sustainability: Five Initiatives (unclear details)

A NVIDIA Blog post outlines five ways NVIDIA AI aims to protect the planet, but no article body is provided.

2 policy single-source · 1 article · 1d ago

Neutral take on LLM reliability and API controls (unconfirmed)

Opinion piece argues that API design and tooling choices by model providers affect LLM reliability, not just the model itself.

2 policy · OpenAI single-source · 1 article · 1d ago

Livestream on Musk v. Altman trial and OpenAI outlook (unclear)

A livestream announced by WIRED AI will address questions about the Musk v. Altman trial and its potential impact on OpenAI.

2 research · Pangram Labs single-source · 1 article · 1d ago

Pangram Labs updates Chrome extension to label AI-generated content

Pangram Labs released an updated Chrome extension that labels AI-generated content as users scroll social feeds.

2 tool-launch · Mozilla single-source · 1 article · 1d ago

Mozilla used an early version of Claude Mythos Preview to identify 271 vulnerabilities fixed in Firefox 150.

2 business · Anthropic single-source · 1 article · 2d ago

Anthropic reportedly tested restricting Claude Code to $100+/month plans before quickly reverting the change after public backlash.

2 research single-source · 1 article · 2d ago

MIT Technology Review explores emerging techniques to make large language models more efficient and capable of solving complex multi-step problems.

2 research single-source · 1 article · 2d ago

Researchers at Google DeepMind, Stanford, and other labs are advancing world models to help AI systems better understand and navigate physical environments.

2 policy single-source · 1 article · 2d ago

Weaponized deepfakes are increasingly used for sexual exploitation, political propaganda, and disinformation, raising concerns about societal trust and election integrity.

2 policy · OpenAI single-source · 1 article · 2d ago

OpenAI updates ChatGPT Images with 2.0 model (unclear).

ChatGPT Images 2.0 improves detail and text rendering but has language limitations.

2 tool-launch · Google · 1 article · 2d ago

Google launches Continued Conversation feature in Gemini for Home

Google has released Continued Conversation, a new feature for Gemini for Home designed to make interactions more natural and efficient.

2 business · Google DeepMind · 1 article · 2d ago

Google DeepMind announces partnerships with global consultancies

Google DeepMind has partnered with global consultancies to help organizations adopt frontier AI technologies.

2 policy · GitHub single-source · 1 article · 2d ago

GitHub cites scaling and architecture as causes of outages (unconfirmed)

GitHub attributes recent outages to rapid growth, architectural coupling, and load-handling limitations.

2 tool-launch · Hugging Face · 1 article · 2d ago

QIMMA: Arabic LLM leaderboard launches by Hugging Face

Hugging Face launches QIMMA, a quality-focused Arabic language model leaderboard.

2 business · Hugging Face · 1 article · 3d ago

Analysis: openness in AI and cybersecurity strategy

Hugging Face publishes commentary on the relationship between open AI development and cybersecurity practices.

2 business · Nathan Lambert single-source · 1 article · 3d ago

Analysis: Open versus closed model performance gaps and benchmarking limitations

Nathan Lambert analyzes why open models lag closed models and argues that benchmark scores increasingly misrepresent real-world AI capabilities.

2 tool-launch · Google · 1 article · 3d ago

Google expands AI Studio access for paid tiers

Google AI Pro and Ultra subscribers now receive increased usage limits and expanded model access in Google AI Studio.

2 research · O'Reilly AI and ML (news-outlet) single-source · 1 article · 3d ago

Scenario planning for AI and the future of work, hedge with uncertainty

The piece argues for scenario planning to imagine divergent futures for AI and employment, citing mixed evidence on AI impact and outlining a two-axis framework to guide robust strategies across possible outcomes.

2 tool-launch · Simon Willison single-source · 1 article · 4d ago

Simon Willison shows data fetch from Datasette to Google Sheets with SQL

Simon Willison documents methods to fetch data from Datasette instances directly into Google Sheets using SQL functions.

2 tool-launch · Simon Willison single-source · 1 article · 4d ago

Simon Willison upgrades Claude Token Counter for model comparisons

Simon Willison upgraded his Claude Token Counter tool to compare tokenization across different Claude model versions.

2 business · Hyatt · 1 article · 4d ago

Hyatt rolls out ChatGPT Enterprise globally for productivity

Hyatt deploys ChatGPT Enterprise across its global workforce to improve productivity and guest experiences.

2 research · Anthropic single-source · 1 article · 5d ago

Simon Willison analyzes system prompt changes between Claude Opus 4.6 and 4.7, documenting updates to child safety, tool integrations, and response behavior.

2 research · Simon Willison single-source · 1 article · 5d ago

Simon Willison documents Claude prompts across versions with git timeline

Simon Willison documents Claude system prompt changes across versions using git timeline visualization.

2 tool-launch · GitHub · 1 article · 6d ago

Emoji list generator built with GitHub Copilot CLI (neutral, unconfirmed)

GitHub demonstrates an emoji list generator runnable in the terminal using the Copilot CLI and Copilot SDK, with plans and multi-model workflow in plan mode.

2 tool-launch · Simon Willison single-source · 1 article · 6d ago

Datasette 1.0a28 alpha fixes bugs, adds cleanup

Datasette 1.0a28 alpha release fixes compatibility bugs and adds automatic resource cleanup for database instances.

1 tool-launch · OpenAI · 2 articles · 16h ago

OpenAI publishes Codex setup and workflows guide

OpenAI published documentation on setting up and using Codex workspaces, threads, projects, and file management.

Covered by: OpenAI

1 tool-launch · Noscroll single-source · 1 article · 6h ago