Anthropic’s most dangerous AI model just fell into the wrong hands
Anthropic's Mythos AI model reportedly accessed by unauthorized users through a contractor's compromised credentials.
62 topics · 6 multi-source · newest first
Anthropic's Mythos AI model reportedly accessed by unauthorized users through a contractor's compromised credentials.
OpenAI released ChatGPT Images 2.0, a new image generation model that Sam Altman claims represents a leap equivalent to GPT-3 to GPT-5.
Anthropic demonstrates automated AI agents can conduct alignment research autonomously, outperforming human researchers on weak-to-strong supervision tasks.
Latent Space newsletter covers Claude Opus 4.7 launch, computer-use agents, and open-source agent research developments.
Anthropic launched Claude Opus 4.7, claiming improvements across reasoning, coding, vision, and instruction-following with unchanged pricing.
OpenAI releases GPT-Rosalind, a reasoning model designed for drug discovery, genomics, and protein analysis.
SpaceX reportedly agrees to either acquire coding platform Cursor for $60 billion or pay a $10 billion fee.
Apple names hardware executive John Ternus as successor to CEO Tim Cook, effective September 1st.
Mozilla used an early version of Claude Mythos Preview to identify 271 vulnerabilities fixed in Firefox 150.
Cybercriminals are increasingly using generative AI tools to scale phishing, deepfakes, malware, and fraud attacks, while AI-powered defenses also improve.
Multi-agent AI tools that coordinate multiple agents to complete complex tasks are beginning to ship from major AI companies.
Robotics companies are collecting human movement data at scale to train humanoid robots, raising questions about feasibility and labor implications.
Chinese AI labs are gaining market share through open-source model releases, reportedly surpassing US companies in global downloads.
AI companies develop autonomous research systems to assist scientific discovery, though research suggests potential risks to scientific diversity.
Global anti-AI movement grows amid concerns over energy costs, job losses, mental health impacts, and military applications.
OpenAI rolls out ChatGPT Images 2.0 with web-search capabilities and improved instruction-following for image generation.
Clarifai deletes 3 million OkCupid photos used for facial recognition training following FTC settlement.
Moonshot releases Kimi K2.6, a 1T-parameter open-weight MoE model reportedly competing with Opus and Gemini on coding and agentic tasks.
OpenAI launches Codex Labs and announces enterprise partnerships to scale Codex deployment across software development.
Noetik signed a $50M deal with GSK to license AI models for cancer patient-treatment matching, aiming to improve clinical trial success rates.
Simon Willison discusses how headless APIs are becoming essential for personal AI agents to interact with enterprise services like Salesforce.
Researchers introduce open-world evaluations, a new methodology for testing frontier AI capabilities in real-world settings, and launch CRUX, a 17-person collaboration conducting such evaluations.
Hugging Face announces training and finetuning capabilities for multimodal embedding and reranker models via Sentence Transformers.
OpenAI launches Trusted Access for Cyber program with GPT-5.4-Cyber and $10M in API grants for security firms and enterprises.
Google DeepMind releases Gemini 3.1 Flash TTS with granular audio tags for expressive speech generation.
Hugging Face publishes research on VAKRA, examining reasoning, tool use, and failure modes in AI agents.
OpenAI updates its Agents SDK with native sandbox execution and model-native harness for secure, long-running agent development.
Datasette 1.0a27 alpha release introduces modern CSRF security headers and table rename event handling for plugins.
YouTube expands its AI deepfake detection feature to celebrities, allowing them to find and request removal of synthetic videos.
GitHub Copilot tightens usage limits, pauses individual plan signups, and shifts to token-based pricing for agentic workflows.
Anthropic reportedly tested restricting Claude Code to $100+/month plans before quickly reverting the change after public backlash.
Meta reportedly developed an internal tool converting employee mouse movements and clicks into AI training data.
MIT Technology Review explores emerging techniques to make large language models more efficient and capable of solving complex multi-step problems.
Researchers at Google DeepMind, Stanford, and other labs are advancing world models to help AI systems better understand and navigate physical environments.
Weaponized deepfakes are increasingly used for sexual exploitation, political propaganda, and disinformation, raising concerns about societal trust and election integrity.
AI startup NeoCognition raises $40M seed funding to develop domain-adaptive AI agents.
Public concern about AI is rising, but the technology remains a minor focus in election campaigns despite majority support for regulation.
Framework launches external GPU modules for its Laptop 16, enabling users to connect desktop graphics cards via OCuLink.
Latitude launches Voyage, a platform enabling users to create AI-powered role-playing games.
Yelp upgrades its AI chatbot assistant with booking and recommendation features to streamline user interactions.
Hugging Face launches QIMMA, a quality-focused Arabic language model leaderboard.
Hugging Face publishes guidance on grounding Korean AI agents using synthetic personas based on real demographic data.
Google rolls out Gemini integration in Chrome across seven Asia-Pacific countries on desktop and iOS.
Fermi's CEO and CFO depart amid operational challenges at the AI nuclear power startup co-founded by Rick Perry.
Simon Willison upgraded his Claude Token Counter tool to compare tokenization across different Claude model versions.
Hyatt deploys ChatGPT Enterprise across its global workforce to improve productivity and guest experiences.
Palantir publishes statement criticizing inclusivity initiatives and describing them as regressive cultural practices.
Simon Willison analyzes system prompt changes between Claude Opus 4.6 and 4.7, documenting updates to child safety, tool integrations, and response behavior.
Simon Willison documents Claude system prompt changes across versions using git timeline visualization.
llm-anthropic 0.25 releases with Claude Opus 4.7 model supporting extended thinking and new display options.
Simon Willison's pelican-drawing benchmark shows Qwen3.6-35B-A3B outperforming Claude Opus 4.7 on image generation tasks.
OpenAI releases updated Codex app with computer use, browsing, image generation, memory, and plugins for developers.
Hugging Face publishes research on Ecom-RLVE, an adaptive verifiable environment framework for e-commerce conversational agents.
Hugging Face announces a feature enabling users to open pull requests automatically.
Sam Altman criticizes Anthropic's cybersecurity model Mythos as relying on fear-based marketing.
Starbucks' new ChatGPT integration for ordering launched last week but proved cumbersome compared to the standard app.
Bond, a new social media platform, launches with AI designed to encourage users to reduce screen time and engage in offline activities.
AI music startup GRAI argues that fans prefer remixing existing tracks over generating songs from scratch.
Simon Willison releases llm-openrouter 0.6 with a refresh command for updating available models.
TechCrunch analyzes OpenAI's recent acquisitions and their potential to address existential challenges facing the company.
Simon Willison used Claude Code to add beat content support to his blog-to-newsletter tool via a single agentic prompt.
PyCon US 2026 in Long Beach will feature new dedicated AI and Security tracks alongside traditional programming.