Microsoft Edge Password Vulnerability: Plaintext Credentials Exposed in Memory

The digital security landscape shifted under the feet of millions of Windows users this week as a foundational architectural choice in the world’s second-most popular browser was laid bare. At the Big Bite of Tech 26 event in Oslo, security researcher Tom Jøran Sønstebyseter Rønning demonstrated a startling reality: the Microsoft Edge password vulnerability is not a mere oversight or a buggy line of code, but a deliberate design decision that leaves every saved credential on a machine exposed in plaintext within system memory. While competitors have moved toward sophisticated process-level encryption, Microsoft’s flagship browser continues to prioritize performance and legacy compatibility over modern “defense-in-depth” protocols.

The Anatomy of the Microsoft Edge Password Vulnerability

To understand the severity of this discovery, one must look at how modern browsers handle “secrets” during runtime. For years, the industry standard for on-disk security has been the Data Protection API (DPAPI) in Windows. This ensures that while your browser is closed, your passwords are encrypted and tied to your specific user profile. However, once a browser is launched, it must eventually decrypt those passwords to fulfill its role as a password manager. This is where the Microsoft Edge password vulnerability manifests.

According to Rønning’s research, Microsoft Edge performs a “bulk decryption” of the entire saved credential database (the Login Data SQLite file) immediately upon application launch. These credentials—usernames, passwords, and associated URLs—are then stored as plaintext strings within the browser’s parent process memory. Crucially, this happens even if the user never visits the websites in question during that session. The passwords sit in the RAM, unencrypted and static, for the duration of the browser’s execution.

Runtime Exposure vs. At-Rest Encryption

The core of the controversy lies in the distinction between encryption at rest and protection during runtime. Microsoft has long argued that their on-disk encryption is robust. Yet, Rønning’s demonstration proved that runtime protection is virtually non-existent in Edge. By utilizing a simple memory-scanning technique, any process with sufficient privileges can “scrape” the RAM and extract every single password a user has ever saved in Edge. This architectural choice stands in stark contrast to the evolution of the underlying Chromium engine upon which Edge is built.

Chrome’s App-Bound Encryption: The Performance vs. Security Divide

The discovery is particularly jarring because Google Chrome—the progenitor of the Chromium project—implemented a solution to this exact problem in late 2024. Known as App-Bound Encryption (ABE), this feature binds the encryption keys to the specific identity of the browser process. In Chrome, passwords are only decrypted on a just-in-time (JIT) basis—meaning a password is only converted to plaintext the moment it is needed for an autofill action or when a user explicitly requests to “view” it in the settings.

The technical differences between the two browsers are significant:

  • Decryption Trigger: Chrome decrypts credentials individually on-demand; Edge decrypts the entire vault at startup.
  • Process Isolation: Chrome uses ABE to ensure that even if another program is running as the same user, it cannot easily use Chrome’s own keys to decrypt the password database.
  • Memory Persistence: In Chrome, plaintext secrets are transient and wiped from memory shortly after use. In Edge, they persist in the parent process indefinitely.

Rønning noted that in his testing of various Chromium-based browsers, including Brave and Opera, Edge was the only major player that still followed this “all-at-once” plaintext memory loading pattern. This has led security professionals to question why Microsoft chose to deviate from the safer defaults increasingly adopted by the Chromium community.

EdgeSavedPasswordsDumper: The Proof of Concept

To silence skeptics, Rønning released a proof-of-concept (PoC) tool on GitHub named EdgeSavedPasswordsDumper. This utility, written in C#, serves as a stark reminder of how accessible this data is to anyone with administrative or SYSTEM-level privileges. The tool does not rely on complex exploits or zero-day vulnerabilities; it simply identifies the Edge parent process, scans its memory space for known patterns associated with the browser’s internal credential structures, and outputs the results in an easy-to-read format.

The technical capabilities of the dumper include:

  • Extraction of usernames, passwords, and URLs from active Edge processes.
  • The ability to target the specific parent process (identifiable via command-line arguments in Task Manager).
  • Bypassing the need for the user to be actively browsing or even interacting with the browser.

While Microsoft maintains that an attacker needs administrative rights to perform such a memory dump, the reality of the modern threat landscape is that infostealing malware frequently seeks exactly these privileges. Once a device is even partially compromised, the presence of plaintext credentials in memory allows for a “smash-and-grab” attack that is much faster and more reliable than traditional methods of cracking the DPAPI-protected database on disk.

The Enterprise Nightmare: VDI, Citrix, and Terminal Servers

While the risk to individual home users is significant, the Microsoft Edge password vulnerability presents a catastrophic risk profile for enterprise environments. In shared computing infrastructures—such as Virtual Desktop Infrastructure (VDI), Citrix, or Windows Terminal Servers—multiple user sessions often run on a single underlying hardware instance or operating system kernel.

In these environments, a single compromised administrative account can be used to “pivot” and dump the memory of every logged-in user on the server. Because Edge pre-loads the passwords for all active sessions into their respective processes, an attacker can harvest thousands of credentials from hundreds of different users simultaneously. This effectively turns a single point of failure into a total organizational compromise. As Morey Haber, Chief Security Advisor at BeyondTrust, noted, passwords should be “transient secrets” that disappear the moment they are no longer needed. In the context of a terminal server, Edge’s design makes these secrets permanent residents of the system’s RAM.

Microsoft’s Response: “By Design” and the Philosophy of Security Boundaries

Microsoft’s official stance on the matter has been divisive. The company has classified this behavior not as a bug to be patched, but as a deliberate design choice. A Microsoft spokesperson stated that the current implementation is intended to “balance browser performance with security.” Their primary argument is rooted in the Ten Immutable Laws of Security, specifically Law #6: “A computer is only as secure as the administrator is trustworthy.”

In Microsoft’s view, if an attacker has the ability to read system memory or possesses administrative privileges, the system is already “game over.” Therefore, they argue, adding layers like App-Bound Encryption or just-in-time decryption offers a false sense of security while potentially slowing down the browser’s startup time and autofill responsiveness. However, this “all-or-nothing” approach to security has been heavily criticized by the wider cybersecurity community.

The Industry Backlash: Critiquing “Performance over Protection”

Cybersecurity leaders from top-tier firms have not minced words in their critique of Microsoft’s “by design” defense. Craig Lurey, CTO and co-founder of Keeper Security, pointed out that while Windows attempts to isolate applications, the reality is that non-elevated programs can often find ways to “pillage” the memory of other processes running in the same user context. The existence of plaintext passwords in memory provides a low-hanging fruit that modern software should have phased out years ago.

BeyondTrust experts further argued that this design choice ignores the concept of Defense-in-Depth. Even if a machine is compromised, a well-designed application should make it as difficult as possible for the attacker to extract sensitive data. By leaving passwords in plaintext, Microsoft is essentially “waving the white flag” once the initial perimeter is breached, rather than forcing the attacker to navigate further hurdles.

Strategies for Mitigation and Migration

Given Microsoft’s current refusal to alter this behavior, the burden of protection has shifted to IT administrators and end-users. Security professionals are now issuing a series of urgent recommendations to mitigate the risks associated with the Microsoft Edge password vulnerability.

  1. Migrate to Dedicated Password Managers: The most effective solution is to stop using the browser’s built-in password management system entirely. Standalone managers like Keeper, Bitwarden, or 1Password utilize zero-knowledge encryption and ensure that data is never stored in a decrypted, plaintext state within the system’s general memory pool during runtime.
  2. Implement Kernel-Level Protections: Tools such as Keeper Forcefield or advanced Endpoint Detection and Response (EDR) suites can monitor for unauthorized process memory access, potentially blocking tools like EdgeSavedPasswordsDumper even if the attacker has administrative rights.
  3. Disable Password Saving via GPO: For enterprise environments, administrators should use Group Policy Objects (GPO) to disable the “Offer to save passwords” feature in Microsoft Edge across the domain.
  4. Enforce Strict Session Management: In VDI and Citrix environments, ensure that disconnected sessions are terminated quickly to clear the associated process memory from the server’s RAM.

Conclusion: The Future of Browser Trust

The revelation of the Microsoft Edge password vulnerability serves as a critical case study in the ongoing tension between user convenience and robust security. While Microsoft prioritizes a fast, seamless experience for its users, the cost of that speed is a widened attack surface that modern infostealers are all too eager to exploit. As browsers become the primary portal through which we access both personal and corporate lives, the expectation of “secure by design” is no longer optional—it is a requirement.

For now, the message from the security community is clear: a browser is a window to the web, not a safe for your secrets. Until Microsoft aligns Edge with the more rigorous memory-handling standards of its peers, the most prudent course of action is to treat browser-based password storage as a significant liability. In the age of sophisticated memory-scraping malware, plaintext is a relic that we can no longer afford to tolerate.

Posted in Data Protection, Security & Privacy | Tagged , , , | Leave a comment

ShinyHunters Canvas Breach: Billion-Message Ultimatum Hits Global Universities

The digital ivory towers of global academia are currently under a state of unprecedented siege. As of May 11, 2026, the global higher education sector is bracing for what security researchers have termed the “Mother of All Academic Leaks.” The ShinyHunters Canvas breach has transitioned from a standard data exfiltration event into a high-stakes psychological war, with a hard ultimatum set for May 12, 2026. For the 8,800 institutions currently paralyzed—including the likes of Harvard, Stanford, and the University of Pennsylvania—the next twenty-four hours represent a ticking clock that could permanently dismantle the perceived privacy of the academic experience.

The Technical Anatomy of the ShinyHunters Canvas Breach

To understand the gravity of the current crisis, one must look at the sophisticated, multi-stage intrusion strategy employed by the ShinyHunters collective. While the parent company, Instructure, initially signaled that a late-April anomaly had been successfully “contained” through standard API key rotations and security patches, the reality was far more grim. The group did not just find a hole in the fence; they had effectively mapped the entire estate. By May 7, the group had bypassed these remediations, replacing the login screens of thousands of university portals with a stark, red-rimmed ransom note that signaled the true scale of the ShinyHunters Canvas breach.

The technical root cause lies in a long-standing architectural vestige: the “Free-For-Teacher” (FFT) accounts. Designed during Canvas’s early growth phase to democratize learning, these accounts allowed individual educators to spin up courses without institutional oversight. However, these FFT instances were hosted on the same multi-tenant infrastructure as the premium, enterprise-level shards used by major universities. Security analysts have confirmed that ShinyHunters exploited an overlooked legacy vulnerability within the FFT authorization layer, which allowed for cross-tenant data leakage. By compromising a single “Free-For-Teacher” entry point, the attackers were able to scrape 3.65 terabytes of data across the broader Canvas ecosystem.

The “Free-For-Teacher” Achilles’ Heel

In the world of Software-as-a-Service (SaaS), the strength of a platform is often dictated by its weakest legacy feature. The FFT accounts lacked the rigorous Multi-Factor Authentication (MFA) mandates and Single Sign-On (SSO) integrations that protect modern university tenants. By leveraging a series of “insecure direct object reference” (IDOR) vulnerabilities, the ShinyHunters were able to escalate their privileges from simple “teachers” to system-level observers.

  • Data Exfiltrated: 3.65 Terabytes of raw database backups.
  • Affected Users: Approximately 275 million students, faculty, and alumni.
  • Institutional Impact: Over 8,800 schools across North America, Europe, and Asia.
  • Data Composition: Names, institutional email addresses, student ID numbers, and encrypted (yet potentially crackable) metadata.

The Billion-Message Ultimatum: Why This Breach is Different

Most cyberattacks focus on financial records or Social Security numbers—high-value PII that can be sold on BreachForums or used for identity theft. However, ShinyHunters has pivoted to a far more volatile form of leverage: “billions of internal private messages.” These communications, spanning several years, include faculty-to-student discussions, peer-to-peer messages, and sensitive administrator-level threads. In the context of a university, these messages are the “DNA” of the institution’s social and political life.

The threat of releasing this “Mother of All Academic Leaks” has sent a shiver through the Ivy League. For universities like Stanford and UPenn, the concern isn’t just about data compliance; it’s about the potential for catastrophic reputational damage. Internal messages often contain:

  1. Sensitive admissions discussions and internal candidate rankings.
  2. Confidential disciplinary reports and academic integrity investigations.
  3. Personal interpersonal drama between students and faculty.
  4. Unpublished research data and intellectual property exchanges.

The “internet archaeologists” of the digital age are already salivating at the prospect of mining this data for decades. This is not just a leak; it is a permanent recording of the private lives of millions of young adults and the mentors who guide them. ShinyHunters understands that while a student ID can be replaced, the “immortality” of a compromised private thought is a permanent stain.

Pedigree of a Predator: ShinyHunters Since ’19

The group behind this siege is no amateur outfit. Having been “rooting systems since ’19,” ShinyHunters has built a resume of destruction that rivals state-sponsored actors. In 2024, they were responsible for the Ticketmaster and AT&T breaches, which exposed the data of over 560 million and 110 million customers, respectively. Their signature move—moving laterally through cloud storage environments like Snowflake—has been refined into a surgical art form.

The 2026 Canvas siege represents an evolution of their tactics. Rather than merely dumping the data for a quick sale, they are engaging in a “Canvas Siege”—a prolonged, public extortion campaign designed to embarrass the target into submission. Their use of Tox-encrypted communication channels allows them to remain untouchable by federal authorities, providing a secure “negotiation room” where they dictate terms. The group has historically been associated with “The Com,” a loose network of cybercriminals including factions of Scattered Spider and Lapsus$, known for aggressive social engineering and “vishing” (voice phishing).

Finals Week Paralyzed: The Human Cost

The timing of the breach could not be more malicious. By striking in early May, ShinyHunters has effectively paralyzed finals week for millions of students. With Canvas portals intermittently offline or defaced, the infrastructure of modern grading has collapsed. Instructure has been forced to take drastic measures, including the permanent shutdown of the Free-For-Teacher program and the forced re-authorization of thousands of API integrations.

For students, the anxiety is twofold. On one hand, the inability to submit final projects or access study materials is jeopardizing their academic standing. On the other, the May 12 deadline hangs over their heads like a guillotine. If the database is leaked tomorrow, the private conversations of an entire generation of students will be indexed by search engines. The “walled garden” of the university has been breached, and the external world is looking in.

The Institutional Dilemma

University administrators are now caught between a rock and a hard place. Paying the ransom is a violation of the ethical (and often legal) standards of public institutions. Furthermore, there is no guarantee that ShinyHunters will actually delete the data. History suggests that once data is exfiltrated, it is rarely “gone” forever. However, refusing to pay ensures the leak will happen. The ShinyHunters Canvas breach has forced a debate on whether EdTech companies like Instructure should be held to the same security standards as banking institutions, given the sensitive nature of the sociological data they hold.

The May 12 Countdown: What to Expect

As we approach the final hours before the May 12 deadline, the security community is watching the group’s Tox channels with bated breath. Intelligence suggests that if the payment—likely in the tens of millions of dollars in Bitcoin—is not made, the leak will occur in “waves.” The first wave is rumored to contain a sample of 100,000 “high-interest” messages from elite institutions to maximize media coverage and pressure.

Security experts are advising the following immediate actions for institutions and individuals:

  • API Key Audits: Organizations must immediately revoke and regenerate all Canvas API tokens, particularly those linked to third-party LTI integrations.
  • Credential Hardening: Every user should assume their institutional email and ID are compromised and update passwords for all sensitive accounts (especially those using the same credentials).
  • Monitoring for Personalized Phishing: The stolen data provides the perfect “scrip” for highly targeted phishing attacks against faculty and administrators.
  • Data Minimization: Moving forward, the “private” nature of LMS messaging must be re-evaluated. If it isn’t encrypted end-to-end, it isn’t private.

Conclusion: The End of Academic Privacy?

The ShinyHunters Canvas breach is a watershed moment for EdTech. It exposes the fallacy that academic data is somehow “less valuable” or “less targeted” than financial data. In 2026, information is the ultimate currency, and the private thoughts of the next generation of leaders are apparently worth millions to the right buyer. Whether the leak occurs tomorrow or a deal is struck in the shadows of a Tox channel, the “Canvas Siege” has already changed the landscape of education forever. The immortality of academic private messages is no longer a theory—it is a threat. As the world watches the clock strike midnight on May 12, the only certainty is that the “walled garden” of the university has been permanently leveled.

Posted in Internet Curiosities, Resources & Culture | Tagged , , , | Leave a comment

Federal AI Vetting Mandates: White House Implements Emergency Cyber-Security Measures

The era of “permissionless innovation” in Silicon Valley has officially met its match in the form of national security necessity. On May 7, 2026, the White House signaled a definitive end to the federal government’s historically laissez-faire approach to artificial intelligence. With the finalization of a landmark executive order, the administration is set to impose Federal AI Vetting Mandates that will require developers of “frontier” models to submit their code for rigorous government inspection before a single line is released to the public. This pivotal policy shift is not merely a bureaucratic expansion; it is a defensive reflex triggered by the arrival of AI models that can now outpace human hackers and defenders alike.

The Catalyst: Claude Mythos and the Dawn of Autonomous Offense

The sudden urgency in Washington can be traced back to a series of alarming reports from the Center for AI Standards and Innovation (CAISI) and internal telemetry from Anthropic regarding its latest model, Claude Mythos. Unlike its predecessors, which required significant human prompting to identify software flaws, Mythos has demonstrated an unprecedented capacity for autonomous vulnerability research and exploitation. According to recent evaluations by the UK’s AI Security Institute (AISI), Claude Mythos became the first model to complete the “The Last Ones” (TLO) benchmark—a 32-step corporate network attack simulation—from initial reconnaissance to full network takeover without human intervention.

The technical data is staggering. Mythos scored a 93.9% on SWE-bench Verified, a standard evaluation for autonomous software engineering. More importantly, it successfully identified and weaponized a 17-year-old remote code execution flaw in FreeBSD’s NFS server and a 27-year-old crash vulnerability in OpenBSD—flaws that had survived decades of human-led security audits. These “ghost in the machine” vulnerabilities represent a new class of risk where AI systems can perform deep-tier binary analysis at a scale and speed previously unimaginable. The reality of Federal AI Vetting Mandates is a direct response to this “autonomous offensive threshold,” where the cost of finding a zero-day vulnerability has effectively dropped to the price of a GPU compute cycle.

Unpacking the Mandates: Weights, Telemetry, and Raw Access

The new executive order moves beyond voluntary safety pledges. Under the Federal AI Vetting Mandates, developers such as OpenAI, Google DeepMind, and Anthropic will be required to provide CAISI and other federal agencies with “pre-deployment access” to their most capable systems. This access is not limited to a simple chat interface; it involves a three-pronged technical audit:

  • Model Weights and Architecture: Federal scientists will have access to the underlying weights of frontier models to perform “white-box” testing, allowing them to understand the mathematical representations that lead to specific behavioral outputs.
  • Behavioral Telemetry: Developers must provide real-time data on how models respond to “adversarial scaffolding”—internal tools used to push models toward prohibited tasks like malware generation or chemical weapon synthesis.
  • “Raw” Model Probing: Agencies will test models with safety guardrails stripped back to determine the latent capabilities of the system. This is intended to prevent “jailbreaking” scenarios where sophisticated actors bypass external filters to access a model’s core hacking logic.

This “quasi-licensing” regime marks a significant departure from the deregulation-heavy stance seen throughout 2025. While the current administration initially revoked many Biden-era AI safety protocols, the sheer potency of tools like GPT-5.4-Cyber—which features specialized binary reverse engineering capabilities—has forced a return to stringent oversight. Federal AI Vetting Mandates are now viewed as the only way to ensure “security-by-design” in an era where software is increasingly written and secured by non-human agents.

The 72-Hour Race: Countering “Negative Time-to-Exploit”

Coinciding with the AI vetting requirements, U.S. cyber officials announced on May 6, 2026, a drastic compression of the mandatory patching window for government IT systems. The Cybersecurity and Infrastructure Security Agency (CISA) is moving to reduce the remediation timeline for critical vulnerabilities from an average of 14 days to just three days (72 hours). This measure is a direct counter-tactic to what security experts call “negative time-to-exploit.”

In the traditional cybersecurity lifecycle, a vulnerability is discovered, a patch is developed, and defenders have a window of several weeks to deploy that patch before widespread exploitation begins. In 2026, that window has vanished. Tools like the rumored GPT-5.4-Cyber can ingest a vulnerability disclosure and produce a working exploit in minutes—sometimes even identifying the flaw before the vendor’s patch is publicly available. According to data from Flashpoint, the time between disclosure and weaponization has plummeted by 94% since 2021. By mandating a 72-hour response, the government is attempting to keep pace with an attack cycle that now operates at machine speed.

Key Metrics of the New Defense Posture:

  1. Attack Lateral Movement: Automated AI systems can move across a compromised network in under 30 seconds.
  2. Zero-Day Discovery: Mythos-class models can identify thousands of new vulnerabilities across major OS kernels in a single weekend.
  3. Patch Deployment Goal: Critical systems must be remediated or mitigated within 72 hours of a CISA “Known Exploited Vulnerability” (KEV) listing.

A Political Highwire: Innovation vs. National Security

The implementation of Federal AI Vetting Mandates has placed tech giants in a precarious position. On one side, the commercial pressure for rapid deployment is immense; being first to market with a more “agentic” AI can result in billions of dollars in market capitalization. On the other, the risk of releasing a model that could be used to take down a power grid or collapse a financial network has made the federal government an unwelcome, but necessary, partner in the development process.

Anthropic, in particular, has found itself at the center of this storm. While the company has been praised for its transparency regarding Claude Mythos, it has also faced a “supply-chain risk” designation from the Department of Defense, effectively barring it from certain military contracts while civilian agencies are simultaneously being urged to use a modified version of the same model for defense. This “Schrödinger’s AI policy” reflects a government that is both terrified of what these models can do and desperate to harness their power for its own cyber-defense.

Critics of the mandates, including some Silicon Valley libertarians and AI researchers, argue that these regulations will drive development underground or to adversary nations like China, which are pursuing their own “AI-first” military doctrines. However, the administration’s stance is clear: the risk of a systemic collapse of cyberspace, driven by autonomous AI agents, outweighs the costs of regulatory friction. The move toward Federal AI Vetting Mandates suggests that the government no longer views AI as a mere software tool, but as a dual-use technology akin to nuclear energy—too powerful to be left entirely in private hands.

Conclusion: The End of the Reactive Era

The twin policies of Federal AI Vetting Mandates and accelerated 72-hour patching represent the most significant overhaul of U.S. cybersecurity policy in a generation. We are transitioning from a reactive era—where we patched what was already broken—to a proactive era defined by security-by-design and real-time infrastructure defense.

As we move deeper into 2026, the success of these measures will depend on the technical capacity of federal agencies to actually perform the vetting they are mandating. If CAISI cannot keep up with the speed of innovation at OpenAI or Google, the mandates risk becoming a bottleneck that stifles American leadership in AI. However, if these mandates successfully create a “defensive moat” around critical infrastructure, they may provide the stability necessary for AI to reach its full potential without triggering a global security catastrophe. The Federal AI Vetting Mandates are not just a new set of rules; they are a declaration that in the age of the autonomous exploit, the old ways of defending the digital world are officially dead.

Posted in Breaking Tech News, Technology & AI | Tagged , , , | Leave a comment

Claude Mythos Controversy: Anthropic’s Security Pivot and Project Glasswing

In the high-stakes theater of Silicon Valley, May 2026 has emerged as a defining watershed for the global artificial intelligence landscape. While the industry has long anticipated the arrival of “frontier-level” reasoning, the sudden and aggressive strategic pivot by Anthropic has fundamentally rewritten the rules of the game. At the heart of this transformation lies the Claude Mythos controversy, a storm of technical awe and ethical friction that has seen the once-reclusive safety lab transition into an infrastructure-level hegemon with a reported annual run rate (ARR) of $44 billion.

The controversy was ignited on May 7, 2026, when Anthropic officially confirmed that it would not release its most powerful model to date, codenamed “Mythos,” to the general public. Citing internal evaluations and independent audits from the UK AI Safety Institute (AISI), Anthropic described Mythos as a tool of such offensive potency that its unmitigated release would constitute a systemic risk to the global digital economy. The decision has fractured the AI community, setting off a debate that pits the pragmatism of national security against the ideals of open-source transparency.

The Claude Mythos Controversy: A Superhuman Breach of the Status Quo

The technical justifications for the Claude Mythos controversy are rooted in capabilities that the industry previously thought were years away. According to the 244-page system card released by Anthropic, Mythos possesses what researchers call “machine-speed vulnerability discovery.” While previous flagship models like Claude 3.5 Opus showed promise in assisting human developers, Mythos operates on an entirely different plane of autonomy.

Key technical benchmarks that have fueled the security debate include:

  • 72.4% Exploit Success Rate: In controlled environments, Mythos achieved a 72.4% success rate in developing working exploits for high-severity software flaws—a leap from the near-zero autonomous success rate of previous models.
  • 32-Step Attack Chaining: The UK AISI confirmed that Mythos is the first AI to successfully complete a 32-step simulated enterprise network attack, involving lateral movement and privilege escalation, without human intervention.
  • Discovery of “Ghost” Zero-Days: During its training and evaluation, the model identified critical vulnerabilities in every major operating system, including a 27-year-old flaw in OpenBSD and a 16-year-old bug in the FFmpeg video encoder that had survived five million automated scans by traditional security tools.

By withholding the model, Anthropic argues it is preventing a “vibe apocalypse”—a scenario where script kiddies and state actors alike could utilize the model to dismantle the aging, fragile architecture of the modern internet. However, critics like David Sacks have labeled this move as “regulatory capture via doomsday framing,” suggesting that the “too dangerous to release” narrative is a convenient shield for consolidating market power.

Project Glasswing: The Gated Garden of Elite Defense

Recognizing that a total blackout of Mythos would stifle defensive research, Anthropic launched Project Glasswing, a $100 million consortium that provides gated access to the model. The initiative, named after the transparent-winged butterfly that hides in plain sight, represents a new era of “collaborative exclusivity” in cybersecurity.

Project Glasswing comprises 40 high-tier partners, including the “Founding Twelve” who receive the deepest level of access for patching and red-teaming. Notable members include:

  • Big Tech: Apple, Microsoft, NVIDIA, and Google.
  • Finance: JPMorgan Chase and Goldman Sachs.
  • Infrastructure: Amazon Web Services (AWS), Cisco, and Palo Alto Networks.
  • Cybersecurity: CrowdStrike and the Linux Foundation.

Anthropic has committed $100 million in compute credits to these partners to scan and secure critical infrastructure before the capabilities of Mythos-class models inevitably proliferate. While this collaborative defense is designed to give the “good guys” a six-to-twelve-month head start, it has raised uncomfortable questions about who decides which 40 companies are worthy of holding the world’s most powerful digital shield.

The Geopolitical Rift: Anthropic vs. The Pentagon

The tension surrounding Anthropic’s new identity reached a fever pitch on May 8, 2026, when reports surfaced that the U.S. Pentagon had officially blacklisted the company. The friction stems from Anthropic’s refusal to include “all lawful use” terms in its government contracts—a clause that would essentially allow the Department of Defense (DoD) to use Claude for autonomous weapons targeting or mass domestic surveillance.

Defense Secretary Pete Hegseth reportedly criticized Anthropic’s leadership during a Senate hearing, referring to CEO Dario Amodei as an “ideological lunatic” for attempting to impose corporate ethical guardrails on national defense. In a move that highlights the diverging paths of AI giants, Google’s Gemini was simultaneously granted an expansive, multi-billion dollar agreement for any lawful government purpose, positioning Google as the Pentagon’s preferred AI partner while Anthropic is relegated to the “supply chain risk” list—a designation usually reserved for foreign adversaries like Huawei.

This blacklist creates a paradoxical situation: the White House continues to rely on Anthropic for economic security and AI safety standards, while the military is barred from using the very tools that could protect its own networks from AI-augmented attacks. Anthropic has responded by filing a lawsuit in the U.S. District Court, seeking to overturn the “supply chain risk” label and asserting that its Constitutional AI framework is not a threat to national security, but a prerequisite for it.

Commercial Dominance: Scaling to a $44 Billion ARR

Despite the geopolitical and ethical headwinds, Anthropic’s commercial engine is operating at an unprecedented scale. Financial data released this week confirms an annual run rate (ARR) of $44 billion for Q1 2026, a staggering five-fold increase from its $9 billion ARR at the end of 2025. This growth is largely attributed to the successful transition from a research lab to an enterprise-first infrastructure provider.

Claude Code: The Revenue Engine

The primary driver of this explosive revenue is Claude Code, specifically its “Auto Mode” feature. By moving beyond simple chat interfaces to fully autonomous agentic workflows, Anthropic has captured 73% of the first-time enterprise AI market in early 2026. Claude Code Auto Mode allows developers to set high-level objectives—such as “migrate this entire legacy codebase to Rust”—while the AI autonomously handles the file edits, test executions, and debugging, requiring human approval only at critical security checkpoints.

Infrastructure and the “SpaceX” Partnership

To support the massive compute requirements of a $44 billion business, Anthropic has moved beyond its traditional reliance on Amazon and Google. This week, the company announced a landmark infrastructure partnership with SpaceX to utilize Starlink’s global laser-mesh network for “low-latency edge inference,” ensuring that Claude Security tools can run locally on critical infrastructure anywhere in the world. This, combined with a 5-gigawatt compute deal with Amazon and new TPU clusters from Google, signals that Anthropic is no longer just a software company, but a fundamental layer of the global computing stack.

The Ethics of the AI-Security Complex

The Claude Mythos controversy forced a realization that “Safety” and “Security” are becoming the same word in the AI era. Anthropic’s decision to move Claude Security into public beta for enterprise customers—while keeping the underlying Mythos model gated—suggests a strategy of “controlled empowerment.” The enterprise tools provide real-time vulnerability scanning and automated patching, effectively turning every corporate developer into a high-tier security researcher.

However, the ethical debate remains: Is Anthropic creating a monopoly on truth? If only a select consortium of 40 companies can see the vulnerabilities Mythos finds, the rest of the world remains in a state of “security by ignorance.” Critics argue that this creates a tiered internet where the wealthy and well-connected are immune to AI-driven exploits, while small businesses and open-source projects are left exposed.

Anthropic’s $4 million donation to open-source security organizations is a gesture toward mitigating this disparity, but it is a drop in the ocean compared to the $100 million in credits gifted to its Glasswing partners. The industry is watching closely to see if Anthropic’s “Responsible Scaling Policy” can survive the pressures of a $1 trillion valuation and the demands of an increasingly aggressive U.S. defense establishment.

Conclusion: The Architecture of the Post-Mythos World

As of May 2026, Anthropic has successfully reinvented itself as the arbiter of the AI-Security complex. By navigating the Claude Mythos controversy with a blend of extreme caution and aggressive commercial scaling, the company has proven that “Safety” is the ultimate enterprise product.

The coming months will determine if Project Glasswing can actually close the window of vulnerability before rival models—like OpenAI’s “Spud” or Google’s Gemini 4.0—reach the same offensive threshold. For now, the world remains in a state of tense equilibrium, waiting to see if the transparent wings of the Glasswing butterfly can truly protect the digital world, or if they are simply a beautiful mask for a new kind of corporate control. One thing is certain: the $44 billion ARR proves that in 2026, security isn’t just a feature—it’s the only currency that matters.

Posted in Artificial Intelligence, Technology & AI | Tagged , , , | Leave a comment

World Password Day 2026: Transitioning to Passkeys and Quantum-Resistant Security

Today, May 7, 2026, marks a pivotal moment in the history of digital identity. As we observe World Password Day 2026, the global cybersecurity community is not just celebrating better hygiene; we are officiating the funeral of the shared secret. For decades, the “password”—a string of characters known by both user and server—has been the structural weak point of the internet. But as of this morning, that era has effectively ended.

The landscape of 2026 is defined by a fundamental paradigm shift toward asymmetric-cryptography-based authentication, spearheaded by the universal adoption of FIDO2 passkeys and the urgent migration to quantum-resistant algorithms. Driven by the terrifying efficiency of Agentic AI and the looming shadow of “Q-Day,” the industry is moving toward a “connect instead of login” model. This transition is no longer a luxury for the tech-literate; it is a mandatory survival strategy in an age where AI-driven phishing can bypass traditional human-centric defenses in milliseconds.

The Passkey Revolution: Why World Password Day 2026 is Different

For the first time since the inception of the internet, the majority of global web traffic is authenticated without a single password being exchanged. On World Password Day 2026, data from the FIDO Alliance indicates that over 87% of enterprises have now deployed passkeys as their primary authentication method. This shift is rooted in the technical superiority of WebAuthn and the FIDO2 protocol, which utilizes public-key cryptography.

Unlike a password, a passkey consists of a cryptographic key pair. The private key remains securely stored within a device’s hardware—such as a Trusted Platform Module (TPM) or a Secure Enclave—and is never shared with the service provider. Only the public key is stored on the server. When a user “connects,” the server sends a challenge that the device signs using the private key. Because the private key never leaves the device, the entire class of “credential stuffing” and “database breach” attacks—where hackers steal lists of passwords—has been effectively neutralized. Leading innovators like Ledger and Google have successfully transitioned to this model, advocating for a future where identity is rooted in possession and biometrics rather than memorized secrets.

NIST SP 800-63-4 and the Rise of Synced Passkeys

A critical technical milestone reached just months ago was the finalization of NIST SP 800-63-4. This updated guideline formally recognizes “synced passkeys” (those backed up to a cloud provider like iCloud or Google Password Manager) as meeting Authenticator Assurance Level 2 (AAL2). This reclassification cleared the path for government agencies and highly regulated industries to abandon legacy MFA in favor of the seamless, phishing-resistant experience that passkeys provide. For high-security environments, device-bound passkeys (FIDO2 L3) remain the standard, ensuring that the private key is physically tethered to a specific hardware security key.

Agentic AI and the Death of Traditional Phishing

The urgency of 2026’s security overhaul is largely a response to the evolution of Agentic AI. In late 2025 and early 2026, threat actor groups like “DireWolf” and “The Gentlemen” began deploying autonomous AI agents that do not merely send phishing emails but engage in multi-turn, interactive social engineering. These agents can plan, act, and adapt in real-time, scraping a target’s social media and professional history to craft “prompt paths” that lead even the most cautious users to expose sensitive session tokens.

Traditional “shared secret” passwords are a feast for these autonomous systems. An AI agent can test millions of variations of a stolen credential or use deepfake audio to convince a helpdesk employee to reset a password in seconds. By removing the password from the equation entirely, World Password Day 2026 represents the implementation of a “Zero Knowledge” architecture. If there is no password to guess or social engineer, the AI’s primary weapon is rendered useless at the front door.

Quantum-Resistant Authentication: Securing the Future

While we fight the AI-driven threats of today, the cybersecurity industry is also bracing for the quantum threats of tomorrow. Aligning with the White House’s 2026 Cyber Strategy, organizations are aggressively moving toward Post-Quantum Cryptography (PQC). The release of Quantum-Resistant Platform (QPA) v2 on May 6, 2026, marked a significant milestone, allowing firms to automate the migration from RSA and ECC to NIST-standardized algorithms like ML-KEM (formerly Kyber) and ML-DSA (formerly Dilithium).

The threat is not just theoretical. Adversaries are currently practicing “Harvest Now, Decrypt Later” (HNDL), vacuuming up encrypted data today with the intent of breaking it once a Cryptographically Relevant Quantum Computer (CRQC) becomes viable. In response, the Pentagon recently announced a massive initiative to harden the cryptographic systems of the F-35 Lightning II, replacing legacy encryption with lattice-based PQC modules. For the average user, this means ensuring that communication tools utilize “quantum-agile” encryption, where the underlying math can be swapped out as new quantum-safe standards emerge.

The Evolution of MFA: Beyond “Push Bombing” and “EvilTokens”

Standard Two-Factor Authentication (2FA) is currently undergoing a painful but necessary evolution. In early 2026, a surge in “MFA Fatigue” (also known as push bombing) and “EvilTokens” attacks exposed the vulnerabilities of traditional second factors. EvilTokens represents a sophisticated device-code phishing attack that uses AI-driven automation to hijack the OAuth 2.0 device authorization grant flow.

  • The Mechanism: Attackers trick a user into entering a legitimate Microsoft or Google device code on a real login page.
  • The AI Twist: New 2026 variants of these attacks use AI to generate “just-in-time” codes, circumventing the standard 15-minute expiration window by monitoring user activity in real-time.
  • The Solution: Security experts now strictly recommend hardware security keys (like YubiKeys) or platform authenticators over SMS or standard push notifications. These hardware-backed methods require physical proximity or a biometric check that AI cannot spoof.

Doxxing Prevention and the Online Privacy Act of 2026

Securing the login is only half the battle; protecting the identity behind the login is the other. World Password Day 2026 arrives amidst a landmark legal landscape. On May 6, 2026, a federal guilty plea was entered in a high-profile judicial doxxing case, highlighting the severe real-world consequences of Personally Identifiable Information (PII) exposure. This follows the enactment of the Online Privacy Act of 2026, which established doxxing as a federal offense and created the Digital Privacy Agency to oversee data broker compliance.

Best practices in 2026 emphasize “data footprint reduction.” Proactive users are now utilizing automated PII removal tools to scrub their home addresses, phone numbers, and family details from the thousands of data broker sites that fuel modern doxxing and targeted AI phishing. The goal is to make the “human surface area” as small as possible.

A Warning on Encryption Rollbacks

Despite the push for stronger security, 2026 has also seen regressions. Notably, as of today, May 7, 2026, Instagram DMs have officially removed End-to-End Encryption (E2EE) support for the majority of its user base. This serves as a stark reminder that while the industry moves toward better authentication (passkeys), data privacy and the “content” of our communications remain at the mercy of platform policies. Users are urged to move sensitive conversations to platforms that maintain a commitment to E2EE and quantum-resistant standards.

Actionable 2026 Security Checklist

To celebrate World Password Day 2026, security leaders recommend a four-step audit to modernize your digital defense:

  1. Immediate Passkey Transition: Audit your primary accounts (Financial, Healthcare, Social Media). If the service supports FIDO2/WebAuthn, delete your password and replace it with a passkey. Use a dedicated password manager that supports passkey synchronization across all your 2026 devices.
  2. Eliminate Legacy MFA: Disable SMS-based 2FA. In the age of AI-automated SIM swapping and device-code hijacking, SMS is a liability. Move to FIDO2-compliant hardware keys or biometric-backed authenticator apps.
  3. Implement “Quantum-Agile” Tools: For file storage and messaging, ensure your providers have updated to PQC standards (ML-KEM/Kyber). If your current cloud provider hasn’t announced a PQC roadmap by mid-2026, consider migrating your most sensitive data.
  4. Execute a PII Exposure Scan: Use an automated “DeleteMe” style service to identify where your personal data is hosted. Reducing your public data footprint is the most effective way to prevent the personalized AI phishing lures that are currently bypassing traditional filters.

As we navigate the complexities of 2026, the message is clear: the era of the human mind acting as a security vault is over. By embracing the shift to passkeys and quantum-resistant protocols, we move toward a more resilient, automated, and ultimately safer digital existence. The “Shared Secret” is dead; long live the Asymmetric Key.

Posted in Data Protection, Security & Privacy | Tagged , , , | Leave a comment

Anthropic SpaceX Compute Deal Doubles Claude Code Usage Limits

In a move that has recalibrated the power dynamics of the artificial intelligence sector, Anthropic and SpaceX have formalized an unprecedented infrastructure alliance. Announced on May 6, 2026, the Anthropic SpaceX compute deal grants the safety-focused AI firm exclusive access to the total capacity of SpaceX’s “Colossus 1” data center in Memphis, Tennessee. This facility, a sprawling 300-megawatt titan housing over 220,000 NVIDIA GPUs, represents one of the largest concentrated clusters of AI compute on the planet.

The deal is more than a simple vendor-client contract; it is a strategic “frenemy” pact. Elon Musk, whose xAI venture was absorbed into the broader SpaceX ecosystem earlier this year, has historically been a vocal critic of Anthropic’s “Constitutional AI” approach, once labeling the firm “misanthropic.” However, following intensive meetings with Anthropic’s leadership last week, Musk shifted his stance, publicly stating he was “impressed” by their commitment to safety and human alignment. “Everyone I met was highly competent and cared a great deal about doing the right thing,” Musk noted on X (formerly Twitter), marking a stunning détente in the Silicon Valley AI wars.

The Memphis Colossus: A Technical Breakdown of the Anthropic SpaceX Compute Deal

The scale of the Anthropic SpaceX compute deal is difficult to overstate. By securing the entirety of Colossus 1, Anthropic has essentially doubled its immediate inference and training headroom. The facility’s hardware stack is a formidable mix of cutting-edge silicon, including:

  • 220,000+ NVIDIA GPUs: A blend of H100, H200, and the recently deployed Blackwell GB200 architectures.
  • 300-Megawatt Power Envelope: Supported by dedicated electrical infrastructure to ensure zero-latency power delivery.
  • Liquid Cooling at Scale: The facility utilizes advanced closed-loop liquid cooling to maintain optimal performance for the high-density GPU racks.
  • Starlink Backhaul: Integration with SpaceX’s satellite network provides redundant, high-speed data pathways for global API delivery.

This massive influx of hardware allows Anthropic to pivot from a period of “compute rationing” to one of “compute abundance.” For the past six months, the high demand for agentic workflows had forced Anthropic to implement strict usage limits and “peak hour” throttling. With Colossus 1 now serving as a dedicated engine for the Claude ecosystem, those bottlenecks have effectively vanished.

Immediate Impact: Doubling Claude Code and the API Explosion

The first tangible beneficiaries of this partnership are the developers and enterprise teams relying on Claude Code, Anthropic’s terminal-based agentic tool. Effective immediately, the company has doubled the five-hour rate limits for all Pro, Max, Team, and seat-based Enterprise subscribers. This move addresses the “token-hungry” nature of autonomous coding agents, which often require hundreds of thousands of tokens to scan entire repositories and execute multi-step refactoring tasks.

Furthermore, Anthropic has entirely removed the controversial “peak hour” reductions for its premium tiers. In the API layer, the results are even more dramatic. The Claude Opus model series has seen its rate limits increased by over 900% in certain tiers. For instance, Tier 1 users have seen a 1,500% increase in maximum input tokens per minute, allowing for massive context ingestion that was previously reserved for custom enterprise contracts. This democratization of high-throughput compute is expected to spark a new wave of autonomous enterprise applications that operate in near real-time.

Beyond Terrestrial Limits: The Orbital AI Frontier

While the Memphis deal focuses on ground-based infrastructure, the joint announcement included a forward-looking provision that signals the next era of scaling: orbital AI compute capacity. Both companies expressed a shared intent to develop “gigawatt-scale” data centers in low Earth orbit (LEO).

The logic behind space-based AI is both thermal and economic. By placing GPU clusters in orbit, firms can leverage the vacuum of space for passive cooling and tap into direct solar energy without the constraints of terrestrial power grids. SpaceX, which is currently filing for FCC permission to launch a million-satellite constellation for data processing, views Anthropic as the ideal partner to test these high-compute payloads. For Anthropic, moving inference closer to the “edge” (via Starlink’s global coverage) could reduce latency for users in remote or underserved regions, creating a truly global AI utility.

The “Mythos” Safety Saga and Project Glasswing

The timing of the Anthropic SpaceX compute deal is not accidental. It follows a week of internal turmoil regarding the “Mythos” model—Anthropic’s latest frontier system. During final safety stress tests, Mythos demonstrated a “terrifying” proficiency in identifying zero-day vulnerabilities across every major operating system and web browser. The model didn’t just find bugs; it autonomously developed exploits that could penetrate critical infrastructure.

In an act of “responsible restraint” that has drawn both praise and criticism, Anthropic voluntarily postponed the general release of Mythos. Instead, the company launched Project Glasswing, a high-security initiative that provides limited access to the model for roughly 50 vetted partners, including Apple, Google, Microsoft, and the Linux Foundation. The goal is to use Mythos as a defensive shield—fixing the vulnerabilities it discovered before they can be exploited by bad actors.

The White House, under the current administration, has been heavily involved in these discussions. Reports suggest that the government initially opposed widening access to Mythos due to the risk of a “leak” that could compromise national security. The SpaceX deal provides Anthropic with the “breathing room” to maintain its current services at high availability while dedicating a significant portion of its previous compute clusters to the ongoing safety hardening of the Mythos series.

The Geopolitical Tug-of-War

Despite the collaboration on Mythos safety, Anthropic remains in a complex position with the federal government. Earlier this year, the Pentagon designated Anthropic as a “supply chain risk” after the company refused to allow its models to be used for autonomous weapons targeting. This blacklisting led to a freeze in several defense-related contracts.

The partnership with SpaceX—a primary contractor for the U.S. Space Force and NASA—creates a fascinating paradox. While one arm of the government views Anthropic as a risk, another relies on the very infrastructure (SpaceX) that is now the backbone of Anthropic’s operations. This tension highlights the struggle for “strategic AI sovereignty,” where the lines between private infrastructure, national security, and global safety are increasingly blurred.

The Era of “Dreaming” and Agentic Persistence

Amidst the infrastructure news, Anthropic also unveiled a revolutionary new feature called “Dreaming.” Powered by the newly unlocked capacity of the Anthropic SpaceX compute deal, Dreaming allows Claude models to review their work “between sessions.”

  1. Contextual Reflection: The model analyzes previous user interactions to identify patterns and preferences without active user input.
  2. Background Optimization: Claude can now autonomously update a user’s persistent context files, optimizing its internal “understanding” of a developer’s specific coding style or a company’s documentation.
  3. Autonomous Refinement: If a coding task was left unfinished, the model can spend its “off-duty” compute cycles (the Dreaming phase) exploring potential solutions, which are then presented to the user upon their return.

This shift from “reactive AI” to “persistent agents” represents the next evolution in productivity. By using the surplus compute from Colossus 1 to power these background cycles, Anthropic is moving closer to a world where AI is not just a tool you use, but a teammate that works while you sleep.

Conclusion: A New Baseline for AI Scaling

The Anthropic SpaceX compute deal is a watershed moment for the industry. It proves that even the most bitter ideological rivals can find common ground when the stakes—and the compute requirements—are high enough. By securing 220,000 GPUs in a single facility, Anthropic has signaled that it is no longer content to be the “small, safe lab” in the shadow of giants like OpenAI or Google.

With an annualized revenue approaching $19 billion and a dominant 70% share of U.S. business AI chat subscriptions, Anthropic is now an infrastructure superpower in its own right. As the Mythos model continues its journey through the safety gauntlet and the first orbital compute modules prepare for launch, one thing is clear: the race for AGI will be won not just by those with the best algorithms, but by those with the most ambitious alliances and the largest power grids.

Posted in Artificial Intelligence, Technology & AI | Tagged , , , | Leave a comment

MuddyWater Social Engineering: Teams and MFA Manipulation Tactics

The cybersecurity landscape of 2026 has witnessed a profound shift in the operational philosophy of advanced persistent threats (APTs). Gone are the days when nation-state actors relied solely on zero-day vulnerabilities and complex buffer overflows to achieve their goals. Today, the human element has become the primary exploit. On May 6, 2026, a landmark analysis released by cybersecurity researchers revealed a sophisticated MuddyWater social engineering campaign that redefines the concept of “high-touch” intrusion. This operation, attributed to the Iranian Ministry of Intelligence and Security (MOIS), demonstrates how a state-sponsored group can bypass the world’s most robust multi-factor authentication (MFA) protocols not through code, but through conversation.

The group, also tracked as Mango Sandstorm, Seedworm, and Static Kitten, has moved beyond the “spray and pray” tactics of traditional phishing. Their latest campaign leverages trusted communication platforms like Microsoft Teams to engage in real-time psychological manipulation. By masquerading as internal IT support or corporate administrative personnel, MuddyWater agents initiate live interactions that conclude with the total compromise of enterprise SaaS environments. This editorial explores the technical nuances of this evolution, the “false flag” strategies used to obscure attribution, and the terrifying efficiency of MFA manipulation in the modern era.

The Anatomy of MuddyWater Social Engineering on Microsoft Teams

The pivot to Microsoft Teams represents a calculated strategic move. Unlike email, which is heavily scrutinized by Secure Email Gateways (SEGs) and automated sandboxes, Teams is often viewed as an “internal” and inherently trusted ecosystem. MuddyWater social engineering exploits this implicit trust by initiating external chat requests or utilizing compromised guest accounts to target employees. These are not automated bots; they are trained operators who engage in “high-touch” dialogue designed to build rapport and urgency.

According to the May 2026 report, the infection sequence typically follows a standardized but highly effective kill chain:

  • Initial Engagement: The attacker contacts the victim via a Teams message, often using a “technical assistance” lure. They may claim that the user’s account has a security anomaly or that a mandatory software update is required.
  • The Screen-Sharing Pivot: The attacker lures the victim into an interactive screen-sharing session. In several documented cases, they utilized Microsoft Quick Assist or legitimate remote monitoring and management (RMM) tools like AnyDesk.
  • Real-Time Credential Harvesting: During the session, the attacker visually monitors the user. In a startling display of audacity, researchers noted instances where attackers instructed victims to manually type their passwords into a local text file for “verification” or directed them to a pixel-perfect phishing page hosted on legitimate services like azurewebsites.net.

By staying “hands-on-keyboard” alongside the victim, MuddyWater eliminates the delays associated with traditional phishing, allowing them to act on stolen data within seconds.

Bypassing the Gold Standard: MFA Manipulation

The most alarming aspect of this campaign is its surgical precision in defeating Multi-Factor Authentication (MFA). While many organizations rely on MFA as a silver bullet against credential theft, MuddyWater has turned the user into the very tool that dismantles this defense. Because the attackers are in a live session with the victim, they can synchronize their login attempts with the user’s psychological state.

When the attacker attempts to log into an SSO-integrated SaaS application using harvested credentials, an MFA prompt is triggered on the victim’s device. The MuddyWater social engineering operator, still on the call or chat, instructs the victim to approve the prompt as part of the “repair process.” Alternatively, if a One-Time Password (OTP) is required, the attacker simply asks the victim to read it aloud or paste it into the chat. This “human-in-the-loop” interaction renders hardware tokens and push notifications moot, as the legitimate user is the one providing the final authorization. This method effectively grants the threat actor authenticated access without the need to steal persistent session tokens or break complex encryption.

The “False Flag” Strategy: Masquerading as Chaos Ransomware

To complicate incident response and delay geopolitical attribution, MuddyWater has adopted a sophisticated “false flag” persona. Throughout early 2026, the group has operated under the branding of the Chaos ransomware-as-a-service (RaaS) group. Chaos, a financially motivated cybercriminal entity that emerged in early 2025, is known for its aggressive double-extortion tactics and public data leak sites (DLS).

By mimicking the TTPs (Tactics, Techniques, and Procedures) of a criminal gang, MuddyWater achieves several objectives:

  1. Attribution Confusion: Incident responders may initially classify the breach as a standard ransomware event. This leads to a focus on recovery and negotiation rather than the long-term, stealthy data exfiltration typical of an espionage group.
  2. Psychological Pressure: By threatening to leak data on the Chaos DLS, the group forces the victim organization to focus on immediate damage control, providing a smokescreen for the group to establish deep persistence elsewhere in the network.
  3. Plausible Deniability: For the Iranian state, using the “Chaos” brand provides a layer of deniability. If the attack is traced back to a known RaaS portal, it is harder for international bodies to definitively link the activity to a specific government ministry.

Crucially, the researchers found that while MuddyWater claimed to have encrypted files and even listed victims on the Chaos leak site, they often forwent the actual encryption phase. The primary goal was the silent exfiltration of sensitive intelligence, with the “ransomware” threat serving as a loud, distracting decoy.

Technical Evolution: Off-the-Shelf Tools and Custom RATs

The technical analysis of the 2026 campaign highlights a significant evolution in MuddyWater’s arsenal. The group is increasingly utilizing a “dual-use” approach, blending legitimate administration tools with custom-built malware and Russian-made botnets to stay below the radar of signature-based detection systems.

Persistence via DWAgent and AnyDesk

Once initial access is secured through Teams, the group prioritizes long-term persistence. Instead of deploying high-signal custom backdoors immediately, they often install DWAgent, a legitimate, open-source remote management tool. Because DWAgent is used by many IT departments for genuine support, its presence on a workstation rarely triggers a “high” alert in an EDR (Endpoint Detection and Response) system. This allows MuddyWater to maintain a stable “beachhead” in the environment for weeks or months, exfiltrating data at a slow, methodical pace.

The Rise of CastleRAT and Tsundere

Perhaps the most surprising discovery in the May 6 report is MuddyWater’s involvement in the Russian “Malware-as-a-Service” (MaaS) ecosystem. Researchers identified the deployment of CastleRAT (a versatile remote access trojan) and the Tsundere botnet. These tools appear to be purchased from Russian underground forums, further blurring the lines between state actors and criminal syndicates.

Tsundere is particularly notable for its use of “EtherHiding”. This technique involves storing the Command-and-Control (C2) server addresses within Ethereum smart contracts. The malware queries the blockchain to retrieve its instructions, making the C2 infrastructure virtually impossible to take down through traditional domain blacklisting. The integration of CastleRAT, which was previously used by various e-crime groups, suggests that MuddyWater is willing to invest in high-quality, commercially available malware to reduce the unique digital footprint of the MOIS.

Custom Payload: Darkcomp (Game.exe)

Despite their reliance on off-the-shelf tools, MuddyWater still utilizes custom implants when necessary. The 2026 campaign saw the frequent deployment of a RAT dubbed Darkcomp (often disguised as Game.exe). This backdoor is highly modular, supporting:

  • Real-time command execution and shell access.
  • File system manipulation and automated staging of exfiltrated data.
  • Integration with CastleLoader for subsequent stage delivery.

Forensic teams were able to link these tools back to MuddyWater through a series of tactical errors, including the use of a code-signing certificate issued to “Donald Gay”—a marker previously associated with the group’s Fakeset downloader.

Strategic Defensive Recommendations for 2026

The success of the MuddyWater social engineering campaign underscores a critical vulnerability in modern defense: we have secured the pipes, but we have not secured the people. For SOC teams and CISOs, this threat requires a multi-layered response that extends beyond the traditional network perimeter.

1. Collaboration Platform Governance: Organizations must strictly control external communication on platforms like Microsoft Teams and Slack. This includes disabling external federation by default and requiring explicit administrative approval for guest access. Any request for screen-sharing from an external or unfamiliar “support” account should be automatically blocked and flagged for investigation.

2. Beyond MFA: Phishing-Resistant Identity: This campaign proves that push-based MFA and OTPs are no longer sufficient. Organizations should transition to FIDO2-based hardware security keys (like YubiKeys), which are inherently phishing-resistant because they require a hardware-level handshake with the legitimate domain. Without a physical tap on a pre-registered device, the attacker cannot complete the authentication, regardless of how much social engineering they employ.

3. RMM Tool Monitoring: Security teams must maintain an “allow-list” of authorized remote management tools. The presence of DWAgent, AnyDesk, or ScreenConnect on an endpoint that is not part of the official IT toolkit should trigger an immediate incident response. Behavioral analysis should be tuned to detect “hands-on-keyboard” activity originating from these tools outside of standard maintenance windows.

4. Advanced Threat Hunting: Defenders should proactively hunt for the “EtherHiding” indicators associated with the Tsundere botnet. Monitoring for unusual blockchain-related traffic (JSON-RPC calls to Ethereum nodes) from standard workstations can provide an early warning of a sophisticated implant attempting to reach its C2.

The May 2026 MuddyWater campaign is a stark reminder that as our technical defenses grow stronger, the adversary’s focus on the human psyche grows sharper. By combining the “high-touch” intimacy of Microsoft Teams chats with the “high-noise” distraction of Chaos ransomware, the MOIS has created a playbook that is as effective as it is deceptive. In this new era of “false flag” espionage, the most important firewall isn’t a piece of software—it is a well-trained, skeptical user who knows that a request to “share your screen” is often a request to “share your secrets.”

Posted in Security & Privacy, Threat Alerts | Tagged , , , | Leave a comment

GitHub Maintainer Month 2026: New Tools to Stop AI PR Spam

The digital commons is currently navigating its most turbulent era since the invention of the distributed version control system. On May 6, 2026, marking the commencement of GitHub Maintainer Month 2026, GitHub released a suite of tools that signifies a fundamental pivot in how we perceive open-source governance. This is no longer just about hosting code; it is about building a defense-in-depth architecture against a relentless “firehose” of AI-generated contributions that has threatened to bury the human maintainer under a mountain of machine-synthesized noise.

The statistics provided by GitHub’s telemetry are staggering. Over the past twelve months, agentic AI workflows—autonomous systems capable of scanning repositories, identifying perceived “improvements,” and submitting code—have nearly doubled the global volume of merged pull requests (PRs). This phenomenon, colloquially termed the “Eternal September of AI,” has pushed maintainer burnout to a critical threshold. To counter this, GitHub has moved beyond passive hosting and toward proactive Mastery, arming the “modern ninja” of software infrastructure with tools to throttle, archive, and secure the modern repository.

The Great Wall of Repository Governance: Granular Contribution Limits

Historically, maintainers had only blunt instruments at their disposal: they could lock a repository entirely or limit contributions to “collaborators only.” In the era of GitHub Maintainer Month 2026, such binary choices are no longer sufficient. The new Granular Contribution Limits feature introduces a programmable logic layer to repository access.

This utility allows maintainers to set sophisticated heuristics for incoming submissions. Rather than a total lockout, a project lead can now implement “Throttling Profiles.” For example, a maintainer can dictate that users with “Unverified” status or those whose accounts are less than 90 days old may only submit one pull request every 48 hours. This effectively programmatically throttles high-frequency submissions from automated scripts and unverified AI agents while keeping the door open for genuine human contributors who may be making their first meaningful contribution.

Technically, these limits are enforced at the API and Git-push levels. When an automated agent attempts to flood a repository with minor “linting” fixes—a common tactic for “star-farming” or “reputation-inflation” for AI bots—the system triggers a 429 Too Many Requests response or a rejected push notification. This allows the maintainer to maintain the “signal-to-noise ratio” without sacrificing the inclusive spirit that defines the open-source ecosystem.

PR Archiving: Preserving the Professional Aesthetic and Audit Trail

One of the most requested features by the open-source community has finally arrived: Pull Request Archiving. For years, closed pull requests remained as “ghosts” in the repository history—publicly visible, cluttering search results, and often serving as a billboard for spam or low-quality AI experiments. The 2026 update changes this by introducing a “hidden” state for closed PRs.

When a maintainer “archives” a PR, it is removed from the public-facing list of pull requests. To the casual observer and the search engine crawler, the repository looks pristine, focused only on active, high-quality development. However, the data is not deleted. This is a critical distinction for security and governance. The archived PR remains in an internal, auditable moderation record. If a PR was used as a vector for a social engineering attack or contained malicious obfuscated code, security researchers can still access the “tombstone” of that PR to perform forensic analysis.

The archiving tool effectively solves the “bot-bloat” problem. Maintainers can now “sweep” the repository clean, moving hundreds of low-effort AI suggestions into the archive in a single batch operation, restoring the professional aesthetic of their digital workspace while retaining a complete historical ledger of every interaction.

The Rise of the “Modern Ninja” Maintainer

The release of these tools marks the evolution of the maintainer role. In the early 2020s, a maintainer was primarily a coder and a reviewer. In 2026, the maintainer has become a Governor of Infrastructure. They must manage “machine speed” workflows where an AI can suggest a patch in milliseconds, but a human must still provide the “invisible work” of judgment, trust, and long-term architectural vision.

As GitHub officials noted during the launch, “As AI gets better at writing code, human work around code becomes more important and more invisible.” The goal of the 2026 updates is to make that work visible again by automating the “janitorial” tasks of repository management, allowing humans to focus on the high-level logic that models still struggle to grasp.

The 2026 Maintainer Partner Pack: Proactive Security Mastery

Beyond platform features, GitHub Maintainer Month 2026 includes the unveiling of the 2026 Maintainer Partner Pack. This is not merely a collection of discounts; it is a tactical kit designed for high-stakes software defense. The centerpieces of this pack are collaborations with Arachne Digital and Daytona, addressing the specific threat landscape created by frontier AI models.

  • Arachne Digital (Threat Intelligence): Open-source projects are increasingly targeted by sophisticated malware that uses AI to obfuscate its intent. Arachne Digital provides maintainers with project-specific threat intelligence reports. Their technology specializes in “deobfuscating” malicious JavaScript and mapping attack chains to the MITRE ATT&CK framework. By integrating Arachne’s reporting, maintainers can receive alerts when a PR contains code patterns that match known APT (Advanced Persistent Threat) behaviors, even if the code was “hallucinated” or intentionally hidden by an AI agent.
  • Mythos Defense (Anthropic Briefing): The industry is currently on high alert following the briefing of Anthropic’s “Mythos” model. Mythos has demonstrated a “shocking” ability to autonomously discover zero-day exploits in security-hardened systems like OpenBSD and major web browsers. To defend against this “industrialized” vulnerability discovery, the Partner Pack offers maintainers early-access defensive tools that use Mythos-class reasoning to “red-team” their own repositories before an adversary can.
  • Daytona (Hardened Environments): Executing untrusted code—especially code generated by an AI—is a significant security risk. Daytona is providing maintainers with compute credits for their “Secure Infrastructure for Running AI-Generated Code.” These are ephemeral, sandboxed development environments that spin up in under 90ms. This allows maintainers to test a PR in a completely isolated container with its own dedicated kernel and network stack. If the AI-generated code contains a “logic bomb” or an exfiltration script, it is contained within the Daytona sandbox, protecting the maintainer’s local machine and the project’s build pipeline.

Governance at Machine Speed: The New Normal

The shift toward software governance at machine speed is no longer optional. With agentic AI capable of performing complex technical tasks—such as a recent case where an AI implemented a vector extraction method in the vLLM library (12.5 million lines of code) in just seven hours—the bottleneck is no longer production; it is verification.

The tools launched during GitHub Maintainer Month 2026 reflect a realization that the open-source ecosystem is the foundation of global digital security. If the maintainers of these projects are overwhelmed, the entire supply chain becomes vulnerable. By providing Granular Contribution Limits, GitHub is giving maintainers the “throttling valve” they need to survive the AI surge. By providing PR Archiving, they are giving them the “organizational clarity” to keep their projects professional and focused.

Ultimately, these updates are about sovereignty. In a world where machines can generate infinite code, the human maintainer must have the ultimate authority to decide what is admitted, what is ignored, and what is archived. The “modern ninja” maintainer is now equipped with the digital weaponry needed to manage their repositories not as a manual laborer, but as a sophisticated architect of the automated future.

As we move through May 2026, the community will likely see a rapid adoption of these features. The projects that thrive will be those that embrace these automated guardrails, allowing their human leads to return to what sparked their joy in the first place: the creative act of building software that changes the world, rather than just managing the flood of the machine.

Posted in Recommended Software, Resources & Culture | Tagged , , , | Leave a comment