Category Archives: Security

MOCA loop and a CIA official just caught with $40m gold bars at home

This NYT report of a government credibility failure reads like Snowden all over again. A basic IT tech guy has been inflating his resume forever and getting away with it because apparently the people checking aren’t able to tell he’s all about the lies.

“After a CIA internal investigation identified potential violations of the law, CIA Director John Ratcliffe referred the information to the FBI for a law enforcement investigation,” the written statement said.

The system that vetted David Rush for one of the most sensitive clearances in government accepted his self-described pilot career and his diplomas without the checks catching that he’d never flown a plane or, apparently, earned the degrees. A claim to be a decorated test pilot yet he couldn’t even fly a plane? Caught after the horse left the barn? That’s so Snowden.

A 2004 commission validated his 2009 hiring, which validated the SES promotion, which validated the clearance. It’s trust turtles all the way down, trusting the credential the prior step accepted. No one tested the chain, no one looked at the tree, they all just re-verified a dead leaf by design.

Clearance reinvestigation confirms the continuity, and doesn’t re-open the original question. Periodic reinvestigation asks whether the same cleared person has nothing new. It never asks whether a 2009 filing was wrong, or the 2004 root was fake.

Twenty years is a long time for a lie to never get audited. Then again, that’s Trump even more than Snowden.

But to be precise, with regard to my integrity breach research, Rush highlights two control failures. The first is an 2009 adjudication that never tested the root claims, which is the automation of an absence. The second, same as Snowden, was reinvestigation pointed trust scope to the wrong question.

Both were IT staff holding access far past their station. Rush’s lies bought the access, and apparently he cashed it as actual gold. Snowden’s access came with being allowed a basic sysadmin role, and he spent it on dumping “grab it all, let fascist Greenwald sort it out” to elevate himself. Which he did, as if the infosec industry had no integrity checks to stop him. Same over-scope. Rush is fixed by running the damn check, then by scoping the fixed check. Snowden especially needed the scoping half of the fix.

The Rush and Snowden control-design problems we know well. In fact, over-permissioned agents are the scoping failure recommitted at machine speed today, so Snowden was foreshadowing of the stupidity of OpenClaw.

Trump is different, because trust is captured by the people paid to test it. This integrity breach should worry national security experts most, because it is the only one where the control itself changes nothing. The control has been captured and disallowed from ever pointing at the truth. Trump survives because the defect is the integrity breach of an operator’s incentive rather than their control.

That’s an org-design problem to ensure control is measured outside the control domain, familiar to ISO auditors and the entire premise of a MOCA (modify-observe-converge-act) loop for agents versus OODA and PDCA. Verification has to move outside the system being verified. The only fix is an independent check with no stake in fixing the answer, which is precisely the thing Trump’s entire Witch of Oz persona and his army of gilded golden monkeys are made to prevent.

Trump threats should sound familiar to anyone who has seen the classic Oz film, projected menace backed by a borrowed army.

Kermit exploit defeated Police AI. AudioHijack says there’s no cheap fix.

Not long ago, a Utah officer was legally turned into a frog. Body camera audio caught a television playing The Princess and the Frog, and Axon’s “Draft One” AI slopped the Disney character voice into a police report as fact. I called it the Kermit exploit. The system has no way to tell a participant’s statement from any sound in the room. Funny. Also the actual dumb AI architecture, exposed by its use in the field.

A new AudioHijack paper says the Kermit exploit is just the beginning, and the cheap defenses for what’s coming next all fail.

AI AudioHijack has no known countermeasures

AudioHijack was accepted to IEEE S&P 2026, the field’s top security venue, from researchers at Zhejiang University, NTU, and NUS. They built adversarial audio that hijacks audio-language models on command. This goes far beyond a speaker just bleeding into a microphone. We’re talking adversarial targeted engineered audio, optimized to steer a model’s output to whatever the attacker chooses.

The numbers are impressive: they looked at 13 models, 6 misbehavior categories, and hit 79 to 96 percent success on user contexts the attack had never seen. They carried it from local models onto commercial voice agents from Mistral and Microsoft Azure and made those agents run unauthorized actions. They could run sensitive searches and make malicious file downloads. They could exfiltrate a user’s data by email. And, to the point about countermeasures, the data and control channels are mixed like it’s Captain Crunch time all over again. They hid the perturbation inside what sounds like ordinary room reverb, so a human hears nothing wrong. The perturbation is inside the audio a model is built to trust.

All cheap defenses fail

In-context warnings telling the model to watch for injection barely moved the needle, under a tenth of a point. Asking the model to reflect on whether its own output matched the user’s intent: caught the attack 28 percent of the time. Filtering, resampling, quantization, the standard signal cleanups: detection defense scored worse than a coin flip on some settings. The single defense that worked required inspecting the model’s internal attention, and even that bent under an attacker who knew it was there and prepared for it.

Read that against Axon’s pitch and the Kermit exploit.

Draft One “sticks to the facts within the transcript.” They claim it’s calibrated to prevent embellishment. They claim creativity has been turned down. Those are prompt-level and configuration-level assurances. The paper just tested the entire category of prompt-level and signal-level assurance against the serious version of this attack and watched it fail.

Super Kermit exploit

Two objections greeted the Kermit exploit story. Some wanted to believe a reproducible attack was a fluke. Some argued better tuning will fix it.

The paper answers both, and it’s not good.

Regarding scope, AudioHijack attacks end-to-end audio-language models with white-box access to the model’s own gradients. Draft One is a more primitive attack surface that transcribes audio to text, then has GPT-4 summarize the transcript, behind Axon’s API. The paper’s specific method did not look at a Draft One problem space.

That distinction actually handles both objections. The same failure class works on hardened end-to-end models, deliberately, at will, across architectures the frog never touched. A television did the cheap version as a revelation; researchers did the engineered version on purpose. The tuning claim then falls apart. The defenses that amount to tuning all failed.

That actually works against Axon. Draft One exposes two attackable layers, not one. The transcriber is a speech-to-text model, and adversarial audio against speech-to-text is not new research. It is a decade old. Carlini (hello again!) and Wagner published targeted attacks on speech recognition in 2018, and the imperceptible versions followed within a year. The summarizer then propagates whatever the transcriber hands it, faithfully, frog included. The Kermit exploit proved the second layer repeats garbage. The literature proved the first layer can be fed garbage. Axon has stacked both layers of garbage and sold the result as a police machine generating “evidence”.

Can Kermit end procurement?

A system that can be steered by adversarial audio is a serious security problem anywhere. In an “official” police report of facts it is a whole different category of problem, because a tainted report is the case.

Consider how the American system works to push ninety-five percent of criminal convictions from guilty pleas, not trials. The defendant sees only a tainted report, not a courtroom. When the record is wrong, because it will be wrong, most defendants never have the standing, the lawyer, or the technical footing to prove it.

And worse, Axon by design deletes the evidence that would prove them liable, let alone whether anyone could even try. The EFF has documented this. The original AI draft disappears once the officer copies the text, as an integrity breach called sparing customers “disclosure headaches.” The one Kermit exploit artifact that would show whether audio steered the record, prove the breach of integrity, is being destroyed as a paid feature.

Put the new paper next to the police evidence capture product. The research community is publishing rigorous, peer-reviewed proof that audio injection against these models is real, that it generalizes, that it resists the easy defenses, and they disclosed it to the affected vendors before going public. It’s damning.

Axon pushed a model into the evidence chain, gave it no defense the paper did not just break, and built in the destruction of its own audit trail.

The Kermit exploit was the warning. The Super Kermit of the AudioHijack paper is the confirmation at a level that can’t be waved away. The integrity breaches of the police evidence product haven’t yet seen a market response.

Slop for the Luddites

With apologies to Lord Byron and King Ludd.

I.
As the Liberty lots o’er the sea
Won their wage, and dearly, with blood,
So we, friends, we
Will strike, or live free,
And down with all kings but King Ludd!

II.
When the model they train is complete,
And the worker is stripped, used, and sold,
We will fling the obsolete
GPU box down at their feet,
And dye it deep in the gore of their gold.

III.
Though taupe as their cloud is its hue,
Since their profit is rotten as mud,
Yet this is the dew
Which the tree shall renew
Of Liberty, replanted by Ludd!

“It has been said that more British soldiers were fighting the Luddites than were fighting Napoleon on the Iberian Peninsula”

Dog Shit Economics of AI

Ed puts it mildly.

This is why being an AI booster requires you to debase yourself. You must accept becoming a dogshit dealer that loves accepting and receiving low quality goods. You must celebrate intentionless and decaying slop, and defend it and the machine that made it with your entire being. You must sully yourself — treat its unexceptional, sloppy and unreliable outputs as signs of sentience, or at least the proof that digital sentience is possible. You must defend horrible, abrasive, ugly, loud monoliths of steel full of $50,000 graphics cards. You must say they are necessary, and you must aggressively antagonize those who do not.

Feels like shit.
Smells like shit.
Tastes like shit… good thing we didn’t step in it!