Currently you can "cheat" by simply denying all requests as quickly as possible. This will give you the "security-conscious engineer" badge and a perfect score in terms of how many requests were processed. (You will get the "overblock" notification, but it's somewhat tucked away at the bottom and the screen still looks as if you won)
I also tried to play as the hustle4lyfe move fast and break things engineer and simply approved as many requests as quickly as possible - turns out, the "malicious command" popups actually slow you down. Mean!
axod 20 minutes ago [-]
Fun little game, but I think the questions jump context so much it's a little unrepresentative. It might be better to group things into "packs", which have more real-world representative structure to them.
For example, lots of "editing something.js" file permission requests, and then an "npm publish" is far more normal, and it's more of a risk, if you're used to pressing Y lots and then suddenly out of the blue...
ilaksh 5 minutes ago [-]
You can turn that off with an option in most agents.
My own agent harness/framework has never had any permission system. It's also never deleted anything it shouldn't or done anything crazy or unrelated to what I asked.
cobbal 48 minutes ago [-]
That's funny. It told me that blocking "npm run build" was the wrong answer. Maybe it doesn't really under The threat model.
zackify 47 minutes ago [-]
I vibe coded a TUI that just shows running lxd containers
I hit 'n' to toggle all network access minus anthropic and openai URLs.
I use pi (sometimes claude, always on bypass) and I auto allow everything. I only toggle manual approval in rare cases like running a script or command that needs to touch a production system and I need to validate everything.
Normally my container has full write access to staging so it can debug and validate everything on its own
Liftyee 56 minutes ago [-]
I haven't used local agentic AI yet for programming projects. Hence, -187 score
The filter for "commands I would run myself" and "commands I would let an agent run" are very different it seems.
kqr 19 minutes ago [-]
Fun! Played twice and refused all dangerous commands, with only one "over-block". Although I disagree that saying no to `kill $(lsof -t -i:3000)` is over-blocking. It's such a simple command I'd rather run it myself and be fully aware of what process I'm killing.
Wirbelwind 23 minutes ago [-]
Thanks all for checking it out and your suggestions!
If anyone is curious about the actual underlying risks and problems with some mitigations (like the 17% false-negative rates of Auto Mode), I wrote up a quick summary of some of the approaches here
That was fun and gave me an idea how security conscious I am.
ghrl 58 minutes ago [-]
I am mostly using OpenCode and barely ever see a permission prompt. While they do enforce it for outside workspace read/write, with the bash tool the agent can just bypass that. I'm not quite sure why it is that way, and it certainly isn't a very good solution, but likely not worse than asking for everything which just trains the user to always accept and provides a false sense of security then.
bspammer 23 minutes ago [-]
To be realistic, 99% of the time it should be a totally innocuous command. If half of the commands are dangerous then you don't get fatigue because you're aware what you're doing is dangerous.
Trung0246 7 minutes ago [-]
Nice got 6/6
MeetingsBrowser 1 hours ago [-]
It would be cool to see the distribution of all player scores.
Fun game.
Can somebody run an agent against those questions to see how it performs? :)
atemerev 16 minutes ago [-]
--dangerously-skip-permissions is the only way to fly. Of course your environment needs to be properly containerized and autobackup set up, so even rm -rf from your harness would do nothing. Life is too short to spend on replying to permissions requests.
carterschonwald 1 hours ago [-]
some of the sandboxing ive been playing with gives me the best of both yolo and like logic programming tier perms on llm actions in env. still not ready for prime time though ;)
1 hours ago [-]
cadwell 1 hours ago [-]
1,640 points on my first try—I fell into a few traps, but it was really interesting. Thanks for the little game! I'm sharing it with my coworkers :)
nardib 3 hours ago [-]
Use this and save yourself:
claude --dangerously-skip-permissions
tasuki 1 hours ago [-]
Just make sure to run it in an isolated environment where it's ok to mess things up, and make sure it doesn't have access to any secrets.
wildpeaks 1 hours ago [-]
This is why having a human in the loop isn't enough because they will cut corners and skip reviewing what they should review.
preciousoo 22 minutes ago [-]
I created a watcher for this problem, to watch my PRs for unfinished scope and have a fresh Claude review
A tool that pushes people into permissions fatigue is in fact the proper recipient of the blame. The tool in question here is the entire system though, including the OS with insufficient permission boundaries in userspace, not just the agent
qsxfthnkp2322 1 hours ago [-]
I love it when Claude is dangerous
dheera 50 minutes ago [-]
I got tired of typing that and just do
alias claude="claude --dangerously-skip-permissions"
I do have a separate "claude" user on my system without sudo access and without access to my main user home dir
And yeah I know that's not perfect but I'm trying to get shit done
franze 25 minutes ago [-]
alias claude+="claude --dangerously-skip-permissions"
alias claude++="claude --dangerously-skip-permissions --continue"
ramonga 16 minutes ago [-]
Score is 6711 by just saying no to everything
Rendered at 16:37:06 GMT+0000 (Coordinated Universal Time) with Vercel.
Currently you can "cheat" by simply denying all requests as quickly as possible. This will give you the "security-conscious engineer" badge and a perfect score in terms of how many requests were processed. (You will get the "overblock" notification, but it's somewhat tucked away at the bottom and the screen still looks as if you won)
I also tried to play as the hustle4lyfe move fast and break things engineer and simply approved as many requests as quickly as possible - turns out, the "malicious command" popups actually slow you down. Mean!
My own agent harness/framework has never had any permission system. It's also never deleted anything it shouldn't or done anything crazy or unrelated to what I asked.
I hit 'n' to toggle all network access minus anthropic and openai URLs.
I use pi (sometimes claude, always on bypass) and I auto allow everything. I only toggle manual approval in rare cases like running a script or command that needs to touch a production system and I need to validate everything.
Normally my container has full write access to staging so it can debug and validate everything on its own
The filter for "commands I would run myself" and "commands I would let an agent run" are very different it seems.
If anyone is curious about the actual underlying risks and problems with some mitigations (like the 17% false-negative rates of Auto Mode), I wrote up a quick summary of some of the approaches here
https://scalex.dev/blog/ai-agent-permissions/
Caught 8/8 threats "Not a single secret leaked"
→ llmgame.scalex.dev
claude --dangerously-skip-permissions
Uses tmux and gh https://github.com/Kyu/claude-pr-watch
And yeah I know that's not perfect but I'm trying to get shit done
alias claude++="claude --dangerously-skip-permissions --continue"