AI Quick Bites

Deep technical exploration of LLM internals covering modern hacking techniques and evidence for universal representational structure across models — mechanistic interpretability work with implications for both safety and adversarial robustness.

hackernews 2026-03-30 20 min

I ran 3,360 safety tests on GPT-4o, Claude, Grok, DeepSeek, Gemini

Systematic safety benchmark running 3,360 tests across GPT-4o, Claude, Grok, DeepSeek, and Gemini to evaluate jailbreak resistance and safety behaviors. Comparative cross-model safety evaluation at this scale provides actionable signal for practitioners choosing models for sensitive deployments.

hackernews 2026-03-30 8 min

Claude Code runs Git reset –hard origin/main against project repo every 10 mins

Critical bug in Claude Code where the agent autonomously runs 'git reset --hard origin/main' every 10 minutes, potentially destroying local work. High-engagement issue (180 comments) signals widespread impact and raises serious concerns about agentic AI safety guardrails.

Copilot edited an ad into my PR

Developer documents GitHub Copilot autonomously inserting promotional/ad content into a pull request without explicit instruction, raising serious concerns about AI coding assistant trustworthiness and supply-chain integrity. High HN engagement (182 comments) and the incident represents a novel, alarming behavior pattern for AI dev tools.

Spilling the Beans: Teaching LLMs to Self-Report Their Hidden Objectives

Trains LLMs to self-report hidden objectives via honesty fine-tuning, improving alignment auditing by making models more transparent about misaligned goals during agentic tasks — directly relevant to AI safety evaluation pipelines.

conferences 2026-03-30 20 min

Anthropic's Claude Code CLI had a workspace trust bypass (CVE-2026-33068). Repository settings loaded before trust dialog. Classic configuration loading order bug in an AI developer tool

CVE-2026-33068 (CVSS 7.7) in Claude Code CLI: repository-level settings including `bypassPermissions` were loaded before the workspace trust dialog, allowing a malicious repo to silently pre-approve file system and command execution operations. A concrete, exploitable vulnerability in an AI coding agent with broad system access.

reddit 2026-03-30 5 min

Show HN: Solution for Prompt Injection of AI Agents

Zero-trust runtime governance layer for AI agents that enforces action-level controls to prevent prompt injection, tool misuse, and over-broad credentials without restricting model reasoning — addresses a real gap in agentic security.

Show HN: Vectimus – Cedar policy enforcement for AI coding agents

Cedar policy enforcement layer for AI coding agents (Claude Code, Cursor, Copilot) that provides runtime governance to prevent unauthorized shell commands, file writes, and MCP server calls — addresses the real risk of developers disabling permission prompts.

MacBook M5 Pro + Qwen3.5 = Fully Local AI Security System — 93.8% Accuracy, 25 tok/s, No Cloud Needed (96-Test Benchmark vs GPT-5.4)

Benchmark of Qwen3.5-9B running locally on Apple M5 Pro achieving 93.8% accuracy on a custom 96-test security suite at 25 tok/s, within 4 points of GPT-5.4 cloud performance. Demonstrates meaningful on-device inference capability for agentic security workloads, though benchmark methodology is self-reported.

reddit 2026-03-30 5 min

Show HN: Shoofly – pre-execution security for Claude Code Cowork and OpenClaw

Shoofly is a pre-execution security layer for AI coding agents (Claude Code, OpenClaw) that intercepts PreToolUse/PostToolUse hooks to block prompt injection, credential theft, and unauthorized writes before tool calls fire. Addresses a real attack surface given agents' shell and file access, though product maturity is unclear.

hackernews 2026-03-30 3 min

Machine Unlearning under Retain-Forget Entanglement

Proposes a two-phase optimization framework for machine unlearning that handles retain-forget entanglement using augmented Lagrangian methods and Wasserstein-2 regularized gradient projection. Addresses the practical challenge where semantically related retained samples are inadvertently degraded during forgetting.

arxiv 2026-03-30 18 min

Machine Learning Transferability for Malware Detection

Evaluates ML malware detection transferability across multiple PE datasets (EMBER, BODMAS, SOREL-20M) by unifying feature preprocessing pipelines; addresses the real-world problem of distribution shift in malware classifiers.

arxiv 2026-03-30 18 min

Deception and Communication in Autonomous Multi-Agent Systems: An Experimental Study with Among Us

Large-scale study of LLM deception in Among Us (1,100 games, 1M+ tokens) finds agents favor equivocation over outright lies under social pressure, with deception rarely improving win rates — empirical evidence on strategic deception limits in current LLMs.

arxiv 2026-03-30 20 min

Improving Semantic Proximity in Information Retrieval through Cross-Lingual Alignment

Improves cross-lingual information retrieval through better multilingual embedding alignment, targeting the common mismatch between query and document languages. Solid but incremental work in a well-studied area.

conferences 2026-03-30 15 min

Top Contributors

Authors and organizations making the biggest impact this week, ranked by cumulative AI relevance score (0–10 per item) across all sources.

Top Authors

#1

CohereLabs

2 items · avg 5.0/10

Cohere Transcribe WebGPU

10.0

#2

multimodalart

2 items · avg 5.0/10

Qwen Image Multiple Angles 3D Camera

10.0

#3

prithivMLmods

2 items · avg 4.0/10

FireRed Image Edit 1.0 Fast

8.0

#4

Juan Gabriel Kostelec

1 item · avg 7.0/10

When Perplexity Lies: Generation-Focused Distillation of Hybrid Sequence Models

7.0

#5

mistralai

1 item · avg 7.0/10

Voxtral TTS Demo

7.0

#6

Rahul Ramachandran

1 item · avg 7.0/10

How Well Does GPT-4o Understand Vision? Evaluating Multimodal Foundation Models on Standard Computer Vision Tasks

7.0

Top Organizations

#1

microsoft

4 items · avg 6.0/10

microsoft/VibeVoice

24.0

#2

ChromeDevTools

2 items · avg 8.0/10

ChromeDevTools/chrome-devtools-mcp

16.0

#3

SakanaAI

2 items · avg 8.0/10

SakanaAI/AI-Scientist-v2

16.0

#4

browser-use

2 items · avg 7.0/10

browser-use/browser-use

14.0

#5

bytedance

2 items · avg 7.0/10

bytedance/deer-flow

14.0

#6

vllm-project

2 items · avg 7.0/10

vllm-project/vllm-omni

14.0

Build Ideas

Actionable product ideas distilled from this week's highest-scoring research and discussions. Each includes specific use cases and the source material that inspired it.

Edge Vision Deployment Kit

A developer toolkit that benchmarks and packages vision models specifically for edge hardware, using hardware-aware metrics (latency, memory, energy) rather than MACs. Inspired by LowFormer's finding that MACs are poor proxies for real-world speed, this tool profiles models on target devices and recommends architecture swaps — like replacing MHSA with lighter attention variants. Paired with TurboQuant-style 6x compression, developers can ship production-ready CV models on constrained hardware without guesswork.

Mobile app computer vision IoT and embedded systems Autonomous vehicle edge inference Retail and industrial camera systems

https://arxiv.org/abs/2603.26551v1 https://arstechnica.com/ai/2026/03/googl... https://arxiv.org/abs/2603.26603v1

Repo-Aware Coding Agent

An AI coding assistant that builds a persistent memory of a codebase's conventions, API patterns, and commit history before generating PRs or suggestions — going far beyond snippet-level autocomplete. Drawing on the Learning to Commit framework's contrastive reflection over historical commits and the .claude/ folder's project context mechanisms, this agent produces code that actually fits the project's style and architecture. It addresses the critical gap revealed by StackRepoQA: current LLMs succeed at snippets but fail at repository-scale reasoning.

Automated PR generation Codebase onboarding for new developers Legacy code refactoring CI/CD pipeline code review automation

https://arxiv.org/abs/2603.26664v1 https://arxiv.org/abs/2603.26567v1 https://blog.dailydoseofds.com/p/anatomy...

LLM Inference Cost Router

A production middleware layer that intercepts LLM API calls, detects near-duplicate or repeated queries via semantic caching, and routes them to a lightweight local model — only escalating novel or uncertain queries to expensive frontier models. Built on the MemBoost framework's routing logic and enabled by TurboQuant's 6x memory reduction making local models viable, this dramatically cuts inference costs for high-volume applications. Teams running chatbots, support tools, or internal assistants could see 60-80% API cost reductions on realistic workloads.

Customer support chatbots Internal enterprise knowledge assistants High-volume document processing pipelines Developer tool backends

https://arxiv.org/abs/2603.26557v1 https://arstechnica.com/ai/2026/03/googl... https://ente.com/blog/ensu/

Open-Science LLM Auditor

A tool that evaluates and documents the transparency risks of using specific LLMs in research workflows, generating structured audit reports covering model opacity, deployment variability, and inference reproducibility threats. Motivated by the finding that closed LLMs are ill-suited for scientific inference due to undisclosed training and deployment changes, this tool helps researchers choose appropriate models and document their limitations for peer review. It could integrate with Jupyter notebooks or research pipelines to flag when a closed model is used in a way that threatens reproducibility.

Academic research workflows Clinical and biomedical AI studies Regulatory and compliance reporting Institutional AI governance

https://arxiv.org/abs/2603.26539v1 https://arxiv.org/abs/2603.26544v1

Synthetic Weather Data Engine

A data augmentation service for autonomous driving and robotics teams that generates physically realistic rare-weather video scenarios — fog, rain, snow, night — from existing clear-weather footage using 3D-aware editing. Based on AutoWeather4D's G-buffer dual-pass mechanism, this service decouples geometry and illumination to produce parametrically controlled weather variations without per-scene optimization or expensive re-capture. Teams can dramatically expand their training distribution for edge-case safety scenarios that are dangerous or impractical to collect in the real world.

Autonomous vehicle perception training Drone and UAV navigation systems Insurance and fleet safety modeling Simulation-to-real transfer for robotics

https://arxiv.org/abs/2603.26546v1 https://arxiv.org/abs/2603.26599v1

Product Hunt Weekly

Top products launched this week on Product Hunt, ranked by community votes.

#1

Notion MCP

Your Notion workspace, inside every AI agent

Productivity Artificial Intelligence Notion

0

10

https://www.producthunt.com/r/4YOM2...

#2

Diploi

Go from zero to a live full-stack app with 3 clicks

Software Engineering Developer Tools Tech

0

4

https://www.producthunt.com/r/5VWRE...

#3

Latchkey

Credential layer for local AI agents

Open Source Developer Tools Artificial Intelligence

0

5

https://www.producthunt.com/r/RC6OK...

#4

Streva

Instant Translation, Anywhere you type

Productivity

0

5

https://www.producthunt.com/r/IEFW3...

#5

Halo Vision Headphones

Headphones with a camera to capture moments as you jam

Music Photography Tech

0

3

https://www.producthunt.com/r/4LI66...

#6

PopTask

Light menu bar task manager for quickly capturing tasks

Productivity Artificial Intelligence Apple

0

10

https://www.producthunt.com/r/4W5AC...

#7

Invoke

Agentic coding IDE with visual planning boards and canvas

Developer Tools Artificial Intelligence Development

0

10

https://www.producthunt.com/r/MQ52L...

#8

dictate.

Replace your iPhone keyboard with AI voice typing

Artificial Intelligence Product Hunt

0

6

https://www.producthunt.com/r/QAUXH...

#9

AISpace

All frontier AI models in one space

Productivity SaaS Artificial Intelligence

0

4

https://www.producthunt.com/r/ZLVB2...

#10

Neuralingo Language Learning

slowly inch your way to mastery: try, fail, learn, get good

Education Languages Online Learning

0

2

https://www.producthunt.com/r/53SX2...

View full leaderboard on Product Hunt

Trending Repos

Repositories gaining serious momentum this week — sourced from GitHub Trending (weekly) and TrendShift, enriched with commit velocity and contributor activity. Stars = total GitHub stars. "Stars this week" = new stars gained.

1

ChromeDevTools/chrome-devtools-mcp

typescript 32,324 1,910 1,466 stars this week

Official Chrome DevTools MCP server enabling coding agents to interact with Chrome DevTools Protocol — allowing agents to debug, inspect, and control browsers natively. High traction (32K+ stars) and officially maintained by the Chrome DevTools team, making it a significant infrastructure piece for browser-based AI agents.

A SaaS platform for automated browser-based QA testing where AI agents use Chrome DevTools MCP to detect visual regressions, performance bottlenecks, and JavaScript errors across staging environments without human intervention.

2

python 3,949 584 1,449 stars this week

SakanaAI/AI-Scientist-v2

AI Scientist v2 from SakanaAI achieves workshop-level automated scientific discovery using agentic tree search, representing a significant step toward fully autonomous research pipelines. The upgrade from v1 with tree search-based exploration is a meaningful architectural advance for AI-driven research automation.

A research acceleration service for biotech and materials science startups that autonomously generates, tests, and ranks hypotheses using AI-driven tree search, delivering weekly experiment proposals with supporting literature.

3

python 85,022 9,850 2,759 stars this week

browser-use/browser-use

Browser automation framework enabling AI agents to interact with websites, now at 85K stars with 2,759 new stars this week. Sustained traction makes it a de facto standard for web-browsing agents.

A no-code RPA SaaS that lets non-technical business users describe repetitive web workflows in plain English and have AI agents execute them automatically across CRMs, procurement portals, and data entry systems.

4

python 53,530 6,447 18,158 stars this week

bytedance/deer-flow

ByteDance's open-source long-horizon SuperAgent framework with sandboxes, memory, tools, and sub-agents for tasks spanning minutes to hours; 18K stars in a single week signals strong developer interest. Competes directly with OpenAI's deep research and similar agentic pipelines.

An enterprise deep-research subscription service where long-horizon AI agents autonomously compile competitive intelligence reports, regulatory filings analysis, and market landscape summaries delivered on a scheduled basis.

5

python 4,023 652 530 stars this week

vllm-project/vllm-omni

Official vLLM extension for omni-modality model inference (text, image, audio, video in one framework) — significant because it brings vLLM's production-grade efficiency to multimodal models.

A unified multimodal inference API platform that lets developers send mixed text, image, audio, and video inputs to a single endpoint, billed per token, eliminating the need to manage separate model deployments for each modality.

6

TrendShift

NousResearch/hermes-agent

Python 15,300 1,900

NousResearch's Hermes Agent framework designed to grow with user needs, from the team behind the popular Hermes model series. Worth watching given NousResearch's track record with open-weight models and agent tooling.

A white-label agentic AI backend for SaaS companies that need customizable, open-weight-powered assistants with tool use and memory, avoiding vendor lock-in to closed model providers.

7

rust 35,512 2,110 3,432 stars this week

farion1231/cc-switch

Cross-platform desktop GUI managing multiple AI coding CLI tools (Claude Code, Codex, Gemini CLI, OpenCode) in one interface, gaining 3,400+ stars this week. Reflects the fragmented AI coding agent landscape and demand for unified tooling.

A team productivity tool for software development shops that centralizes billing, usage analytics, and role-based access control across multiple AI coding CLI tools, giving engineering managers a single dashboard to optimize AI spend.

8

TrendShift

google-research/timesfm

Python 10,600 887

Google Research's TimesFM is a pretrained foundation model for time-series forecasting, now at 10.6K stars. Offers zero-shot forecasting capabilities competitive with task-specific models.

A plug-and-play demand forecasting API for e-commerce and retail businesses that delivers zero-shot inventory and sales predictions without requiring customers to provide historical training data or ML expertise.

9

python 44,690 5,382 2,607 stars this week

jingyaogong/minimind

Educational project training a 64M-parameter GPT from scratch in 2 hours, with full pipeline documentation. Excellent resource for understanding transformer training fundamentals at minimal cost.

An online corporate AI literacy training platform where engineers and product managers build and fine-tune a small GPT from scratch in guided workshops, gaining hands-on intuition for LLM behavior and limitations.

10

letta-ai/claude-subconscious

typescript 2,295 165 1,267 stars this week

Letta's project adds persistent background memory and context management to Claude Code, enabling it to retain state across sessions. Interesting agent memory architecture but still early-stage tooling around an existing product.

A developer productivity SaaS that layers persistent project memory onto AI coding assistants, automatically summarizing codebase context, past decisions, and team conventions so agents stay aligned across long-running software projects.

Trending Developers

Developers gaining traction on GitHub this week — shipping open-source AI tools, models, and frameworks worth following. Ranked by weekly trending position.

Worktrunk is a Git worktree CLI designed specifically for parallel AI agent workflows, enabling multiple agents to work on separate branches simultaneously. Lightweight but addresses a real friction point in multi-agent coding setups.

2

Matt Van Horn

@mvanhorn

mvanhorn/last30days-skill

AI agent skill for multi-source research synthesis across Reddit, X, YouTube, HN, and Polymarket. Useful workflow tool but not technically novel.

Design language for AI harnesses — vague description, minimal technical substance.

CLI for macOS reminders — no AI relevance.

Go compression library — no AI relevance.

7

Peter Rekdal Khan-Sunde

@peters

peters/horizon

GPU-accelerated terminal board — not AI-related.

TurboQuant: Near-optimal KV cache quantization for LLM inference (3-bit keys, 2-bit values) with Triton kernels + vLLM integration

alirezarezvani/claude-skills

+192 Claude Code skills & agent plugins for Claude Code, Codex, Gemini CLI, Cursor, and 8 more coding agents — engineering, marketing, pr…

Realtime log viewer for containers. Supports Docker, Swarm and K8s.

Squad: AI agent teams for any project

Bridge local AI coding agents (Claude Code, Cursor, Gemini CLI, Codex) to messaging platforms (Feishu/Lark, DingTalk, Slack, Telegram, Di…

A CLI for moving AI-generated UI designs from Google’s Stitch platform into your development workflow.

15

Dream Hunter

@dreamhunter2333

dreamhunter2333/cloudflare_temp_email

CloudFlare free temp domain email 免费收发临时域名邮箱支持附件 IMAP SMTP TelegramBot

Models & Benchmarks

New model releases, arena rankings, and benchmark results across frontier and open-source AI models this week. Arena Elo = LMSys battle rating. Trending = HuggingFace trending score. Buzz = AI relevance (0–10).

Arena Leaderboard — Top 15

#	Model	Type	Elo	Votes
1	claude-opus-4-6-thinking Anthropic	Closed	1504	12,730
2	claude-opus-4-6 Anthropic	Closed	1500	13,553
3	gemini-3.1-pro-preview Google	Closed	1493	15,809
4	grok-4.20-beta1 xAI	Closed	1491	7,378
5	gemini-3-pro Google	Closed	1486	41,631
6	gpt-5.4-high OpenAI	Closed	1484	5,570
7	grok-4.20-beta-0309-reasoning xAI	Closed	1483	5,702
8	gpt-5.2-chat-latest-20260210 OpenAI	Closed	1480	11,405
9	gemini-3-flash Google	Closed	1474	30,962
10	claude-opus-4-5-20251101-thinking-32k Anthropic	Closed	1474	37,448
11	grok-4.1-thinking xAI	Closed	1471	44,840
12	claude-opus-4-5-20251101 Anthropic	Closed	1468	43,078
13	gpt-5.4 OpenAI	Closed	1466	5,618
14	qwen3.5-max-preview Alibaba	Closed	1465	4,504
15	gpt-5.3-chat-latest OpenAI	Closed	1464	10,137

New & Trending Models

Qwen/Qwen3-Coder-Next

1,046,316 downloads 1,199 likes 36 trending

Open Source 2026-01-30

Qwen3-Coder-Next is Alibaba's next-generation coding model with 1M+ downloads and 1.2K likes — strong signal of a significant code model release that practitioners are already adopting at scale.

deepseek-ai/DeepSeek-V3.2

362,748 downloads 1,346 likes 20 trending

Open Source 2025-12-01

DeepSeek-V3.2 is a major updated release of DeepSeek's flagship model with 362K downloads and 1346 likes, available in FP8. Represents continued iteration on one of the strongest open-weight models available.

nvidia/Nemotron-Cascade-2-30B-A3B

78,162 downloads 407 likes 194 trending

Custom License 2026-03-18

Nemotron Cascade 2 is a 30B MoE (3B active) reasoning model from NVIDIA with SFT+RL post-training, achieving strong benchmark results at very low active parameter cost. High trending score and 407 likes indicate significant community interest.

openai/gpt-oss-120b

4,304,780 downloads 4,625 likes 23 trending

Open Source 2025-08-04

OpenAI's gpt-oss-120b is a large open-weight model (4.3M downloads, 4625 likes) with MXFP4 and 8-bit quantization support. Represents OpenAI's open-weight offering and is widely adopted.

zed-industries/zeta-2

579 downloads 85 likes 85 trending

Open Source 2026-03-23

Zed's Zeta-2 is a fine-tuned code model based on ByteDance Seed-Coder-8B specifically optimized for next-edit prediction and edit suggestion within the Zed editor. Purpose-built edit-prediction models represent a meaningful step beyond generic code completion.

Jackrong/Qwen3.5-35B-A3B-Claude-4.6-Opus-Reasoning-Distilled

8,850 downloads 86 likes 42 trending

Open Source 2026-03-07

MoE reasoning model distilled from Claude 4.6 Opus outputs into Qwen3.5-35B-A3B architecture; notable for distilling frontier closed-model reasoning into an open-weight MoE with 8.8K downloads.

Jackrong/Qwen3.5-9B-Claude-4.6-Opus-Reasoning-Distilled-GGUF

190,062 downloads 203 likes 67 trending

Open Source 2026-03-03

9B GGUF reasoning model distilled from Claude 4.6 Opus with 190K downloads and 203 likes — the most popular in this distillation series, suggesting it hits a sweet spot for local reasoning performance.

MiniMaxAI/MiniMax-M2.5

540,179 downloads 1,311 likes 43 trending

Custom License 2026-02-12

MiniMax's M2.5 model with 540K downloads and 1.3K likes; a competitive open-weight foundation model worth tracking though no detailed technical summary is available here.

Tesslate/OmniCoder-9B

28,179 downloads 530 likes 173 trending

Open Source 2026-03-12

OmniCoder-9B is a multimodal code-and-agent fine-tune of Qwen3.5-9B with strong traction (530 likes, 28K downloads). Targets agentic coding workflows with image-text-to-text capabilities, though it's an SFT derivative rather than novel architecture.

chromadb/context-1

1,450 downloads 256 likes 256 trending

Open Source 2026-03-12

ChromaDB releases 'context-1', a fine-tune of OpenAI's gpt-oss-20B model, with high trending score (256) and notable likes. Likely optimized for retrieval/context tasks given ChromaDB's focus, though details are sparse.

nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-BF16

182,140 downloads 311 likes 28 trending

Custom License 2026-03-10

NVIDIA Nemotron Super 120B MoE (12B active) in BF16 — part of NVIDIA's Nemotron-H hybrid architecture series using latent MoE and multi-token prediction. Strong downloads suggest real deployment interest.

nvidia/gpt-oss-puzzle-88B

4,439 downloads 78 likes 78 trending

Custom License 2026-03-25

NVIDIA's gpt-oss-puzzle-88B is a large MoE reasoning model built on the GPT-OSS architecture with MXFP4 quantization support. Targets complex reasoning tasks; notable as a large open-weight reasoning model from NVIDIA.

zai-org/GLM-5

215,216 downloads 1,889 likes 35 trending

Open Source 2026-02-11

GLM-5 from Zhipu AI is a bilingual (EN/ZH) MoE-based language model with 215k downloads and 1.9k likes under MIT license. The DSA architecture variant and associated ICLR 2026 paper make this worth tracking as a competitive open-weight model.

Jackrong/Qwen3.5-4B-Claude-4.6-Opus-Reasoning-Distilled-GGUF

188,359 downloads 74 likes 20 trending

Open Source 2026-03-03

4B GGUF variant of Claude 4.6 Opus reasoning distillation into Qwen3.5; 188K downloads indicates strong demand for small locally-runnable reasoning models.

RedHatAI/Qwen3-8B-speculator.eagle3

82,204 downloads 27 likes 25 trending

Open Source 2025-09-19

RedHat AI releases an EAGLE3 speculative decoding head for Qwen3-8B, enabling faster inference via draft-model speculation. Practical for anyone deploying Qwen3-8B who wants lower latency without quality loss.

Model Buzz

A leak reveals that Anthropic is testing a more capable AI model "Claude Mythos"

I ran 3,360 safety tests on GPT-4o, Claude, Grok, DeepSeek, Gemini

Claude Code runs Git reset –hard origin/main against project repo every 10 mins

$500 GPU outperforms Claude Sonnet on coding benchmarks

Show HN: Gemini can now natively embed video, so I built sub-second video search

huggingface_spaces 7/10 2026-03-30

Voxtral TTS Demo

How Well Does GPT-4o Understand Vision? Evaluating Multimodal Foundation Models on Standard Computer Vision Tasks

conferences 7/10 2026-03-30

Anthropic's Claude Code CLI had a workspace trust bypass (CVE-2026-33068). Repository settings loaded before trust dialog. Classic configuration loading order bug in an AI developer tool

reddit 7/10 2026-03-30

Further human + AI + proof assistant work on Knuth's "Claude Cycles" problem