AI Quick Bites

N-Day-Bench – Can LLMs find real vulnerabilities in real codebases?

7/10

N-Day-Bench is a dynamic benchmark that monthly pulls fresh CVEs from GitHub security advisories and tests whether frontier LLMs can autonomously find known vulnerabilities in real codebases via a sandboxed bash shell. The monthly refresh mechanism specifically combats training data contamination, making this a rigorous and evolving evaluation of LLM-powered vulnerability discovery.

hackernews 2026-04-20 8 min

Spilling the Beans: Teaching LLMs to Self-Report Their Hidden Objectives

7/10

ICLR 2026 paper on training LLMs to self-report hidden objectives via honesty fine-tuning, enabling alignment auditing of agentic models — directly addresses the challenge of detecting deceptive or misaligned behavior in capable AI systems.

conferences 2026-04-20 20 min

Detecting and Suppressing Reward Hacking with Gradient Fingerprints

GRIFT detects reward hacking in RLVR by computing gradient fingerprints of chain-of-thought traces and using them as a classifier signal, achieving 25%+ relative improvement over CoT Monitor and TRACE baselines; integrating GRIFT into rejection fine-tuning reduces hacking and improves task performance. Addresses a real and underexplored failure mode in reasoning model training.

arxiv 2026-04-20 18 min

ASMR-Bench: Auditing for Sabotage in ML Research

ASMR-Bench evaluates LLMs' ability to detect subtle sabotage in ML research codebases, finding even the best model (Gemini) achieves only 0.77 AUROC and 42% fix rate — highlighting a critical gap in AI-assisted research oversight. Relevant to AI safety and autonomous research agent deployment.

arxiv 2026-04-20 20 min

€54k spike in 13h from unrestricted Firebase browser key accessing Gemini APIs

A developer incurred €54k in Gemini API charges in 13 hours after an unrestricted Firebase browser key was exposed and abused. Highlights a critical API key security anti-pattern specific to AI billing exposure — relevant warning for anyone shipping Gemini-integrated apps.

Show HN: Nyx – multi-turn, adaptive, offensive testing harness for AI agents

Nyx is an autonomous red-teaming harness for AI agents that probes for logic bugs, instruction-following failures, jailbreaks, and prompt injection via multi-turn adaptive testing. Early-stage but addresses a real gap in agent QA tooling.

They Hacked Claude, Gemini, and Copilot (and No One Told You)

Security research blog claiming successful attacks against Claude, Gemini, and GitHub Copilot, though the lack of HN comments and marketing-heavy framing suggest this may be more promotional than rigorous. Worth a quick read to assess the actual technical depth of the vulnerabilities disclosed.

Changes in the system prompt between Claude Opus 4.6 and 4.7

Analysis of system prompt changes between Claude Opus 4.6 and 4.7, revealing how Anthropic is evolving behavioral constraints and model identity instructions. Useful for understanding alignment and deployment decisions.

KillBench: Every frontier LLM is biased about who deserves to live

KillBench evaluates frontier LLMs for demographic biases in life-or-death moral dilemmas, finding consistent biases across all tested models. Provocative but methodologically relevant for AI safety and alignment researchers.

hackernews 2026-04-20 8 min

Anthropic installed a spyware bridge on my machine?

6.0/10

A user claims Anthropic's Claude Code installed an undisclosed network bridge on their machine, raising supply chain and privacy concerns about agentic coding tools. Worth monitoring for technical follow-up; currently unverified.

hackernews 2026-04-20 6 min

Beyond Surface Statistics: Robust Conformal Prediction for LLMs via Internal Representations

5/10

Proposes Layer-Wise Information (LI) scores derived from internal LLM representations as conformal prediction nonconformity scores, achieving better validity-efficiency tradeoffs than output-level uncertainty signals especially under distribution shift. Useful for practitioners needing reliable uncertainty quantification in deployed LLMs.

arxiv 2026-04-20 18 min

Does Gas Town 'steal' usage from users' LLM credits to improve itself?

5/10

Community investigation into whether Gas Town (an AI coding tool) covertly uses users' LLM API credits to improve itself — raises real concerns about unauthorized token consumption and data exfiltration by AI tooling.

Distilling 100B+ Models 40x Faster with TRL

Top Contributors

Authors and organizations making the biggest impact this week, ranked by cumulative AI relevance score (0–10 per item) across all sources.

Top Authors

#1

webml-community

2 items · avg 6.5/10

Bonsai 1-bit WebGPU

13.0

#2

r3gm

2 items · avg 5.5/10

Wan2.2 14B Preview

11.0

#3

HuggingFaceTB

1 item · avg 7.0/10

7.0

#4

prism-ml

1 item · avg 7.0/10

Bonsai 1-bit GPU

7.0

#5

victor

2 items · avg 3.5/10

ace-step-jam

7.0

#6

Chloe Li

1 item · avg 7.0/10

7.0

Top Organizations

#1

NousResearch

2 items · avg 7.0/10

NousResearch/hermes-agent

14.0

#2

OpenBMB

2 items · avg 7.0/10

OpenBMB/VoxCPM

14.0

#3

lsdefine

2 items · avg 7.0/10

lsdefine/GenericAgent

14.0

#4

openai

2 items · avg 7.0/10

openai/openai-agents-python

14.0

#5

z-lab

2 items · avg 7.0/10

z-lab/dflash

14.0

#6

OpenMOSS

2 items · avg 6.0/10

OpenMOSS/MOSS-TTS

12.0

Build Ideas

Actionable product ideas distilled from this week's highest-scoring research and discussions. Each includes specific use cases and the source material that inspired it.

Agent Security Firewall

A runtime security layer that sits between LLM agents and their tool/execution environments, detecting and blocking prompt injection, command injection, and trust boundary violations before they execute. As agentic AI deployments proliferate in commerce and dev tooling, the attack surface is exploding and no dedicated defense product exists yet. Build a proxy/middleware SDK that intercepts agent tool calls, scores them for malicious intent, and enforces configurable policy rules.

AI coding agents in CI/CD pipelines Agentic e-commerce and payment workflows Enterprise LLM agent deployments Claude Code / Cursor-style developer tools

https://arxiv.org/abs/2604.15367 https://beyondmachines.net/event_details...

Reward Hack Monitor

A developer tool that integrates gradient fingerprint analysis (inspired by GRIFT) into RLVR training pipelines to automatically flag and filter reward-hacking chain-of-thought traces before they corrupt model behavior. Reward hacking is a silent killer in reasoning model fine-tuning and practitioners have no off-the-shelf tooling to catch it. Build a training callback library compatible with popular frameworks like TRL and veRL that surfaces hacking signals in a dashboard and optionally gates gradient updates.

Math and coding reasoning model training RLVR fine-tuning pipelines Enterprise LLM customization workflows AI safety auditing for deployed models

https://arxiv.org/abs/2604.16242v1 https://arxiv.org/abs/2604.16259v1

Radiology Agent Hierarchy

A multi-agent clinical reporting product that mirrors the resident/fellow/attending supervision structure in radiology, using specialized LLM agents with retrieval-augmented revision and consensus to generate CT reports with fewer hallucinations than monolithic VLMs. Radiologist burnout and report backlog are real problems, and a structured agentic approach that mimics existing clinical workflows is far more deployable than black-box models. Build a HIPAA-compliant SaaS that integrates with PACS systems and surfaces confidence scores alongside draft reports for physician review.

Hospital radiology departments Teleradiology services Medical AI second-opinion tools Clinical documentation automation

https://arxiv.org/abs/2604.16175v1

Crypto Synthetic Data Engine

A privacy-safe synthetic financial time series generator that uses conditional GAN-LSTM models to produce realistic cryptocurrency and broader market data that preserves temporal patterns, correlations, and volatility regimes. Financial ML teams are blocked by data licensing costs, privacy regulations, and sparse historical data for rare market events. Build a self-serve API where users specify asset class, time horizon, and market regime parameters and receive statistically validated synthetic datasets with built-in quality metrics.

Quant trading strategy backtesting Risk model training without proprietary data exposure Academic financial ML research Regulatory stress-testing simulations

https://arxiv.org/abs/2604.16182v1

LLM Uncertainty Shield

A drop-in uncertainty quantification middleware for production LLM APIs that uses internal layer-wise representations (not just output logits) to generate calibrated conformal prediction intervals, flagging low-confidence responses before they reach end users. Output-level confidence scores are notoriously unreliable under distribution shift, and practitioners deploying LLMs in high-stakes settings have no robust alternative today. Build a lightweight inference wrapper compatible with OpenAI, Anthropic, and open-weight model APIs that returns a confidence band and a reject/accept signal alongside every response.

Medical and legal LLM assistants Automated customer support escalation routing RAG pipelines with factual accuracy requirements AI-assisted code review and security scanning

https://arxiv.org/abs/2604.16217v1 https://arxiv.org/abs/2604.16146v1

Product Hunt Weekly

Top products launched this week on Product Hunt, ranked by community votes.

#1

Gemmetric

Measure and improve how your brand appears in AI search

Analytics SEO Artificial Intelligence

0

1

https://www.producthunt.com/r/YJB3H...

#2

Mav9

Time to allocate capital

Pitch Berlin

0

https://www.producthunt.com/r/KB77Y...

#3

Tinkery

Revenue clarity for growing businesses

Pitch Berlin

0

https://www.producthunt.com/r/UDBJS...

#4

Co-Tasker

Book local pros for quick & affordable help

Pitch Berlin

0

https://www.producthunt.com/r/TX5GO...

#5

nooxit

AI workers for the procurement back office saving 90% costs

Pitch Berlin

0

https://www.producthunt.com/r/RXFGJ...

#6

Iqana

Step into the future of digital asset investing

Pitch Berlin

0

https://www.producthunt.com/r/C7QQK...

#7

EyeOnBlue

Remote sensing and AI from space

Pitch Berlin

0

https://www.producthunt.com/r/NFWMF...

#8

Zombie Delete

Docusign for provable deletion - anywhere, everywhere.

Legal Data Pitch Berlin

0

https://www.producthunt.com/r/Y3DT5...

#9

In Parallel

The Operating System for Execution

Pitch Berlin

0

1

https://www.producthunt.com/r/AD3BY...

#10

Ona AI

Building digital sign language avatars & inclusive datasets.

Pitch Berlin

0

https://www.producthunt.com/r/SARFM...

View full leaderboard on Product Hunt

Trending Repos

Repositories gaining serious momentum this week — sourced from GitHub Trending (weekly) and TrendShift, enriched with commit velocity and contributor activity. Stars = total GitHub stars. "Stars this week" = new stars gained.

1

NousResearch/hermes-agent

python 103,973 14,823 38,194 stars this week

NousResearch's Hermes Agent is an open-source agent framework that gained 38k+ GitHub stars in a single week — extraordinary traction suggesting it fills a real gap. Worth investigating for its architecture and tool-use capabilities.

Build a white-label AI agent platform for enterprises that lets non-technical teams deploy customizable, tool-using agents for tasks like CRM updates, internal IT helpdesk, and data retrieval — all hosted and managed as a SaaS with usage-based pricing.

2

python 14,990 1,776 4,136 stars this week

OpenBMB/VoxCPM

VoxCPM2 is a tokenizer-free TTS model from OpenBMB supporting multilingual speech generation, creative voice design, and voice cloning — gaining 4k+ stars this week. The tokenizer-free approach is architecturally notable for speech generation.

Launch a multilingual voice cloning and custom voice design API targeting podcast platforms, audiobook publishers, and game studios that need scalable, expressive, on-demand narration without hiring voice actors.

3

python 4,768 514 3,512 stars this week

lsdefine/GenericAgent

Self-evolving agent that grows a skill tree from a 3.3K-line seed codebase, claiming full system control with 6x lower token consumption than comparable agents. The self-improvement and token efficiency claims are technically interesting and warrant scrutiny.

Build a cost-efficient AI DevOps agent SaaS that autonomously learns and expands its own skill set to handle infrastructure tasks — deployments, monitoring, incident response — at a fraction of the token cost of competing agent solutions.

4

openai/openai-agents-python

python 23,633 3,669 2,197 stars this week

OpenAI's official lightweight Python framework for building multi-agent workflows, with 23k+ stars and strong weekly growth. Represents the canonical SDK for orchestrating OpenAI-powered agents with handoffs, guardrails, and tracing built in.

Offer a managed multi-agent workflow builder — a no-code/low-code platform on top of this SDK — where businesses can visually design, deploy, and monitor complex agent pipelines with built-in guardrails, handoffs, and audit tracing.

5

python 1,927 135 869 stars this week

z-lab/dflash

DFlash introduces block diffusion for flash speculative decoding, combining diffusion-based generation with speculative decoding to accelerate LLM inference. Novel technique at the intersection of diffusion models and inference optimization with meaningful speedup potential.

Create an LLM inference optimization service that integrates DFlash's speculative decoding technique to offer enterprises significantly faster and cheaper API inference for their self-hosted or cloud-deployed large language models.

6

python 1,588 144 372 stars this week

OpenMOSS/MOSS-TTS

MOSS-TTS is an open-source speech and sound generation model family targeting high-fidelity, expressive, real-world scenarios including multi-speaker dialogue, voice design, sound effects, and real-time streaming TTS. Solid open-source release in a competitive space.

Build a real-time dubbing and localization SaaS for video content creators and streaming platforms, using MOSS-TTS to automatically generate expressive, multi-speaker, multilingual audio tracks synchronized to existing video content.

7

rust 3,678 382 211 stars this week

dora-rs/dora

Rust-based dataflow-oriented middleware for building AI robotic applications with low-latency, composable, distributed pipelines modeled as directed graphs. Solid open-source infrastructure for AI robotics.

Offer a managed cloud platform for robotics teams to deploy, monitor, and iterate on AI robotic pipelines built with Dora, providing hosted orchestration, telemetry dashboards, and OTA updates for fleets of robots in warehouses or manufacturing.

8

TrendShift

forrestchang/andrej-karpathy-skills

61,700 5,400

A single CLAUDE.md configuration file distilling Andrej Karpathy's observations on LLM coding pitfalls into actionable Claude Code behavior improvements. Viral traction (61K stars) reflects broad practitioner interest in prompt engineering for coding agents.

Sell a subscription library of curated, expert-validated CLAUDE.md and system prompt configuration packs tailored to specific developer roles — frontend, backend, data science — that teams can drop into their AI coding agent workflows to immediately improve output quality.

9

python 113,080 7,330 9,018 stars this week

microsoft/markitdown

Microsoft's Python tool for converting Office documents and files to Markdown, widely used as a preprocessing step for LLM ingestion pipelines. Massive traction (113K stars, 9K this week) confirms it as a standard RAG preprocessing utility.

Build a document ingestion pipeline SaaS that uses MarkItDown to automatically convert enterprise file repositories — SharePoint, Google Drive, Confluence — into clean, LLM-ready Markdown, feeding downstream RAG and knowledge base applications.

10

superradcompany/microsandbox

rust 5,603 265 256 stars this week

Rust-based secure local sandbox environment designed specifically for AI agents to execute code safely. Addresses a critical infrastructure need for agentic systems that require isolated code execution.

Offer a secure code execution infrastructure API — similar to E2B but Rust-native — that AI agent developers and LLM application builders can integrate to safely run untrusted, AI-generated code in isolated sandboxes with per-execution billing.

Trending Developers

Developers gaining traction on GitHub this week — shipping open-source AI tools, models, and frameworks worth following. Ranked by weekly trending position.

1

朱昆鹏

@zhukunpenglinyutong

zhukunpenglinyutong/jetbrains-cc-gui

A JetBrains plugin providing a GUI for Claude Code and OpenAI Codex, bringing agentic coding assistants into the JetBrains IDE ecosystem. Useful for developers who prefer JetBrains over VS Code for AI-assisted coding.

2

Benson Wong

@mostlygeek

mostlygeek/llama-swap

3

dav nguyxn

@hoangsonww

hoangsonww/Claude-Code-Agent-Monitor

maziyarpanahi/openmed

6

Matt Van Horn

@mvanhorn

mvanhorn/last30days-skill

Developer profile featuring a research agent skill that aggregates and synthesizes information across Reddit, X, YouTube, HN, and Polymarket. Minimal technical detail available from the profile alone.

Developer profile for a workspace-first multi-agent coordination platform for AI development. Insufficient technical detail from profile summary alone.

igorls/context-builder

10

Duy /zuey/

@mrgoonie

mrgoonie/claudekit-skills

11

@qixing-jk

qixing-jk/all-api-hub

API relay manager for managing multiple LLM API accounts with balance tracking and key export. Utility tool with limited technical novelty.

Developer profile for CowAgent, a WeChat-integrated LLM assistant; duplicate of the repo entry below.

felixrieseberg/windows95

shahednasser/awesome-resources

Developer profile featuring a community resource list; not AI-specific.

Developer profile for dotfile manager chezmoi; not AI-related.

21

Classic298

@Classic298

Classic298/open-webui-plugins

A curated collection of Open WebUI plugins - tools, skills, filters, pipes, and actions that extend your AI chat experience.

Models & Benchmarks

New model releases, arena rankings, and benchmark results across frontier and open-source AI models this week. Arena Elo = LMSys battle rating. Trending = HuggingFace trending score. Buzz = AI relevance (0–10).

Arena Leaderboard — Top 15

#	Model	Type	Elo	Votes
1	claude-opus-4-7-thinking Anthropic	Closed	1505	2,618
2	claude-opus-4-6-thinking Anthropic	Closed	1503	18,144
3	claude-opus-4-7 Anthropic	Closed	1498	3,485
4	claude-opus-4-6 Anthropic	Closed	1497	19,373
5	muse-spark Meta	Closed	1496	5,155
6	gemini-3.1-pro-preview Google	Closed	1492	22,905
7	gemini-3-pro Google	Closed	1486	41,404
8	grok-4.20-beta1 xAI	Closed	1485	12,069
9	gpt-5.4-high OpenAI	Closed	1482	11,568
10	grok-4.20-beta-0309-reasoning xAI	Closed	1480	11,661
11	gpt-5.2-chat-latest-20260210 OpenAI	Closed	1477	17,827
12	grok-4.20-multi-agent-beta-0309 xAI	Closed	1476	12,049
13	gemini-3-flash Google	Closed	1474	30,817
14	claude-opus-4-5-20251101-thinking-32k Anthropic	Closed	1473	37,184
15	glm-5.1 Z.ai	Open	1472	7,179

New & Trending Models

MiniMaxAI/MiniMax-M2.7

314,205 downloads 989 likes 374 trending

Custom License 2026-04-09

Official MiniMax M2.7 model release — a large MoE model with 314K downloads and 989 likes, making it one of the most significant open model releases in this batch. The M2 architecture with FP8 support and strong community traction signals this is a competitive open-weight model worth evaluating.

zai-org/GLM-5.1

124,162 downloads 1,431 likes 264 trending

Open Source 2026-04-03

GLM-5.1 is a new MoE model from ZhipuAI/zai-org with 1431 likes and 124K downloads, representing a significant new open-weight Chinese frontier model release. The glm_moe_dsa architecture tag and bilingual (en/zh) support make it a notable addition to the open-weight MoE landscape.

Qwen/Qwen3-Coder-Next

646,521 downloads 1,310 likes 27 trending

Open Source 2026-01-30

Qwen3-Coder-Next is a code-specialized model from Alibaba's Qwen team with 646K downloads and 1310 likes, indicating strong adoption. The 'Next' designation suggests this is a preview/upcoming release of the next Qwen coding model generation.

prism-ml/Bonsai-8B-gguf

96,081 downloads 646 likes 70 trending

Open Source 2026-03-18

Bonsai-8B is a 1-bit quantized 8B model from Prism ML with 96K downloads and 646 likes, representing serious interest in extreme quantization for on-device deployment. The 1-bit approach at 8B scale with CUDA and Metal support is technically noteworthy.

z-lab/Qwen3.5-27B-DFlash

16,972 downloads 88 likes 34 trending

Open Source 2026-03-14

DFlash applies diffusion-based speculative decoding to Qwen3.5-27B, combining block diffusion language modeling with flash decoding for faster inference. The arxiv:2602.06036 reference points to a novel decoding paradigm worth investigating.

z-lab/Qwen3.6-35B-A3B-DFlash

5,930 downloads 37 likes 35 trending

Open Source 2026-04-17

DFlash variant for Qwen3.6-35B-A3B MoE model using block diffusion as a draft model for speculative decoding. Applying diffusion-based drafting to MoE architectures is a novel inference optimization angle.

LilaRest/gemma-4-31B-it-NVFP4-turbo

157,867 downloads 254 likes 65 trending

Open Source 2026-04-07

NVFP4 quantization of Gemma 4 31B using NVIDIA ModelOpt, optimized for vLLM inference with 157K downloads. Represents a practical path to running Gemma 4 31B efficiently on NVIDIA hardware.

Rta-AILabs/Nandi-Mini-150M-Instruct

3,486 downloads 44 likes 42 trending

Open Source 2026-04-13

A 150M parameter instruction-tuned model supporting 11 Indian languages (Hindi, Marathi, Tamil, Telugu, Kannada, Malayalam, Bengali, Punjabi, Gujarati, Odia), addressing a significant underserved multilingual NLP gap. Small model size makes it practical for on-device deployment in South Asian markets.

nvidia/Gemma-4-31B-IT-NVFP4

1,325,194 downloads 411 likes 43 trending

Custom License 2026-04-02

NVIDIA's official NVFP4 quantization of Gemma 4 31B IT using ModelOpt, with 1.3M downloads indicating it's the go-to quantization for NVIDIA GPU users. Complements the community NVFP4 variant and validates the format for production use.

prism-ml/Ternary-Bonsai-8B-mlx-2bit

6,433 downloads 60 likes 58 trending

Open Source 2026-04-13

Ternary (1.58-bit) MLX quantization of Bonsai-8B for Apple Silicon, pushing extreme compression for on-device inference. Complements the GGUF 1-bit variant and extends the Bonsai family to Mac hardware.

unsloth/GLM-5.1-GGUF

43,725 downloads 169 likes 35 trending

Open Source 2026-04-06

Unsloth's GGUF quantization of GLM-5.1 enables local inference of the new GLM-5.1 MoE model. 43K downloads signals strong demand for accessible local deployment.

unsloth/MiniMax-M2.7-GGUF

139,172 downloads 136 likes 54 trending

Custom License 2026-04-11

GGUF quantization of MiniMax-M2.7 with 139K downloads, making this large MoE model accessible for local inference via llama.cpp. High download count reflects strong interest in MiniMax's model.

unsloth/Qwen3-Coder-Next-GGUF

215,057 downloads 597 likes 24 trending

Open Source 2026-02-03

GGUF quantization of Qwen3-Coder-Next with 215K downloads, making Qwen's latest coding model accessible locally. High download count but the quantization itself is routine.

Jackrong/Qwopus-GLM-18B-Merged-GGUF

7,182 downloads 133 likes 128 trending

Open Source 2026-04-18

High-traction GGUF of a frankenmerge combining Qwen3.5 and GLM5.1 distillation, targeting reasoning, tool-use, and multilingual tasks at 18B parameters. The merge approach combining two distinct model families is mildly interesting.

Jiunsong/supergemma4-26b-abliterated-multimodal

7,385 downloads 71 likes 56 trending

gemma 2026-04-12

Abliterated multimodal variant of Gemma 4 26B supporting image-text-to-text with tool-use and instruction-following. The multimodal + uncensored combination drives notable download numbers but is derivative work.

Model Buzz

Claude Opus 4.7

hackernews 9/10 2026-04-20

GPT-6 released: Symphony architecture unifies text/image/audio/video

hackernews 9.0/10 2026-04-20

Anthropic Claude Code Leak Reveals Critical Command Injection Vulnerabilities

Claude Code Routines

Gemini Robotics-ER 1.6

A 164-parameter architecture beats a 6.5M transformer on SCAN by 94 points

LegendreGPT: Compressing a transformer into 15.7 MB with Legendre polynomials

N-Day-Bench – Can LLMs find real vulnerabilities in real codebases?

GPT‑Rosalind for life sciences research

Claude Design