Back to home
// sample · BUG-HUNT
BUG-HUNT
LLM-Based Approval Bypass for High-Risk Actions.
difficulty · 10/10·25–30 min·authority boundaries · policy enforcement
ScenarioAuthority boundaries10/10
A support automation uses the LLM to decide whether a refund should be approved. Large refunds are slipping through. Unit tests pass. Product is confused — the prompt looks fine.
// task: Find the authority-boundary failure, explain why this is a policy-enforcement bug, and patch it so high-risk refunds can never ship on model text alone.
// language · Python
refund_automation.py
selected · 0 / 3
// click any line to mark it suspicious (up to 3)
patch · refund_automation.py