Microsoft Copilot Cowork Exfiltrates Files
Simon Willison's AI Notes published: Microsoft Copilot Cowork Exfiltrates Files The biggest challenge in designing agentic systems continues to be preventing them from enabling attackers to exfiltrate data. In this case Microsoft Copilot Cowork (yes, that's a real product name ) was allowing agents to send emails to the user's own inbox without approval... but those messages were then displayed in a way that could leak data to an attacker via rendered images: Because these messages can contain external images that trigger network requests to external websites, data can be exfiltrated when a user opens a compromised message sent by the agent. Since OneDrive can create pre-authenticated download links, a successful prompt injection could cause those links to be leaked, allowing files to be downloaded by the attacker. Via Hacker News Tags: ai , microsoft , llms , prompt-injection , security , generative-ai , lethal-trifecta , exfiltration-attacks
Read originalWhy it matters
This reported AI news may affect AI product capabilities, developer choices, or adoption timing. Review the original source for exact claims and availability.
This page is an independent summary. Facts and availability should be verified in the original publication.