Engineered prompts in emails

paper

2025·arXiv·arxiv.org/html/2510.23883v1

Authors

Shrestha Datta·Shahriar Kabir Nahin·Anshuman Chhabra·Prasant Mohapatra

Credibility Rating

3/5

Good(3)

Good quality. Reputable source with community review or editorial standards, but less rigorous than peer-reviewed venues.

Rating inherited from publication venue: arXiv

A 2025 survey paper providing a structured overview of security risks in agentic LLM systems; useful reference for researchers and practitioners working on safe deployment of autonomous AI agents.

Metadata

Importance: 62/100arxiv preprintanalysis

Abstract

Agentic AI systems powered by large language models (LLMs) and endowed with planning, tool use, memory, and autonomy, are emerging as powerful, flexible platforms for automation. Their ability to autonomously execute tasks across web, software, and physical environments creates new and amplified security risks, distinct from both traditional AI safety and conventional software security. This survey outlines a taxonomy of threats specific to agentic AI, reviews recent benchmarks and evaluation methodologies, and discusses defense strategies from both technical and governance perspectives. We synthesize current research and highlight open challenges, aiming to support the development of secure-by-design agent systems.

Summary

A comprehensive survey of security threats unique to agentic AI systems—LLM-powered autonomous agents with planning, tool use, and memory—presenting a threat taxonomy, reviewing evaluation benchmarks, and discussing technical and governance defense strategies. The paper distinguishes agentic AI risks from both traditional AI safety and conventional software security, synthesizing current research and open challenges to support secure-by-design agent development.

Key Points

•Proposes a taxonomy of security threats specific to agentic AI, including prompt injection, tool misuse, memory poisoning, and multi-agent attack vectors.
•Reviews evaluation benchmarks and methodologies for assessing vulnerabilities in autonomous agents operating across web, software, and physical environments.
•Discusses defense strategies from both technical (input sanitization, sandboxing, monitoring) and governance (policy, auditing) perspectives.
•Highlights that agentic AI risks are distinct from traditional AI safety and software security, requiring new frameworks and threat models.
•Identifies open challenges in securing agentic systems, including the difficulty of evaluating emergent multi-step attack chains.

Cited by 2 pages

Page	Type	Quality
Agentic AI	Capability	68.0
Tool-Use Restrictions	Approach	91.0

Cached Content Preview

HTTP 200Fetched Apr 7, 202698 KB

Agentic AI Security: Threats, Defenses, Evaluation, and Open Challenges 
 
 
 

 
 

 
 
 
 
 Agentic AI Security: 
 Threats, Defenses, Evaluation, and Open Challenges

 
 
 Shrestha Datta
 
 
 
 
 Shahriar Kabir Nahin
 
 
 
 
 Anshuman Chhabra
 Corresponding Author.
 
 
 
 Prasant Mohapatra
 
 
 
 
 
 Abstract

 Agentic AI systems powered by large language models (LLMs) and endowed with planning, tool use, memory, and autonomy, are emerging as powerful, flexible platforms for automation. Their ability to autonomously execute tasks across web, software, and physical environments creates new and amplified security risks, distinct from both traditional AI safety and conventional software security. This survey outlines a taxonomy of threats specific to agentic AI, reviews recent benchmarks and evaluation methodologies, and discusses defense strategies from both technical and governance perspectives. We synthesize current research and highlight open challenges, aiming to support the development of secure-by-design agent systems.

 
 
 
 1 Introduction

 
 Artificial Intelligence (AI) has become one of the most transformative technologies of the twenty-first century [ 1 ] . From early rule-based expert systems [ 2 ] to modern deep learning architectures [ 3 ] , AI has steadily expanded in both capability and scope. Traditionally and over the past decade, AI has excelled at narrow, task-specific applications such as image classification, speech recognition, recommendation systems, and predictive analytics [ 4 , 3 ] . These systems typically operate within well-defined boundaries and are optimized for performance on constrained datasets, but lack the ability to flexibly adapt beyond their original input/output designs.

 
 
 Recently, the advent of large language models (LLMs), such as OpenAI’s GPT [ 5 , 6 ] and Meta’s LLaMA [ 7 ] , has marked a paradigm shift for AI models. Trained on vast corpora of text (and now, even multimodal data), these models exhibit impressive generalization abilities and can generate coherent, contextually relevant responses across a wide range of domains [ 8 , 9 ] . LLMs have enabled breakthroughs in conversational agents, code generation, content summarization, and multimodal reasoning [ 10 , 11 , 12 ] . Moreover, by design, most LLM deployments remain passive : they respond to input prompts containing instructions and generate natural language outputs but do not independently pursue goals, maintain memory, or interact autonomously with the external world without human supervision [ 13 , 14 ] .

 
 
 Agentic AI represents the next stage in this natural evolution of AI systems powered by LLMs and other Generative AI models. Agentic AI systems are characterized by autonomy, goal-directed reasoning, planning, and the ability to act upon digital or physical environments through tools, APIs, or robotic embodiments [ 15 , 16 ] . Unlike static LLMs, agentic systems maintain persistent memory, deliberate across time, coordinate wit

... (truncated, 98 KB total)

Resource ID: 307088cd981d31e1 | Stable ID: sid_oVHW5PitOL