RIFT-Bench: Dynamic Red-teaming For Agentic AI Systems
arXiv:2606.23927v1 Announce Type: new Abstract: Agentic AI systems powered by large language models (LLMs) are rapidly evolving into autonomous decision-making systems, exposing attack vectors beyond those of traditional LLM vulnerabilities. Existing security evaluations are often tied to specific implementations or domains, limiting unified comparison across heterogeneous systems. To address this gap, we introduce RIFT-Bench, a graph representation-driven methodology for dynamic red-teaming tha
延伸阅读
- Ensemble Feature Selection and Harris Hawks Optimization for Explainable Mental Health Risk Prediction in Female Sex Workers
- Breaking the Filter Bubble: A Semantic Pareto-DQN Framework for Multi-Objective Recommendation
- Can Language Model Agents be Helpful Circuit Explainers in Mechanistic Interpretability?
相关资讯
Ensemble Feature Selection and Harris Hawks Optimization for Explainable Mental Health Risk Prediction in Female Sex Workers
今天Breaking the Filter Bubble: A Semantic Pareto-DQN Framework for Multi-Objective Recommendation
今天Can Language Model Agents be Helpful Circuit Explainers in Mechanistic Interpretability?
今天Reinforcement Learning Towards Broadly and Persistently Beneficial Models
今天