AI Agent

  • Large model security: open source framework Guardrails security fence introduction and analysis

    OpenGuardrails is the first complete open source enterprise-grade large model security guardrail platform, supporting 119 languages, unified LLM architecture, configurable sensitivity policies, and multi-cloud deployment. This report deeply analyzes its core technology innovation, application scenarios, deployment models, performance benchmarking and future development, providing security compliance guidelines for AI applications in regulated industries such as finance, healthcare, and law. By analyzing OpenGuardrails' configurable policies, efficient model design and production-grade infrastructure, it reveals the development direction of the next-generation AI security guardrails.

    January 6, 2026
    06.6K0
  • CSO: A Chief Security Officer's Guide to Full-Link Security for Artificial Intelligence Data

    Chief Security Officers (CSOs) are facing an unprecedented challenge: AI systems are both amplifying existing data risks and introducing entirely new threats such as data poisoning, model reverse engineering, and supply chain contamination. This guide builds on the NIST AI Risk Management Framework (AI RMF), the Google Secure AI Framework (SAIF), and industry practices to provide CSOs with an actionable data security governance system.

    December 31, 2025
    05.3K0
  • AI Intelligence Body Security: GitHub Actions Prompt Word Injection (PromptPwnd) Vulnerability

    PromptPwnd is a new type of vulnerability discovered by the Aikido Security research team that poses a serious threat to GitHub Actions and GitLab CI/CD pipelines that integrate AI agents. The vulnerability utilizes Prompt Injection to cause key compromise, workflow manipulation, and supply chain compromise by injecting malicious commands into an AI model, causing it to perform high-privilege operations. At least five Fortune 500 companies have been affected, and several high-profile projects such as the Google Gemini CLI have been verified to have the vulnerability.

    December 27, 2025
    04.4K0
  • AI Hacking: Automated Infiltration Analysis of AI Agents

    Strix represents a paradigm shift in the field of cybersecurity testing - an evolution from a manual-centric penetration approach to a multi-agent collaborative automation model. The tool realizes complete vulnerability lifecycle management (reconnaissance, exploitation, validation) through LLM-driven autonomous intelligences, demonstrating significant cost advantages (cost reduction of 70% or more) and time efficiency advantages (test cycle shortened from weeks to hours) over traditional manual penetration and passive scanning tools. However, its limitations are equally obvious: the success rate of zero-day vulnerability exploitation is only 10-12%, the detection capability of business logic vulnerability is seriously insufficient, and the inherent security risks of multi-agent systems (hint injection, inter-agent trust abuse) require a structured governance framework.

    December 24, 2025
    06.3K0
  • OWASP Release: AI Intelligence Body Security OWASP Top 10 2026

    As AI evolves from mere "Chatbots" to "Agentic AI" with autonomous planning, decision-making and execution capabilities, the attack surface of applications has fundamentally changed. In contrast to traditional LLM ...

    December 22, 2025
    09.0K0