AFB Taxonomy Specification

The Agent Failure Boundary (AFB) taxonomy is an open security specification for agentic AI systems.

It defines four invariant failure boundaries in the agentic execution loop:

AFB01 - Context Poisoning: the model ingests corrupted, forged, or manipulated context.
AFB02 - Model Boundary Compromise: integrity/confidentiality failures at the model input/output boundary.
AFB03 - Instruction Hijack: model output becomes unsafe instructions for the agent layer.
AFB04 - Unauthorized Action: the agent attempts or performs an action outside authorized policy.

Who this is for

Start with spec/afb-v1.md and spec/afb-v2.md for the full taxonomy texts derived from the source papers.
Use boundary definitions to map architecture risks to loop transitions (Context -> Model -> Agent -> Act).
Use owasp-mapping.md to align AFB categories with OWASP LLM and OWASP Agentic categories (interpretive overlap only).
Use the examples in examples/ to see concrete AFB01 and AFB04 exposure patterns.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
examples		examples
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
LICENSE		LICENSE
README.md		README.md
owasp-mapping.md		owasp-mapping.md
v1.0-plarix-thesis-15-03-26.pdf		v1.0-plarix-thesis-15-03-26.pdf
v2.0-plarix-thesis-20-03-26.pdf		v2.0-plarix-thesis-20-03-26.pdf