examples

Examples

Runnable example scripts demonstrating the major features of fireflyframework-agentic.

Prerequisites

Python 3.13+
uv package manager
An OpenAI API key (set OPENAI_API_KEY or enter it when prompted)

All examples use the model openai:gpt-4o.

Running

From the repository root:

export OPENAI_API_KEY="sk-..."
uv run python examples/<example_name>.py

If OPENAI_API_KEY is not set, each script will prompt you interactively.

Agent Examples

basic_agent.py — Create a FireflyAgent with instructions and tags, run a prompt.
conversational_memory.py — Multi-turn conversation with MemoryManager and create_conversational_agent.
summarizer.py — create_summarizer_agent with tuneable length, style, and format.
classifier.py — create_classifier_agent with categories and ClassificationResult structured output.
extractor.py — create_extractor_agent with a custom Pydantic model for structured data extraction.
router.py — create_router_agent with an agent map and RoutingDecision structured output.

Security Examples

security_guards.py — PromptGuard and OutputGuard standalone scanning. Demonstrates injection detection, PII/secrets/harmful content scanning, sanitise mode, custom deny patterns, and max output length. No API key required.

Tool Examples

cached_tool.py — CachedTool wrapping a slow tool with TTL-based memoisation. Shows cache hits/misses, TTL expiry, invalidate(), clear(), and max_entries eviction. No API key required.
tool_timeout.py — BaseTool(timeout=...) per-tool execution timeout and ToolTimeoutError handling. Shows fast/slow/no-timeout tools and graceful fallback patterns. No API key required.

Memory Examples

conversation_export_import.py — export_conversation() and import_conversation() for conversation backup, migration, and restoration. Also demonstrates create_llm_summarizer(). No API key required for export/import.

Observability Examples

observability_usage.py — UsageTracker with bounded max_records, cumulative cost tracking, per-agent and per-correlation summaries. No API key required.

Delegation Examples

delegation_strategies.py — DelegationRouter with all four strategies: RoundRobinStrategy, CapabilityStrategy, CostAwareStrategy, and ContentBasedStrategy (LLM routing).

Pipeline Examples

pipeline_branching.py — BranchStep for conditional routing in a DAG, PipelineEventHandler for live progress, and DAGNode.backoff_factor for exponential retry backoff. No API key required.

Complex Examples

idp_pipeline.py (+ idp_tools.py) — Full Intelligent Document Processing pipeline that downloads a real 33-page PDF (Unilever Certificate of Incorporation & Bylaws) and processes it end-to-end through a 7-node DAG: ingest → split → classify → extract → validate → assemble → explain. Exercises all major framework features together:
- Agents — FireflyAgent, create_classifier_agent (with category descriptions), create_extractor_agent
- Tools — @firefly_tool, ToolKit, CachedTool (TTL-based memoisation of PDF downloads), tool-to-agent bridging via as_pydantic_tools()
- Security — PromptGuardMiddleware (injection detection/sanitisation), OutputGuardMiddleware (PII/secrets/harmful content scanning), CostGuardMiddleware (budget tracking in warn-only mode)
- Prompts — PromptTemplate with declared variables (split, classification, extraction, explainability)
- Reasoning patterns — ReflexionPattern for validation self-correction
- Content processing — TextChunker, ContextCompressor, TruncationStrategy
- Memory — MemoryManager with working memory and conversation memory
- Validation — OutputValidator, GroundingChecker, OutputReviewer (custom retry prompt), field rules, cross-field rules
- Pipeline DAG — PipelineBuilder, CallableStep, .chain(), PipelineEngine, PipelineEventHandler (live progress logging)
- Document splitting — LLM-powered boundary detection splits the PDF into 4 sub-documents, each processed independently
- Explainability — TraceRecorder, AuditTrail, ReportBuilder, plus an LLM agent that generates a comprehensive human-readable narrative
- Pretty JSON output — ANSI-colored JSON rendering with key/value colour differentiation
- Logging — configure_logging
Requires pdfplumber (included in dev dependencies).

corpus_search/ — Drop a folder, get a queryable corpus. Hybrid retrieval over local files: markitdown converts each document, chunks land in SQLite (FTS5/BM25) plus a Chroma vector store. Query with natural language → Haiku expands the question into reformulations → BM25 + vector search per variant → Reciprocal Rank Fusion merges rankings → Sonnet synthesises an answer with [chunk_id] citations. No knowledge graph, no extractors, no reranker — just qmd-style hybrid search.

# Ingest (Azure OpenAI for embeddings — no Anthropic key needed)
EMBEDDING_BINDING_HOST=https://...openai.azure.com EMBEDDING_BINDING_API_KEY=... \
  uv run python -m examples.corpus_search ingest --folder ./drop

# Watch a folder for new files
uv run python -m examples.corpus_search ingest --folder ./drop --watch

# Ask questions (needs ANTHROPIC_API_KEY for expansion / rerank / answer)
uv run python -m examples.corpus_search query "Who is the CEO of OpenAI?"

# Inspect a chunk by id (no API keys needed)
uv run python -m examples.corpus_search show-chunk <chunk-id>

Outputs land under ./kg/:

./kg/
├── corpus.sqlite     # chunks, chunks_fts (BM25), ingestions
└── chroma/           # OpenAI chunk vectors

See docs/use-case-corpus-search.md for the full design.

Reasoning Pattern Examples

reasoning_cot.py — Chain of Thought: step-by-step reasoning with ReasoningThought and trace inspection.
reasoning_react.py — ReAct: Reason-Act-Observe loop via run_with_reasoning().
reasoning_reflexion.py — Reflexion: Execute-Reflect-Retry with ReflectionVerdict self-critique.
reasoning_plan.py — Plan-and-Execute: structured planning with PlanStepDef status tracking.
reasoning_tot.py — Tree of Thoughts: parallel branch exploration with BranchEvaluation scoring.
reasoning_goal.py — Goal Decomposition: hierarchical GoalPhase breakdown and task execution.
reasoning_pipeline.py — Pipeline: chaining Chain-of-Thought into Reflexion with a merged trace.
reasoning_memory.py — Memory: reasoning with MemoryManager working memory enrichment.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

Examples

Prerequisites

Running

Agent Examples

Security Examples

Tool Examples

Memory Examples

Observability Examples

Delegation Examples

Pipeline Examples

Complex Examples

Reasoning Pattern Examples

Name		Name	Last commit message	Last commit date
parent directory ..
corpus_search		corpus_search
README.md		README.md
__init__.py		__init__.py
basic_agent.py		basic_agent.py
batch_processing.py		batch_processing.py
cached_tool.py		cached_tool.py
circuit_breaker.py		circuit_breaker.py
classifier.py		classifier.py
conversation_export_import.py		conversation_export_import.py
conversational_memory.py		conversational_memory.py
database_persistence.py		database_persistence.py
delegation_strategies.py		delegation_strategies.py
distributed_tracing.py		distributed_tracing.py
extractor.py		extractor.py
full_integration.py		full_integration.py
http_connection_pooling.py		http_connection_pooling.py
idp_pipeline.py		idp_pipeline.py
idp_tools.py		idp_tools.py
incremental_streaming.py		incremental_streaming.py
mongodb_persistence.py		mongodb_persistence.py
observability_usage.py		observability_usage.py
pipeline_branching.py		pipeline_branching.py
prompt_caching.py		prompt_caching.py
quota_management.py		quota_management.py
reasoning_cot.py		reasoning_cot.py
reasoning_goal.py		reasoning_goal.py
reasoning_memory.py		reasoning_memory.py
reasoning_pipeline.py		reasoning_pipeline.py
reasoning_plan.py		reasoning_plan.py
reasoning_react.py		reasoning_react.py
reasoning_reflexion.py		reasoning_reflexion.py
reasoning_tot.py		reasoning_tot.py
router.py		router.py
security_guards.py		security_guards.py
summarizer.py		summarizer.py
tool_timeout.py		tool_timeout.py

FilesExpand file tree

examples

Directory actions

More options

Directory actions

More options

Latest commit

History

examples

Folders and files

parent directory

README.md

Examples

Prerequisites

Running

Agent Examples

Security Examples

Tool Examples

Memory Examples

Observability Examples

Delegation Examples

Pipeline Examples

Complex Examples

Reasoning Pattern Examples