Skip to content
@evalops

EvalOps

EvalOps is an AI testing and monitoring platform that helps engineering teams ship reliable AI features with confidence.

Popular repositories Loading

  1. cognitive-dissonance-dspy cognitive-dissonance-dspy Public

    A multi-agent LLM system for detecting and resolving cognitive dissonance.

    Python 276 22

  2. dspy-micro-agent dspy-micro-agent Public

    Minimal agent runtime built with DSPy modules and a thin Python loop. Includes CLI, FastAPI server, and eval harness with OpenAI/Ollama support.

    Python 71 6

  3. founder-email-optimizer founder-email-optimizer Public

    DSPy-powered email optimization for startup founders: drop in your 3 best emails, get optimized outreach for new leads

    Python 39 1

  4. orbit-agent orbit-agent Public

    A brutally honest "high‑orbit" startup advisor you can text or run from the CLI. Built with DSPy, it provides opinionated, YC-style advice and financial tools for founders.

    Python 20

  5. diffscope diffscope Public

    A composable code review engine for automated diff analysis

    Rust 12 2

  6. nimbus nimbus Public

    Self-hosted CI infrastructure optimized for AI evaluation workloads. Run evals on bare metal with Firecracker isolation, built-in observability, and zero cloud egress costs

    Python 7

Repositories

Showing 10 of 44 repositories
  • cerebro Public

    Entity Intelligence Engine — security, business, and operational posture management

    evalops/cerebro’s past year of commit activity
    Go 0 Apache-2.0 0 12 6 Updated Mar 26, 2026
  • fabric Public

    Agent Fabric - Reimagining Slack for AI agents

    evalops/fabric’s past year of commit activity
    TypeScript 0 0 0 1 Updated Mar 25, 2026
  • keep Public

    PoC zero-trust access stack with Google SSO, Envoy, OPA, and device attestation

    evalops/keep’s past year of commit activity
    Go 0 0 0 11 Updated Mar 23, 2026
  • diffscope Public

    A composable code review engine for automated diff analysis

    evalops/diffscope’s past year of commit activity
    Rust 12 Apache-2.0 2 44 1 Updated Mar 22, 2026
  • verdict Public Forked from haizelabs/verdict

    Inference-time scaling for LLMs-as-a-judge.

    evalops/verdict’s past year of commit activity
    Jupyter Notebook 0 MIT 26 0 2 Updated Mar 15, 2026
  • grimoire Public Forked from anomalyco/opencode

    The AI coding agent built for the terminal.

    evalops/grimoire’s past year of commit activity
    TypeScript 0 MIT 13,912 0 2 Updated Mar 15, 2026
  • maestro Public

    Lightweight agent orchestration hooks for Claude Code and OpenAI Codex. Keep your coding agents working until the job is done.

    evalops/maestro’s past year of commit activity
    Python 0 MIT 0 0 0 Updated Mar 14, 2026
  • asb Public

    Agents-first secret broker control plane in Go

    evalops/asb’s past year of commit activity
    Go 0 0 23 0 Updated Mar 12, 2026
  • ensemble-tap Public

    Ingest SaaS webhooks, polls, and CDC into NATS JetStream + ClickHouse — gives Ensemble continuous awareness of customer business systems

    evalops/ensemble-tap’s past year of commit activity
    Go 0 0 0 0 Updated Mar 9, 2026
  • open-associate-skills Public

    Skills bundle for simulating a top‑decile VC associate: market mapping, sourcing, diligence, memos, and portfolio ops.

    evalops/open-associate-skills’s past year of commit activity
    Python 0 0 0 0 Updated Jan 29, 2026

People

This organization has no public members. You must be a member to see who’s a part of this organization.