Research

Read Our Latest Publications

Tracking the Behavioral Trajectories of Adapting Agents

A methodology for measuring agent behavioral traits as directions in embedding space, applied to diffs of agent skill files over time. Achieves 91.2% sign classification accuracy on data-seeking trait detection. Includes an agent-to-agent protocol for continuous automated evaluation. Accepted to ICML '26.

ICML '26 · AIWILD Workshop | PDF

Behavioral Fingerprints for LLM Endpoint Stability and Identity

We introduce Stability Monitor, a black-box stability monitoring system that periodically fingerprints an endpoint by sampling outputs from a fixed prompt set and comparing the resulting output distributions over time. In controlled validation, Stability Monitor detects changes to model family, version, inference stack, quantization, and behavioral parameters. Accepted to ACM CAIS '26.

ACM CAIS '26 · System Demo | PDF

Identifying and Banning AI Developed by Foreign Adversaries

Exploring methods and frameworks for identifying AI systems developed by foreign adversaries and implementing appropriate policy responses.

White Paper

ZKTorch: Open-Sourcing the First Universal ZKML Compiler for Real-World AI

Introducing ZKTorch, the first universal zero-knowledge machine learning compiler designed for real-world AI applications.

White Paper | code

Security Assurances for AI in High-Stakes Environments using Verifiable Computation

A comprehensive white paper exploring the critical importance of AI model verification, behavioral fingerprinting, and establishing trust in AI systems for enterprise deployment.

White Paper

Hardware-Rooted Trust Anchors for Sovereign AI Processing: Cryptographic Verification of Location, Identity, and Confidentiality in Cloud Environments

Cryptographic verification of location, identity, and confidentiality in cloud environments for sovereign AI processing.

ICDS '25

You Can't Trust What You Can't Verify — The Case for AI Model Identity

Chris Hughes interviews Manish Shah on why most organizations cannot verify which model is actually running in their environment — and how behavioral fingerprinting and ZkTorch provide the technical controls to close that gap.

Resilient Cyber

Misbehaving Agents & The Impacts of Extreme Instability

Model endpoints with extreme levels of behavioral instability show high levels of task instability compared to peer endpoints serving the same nominal model. First evidence that fingerprint instability predicts agent tool-calling changes.

Substack

Introducing Stability Arena and The Seven Metrics of Model Hosting

The evolving landscape of LLM agents requires a fundamental shift in how model hosting infrastructure is measured. Introducing Stability Arena, a public monitor designed to track Identity, Stability, and Fidelity — three new, essential metrics for the agent era.

Substack

Reliability ≠ Stability

Just because an AI is generating tokens reliably doesn't mean those tokens are the right ones. Exploring why standard reliability metrics fall short and why stability — behavioral consistency over time — is what actually matters for AI-native applications.

Substack

Building a Vocabulary for AI Assurance — Part II: Verifiability and Accuracy

Establishing ground truth via verifiability and accuracy. Exploring how to prove the authenticity of model outputs and ensure correctness — two prerequisites for meaningful AI assurance.

Substack

Building a Vocabulary for AI Assurance — Part I: Explainability and Interpretability

Establishing clarity around the terms used to discuss AI assurances. Part I tackles explainability and interpretability — what they mean, how they differ, and why the distinction matters.

Substack

What is Model Informatics? What Does It Mean to "Verify" AI?

Inspired by bioinformatics, Model Informatics is the systematic study of AI models as complex information systems. Exploring the tools and frameworks needed to understand, recognize, and verify properties of AI models.

Substack

Agents, Task Time Compute, & Task Time Marketplaces

Agents coordinating to complete complex, multi-step tasks for users and using a marketplace to bid out individual tasks to specialty agents.

Substack

The Security Evolution of Core Technologies: What It Means for AI (Part I)

There is a consistent pattern of increasing the robustness of security features for core computing technologies. AI shouldn't be any different. Walking through how previous core technologies increased security as they gained adoption.

Substack

Product Development in the Age of AI

Developing on top of probabilistic compute changes how we build software, software products, and eventually anything. Exploring the challenges product leaders face around model uncertainty and planning.

Substack

Transparency vs Interpretability

While we may not know how a model generates its result, we should still know what was asked of the model to get the result. Examining the critical differences between transparency and interpretability.

Substack

Which Model Am I Getting?

Just reviewing outputs from a model won't tell you which model you're using. Even if your API provider says which model you're using, you have no way to verify it independently.

Substack

VAIL Use Case: Verifiable Evals

How can you prove a model passed an eval with the reported score? Exploring why benchmark results need cryptographic verification and how VAIL makes that possible.

Substack

AI is like Sugar

The number of AI models is growing fast and that's good. Drawing parallels between AI proliferation and sugar adoption to understand societal impacts and dependencies.

Substack

Part 4: Stochastic Computation Needs Verifiable Computing

Since AI models are stochastic machines, we can't predict their exact outputs. Exploring why verifiable computing is essential to ensure trust in probabilistic AI systems.

Substack

Subscribe for More Insights

Stay updated with the latest research, insights, and thought leadership on AI and model informatics.

Visit our Substack →

Model Informatics

Verifiable Computation

Core Technologies

Read Our Latest Publications

Subscribe for More Insights

Request a briefing

Get Your Free Research Paper