May 28, 2025
Schaun Wheeler

Beyond Multi-Armed Bandits: Understanding Aampe's Semantic-Associative Agents

May 28, 2025
Schaun Wheeler

Beyond Multi-Armed Bandits: Understanding Aampe's Semantic-Associative Agents

May 28, 2025
Schaun Wheeler

Beyond Multi-Armed Bandits: Understanding Aampe's Semantic-Associative Agents

May 28, 2025
Schaun Wheeler

Beyond Multi-Armed Bandits: Understanding Aampe's Semantic-Associative Agents

People sometimes ask whether our system is a kind of multi-armed bandit. It’s not. But that’s not a bad place to start if you want a familiar reference point.

Our semantic-associative agents use the same basic intuition: take actions, observe outcomes, and update preferences. But two key differences make this something else entirely:


  1. Multi-dimensional action space

    In a typical bandit problem, the agent chooses from a flat set of discrete actions—pull arm A, B, or C. Each action is assumed to be atomic and independent. Even when bandits are extended into contextual or combinatorial forms, they still often treat each action as a point in a single, unified decision space.

    Real-world decision-making—especially in applications like customer engagement—isn’t like that. You’re not just choosing “an action.” You’re selecting a profile made up of choices across several intersecting dimensions: time of day, day of week, message channel, content theme, offer type, incentive level, tone, subject line, etc. Each of these is its own action set, and the agent must learn how these dimensions interact—both with each other and with user behavior. The task isn’t just to find the best arm, but to learn a combinatorial space of micro-preferences and then select a coherent, deliverable action bundle that fits.


  2. Non-ergodic learning

    Most bandit systems assume some form of ergodicity—the idea that statistical insights gained from one user’s behavior can generalize to another. In an ergodic system, learning can be pooled: we assume that averages across time and averages across the population converge. That makes for efficient learning, especially when individual data is sparse.

    But user behavior in domains like messaging or content interaction is not ergodic. People differ—not just in preferences, but in responsiveness, habits, intent, timing, and attention. Treating these differences as noise and trying to learn a global average flattens signal that actually matters. Our agents treat each user as their own environment. They don’t generalize across users. They build up individualized models based solely on that user’s interaction history, which lets them preserve and act on genuine behavioral variance instead of averaging it away.

So while it’s tempting to think of this as a fancy bandit setup, that framing misses what’s actually happening. It’s not a variant—it’s a structurally different approach. Bandits are a good metaphor to start with, but the differences are architectural, not cosmetic.

And to be clear: none of this depends on LLMs. An LLM is just an actor—it takes context and produces plausible outputs. Our learning agents run upstream of that. They’re responsible for producing the right context in the first place, based on what they’ve learned about how a particular user responds to different combinations of actions. That context can then drive the LLM, or be used to select from a content library indexed to the same profile space.

0

Related

Shaping the future of marketing with Aampe through innovation, data.

Jun 16, 2025

Schaun Wheeler

Discover how Aampe's agents employ causal analysis to accurately measure user engagement outcomes, even when influenced by external messages. By isolating the effects of their own actions from other variables, Aampe ensures precise attribution and effective decision-making in a complex messaging environment.

Jun 16, 2025

Schaun Wheeler

Discover how Aampe's agents employ causal analysis to accurately measure user engagement outcomes, even when influenced by external messages. By isolating the effects of their own actions from other variables, Aampe ensures precise attribution and effective decision-making in a complex messaging environment.

Jun 16, 2025

Schaun Wheeler

Discover how Aampe's agents employ causal analysis to accurately measure user engagement outcomes, even when influenced by external messages. By isolating the effects of their own actions from other variables, Aampe ensures precise attribution and effective decision-making in a complex messaging environment.

Jun 16, 2025

Schaun Wheeler

Discover how Aampe's agents employ causal analysis to accurately measure user engagement outcomes, even when influenced by external messages. By isolating the effects of their own actions from other variables, Aampe ensures precise attribution and effective decision-making in a complex messaging environment.

Jun 10, 2025

Schaun Wheeler

Explore how agentic systems define and execute decisions. This article delves into the five key decision types—Go/No-Go, Context, Creative Policy, Item Recommendation, and Freshness—that guide autonomous agents in delivering personalized user experiences. Learn how these systems prioritize meaningful choices to enhance engagement and effectiveness.

Jun 10, 2025

Schaun Wheeler

Explore how agentic systems define and execute decisions. This article delves into the five key decision types—Go/No-Go, Context, Creative Policy, Item Recommendation, and Freshness—that guide autonomous agents in delivering personalized user experiences. Learn how these systems prioritize meaningful choices to enhance engagement and effectiveness.

Jun 10, 2025

Schaun Wheeler

Explore how agentic systems define and execute decisions. This article delves into the five key decision types—Go/No-Go, Context, Creative Policy, Item Recommendation, and Freshness—that guide autonomous agents in delivering personalized user experiences. Learn how these systems prioritize meaningful choices to enhance engagement and effectiveness.

Jun 10, 2025

Schaun Wheeler

Explore how agentic systems define and execute decisions. This article delves into the five key decision types—Go/No-Go, Context, Creative Policy, Item Recommendation, and Freshness—that guide autonomous agents in delivering personalized user experiences. Learn how these systems prioritize meaningful choices to enhance engagement and effectiveness.

Jun 5, 2025

Schaun Wheeler

Explore Aampe's innovative approach to personalization, combining classical recommender systems for item selection with real-time reinforcement learning agents for dynamic message composition. Learn how this dual-path strategy enhances user engagement and content relevance.

Jun 5, 2025

Schaun Wheeler

Explore Aampe's innovative approach to personalization, combining classical recommender systems for item selection with real-time reinforcement learning agents for dynamic message composition. Learn how this dual-path strategy enhances user engagement and content relevance.

Jun 5, 2025

Schaun Wheeler

Explore Aampe's innovative approach to personalization, combining classical recommender systems for item selection with real-time reinforcement learning agents for dynamic message composition. Learn how this dual-path strategy enhances user engagement and content relevance.

Jun 5, 2025

Schaun Wheeler

Explore Aampe's innovative approach to personalization, combining classical recommender systems for item selection with real-time reinforcement learning agents for dynamic message composition. Learn how this dual-path strategy enhances user engagement and content relevance.

Jun 3, 2025

Schaun Wheeler

Explore why focusing solely on short-term metrics like lift can be misleading when assessing adaptive systems, and discover alternative approaches for meaningful evaluation.

Jun 3, 2025

Schaun Wheeler

Explore why focusing solely on short-term metrics like lift can be misleading when assessing adaptive systems, and discover alternative approaches for meaningful evaluation.

Jun 3, 2025

Schaun Wheeler

Explore why focusing solely on short-term metrics like lift can be misleading when assessing adaptive systems, and discover alternative approaches for meaningful evaluation.

Jun 3, 2025

Schaun Wheeler

Explore why focusing solely on short-term metrics like lift can be misleading when assessing adaptive systems, and discover alternative approaches for meaningful evaluation.

Load More

Load More

Load More

Load More