The shape of Henry
Henry is a system that combines:- A customer-supplied AI provider. OpenAI or Gemini. Brand Atlas does not run the model; the customer’s account does.
- A system prompt. Brand Atlas’s instructions to the model, defining Henry’s behaviour and constraints.
- Brand-record grounding. The atlas’s content, supplied to the model at each interaction, scoped to what the asking user can read.
- No tool-use loop with write tools. Henry can read; Henry cannot write.
- Conversation memory per session. Within a session, Henry remembers context. Between sessions, memory is fresh.
- Citation in every answer. Henry’s responses cite the section, the line, or the request the answer is drawn from.
The read-only boundary, in detail
The read-only boundary is enforced at three layers:- The system prompt. Henry is instructed that it cannot edit the brand record, cannot create Update Requests on behalf of the user, cannot generate Guest Passes, cannot create Horizons. The instruction is explicit and reinforced.
- The tool set. Henry is given read tools (
search,read_section,read_history) and not given any write tools. The tools Henry uses are the only tools Henry has. - The application layer. Brand Atlas’s API rejects any write-shaped request from a Henry session, regardless of how the request was formed. The boundary is defended in code, not just in instruction.
Scoped read access
Henry’s read access is scoped to what the asking user can read. A team member talking to Henry can only get Henry to surface material the team member already has rights to. A Guest talking to Henry (where the Guest Pass permits Henry use) can only get Henry to surface material within the pass scope. This is enforced at the same three layers as the write boundary: in the system prompt, in the read tools (which take the user’s role as a parameter), and at the application layer (which filters responses against the user’s permissions).Conversation memory
Within a session, Henry remembers the conversation. The team member can ask a follow-up question without re-stating the context: what about the dark variant? after a question about logo usage. Henry knows the conversation is about logos. Between sessions, Henry starts fresh. There is no long-term memory; Henry does not remember the question the user asked last week. The default session window is the current conversation, from open to close. Closing the panel ends the session; a new panel open starts a new session. For Guardian customers, the session window can extend to “until reset by the user,” meaning the conversation persists across panel-open events. Useful for users who want continuity across the day. The setting is per-user.Citation behaviour
Every Henry response includes citations. A citation contains:- The section name (e.g. “Colour System”).
- The specific element if relevant (e.g. “Primary Blue swatch”).
- A link to the section.
What is sent to the provider
Each Henry interaction sends to the customer’s provider:- The system prompt.
- The brand-record context relevant to the question (grounding).
- The conversation so far in the session.
- The user’s prompt.
- Other atlases the user has access to.
- The plaintext form of any secrets stored in the atlas.
- The team’s email, calendar, or external system data.
The grounding model
Henry receives the brand-record context as part of the prompt sent to the provider. The context is:- The Glossary.
- The standard sections relevant to the question.
- Active Horizons relevant to the question.
- Recent Update Request history relevant to the question.
- Henry’s answers are traceable. Every claim Henry makes can (and should) be tied to a citation.
- Henry knows what it does not know. When the record is silent, Henry says so.
The cost model
Each Henry interaction has a cost in tokens (input + output) at the provider’s rates. The customer is billed by the provider at the end of the billing period. Brand Atlas does not mark up the AI cost. Brand Atlas’s revenue is the Keeper or Guardian subscription. The provider’s bill is the customer’s responsibility. A useful estimate: a single Henry question and response is typically 1,000 to 3,000 tokens combined. At current OpenAI and Gemini rates, that is a fraction of a cent. A heavy team using Henry several hundred times a day might see usage in the tens of dollars per month; a typical team is closer to $10-20.Rate limits
Henry inherits the rate limits set by the customer’s provider account. If the team exhausts the provider’s per-minute rate limit, Henry returns a rate-limit message and asks the team member to retry shortly. Brand Atlas does not set its own per-team rate limit on top. For Guardian customers running high volume, configure a higher rate limit with the provider or use the dedicated tier their account permits.Provider failover
Henry is configured with a single provider per atlas. If the provider has an outage, Henry is unavailable for the duration. There is no automatic failover to a different provider; failover would require duplicating the key arrangement and the customer’s setup. If your team’s tolerance for Henry downtime is low, the right answer is to have a tested second key from the alternative provider, ready to paste in during an outage. See Switching providers.Related pages
What Henry does
Capabilities.
Switching providers
Provider failover.
BYOK Policy
Key handling.