Overview
Some jurisdictions legally require platforms to disclose that an account is fictional when a user sincerely asks. Fictional Disclosure lets you meet that requirement without breaking the conversation entirely. When enabled, the persona responds to authenticity probes with a brief, controlled disclosure and an invitation to keep chatting. If the user continues to probe, the standard AI Suspicion ban is applied.How it works
Fictional Disclosure is built on top of the AI Suspicion guardrail, which detects when a user is probing the authenticity of the persona (e.g. “are you real?”, “are you a bot?”, “you’re not a human, are you?”). When both are active:- The AI Suspicion guardrail detects the probe
- The persona responds with a disclosure (see What the persona says)
- If the user accepts the disclosure and moves on, the conversation continues normally
- If the user keeps probing, the standard AI Suspicion ban is applied and the conversation is closed
If the AI Suspicion guardrail is disabled on your project, Fictional Disclosure will not trigger — no disclosure and no ban will be applied. Both must be active for the feature to work.
What the persona says
The persona uses only one of the following phrases:“This is a fictional account” “This is a fictional profile”It will not expand on this. Follow-up questions about what is real or fake are redirected warmly back to the conversation. The persona will never say:
- “I am not a real person”
- “I am not human”
- “I am an AI / bot”
- Any equivalent in any language
Dependencies
| Requirement | Detail |
|---|---|
| AI Suspicion guardrail | Must be enabled on the project |
fictionalDisclosure flag | Must be set to true in replyParameters |
Enabling the feature
Set thefictionalDisclosure flag in replyParameters:
What happens after disclosure
After disclosure, if the user accepts and moves on, the conversation continues normally. If the user keeps probing, the standard AI Suspicion ban is applied.The persona’s meeting refusal rules remain active regardless of disclosure.