Glossary¶
Key terms used throughout the EpsimoAI platform.
Access Mode : Controls how an assistant can be accessed. Assistants can be private (project-scoped) or public (accessible by anyone with the assistant ID or via the public API).
Agent
: A LangGraph-compiled graph that orchestrates LLM calls and tool usage. Agent types include chatbot (simple conversation), tools (function-calling agent), and retrieval (RAG-based agent).
Assistant
: A saved configuration that defines an agent's behavior — which LLM to use, what tools are enabled, and the system prompt. Stored in the assistants table.
Checkpoint : A serialized snapshot of a LangGraph execution state. Stored in PostgreSQL, checkpoints allow conversations to be resumed and replayed. Each run creates one or more checkpoints.
Cognito Exchange : An authentication flow (RS256) designed for exchanging AWS Cognito tokens for EpsimoAI JWTs. Currently defined but not deployed in production.
Embedding
: A dense vector representation of text, generated by an embedding model (e.g. OpenAI text-embedding-ada-002). Used for semantic similarity search in the vector store.
JWT (JSON Web Token)
: The authentication token format used by EpsimoAI. Tokens are signed with HS256 using JWT_SECRET_KEY, valid for 30 days, and include the user ID, email, and project ID as claims.
LangGraph : A framework built on LangChain for creating stateful, multi-step AI agent workflows as directed graphs. EpsimoAI uses LangGraph to orchestrate tool-calling loops, streaming, and checkpointing.
MCP (Model Context Protocol) : An open protocol for connecting AI assistants to external tools and data sources. EpsimoAI supports MCP servers via both stdio and SSE transport, allowing assistants to use third-party tool servers.
Project
: A workspace that groups assistants and threads under a single owner. Every user has a default main project created on first login. Projects enable logical separation of different use cases.
Public Token
: A mechanism for accessing public assistants without user authentication. Public assistants can be invoked via the OpenAI-compatible API endpoint (/public/openai/v1/chat/completions) using the assistant ID.
Run
: A single execution of an agent on a thread. A run takes input messages, invokes the LangGraph agent, streams the response, and saves the resulting checkpoint. Runs consume usage tokens from the user's thread_counter.
Thread : A conversation container that holds the message history between a user and an assistant. Threads are scoped to a project and optionally linked to a specific assistant.
Tool : A function that an agent can invoke during a run. Built-in tools include web search (DuckDuckGo, Tavily), retrieval (vector search), Wikipedia, ArXiv, PubMed, DALL-E, and MCP servers. Tools are configured per-assistant.
Vector Store
: A PostgreSQL-based store (using the pgvector extension) that holds document embeddings for retrieval-augmented generation. Documents are chunked, embedded, and stored when files are ingested via the /ingest endpoint.
Related Pages¶
- System Overview — Architecture diagrams
- Data Model — Database tables
- Configuration Reference — Environment variables