Overview
Knowledge Bases help you make unstructured information more usable inside AI workflows. A Knowledge Base lets users upload information (in the form of documents). split them in smaller parts, convert them into embeddings, and store them in a vector database. Once processed, this content can be retrieved as and when required by AI workflows. A Knowledge Base:- Stores uploaded files
- Manages how these files are processed
- Manages storage of processed data
- Helps contextual retrieval
What a Knowledge Base Includes
Each Knowledge Base combines configuration, storage, and content into a single unit:- File ingestion - Upload documents directly into the platform
- Chunking logic - Break documents into smaller, overlapping segments to get better context
- Embedding configuration - Define how text is converted into vectors
- Vector storage - Store embeddings in a hosted or customer-managed database
- Retrieval interface - Make the content available to workflows via semantic search or queries
Supported Knowledge Base Types
AI Squared supports two Knowledge Base types:- Vector Store
Designed for semantic search and retrieval. Uploaded documents are embedded and stored as vectors, enabling similarity-based queries. - Semantic Data Model
Designed for non-vector or structured retrieval scenarios, where queries are executed directly against a connected database.
Creating a Knowledge Base
When creating a Knowledge Base, users define how their documents should be processed and stored. Some of the key configuration options are:- Embedding provider - Service used to generate embeddings (default: OpenAI)
- Embedding model - Specific model used for vector generation
- Chunk size - Maximum size of each document chunk
- Chunk overlap - Overlap between chunks to preserve context
- Vector storage - Choose between a hosted vector store or an external database
- Storage schema - Define which database columns store vectors, text, and metadata
.png?fit=max&auto=format&n=j6yGvcGXn1wUVvcn&q=85&s=5cd6f9290ac4e5015610fc2f4ea56c90)
File Upload and Processing
Uploading a file to a Knowledge Base triggers an automated processing pipeline:- The file is uploaded and verified
- Content is extracted and split into smaller chunks
- Each chunk is converted into an embedding
- Embeddings, text, and metadata are written to the selected storage
- File status is updated once processing completes
.png?fit=max&auto=format&n=j6yGvcGXn1wUVvcn&q=85&s=57a00ef091f7a1fcecaf64b0f3493a2d)
.png?fit=max&auto=format&n=j6yGvcGXn1wUVvcn&q=85&s=5fa8bb931d87583b5935ed38b54de809)
Updating a Knowledge Base
Knowledge Bases cannot be changed once they are created. If you need to make changes in the configuration, you must create a new Knowledge Base and re-upload files with the updated settings.Deleting Knowledge Bases and Files
Knowledge Bases and their files can be safely deleted when they are no longer in use. When individual files are deleted, the embeddings associated with that file are also removed.Retrieval and Usage in Workflows
Knowledge Bases are designed in a way that they can be used directly by AI workflows. Retrieval is designed to ensure that AI responses are grounded in authoritative, relevant, and permitted data, rather than relying on general model knowledge. When a query is received:- Content is retrieved based on meaning allowing workflows to work effectively with unstructured information
- Semantic retrieval can be combined with simple keyword matching and metadata filters to improve accuracy and recall
- Access controls and permissions are enforced, ensuring users only retrieve data they are allowed to see
- Retrieved results can include references back to the original documents or records, maintaining traceability
Knowledge Bases with agents
When used within agent-driven workflows, Knowledge Bases can be invoked dynamically as part of the agent’s reasoning process. This allows agents to decide when retrieval is needed, query the Knowledge Base with the right context, and use the returned information to guide multi-step reasoning. Even in these cases, retrieval remains controlled and predictable - agents can only access Knowledge Bases explicitly connected to the workflow, and all retrieved content is returned in a structured, auditable form.Why are Knowledge Bases Important
Knowledge Bases help translate raw data into data that is usable for AI workflows. They help teams to:- Bring unstructured knowledge into AI systems
- Maintain clean separation between data, configuration, and workflows
- Ensure consistent, compliant access to knowledge within the company