Knowledge & Data Sources
π§ Powering Your Agent with Domain Knowledge
The Knowledge tab is where you transform your AI agent from a general assistant into a domain expert. By connecting relevant data sources, documents, and knowledge bases, you give your agent access to the specific information it needs to provide accurate, contextual responses.
π― Knowledge Source Types
π Document Upload
Upload files directly to your agent's knowledge base for semantic search and retrieval.
Supported File Types
PDF Documents: Research papers, manuals, reports
Word Documents: (.docx) Policies, procedures, guides
Text Files: (.txt, .md) Documentation, notes, plain text
HTML Files: (.html) Web pages, formatted documentation
Spreadsheets: (.xlsx, .xls, .csv) Data tables, catalogs, structured data
Document Processing Features
Intelligent Chunking: Configurable chunking strategies for optimal knowledge retrieval
Text Extraction: Automatic text extraction from supported formats
Space Normalization: Remove extra whitespace for cleaner text
Best Practices for Document Upload
π Table Upload
Upload structured data in tabular format for precise lookups and semantic search.
Supported Table Formats
CSV Files: Comma-separated values
Excel Files: (.xlsx, .xls) Spreadsheets with single or multiple sheets
Manual Entry: Create tables directly in the interface
Table Configuration
Column Types: TEXT, NUMBER, INTEGER, BOOLEAN, DATE
Semantic Columns: Mark columns for semantic search indexing
Column Sequencing: Define display order for columns
Schema Analysis: Automatic type detection from uploaded files
Table Use Cases
Product catalogs with specifications and pricing
Customer records and interaction history
FAQ databases with questions and answers
Knowledge articles with categorization
Configuration and settings databases
ποΈ Database Schema (Manual)
Create database-like schemas for structured knowledge organization.
Database Format Features
Custom Schema Design: Define your own table structures
Column Type Support: TEXT, NUMBER, INTEGER, BOOLEAN, DATE
Manual Data Entry: Populate data through the interface
Structured Queries: Enable precise data retrieval
βοΈ Knowledge Processing Settings
Chunking Strategy
Control how documents are broken down for processing and retrieval.
Chunking Configuration Options
Chunk Type: Strategy for dividing content
Max Tokens: Maximum size per chunk (configurable)
Separator: Custom separator for chunk boundaries
Remove Extra Spaces: Clean up whitespace
Remove URLs/Emails: Filter out contact information
Best Practices
π Agent Knowledge Configuration
Configure how your agent retrieves and uses knowledge during conversations.
Retrieval Mode
Auto Retrieval (Default: Off)
Manual Tool Call (Default: On)
Search Strategy
Hybrid Search (Default - Recommended)
Semantic Search Only
Full-Text Search Only
Retrieval Parameters
Top K (Default: 5, Range: 1-10)
Threshold (Default: 0.5, Range: 0.0-1.0)
Query Rewrite (Default: On)
Rerank (Default: Off)
Connected Datasets
Multiple Dataset Support
Connect up to 100 datasets per agent
Each dataset appears as a searchable knowledge source
Datasets maintain their own:
Name and ID
Source type (UPLOAD, MANUAL)
Format type (TEXT, TABLE, DATABASE)
Processing status
Dataset Information Display
For each connected dataset, the agent has access to:
Dataset name (user-friendly identifier)
Dataset ID (unique identifier)
Source type (how data was added)
Status (PENDING, SUCCESS, FAILURE)
Format type (TEXT, TABLE, DATABASE)
π Knowledge Analytics & Management
Dataset Status Monitoring
Processing States
PENDING: Dataset creation or update in progress
SUCCESS: Dataset ready for use
FAILURE: Processing encountered errors
Progress Tracking
Monitor document import progress
Track embedding generation status
View chunk processing metrics
Embedding Updates
Manual Embedding Refresh
π§ Knowledge Configuration Best Practices
Initial Setup Process
Audit Existing Information: Catalog what knowledge you have
Choose Dataset Format: TEXT for documents, TABLE for structured data
Configure Processing: Set chunking and parsing options
Select Embedding Model: Choose based on language and domain
Test Retrieval: Verify agent responses with sample queries
Dataset Organization Strategies
By Topic
By Source Type
Optimization Guidelines
Document Preparation
Table Design
Retrieval Tuning
π Advanced Features
Multi-Dataset Retrieval
When connecting multiple datasets to an agent:
Agent can search across all connected datasets
Results merged and ranked by relevance
Each result includes source dataset information
Useful for comprehensive knowledge coverage
Semantic Column Configuration
For TABLE and DATABASE formats:
Mark specific columns for semantic search indexing
Non-semantic columns remain queryable but not embedded
Reduces embedding costs for large tables
Improves search focus on relevant fields
π― Knowledge Integration Checklist
Before activating your agent's knowledge base:
Your agent's knowledge is its competitive advantageβinvest in building a comprehensive, well-organized knowledge base that enables intelligent, accurate responses.
Last updated
Was this helpful?