Knowledge & Data Sources

🧠 Powering Your Agent with Domain Knowledge

The Knowledge tab is where you transform your AI agent from a general assistant into a domain expert. By connecting relevant data sources, documents, and knowledge bases, you give your agent access to the specific information it needs to provide accurate, contextual responses.


🎯 Knowledge Source Types

πŸ“„ Document Upload

Upload files directly to your agent's knowledge base for semantic search and retrieval.

Supported File Types

  • PDF Documents: Research papers, manuals, reports

  • Word Documents: (.docx) Policies, procedures, guides

  • Text Files: (.txt, .md) Documentation, notes, plain text

  • HTML Files: (.html) Web pages, formatted documentation

  • Spreadsheets: (.xlsx, .xls, .csv) Data tables, catalogs, structured data

Document Processing Features

  • Intelligent Chunking: Configurable chunking strategies for optimal knowledge retrieval

  • Text Extraction: Automatic text extraction from supported formats

  • Space Normalization: Remove extra whitespace for cleaner text

Best Practices for Document Upload

πŸ“Š Table Upload

Upload structured data in tabular format for precise lookups and semantic search.

Supported Table Formats

  • CSV Files: Comma-separated values

  • Excel Files: (.xlsx, .xls) Spreadsheets with single or multiple sheets

  • Manual Entry: Create tables directly in the interface

Table Configuration

  • Column Types: TEXT, NUMBER, INTEGER, BOOLEAN, DATE

  • Semantic Columns: Mark columns for semantic search indexing

  • Column Sequencing: Define display order for columns

  • Schema Analysis: Automatic type detection from uploaded files

Table Use Cases

  • Product catalogs with specifications and pricing

  • Customer records and interaction history

  • FAQ databases with questions and answers

  • Knowledge articles with categorization

  • Configuration and settings databases

πŸ—„οΈ Database Schema (Manual)

Create database-like schemas for structured knowledge organization.

Database Format Features

  • Custom Schema Design: Define your own table structures

  • Column Type Support: TEXT, NUMBER, INTEGER, BOOLEAN, DATE

  • Manual Data Entry: Populate data through the interface

  • Structured Queries: Enable precise data retrieval


βš™οΈ Knowledge Processing Settings

Chunking Strategy

Control how documents are broken down for processing and retrieval.

Chunking Configuration Options

  • Chunk Type: Strategy for dividing content

  • Max Tokens: Maximum size per chunk (configurable)

  • Separator: Custom separator for chunk boundaries

  • Remove Extra Spaces: Clean up whitespace

  • Remove URLs/Emails: Filter out contact information

Best Practices


πŸ” Agent Knowledge Configuration

Configure how your agent retrieves and uses knowledge during conversations.

Retrieval Mode

Auto Retrieval (Default: Off)

Manual Tool Call (Default: On)

Search Strategy

Semantic Search Only

Full-Text Search Only

Retrieval Parameters

Top K (Default: 5, Range: 1-10)

Threshold (Default: 0.5, Range: 0.0-1.0)

Query Rewrite (Default: On)

Rerank (Default: Off)

Connected Datasets

Multiple Dataset Support

  • Connect up to 100 datasets per agent

  • Each dataset appears as a searchable knowledge source

  • Datasets maintain their own:

    • Name and ID

    • Source type (UPLOAD, MANUAL)

    • Format type (TEXT, TABLE, DATABASE)

    • Processing status

Dataset Information Display

For each connected dataset, the agent has access to:

  • Dataset name (user-friendly identifier)

  • Dataset ID (unique identifier)

  • Source type (how data was added)

  • Status (PENDING, SUCCESS, FAILURE)

  • Format type (TEXT, TABLE, DATABASE)


πŸ“Š Knowledge Analytics & Management

Dataset Status Monitoring

Processing States

  • PENDING: Dataset creation or update in progress

  • SUCCESS: Dataset ready for use

  • FAILURE: Processing encountered errors

Progress Tracking

  • Monitor document import progress

  • Track embedding generation status

  • View chunk processing metrics

Embedding Updates

Manual Embedding Refresh


πŸ”§ Knowledge Configuration Best Practices

Initial Setup Process

  1. Audit Existing Information: Catalog what knowledge you have

  2. Choose Dataset Format: TEXT for documents, TABLE for structured data

  3. Configure Processing: Set chunking and parsing options

  4. Select Embedding Model: Choose based on language and domain

  5. Test Retrieval: Verify agent responses with sample queries

Dataset Organization Strategies

By Topic

By Source Type

Optimization Guidelines

Document Preparation

Table Design

Retrieval Tuning


πŸš€ Advanced Features

Multi-Dataset Retrieval

When connecting multiple datasets to an agent:

  • Agent can search across all connected datasets

  • Results merged and ranked by relevance

  • Each result includes source dataset information

  • Useful for comprehensive knowledge coverage

Semantic Column Configuration

For TABLE and DATABASE formats:

  • Mark specific columns for semantic search indexing

  • Non-semantic columns remain queryable but not embedded

  • Reduces embedding costs for large tables

  • Improves search focus on relevant fields


🎯 Knowledge Integration Checklist

Before activating your agent's knowledge base:


Your agent's knowledge is its competitive advantageβ€”invest in building a comprehensive, well-organized knowledge base that enables intelligent, accurate responses.

Last updated

Was this helpful?