Skip to main content

Data Connectors Overview

ZenSearch supports 17 data connectors across cloud storage, collaboration tools, CRM systems, databases, and web content. Connect your organization's data sources to create a unified, searchable knowledge base.

Connector Categories

Cloud Storage

Store and manage files in cloud platforms:

ConnectorDescriptionAuth Method
Amazon S3S3 and S3-compatible storage (MinIO)IAM Role, Access Keys
Google DrivePersonal and Workspace drivesOAuth 2.0
SharePointMicrosoft SharePoint OnlineOAuth 2.0
Azure BlobAzure Blob Storage containersConnection String, SAS

Collaboration Tools

Connect team collaboration platforms:

ConnectorDescriptionAuth Method
ConfluenceAtlassian Confluence (Cloud/Server)API Token, OAuth
NotionPages, databases, and blocksOAuth 2.0
SlackChannels, threads, and filesOAuth 2.0

Development Tools

Index code and project management:

ConnectorDescriptionAuth Method
GitHubRepositories, code, and issuesOAuth 2.0, PAT
JiraIssues, comments, and worklogsAPI Token, OAuth

CRM Systems

Connect customer relationship management platforms:

ConnectorDescriptionAuth Method
SalesforceAccounts, contacts, opportunitiesOAuth 2.0
HubSpotContacts, companies, dealsOAuth 2.0
SAPS/4HANA ERP dataBasic Auth, OAuth

Databases

Query structured data sources:

ConnectorDescriptionAuth Method
PostgreSQLPostgreSQL databasesUsername/Password
MySQLMySQL and MariaDBUsername/Password
ClickHouseClickHouse analytics DBUsername/Password
MS SQL ServerMicrosoft SQL ServerSQL/Windows Auth

Web Content

Crawl and index websites:

ConnectorDescriptionAuth Method
Web CrawlerWebsite crawlingNone/Basic Auth

Common Configuration

Basic Settings

All connectors share common configuration options:

SettingDescription
NameDisplay name for the connector
CollectionTarget collection for documents
EnabledWhether sync is active
ScheduleSync frequency (if applicable)

Authentication

OAuth Connectors

For OAuth-based connectors (Google Drive, Notion, Slack, etc.):

  1. Click Authorize during setup
  2. Sign in to the external service
  3. Grant requested permissions
  4. Return to ZenSearch automatically

API Key/Token

For API key-based connectors:

  1. Generate an API key in the source platform
  2. Enter the key in ZenSearch
  3. Test the connection

Username/Password

For database and basic auth connectors:

  1. Enter host and port
  2. Provide username and password
  3. Configure SSL if needed
  4. Test the connection

Adding a Connector

Step-by-Step

  1. Navigate to KnowledgeData Sources
  2. Click Add Data Source
  3. Select connector type
  4. Configure authentication
  5. Set source-specific options
  6. Choose target collection
  7. Test connection
  8. Create connector

Configuration Wizard

The wizard guides you through:

┌─────────────────────────────────────────┐
│ Step 1: Select Connector Type │
│ ┌─────┐ ┌─────┐ ┌─────┐ ┌─────┐ │
│ │ S3 │ │Drive│ │Slack│ │ ... │ │
│ └─────┘ └─────┘ └─────┘ └─────┘ │
├─────────────────────────────────────────┤
│ Step 2: Configure Connection │
│ [Authentication fields...] │
├─────────────────────────────────────────┤
│ Step 3: Source Options │
│ [Filters, paths, scopes...] │
├─────────────────────────────────────────┤
│ Step 4: Collection Assignment │
│ [Select or create collection] │
└─────────────────────────────────────────┘

Sync Behavior

Initial Sync

When a connector is created:

  1. Full content fetch from source
  2. Document parsing and analysis
  3. Embedding generation
  4. Index population

Incremental Sync

Subsequent syncs:

  1. Detect changed/new content
  2. Process only updates
  3. Remove deleted content
  4. Update search index

Webhook Sync (Real-time)

For supported connectors:

  1. Configure webhooks in source
  2. Receive instant update notifications
  3. Process changes immediately
  4. Near real-time search updates

Supported Sync Methods by Connector

ConnectorScheduledIncrementalWebhooks
S3YesYesYes
Google DriveYesYesYes
SharePointYesYesYes
ConfluenceYesYesYes
NotionYesYesNo
SlackYesYesYes
GitHubYesYesYes
JiraYesYesYes
SalesforceYesYesYes
HubSpotYesYesYes
SAPYesYesNo
PostgreSQLYesNoNo
MySQLYesNoNo
ClickHouseYesNoNo
MS SQLYesNoNo
Web CrawlerYesYesNo

Permission Sync

Supported Platforms

These connectors sync document-level permissions:

  • Google Drive: File and folder sharing
  • SharePoint: Site and document permissions
  • Confluence: Space and page restrictions
  • Salesforce: Record sharing rules
  • Slack: Channel membership

Permission Modes

ModeDescription
StrictOnly show documents user can access in source
PermissiveShow all documents (for internal use)

Content Filtering

Path/Prefix Filters

For storage connectors:

Include: /documents/public/*
Exclude: /documents/archive/*

File Type Filters

Limit by file extension:

Include: .pdf, .docx, .txt, .md
Exclude: .exe, .zip

Date Filters

Sync content from specific periods:

Modified after: 2024-01-01
Created within: Last 90 days

Custom Filters

Connector-specific filtering:

  • Slack: Specific channels
  • Confluence: Selected spaces
  • GitHub: Specific branches
  • Jira: JQL filter queries

Best Practices

Security

  1. Use minimal required permissions
  2. Rotate API keys regularly
  3. Use OAuth when available
  4. Enable permission sync

Performance

  1. Start with smaller scopes
  2. Enable incremental sync
  3. Use webhooks when available
  4. Filter unnecessary content

Organization

  1. One connector per purpose
  2. Group related connectors in collections
  3. Use descriptive names
  4. Document filter settings

Troubleshooting

Connection Failed

  1. Verify credentials are correct
  2. Check network connectivity
  3. Ensure required permissions granted
  4. Review firewall/proxy settings

Sync Errors

  1. Check source accessibility
  2. Verify API rate limits
  3. Review error messages
  4. Try manual re-sync

Missing Content

  1. Check filter settings
  2. Verify permissions
  3. Wait for sync completion
  4. Review content types supported

Next Steps

Choose a connector guide for detailed setup instructions: