Ingestion API Overview

Introduction

Welcome to the eGain Ingestion API guide. This set of APIs provides a powerful and programmatic way to bulk-import, validate, and manage your knowledge content within the eGain AI Knowledge Hub. It is designed to handle large volumes of content from external sources, making it the ideal solution for migrations and automated content synchronization.

The entire process is asynchronous, which means that when you submit a request to import or validate content, the API creates a background job to handle the processing. This allows you to manage large datasets without blocking your applications and provides a robust way to track the job's progress through to completion.

What is the Ingestion API for?

The primary purpose of the Ingestion API is to streamline the process of getting your external content into eGain Knowledge. Whether you are moving from a different system or have an existing content repository you need to sync, these APIs provide the necessary tools to automate the workflow.

Common Use Cases

Legacy System Migration : Migrating thousands of articles from a legacy knowledge management system into the eGain platform.
Automated Content Updates : Setting up a recurring job to pull the latest product documentation from a content repository and update the knowledge base automatically.
Third-Party Content Integration : Regularly importing content feeds from external providers or different business units.
Pre-Production Validation : Integrating the validation API into your content creation pipeline to perform automated quality assurance checks before content is published.

The Ingestion Workflow

A typical end-to-end process for using the Ingestion API follows these steps:

Format Your Content : Before you can import anything, your content must be structured in a specific way. This involves organizing your files into a defined folder structure and creating manifest files ( manifest.json , articles.json , knowledge.json ).
- Learn more : Format Guide
Upload Your Data : Place your fully formatted and zipped content package into your chosen data source, which can be an AWS S3 bucket or an accessible Shared File Path (like SFTP).
Validate the Content (Recommended) : Before running the actual import, it is highly recommended to run a validation job. This "dry run" checks your package for formatting errors, broken links, and other potential issues without writing any data to the knowledge base.
- Learn how : Guide: Validating Content Before Import
Start the Import Job : Once you have uploaded your data (and preferably validated it), you can initiate the import job. You will choose the API endpoint that corresponds to your data source.
- For S3 : Guide: Starting an Ingestion Job with an S3 Bucket Datasource
- For Shared Paths : Guide: Starting an Ingestion Job with a Shared File Path Datasource
Monitor the Job Status : After starting a validation or import job, the API immediately returns a job_id . You must use this ID to periodically check the status of the job until it is complete. The status endpoint will provide real-time progress, and upon completion, a summary of results, including any errors or warnings.
- Learn how : Guide: Checking the Status of an Import or Validation Job

API Guides and Resources

To get started, please refer to the detailed guides below for step-by-step instructions and cURL examples.

Data Formatting
- Format Guide - Start here! Your content must follow this structure.
Core API Functions