Data Ingestion is the first layer in Unbody, responsible for bringing data from various sources—third-party services, local folders, remote websites, and databases—into the unified pipeline. This ensures that all raw input is available for downstream parsing, enhancement, and indexing.
Data Provider Plugins offer native integrations for common data sources, making it simple to ingest content from popular platforms and storage solutions.
Integrations with popular external cloud services.
modules
Module Name | Module Description | Open Source Status | Cloud Status |
---|---|---|---|
Google Drive | Retrieves documents and images from a connected Google Drive account. | Alpha |
Available |
GitHub | Imports issues/repository data from GitHub for analysis and processing. | Alpha |
Available |
Discord | Gathers messages/files from a specified Discord channel or server. | Backlog |
Available |
Google Cal | Retrieves events from Google Calendar for scheduling or knowledge extraction. | Backlog |
Available |
Notion | Imports pages and databases from Notion for further processing within Unbody. | Icebox |
Icebox |
Integration with local file systems and network drives.
modules
Module Name | Module Description | Open Source Status | Cloud Status |
---|---|---|---|
Local Folder | Ingests files from a designated local directory or network drive. | Alpha |
Backlog |
Integration with remote websites or endpoints over HTTP/HTTPS.
modules
Module Name | Module Description | Open Source Status | Cloud Status |
---|---|---|---|
Crawlee | Backlog |
Backlog |
|
Firecrawl | Not Planned |
Available |
Connectors for relational or NoSQL databases.