Changelog
Fixed a reliability issue where Excel files were sometimes misidentified during ingestion, causing them to fail silently. Excel files are now correctly recognized and processed.
Resource limits for a partition are now surfaced in a dedicated Limits tab on the partition details page, making it easier to review at a glance.
Introducing Ragie Parse (Early Access), a new Agentic OCR pipeline that uses vision AI to extract structured content with higher fidelity than traditional OCR, supporting 25+ element types including tables, forms, signatures, key-value pairs, barcodes, and stamps.
The web crawler connector can now be created and managed programmatically through the public API without going through the dashboard (Enterprise only).

Spreadsheets now include embedded images and charts alongside table data, with automatic separation of table and freeform regions for cleaner, more complete extraction.
A new endpoint (GET /webhook_endpoints) to retrieve a list of all your configured webhook endpoints.
Ragie MCP Bridge (early access), a permissioned bridge that brings RBAC, SSO, SCIM provisioning, and model-agnostic retrieval to internal employees.
Open-source ragie-mcp-server (v1.0) delivering a full MCP toolset backed by Ragie retrieval.
ragie-cli binaries for macOS, Linux, and Windows that bulk-ingest local files, archives, YouTube content, WordPress docs, and more.
Promptie, a playground for iterating on prompts, retrieval settings, and source inspection.
Early access to Audio & Video RAG with multilingual transcription, on-screen analysis, and support for media formats such as MP3, WAV, MP4, and AVI.
SharePoint and Backblaze connectors plus Google Drive file-level uploads so enterprises can ingest exactly the assets they need.
Sync Connections API so engineers can programmatically trigger connector refreshes without visiting the UI.
Connector page-limit controls to stop runaway ingestion and manage spend per tenant or integration.
Document Chunks API, exposing the exact context Ragie generated and enabling "chunk fill" optimizations.
Additional webhook statuses announcing when documents reach the indexed state, enabling latency-sensitive applications.