Uploading Documents

Upload files to your knowledge base and let the ingestion pipeline handle parsing, chunking, embedding, and indexing automatically.

note

Uploading documents requires an API key with write permission. See Authentication for details.

How It Works

You send a POST request with your file to /api/v1/documents.
The API stores the file, creates a document record with status pending, and returns immediately.
The ingestion pipeline processes the file asynchronously: parse the raw content, chunk it into passages, embed each chunk into a vector, and index everything into the search layer.
When processing completes, the document status changes to processed.

Upload → S3 → Message Queue → Parse → Chunk → Embed → Index
                                                         ↓
                                              PostgreSQL + Weaviate

Supported File Types

Format	MIME Type	Extension
PDF	`application/pdf`	`.pdf`
Word (DOCX)	`application/vnd.openxmlformats-officedocument.wordprocessingml.document`	`.docx`
Excel (XLSX)	`application/vnd.openxmlformats-officedocument.spreadsheetml.sheet`	`.xlsx`
Plain Text	`text/plain`	`.txt`
Markdown	`text/markdown`	`.md`
CSV	`text/csv`	`.csv`
JSON	`application/json`	`.json`

Maximum file size: 50 MB

Upload a Document

Send a multipart/form-data request with your file in the file field.

cURL
Python
JavaScript

curl -X POST https://api.inherent.systems/api/v1/documents \
  -H "X-API-Key: $INHERENT_API_KEY" \
  -F "file=@./quarterly-report.pdf"

import requests

url = "https://api.inherent.systems/api/v1/documents"
headers = {"X-API-Key": os.environ["INHERENT_API_KEY"]}

with open("quarterly-report.pdf", "rb") as f:
    response = requests.post(url, headers=headers, files={"file": f})

doc = response.json()
print(doc["id"], doc["status"])  # "abc123" "pending"

const fs = require("fs");
const FormData = require("form-data");

const form = new FormData();
form.append("file", fs.createReadStream("quarterly-report.pdf"));

const response = await fetch("https://api.inherent.systems/api/v1/documents", {
  method: "POST",
  headers: {
    "X-API-Key": process.env.INHERENT_API_KEY,
    ...form.getHeaders(),
  },
  body: form,
});

const doc = await response.json();
console.log(doc.id, doc.status); // "abc123" "pending"

Response

{
  "id": "doc_abc123",
  "name": "quarterly-report.pdf",
  "status": "pending",
  "content_type": "application/pdf",
  "size_bytes": 2048576,
  "metadata": {},
  "created_at": "2026-04-03T10:30:00Z"
}

Adding Metadata

Pass an optional metadata field as a JSON string to attach key-value pairs to the document. Metadata is returned in search results and can help you organize documents.

curl -X POST https://api.inherent.systems/api/v1/documents \
  -H "X-API-Key: $INHERENT_API_KEY" \
  -F "file=@./quarterly-report.pdf" \
  -F 'metadata={"department": "finance", "quarter": "Q1-2026"}'

Checking Document Status

After uploading, poll GET /api/v1/documents to check when processing completes.

curl https://api.inherent.systems/api/v1/documents \
  -H "X-API-Key: $INHERENT_API_KEY"

{
  "documents": [
    {
      "id": "doc_abc123",
      "name": "quarterly-report.pdf",
      "status": "processed",
      "chunk_count": 47,
      "content_type": "application/pdf",
      "size_bytes": 2048576,
      "metadata": {"department": "finance", "quarter": "Q1-2026"},
      "created_at": "2026-04-03T10:30:00Z",
      "processed_at": "2026-04-03T10:30:45Z"
    }
  ]
}

Document Statuses

Status	Meaning
`pending`	Queued for processing
`processing`	Ingestion pipeline is actively working on it
`processed`	Ready to search and retrieve
`failed`	Processing failed — check the dashboard for details

tip

A typical PDF processes in under 60 seconds. If your document stays in pending for more than a few minutes, check the dashboard for errors.

Best Practices

Use descriptive file names. The file name is stored as the document name and appears in search results. q1-2026-revenue-report.pdf is more useful than document.pdf.
Stick to supported formats. Unsupported file types are rejected at upload time with a 400 error.
Handle large files. Files close to the 50 MB limit may take longer to process. Consider splitting very large documents into logical sections before uploading.
Attach metadata. Adding metadata like department, source, or date makes it easier to filter and organize your knowledge base later.
Wait for processed before searching. Documents in pending or processing status are not yet available for search or chunk retrieval.

How It Works​

Supported File Types​

Upload a Document​

Response​

Adding Metadata​

Checking Document Status​

Document Statuses​

Best Practices​