Skip to content
This repository was archived by the owner on Jul 4, 2025. It is now read-only.
This repository was archived by the owner on Jul 4, 2025. It is now read-only.

bug: OOM issue - [0.5.10] RAG feature failed with Engine Is Not Loaded Yet error #1857

Open
@imtuyethan

Description

@imtuyethan

Jan version

0.5.10

Describe the Bug

https://discord.com/channels/1107178041848909847/1313496019475894363

When using the Retrieval (RAG) feature with PDF files, the engine fails to load with error code 322122505 Engine Is Not Loaded Yet. This prevents PDF analysis functionality from working.

Steps to Reproduce

  1. Set up Embedding Model as Llama 3.2 1B Instruct Q8
  2. Set up Vector Database as HNSWLib
  3. Upload a PDF file (issue persists with files as small as 27.51KB)
  4. Attempt to use RAG/PDF analysis feature

Error Messages

  • "Engine Is Not Loaded Yet"
  • "llama_decode_internal: invalid token"
  • "llama_decode: failed to decode"
  • "Internal error catched Input prompt is too big compared to KV size"

Logs

  • Error code: 322122505 from cortex-server.exe process

Expected behavior

  • Engine should load successfully
  • PDF should be processed for RAG functionality

Screenshots / Logs

Logs
message (4).txt
app (2).log
cortex (2).log

Device specs

  • Machine: x86
  • OS: Windows (based on .exe reference)
  • Memory: 32.00 GB (24.84 GB used)
  • CPU Usage: 18%

What is your OS?

  • MacOS
  • Windows
  • Linux

Metadata

Metadata

Assignees

Labels

category: toolsRAG, function calling, etctype: bugSomething isn't working

Type

No type

Projects

Status

Investigating

Status

Eng Planning

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions