This repository was archived by the owner on Jul 4, 2025. It is now read-only.
This repository was archived by the owner on Jul 4, 2025. It is now read-only.
bug: OOM issue - [0.5.10] RAG feature failed with Engine Is Not Loaded Yet
error #1857
Open
Description
Jan version
0.5.10
Describe the Bug
https://discord.com/channels/1107178041848909847/1313496019475894363
When using the Retrieval (RAG) feature with PDF files, the engine fails to load with error code 322122505 Engine Is Not Loaded Yet
. This prevents PDF analysis functionality from working.
Steps to Reproduce
- Set up Embedding Model as Llama 3.2 1B Instruct Q8
- Set up Vector Database as HNSWLib
- Upload a PDF file (issue persists with files as small as 27.51KB)
- Attempt to use RAG/PDF analysis feature
Error Messages
- "Engine Is Not Loaded Yet"
- "llama_decode_internal: invalid token"
- "llama_decode: failed to decode"
- "Internal error catched Input prompt is too big compared to KV size"
Logs
- Error code: 322122505 from cortex-server.exe process
Expected behavior
- Engine should load successfully
- PDF should be processed for RAG functionality
Screenshots / Logs
Logs
message (4).txt
app (2).log
cortex (2).log
Device specs
- Machine: x86
- OS: Windows (based on .exe reference)
- Memory: 32.00 GB (24.84 GB used)
- CPU Usage: 18%
What is your OS?
- MacOS
- Windows
- Linux