project-nomad

mirror of https://github.com/Crosstalk-Solutions/project-nomad.git synced 2026-05-29 23:56:49 +02:00

History

Chris Sherwood 2dec5bf676 fix(AI): pre-cap embed input + log fallback reason (#881 ) The OpenAI-compatible /v1/embeddings fallback path can't pass `truncate:true` / `num_ctx:8192` to the model, so any chunk that exceeds the model's loaded context_length (often 2048 for nomic-embed-text:v1.5) returns a 400 BadRequestError and is silently dropped from Qdrant. Two CPU-only ingestion runs on NOMAD1 hit this on dense technical content (medlineplus, arduino.stackexchange) even after PR #763's num_ctx fix on the native path. Pre-cap each input string at 4000 chars before either backend call. That's ~1000-2000 tokens depending on density, comfortably under the model's 2048 default. The chunker in RagService is sized for MAX_SAFE_TOKENS=1600 (3200 chars at its conservative 2 chars/token estimate), so well-formed inputs are never touched; this is purely a runtime safety net for the edge cases that slip through. Also stop swallowing the original error in the catch. The bare `} catch {}` here has masked recurring "input length exceeds context length" failures for months (#369, #670, #881). Capture and warn-log the message so future investigations see why we fell back. Same root cause as #369 and #670 which were closed without an actual fix to the fallback path.		2026-05-20 10:16:00 -07:00
..
controllers	fix(KB): add re-embed and reset & rebuild opts to fix broken embeddings (#886 )	2026-05-20 10:16:00 -07:00
exceptions	fix(Docs): documentation renderer fixes	2025-12-23 16:00:33 -08:00
jobs	fix(RAG): anchor continuation-batch initial progress to overall-file frame (#889 )	2026-05-20 10:16:00 -07:00
middleware	fix(API): skip compression for Server-Sent Events (#798 )	2026-05-20 10:16:00 -07:00
models	feat(KB): per-file ingest state machine (Phase 1 of RFC #883 ) (#888 )	2026-05-20 10:16:00 -07:00
services	fix(AI): pre-cap embed input + log fallback reason (#881 )	2026-05-20 10:16:00 -07:00
utils	feat(KB): per-file ingest state machine (Phase 1 of RFC #883 ) (#888 )	2026-05-20 10:16:00 -07:00
validators	feat(Content): custom ZIM library sources with pre-seeded mirrors (#593 )	2026-05-20 10:16:00 -07:00