project-nomad

mirror of https://github.com/Crosstalk-Solutions/project-nomad.git synced 2026-05-30 08:06:51 +02:00

Author	SHA1	Message	Date
Jake Turner	a0047c1555	fix(KB): surface file-warning compute failures instead of masking as healthy (PR #895 review) `computeFileWarnings()` previously caught all errors and returned an empty map, which the frontend rendered as "every file is healthy" — reintroducing exactly the silent-failure mode this surface exists to expose. Return `{ ok, warnings }`; flip `ok: false` from the catch. KB modal renders an inline amber notice under the Stored Files header when `ok === false`, leaving per-row warning rendering untouched. Transient failures self-heal on the next 30s poll; no toast spam.	2026-05-20 10:16:00 -07:00
Chris Sherwood	563f86a22b	feat(KB): conditional warnings A + B on Stored Files (RFC #883 §6) Surfaces two silent failure modes that the prior binary "any-chunks-in-Qdrant ⇒ embedded" check could not distinguish from healthy ingestion: - Warning A — Zero-chunk file (file_size > 100 MB, chunks = 0) Fires on video-only / image-only ZIMs (`lrnselfreliance_en_all`, TED talks, etc.) that the pipeline completes "successfully" with no extractable text. AI Assistant literally cannot reference these. - Warning B — Partial-embed stall (chunks < 50% of expected from the ratio registry). Surfaces the simple_wiki "266 of 600,000 chunks" case observed during NOMAD1 ingestion testing — previously these looked identical to fully-completed embeds in the UI. Both warnings render only when their condition is met (silent by default; noisy only on real problems). Base is `feat/kb-ratio-registry` (#891) because Warning B's "expected chunks" estimate comes from `KbRatioRegistry.estimateChunks()`. GitHub fast-forwards to `rc` once #891 merges. - `app/utils/kb_warning_decision.ts` — pure `decideWarnings(inputs)` with thresholds (`100 MB`, `0.5×`) as exported constants. 10 unit tests cover the healthy case, both warnings, the under/at/over boundary, the registry-miss suppression, and the video-only registry case (`expectedChunks: 0` correctly skips Warning B). - `RagService.computeFileWarnings()` — single Qdrant scroll tallies chunks per source, filesystem walk fills in zero-chunk files, ratio registry estimates the expectation, decision function emits. - New endpoint `GET /api/rag/file-warnings` returns `Record<source, FileWarning[]>` (sources with no warnings are omitted, so the frontend can `warnings[source] ?? []` for clean defaults). - KB modal: warnings render inline under the file name as amber-tinted pills. Polled every 30s alongside the existing health check. - Warning C — chunks skipped due to length. PR #890 (#881 fix) prevents the silent drop at the embed boundary, so the underlying condition shouldn't fire anymore. If we still want to surface "we truncated N chunks to fit", that needs separate `skipped_count` tracking in EmbedFileJob — a Phase 2 follow-up. - Suppressing Warning B during active mid-ingestion. The user can cross- reference the Processing Queue to know it's in-flight; suppressing warnings while a job runs would mask real stalls where the job died mid-batch. Will revisit when per-card status is wired through. - Use of `kb_ingest_state.chunks_embedded` (#888) as the chunk count source. This PR uses Qdrant scroll directly so it can land independently of #888. - 10 new unit tests on `decideWarnings`, all pass - Type-check clean - Hot-patch + browser smoke test deferred until #891 lands (the ratio registry needs to exist in the DB for `estimateChunks()` to return non-null estimates — without it, only Warning A fires which is still useful but Warning B stays dormant)	2026-05-20 10:16:00 -07:00
Chris Sherwood	e68c753e39	feat(KB): surface embedding-disk estimate in curated tier-change modal (RFC #883 §1) When a user picks a tier in TierSelectionModal, show how much additional disk space the AI Assistant will need if the new ZIMs are indexed, plus a policy-aware footer explaining whether they'll auto-index (Always) or wait for opt-in (Manual). Estimates consume #891's KbRatioRegistry via a new POST /api/rag/estimate-batch endpoint. Backend - New POST /api/rag/estimate-batch route + RagController.estimateBatch - VineJS schema accepting array of {filename, sizeBytes}, capped at 500 - KbRatioRegistry.estimateBatch aggregates via the existing prefix-match lookup, returns {totalChunks, totalBytes, hasUnknown} - New BYTES_PER_CHUNK_ON_DISK constant (~8 KB: 3 KB vector + ~3 KB chunk text + ~2 KB payload/index overhead). Tunable; will be replaced by Phase 4 self-calibration once we have real measurements. - Controller normalizes incoming filenames via path.basename so callers that send full paths or URLs still match registry prefixes correctly. Frontend - api.estimateEmbeddingBatch() client method - TierSelectionModal: when localSelectedSlug is set, resolve the tier's resources (incl. inherited tiers), POST to /estimate-batch, and render a new info block with the +~X GB figure + ingest-policy copy. Also fetches rag.defaultIngestPolicy so the same block surfaces whether indexing will fire automatically or wait for the user. - resourceFilename() helper extracts the basename from the resource URL so the registry lookup hits the right prefix regardless of mirror. Tests - 4 new cases in tests/unit/kb_ratio_lookup.spec.ts covering the estimateBatch aggregator: standard sum, unknown-flagging, video-only ZIM (0 chunks but known, hasUnknown stays false), empty input. Stacks on feat/kb-ratio-registry (#891) — consumes the registry table seeded by that PR. Once #891 merges to rc, this PR auto-rebases. Out of scope for this PR (deferred to follow-ups): - Per-batch opt-in checkbox (RFC §1's '☑ Also index these for AI') needs a per-batch policy override path and is a separate PR - Guardrail modal at 50 GB / 10% free / 6 hr thresholds (RFC §7) is also separate; this PR is informational, not gating - Time-to-embed estimate awaits a chunks-per-second metric per host	2026-05-20 10:16:00 -07:00
Jake Turner	4c211964e0	fix(KB): add re-embed and reset & rebuild opts to fix broken embeddings (#886 )	2026-05-20 10:16:00 -07:00
Chris Sherwood	43645e4bbc	fix(AI): rewrite RAG query on first follow-up (off-by-one in skip-rewrite threshold) The short-conversation skip in `rewriteQueryWithContext` used `userMessages.length <= 2`, which short-circuits both the very first turn AND the first follow-up. The follow-up is the moment the rewriter matters most — it's where pronouns and shorthand ("the bars", "how long does it last?") need to be resolved against earlier turns before the embedding search runs. With the rewriter skipped, RAG queries against the raw last message, scores nothing above the 0.3 threshold, and no context gets injected for that turn. The visible symptom is the assistant treating the first follow-up in any chat as a brand-new question — e.g. "great - they threw up 2 of the bars it looks like" answered as if it were a recipe-bars question, with no carry-forward of the prior chocolate- poisoning context. Threshold lowered to `< 2`: skip only when there's exactly one user message (nothing to rewrite from). From the first follow-up onward the rewriter runs, as originally intended before commit `96e5027`. Validated against `mistral-nemo:12b` on NOMAD3 by hot-patching the compiled controller and replaying the dog-chocolate scenario. Post-patch response correctly threads "3 Hershey's bars" from turn 1 into turn 2's answer; pre-patch (per reporter's screenshot) pivoted to peanut butter bar recipes. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-20 10:16:00 -07:00
chriscrosstalk	62e75fdb54	feat(Content): custom ZIM library sources with pre-seeded mirrors (#593 ) * feat(content): add custom ZIM library sources with pre-seeded mirrors Users reported slow download speeds from the default Kiwix CDN. This adds the ability to browse and download ZIM files from alternative Kiwix mirrors or self-hosted repositories, all through the GUI. - Add "Custom Libraries" button next to "Browse the Kiwix Library" - Source dropdown to switch between Default (Kiwix) and custom libraries - Browsable directory structure with breadcrumb navigation - 5 pre-seeded official Kiwix mirrors (US, DE, DK, UK, Global CDN) - Built-in mirrors protected from deletion - Downloads use existing pipeline (progress, cancel, Kiwix restart) - Source selection persists across page loads via localStorage - Scrollable directory browser (600px max) with sticky header - SSRF protection on all custom library URLs Closes #576 Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * fix(content): recognize Wikipedia downloads from mirror sources When Wikipedia is downloaded via a custom mirror instead of the default Kiwix server, the completion callback now matches by filename instead of exact URL. This ensures the Wikipedia selector correctly shows "Installed" status and triggers old-version cleanup regardless of which mirror was used. Also handles the case where no Wikipedia selection exists yet (file downloaded before visiting the selector), creating the record automatically. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * fix(ZIM): use cheerio for custom mirror directory parsing * fix(ZIM): use URL constructor for more robust joining --------- Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com> Co-authored-by: Jake Turner <jturner@cosmistack.com>	2026-05-20 10:16:00 -07:00
0xGlitch	94059b0aaf	feat(Maps): regional map downloads via go-pmtiles extract (#780 ) * feat(maps): add regional map downloads via go-pmtiles extract * address Copilot review feedback on PR #780 - auto-refresh preflight on selection/maxzoom change with 400ms debounce and requestId stale-safety so the confirm button no longer requires a two-step "Estimate Size" -> "Start Download" dance - safeUpdateProgress helper replaces fire-and-forget updateProgress().catch() pattern so cancelled-job errors (code -1) can't surface as unhandled rejections - gate world basemap source on worldBasemapReady - when ensureWorldBasemap() fails we already delete world.pmtiles, so emitting the source was producing 404s on every tile request - verify go-pmtiles binary SHA256 at image build time; upstream doesn't ship a checksums file so per-arch hashes are pinned as build args with a regenerate note when bumping PMTILES_VERSION	2026-05-20 10:16:00 -07:00
Henry Estela	2d8a02f257	fix(RAG): add start button in kb modal and ensure restart policy exists (#700 ) Adds a check to RAG health to make sure nomad_qdrant is online, if not then the user will be blocked from clicking any buttons in the KB modal until they click the start qdrant button and let the container start There is a new file qdrant_restart_policy_provider.ts, which tries to ensure that the restart policy always exists for the nomad_qdrant container even though the policy should have been there when the container is created.	2026-05-20 10:16:00 -07:00
John Scherer	132ec9c98a	fix(API): accept notes, marker_type, and position on markers endpoints (#770 ) The VineJS validators in createMarker and updateMarker silently dropped fields not in their schema. The MapMarker model and DB include notes and marker_type, and GET responses return them, but POST and PATCH would not persist them. updateMarker additionally did not accept latitude/longitude, so markers could not be repositioned via the API after creation. - Add notes and marker_type to both validators and model assignments. - Add latitude/longitude to the update validator. - Add coordinate range validation on both endpoints. Closes #768	2026-05-20 10:16:00 -07:00
chriscrosstalk	6646b3480b	fix(AI): stop local nomad_ollama container when remote Ollama is configured (#744 ) When users set a remote Ollama URL via AI Settings, the local nomad_ollama container continued running and competed with the remote host for port 11434 and GPU access. Now configureRemote stops the local container on set and restores it on clear (if still present). Container and its models volume are preserved so the local install can be re-enabled later. Closes #662 Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-21 14:26:28 -07:00
Luís Miguel	806b2c1714	fix(security): SSRF validation for map downloads and error sanitization (CWE-918, CWE-209) (#552 ) * fix(security): add SSRF validation to map download URLs from manifest * fix(security): sanitize verbose error in rag controller scan endpoint * fix(security): sanitize verbose errors in benchmark controller * fix(security): sanitize verbose error in system controller version fetch * fix(security): sanitize verbose errors in chats controller (6 instances) * fix(security): sanitize verbose errors in docker service (6 instances) * fix(security): sanitize verbose error in system update service * fix(security): sanitize verbose errors in collection update service --------- Co-authored-by: Jake Turner <52841588+jakeaturner@users.noreply.github.com>	2026-04-21 14:26:28 -07:00
Henry Estela	6510f42184	fix(AI): qwen2.5 loading on every chat message (#649 ) Use the currently loaded model for chat title generation and query rewrite.	2026-04-21 14:26:28 -07:00
chriscrosstalk	0183b42d71	feat(maps): add scale bar and location markers (#636 ) Add distance scale bar and user-placed location pins to the offline maps viewer. - Scale bar (bottom-left) shows distance reference that updates with zoom level - Click anywhere on map to place a named pin with color selection (6 colors) - Collapsible "Saved Locations" panel lists all pins with fly-to navigation - Full dark mode support for popups and panel via CSS overrides - New `map_markers` table with future-proofed columns for routing (marker_type, route_id, route_order, notes) to avoid a migration when routes are added later - CRUD endpoints: GET/POST /api/maps/markers, PATCH/DELETE /api/maps/markers/:id - VineJS validation on create/update - MapMarker Lucid model Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-03 14:26:50 -07:00
Henry Estela	0edfdead90	feat(AI): enable flash_attn by default and disable ollama cloud (#616 ) New defaults: OLLAMA_NO_CLOUD=1 - "Ollama can run in local only mode by disabling Ollama’s cloud features. By turning off Ollama’s cloud features, you will lose the ability to use Ollama’s cloud models and web search." https://ollama.com/blog/web-search https://docs.ollama.com/faq#how-do-i-disable-ollama%E2%80%99s-cloud-features example output: ``` ollama run minimax-m2.7:cloud Error: ollama cloud is disabled: remote model details are unavailable ``` This setting can be safely disabled as you have to click on a link to login to ollama cloud and theres no real way to do that in nomad outside of looking at the nomad_ollama logs. This one can be disabled in settings in case theres a model out there that doesn't play nice. but that doesnt seem necessary so far. OLLAMA_FLASH_ATTENTION=1 - "Flash Attention is a feature of most modern models that can significantly reduce memory usage as the context size grows. " Tested with llama3.2: ``` docker logs nomad_ollama --tail 1000 2>&1 \|grep --color -i flash_attn llama_context: flash_attn = enabled ``` And with second_constantine/deepseek-coder-v2 with is based on https://huggingface.co/lmstudio-community/DeepSeek-Coder-V2-Lite-Instruct-GGUF which is a model that specifically calls out that you should disable flash attention, but during testing it seems ollama can do this for you automatically: ``` docker logs nomad_ollama --tail 1000 2>&1 \|grep --color -i flash_attn llama_context: flash_attn = disabled ```	2026-04-03 14:26:50 -07:00
Jake Turner	f4beb9a18a	fix(Maps): remove unused import	2026-04-03 14:26:50 -07:00
chriscrosstalk	bac53e28dc	feat(downloads): rich progress, friendly names, cancel, and live status (#554 ) * feat(downloads): rich progress, friendly names, cancel, and live status Redesign the Active Downloads UI with four improvements: - Rich progress: BullMQ jobs now report downloadedBytes/totalBytes instead of just a percentage, showing "2.3 GB / 5.1 GB" instead of "78% / 100%" - Friendly names: dispatch title metadata from curated categories, Content Explorer library, Wikipedia selector, and map collections - Cancel button: Redis-based cross-process abort signal lets users cancel active downloads with file cleanup. Confirmation step prevents accidents. - Live status indicator: green pulsing dot with transfer speed for active downloads, orange stall warning after 60s of no data, gray dot for queued Backward compatible with in-flight jobs that have integer-only progress. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * fix(downloads): fix cancel, dismiss, speed, and retry bugs - Speed indicator: only set prevBytesRef on first observation to prevent intermediate re-renders from inflating the calculated speed - Cancel: throw UnrecoverableError on abort to prevent BullMQ retries - Dismiss: remove stale BullMQ lock before job.remove() so cancelled jobs can actually be dismissed - Retry: add getActiveByUrl() helper that checks job state before blocking re-download, auto-cleans terminal jobs - Wikipedia: reset selection status to failed on cancel so the "downloading" state doesn't persist Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * feat(downloads): improve cancellation logic and surface true BullMQ job states --------- Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com> Co-authored-by: Jake Turner <jturner@cosmistack.com>	2026-04-03 14:26:50 -07:00
David Gross	b65b6d6b35	fix(Maps): add x-forwarded-proto support to handle https termination (#600 )	2026-04-03 14:26:50 -07:00
Luís Miguel	b183bc6745	fix(security): validate key parameter on settings read endpoint#517 Co-authored-by: Jake Turner <52841588+jakeaturner@users.noreply.github.com>	2026-04-03 14:26:50 -07:00
0xGlitch	789fdfe95d	feat(maps): add global map download from Protomaps (#525 ) * feat(maps): add global map download from Protomaps * fix: add path traversal check to global map download	2026-04-03 14:26:50 -07:00
Henry Estela	69c15b8b1e	feat(AI): enable remote AI chat host	2026-04-03 14:26:50 -07:00
Tom Boucher	6b558531be	fix: surface actual error message when service installation fails Backend returned { error: message } on 400 but frontend expected { message }. catchInternal swallowed Axios errors and returned undefined, causing a generic 'An internal error occurred' message instead of the real reason (already installed, already in progress, not found). - Fix 400 response shape to { success: false, message } in controller - Replace catchInternal with direct error handling in installService, affectService, and forceReinstallService API methods - Extract error.response.data.message from Axios errors so callers see the actual server message	2026-03-25 16:30:35 -07:00
Chris Sherwood	023e3f30af	fix(downloads): allow users to dismiss failed downloads Failed download jobs persist in BullMQ forever with no way to clear them, leaving stale error notifications in Content Explorer and Easy Setup. Adds a dismiss button (X) on failed download cards that removes the job from the queue via a new DELETE endpoint. - Backend: DELETE /api/downloads/jobs/:jobId endpoint - Frontend: X button on failed download cards with immediate refresh Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-20 11:46:10 -07:00
Chris Sherwood	e4fde22dd9	feat(UI): add Debug Info modal for bug reporting Add a "Debug Info" link to the footer and settings sidebar that opens a modal with non-sensitive system information (version, OS, hardware, GPU, installed services, internet status, update availability). Users can copy the formatted text and paste it into GitHub issues. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-20 11:46:10 -07:00
Jake Turner	9220b4b83d	fix(maps): respect request protocol for reverse proxy HTTPS support	2026-03-20 11:46:10 -07:00
Chris Sherwood	6a737ed83f	feat(UI): add Support the Project settings page Adds a new settings page with Ko-fi donation link, Rogue Support banner, and community contribution options (GitHub, Discord). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-20 11:46:10 -07:00
Jake Turner	96e5027055	feat(AI Assistant): performance improvements and smarter RAG context usage	2026-03-11 14:08:09 -07:00
Jake Turner	460756f581	feat(AI Assistant): improved state management and performance	2026-03-11 14:08:09 -07:00
Chris Sherwood	75106a8f61	fix(security): path traversal and SSRF protections from pre-launch audit Fixes 4 high-severity findings from a comprehensive security audit: 1. Path traversal on ZIM file delete — resolve()+startsWith() containment 2. Path traversal on Map file delete — same pattern 3. Path traversal on docs read — same pattern (already used in rag_service) 4. SSRF on download endpoints — block private/internal IPs, require TLD Also adds assertNotPrivateUrl() to content update endpoints. Full audit report attached as admin/docs/security-audit-v1.md. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-11 14:08:09 -07:00
Jake Turner	58b106f388	feat: support for updating services	2026-03-11 14:08:09 -07:00
Jake Turner	db69428193	fix(AI): allow force refresh of models list	2026-03-11 14:08:09 -07:00
Jake Turner	dfa896e86b	feat(RAG): allow deletion of files from KB	2026-03-04 20:05:14 -08:00
Jake Turner	99b96c3df7	feat(RAG): display embedding queue and improve progress tracking	2026-03-04 20:05:14 -08:00
Jake Turner	96beab7e69	feat(AI Assistant): custom name option for AI Assistant	2026-03-04 20:05:14 -08:00
Jake Turner	6817e2e47e	fix: improve type-safety for KVStore values	2026-03-03 20:51:38 -08:00
Jake Turner	00bd864831	fix(AI): improved perf via rewrite and streaming logic	2026-03-03 20:51:38 -08:00
Jake Turner	6874a2824f	feat(Models): paginate available models endpoint	2026-03-03 20:51:38 -08:00
Jake Turner	98b65c421c	feat(AI): thinking and response streaming	2026-02-18 21:22:53 -08:00
Jake Turner	279ee1254c	fix(Benchmark): improved error reporting and fix sysbench race condition	2026-02-11 22:09:31 -08:00
Jake Turner	d55ff7b466	feat: curated content update checking	2026-02-11 21:49:46 -08:00
Jake Turner	32d206cfd7	feat: curated content system overhaul	2026-02-11 15:44:46 -08:00
Jake Turner	4747863702	feat(AI Assistant): allow manual scan and resync KB	2026-02-09 15:16:18 -08:00
Jake Turner	276bdcd0b2	feat(AI Assistant): query rewriting for enhanced context retrieval	2026-02-08 16:19:27 -08:00
Jake Turner	8726700a0a	feat: zim content embedding	2026-02-08 13:20:10 -08:00
Jake Turner	2e0ab10075	feat: cron job for system update checks	2026-02-06 15:40:30 -08:00
Jake Turner	36b6d8ed7a	fix: rework content tier system to dynamically determine install status Removes the InstalledTier model and instead checks presence of files on-the-fly. Avoid broken state by handling on the server-side vs. marking as installed by client-side API call	2026-02-04 22:58:21 -08:00
Jake Turner	d4cbc0c2d5	feat(AI): add fuzzy search to models list	2026-02-04 16:45:12 -08:00
Jake Turner	fa8300b5df	fix(Maps): ensure asset urls resolve correctly	2026-02-03 23:34:32 -08:00
Jake Turner	5de3c5f261	fix: hide chat button and page unless AI Assistant installed	2026-02-03 23:15:39 -08:00
Chris Sherwood	2c4fc59428	feat(ContentManager): Display friendly names instead of filenames Content Manager now shows Title and Summary columns from Kiwix metadata instead of just raw filenames. Metadata is captured when files are downloaded from Content Explorer and stored in a new zim_file_metadata table. Existing files without metadata gracefully fall back to showing the filename. Changes: - Add zim_file_metadata table and model for storing title, summary, author - Update download flow to capture and store metadata from Kiwix library - Update Content Manager UI to display Title and Summary columns - Clean up metadata when ZIM files are deleted Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-02-03 23:14:28 -08:00
Jake Turner	d1f40663d3	feat(RAG): initial beta with preprocessing, embedding, semantic retrieval, and ctx passage	2026-02-01 23:59:21 +00:00

1 2

82 Commits