project-nomad

mirror of https://github.com/Crosstalk-Solutions/project-nomad.git synced 2026-05-26 22:35:05 +02:00

Author	SHA1	Message	Date
Chris Sherwood	563f86a22b	feat(KB): conditional warnings A + B on Stored Files (RFC #883 §6) Surfaces two silent failure modes that the prior binary "any-chunks-in-Qdrant ⇒ embedded" check could not distinguish from healthy ingestion: - Warning A — Zero-chunk file (file_size > 100 MB, chunks = 0) Fires on video-only / image-only ZIMs (`lrnselfreliance_en_all`, TED talks, etc.) that the pipeline completes "successfully" with no extractable text. AI Assistant literally cannot reference these. - Warning B — Partial-embed stall (chunks < 50% of expected from the ratio registry). Surfaces the simple_wiki "266 of 600,000 chunks" case observed during NOMAD1 ingestion testing — previously these looked identical to fully-completed embeds in the UI. Both warnings render only when their condition is met (silent by default; noisy only on real problems). Base is `feat/kb-ratio-registry` (#891) because Warning B's "expected chunks" estimate comes from `KbRatioRegistry.estimateChunks()`. GitHub fast-forwards to `rc` once #891 merges. - `app/utils/kb_warning_decision.ts` — pure `decideWarnings(inputs)` with thresholds (`100 MB`, `0.5×`) as exported constants. 10 unit tests cover the healthy case, both warnings, the under/at/over boundary, the registry-miss suppression, and the video-only registry case (`expectedChunks: 0` correctly skips Warning B). - `RagService.computeFileWarnings()` — single Qdrant scroll tallies chunks per source, filesystem walk fills in zero-chunk files, ratio registry estimates the expectation, decision function emits. - New endpoint `GET /api/rag/file-warnings` returns `Record<source, FileWarning[]>` (sources with no warnings are omitted, so the frontend can `warnings[source] ?? []` for clean defaults). - KB modal: warnings render inline under the file name as amber-tinted pills. Polled every 30s alongside the existing health check. - Warning C — chunks skipped due to length. PR #890 (#881 fix) prevents the silent drop at the embed boundary, so the underlying condition shouldn't fire anymore. If we still want to surface "we truncated N chunks to fit", that needs separate `skipped_count` tracking in EmbedFileJob — a Phase 2 follow-up. - Suppressing Warning B during active mid-ingestion. The user can cross- reference the Processing Queue to know it's in-flight; suppressing warnings while a job runs would mask real stalls where the job died mid-batch. Will revisit when per-card status is wired through. - Use of `kb_ingest_state.chunks_embedded` (#888) as the chunk count source. This PR uses Qdrant scroll directly so it can land independently of #888. - 10 new unit tests on `decideWarnings`, all pass - Type-check clean - Hot-patch + browser smoke test deferred until #891 lands (the ratio registry needs to exist in the DB for `estimateChunks()` to return non-null estimates — without it, only Warning A fires which is still useful but Warning B stays dormant)	2026-05-20 10:16:00 -07:00
Chris Sherwood	e68c753e39	feat(KB): surface embedding-disk estimate in curated tier-change modal (RFC #883 §1) When a user picks a tier in TierSelectionModal, show how much additional disk space the AI Assistant will need if the new ZIMs are indexed, plus a policy-aware footer explaining whether they'll auto-index (Always) or wait for opt-in (Manual). Estimates consume #891's KbRatioRegistry via a new POST /api/rag/estimate-batch endpoint. Backend - New POST /api/rag/estimate-batch route + RagController.estimateBatch - VineJS schema accepting array of {filename, sizeBytes}, capped at 500 - KbRatioRegistry.estimateBatch aggregates via the existing prefix-match lookup, returns {totalChunks, totalBytes, hasUnknown} - New BYTES_PER_CHUNK_ON_DISK constant (~8 KB: 3 KB vector + ~3 KB chunk text + ~2 KB payload/index overhead). Tunable; will be replaced by Phase 4 self-calibration once we have real measurements. - Controller normalizes incoming filenames via path.basename so callers that send full paths or URLs still match registry prefixes correctly. Frontend - api.estimateEmbeddingBatch() client method - TierSelectionModal: when localSelectedSlug is set, resolve the tier's resources (incl. inherited tiers), POST to /estimate-batch, and render a new info block with the +~X GB figure + ingest-policy copy. Also fetches rag.defaultIngestPolicy so the same block surfaces whether indexing will fire automatically or wait for the user. - resourceFilename() helper extracts the basename from the resource URL so the registry lookup hits the right prefix regardless of mirror. Tests - 4 new cases in tests/unit/kb_ratio_lookup.spec.ts covering the estimateBatch aggregator: standard sum, unknown-flagging, video-only ZIM (0 chunks but known, hasUnknown stays false), empty input. Stacks on feat/kb-ratio-registry (#891) — consumes the registry table seeded by that PR. Once #891 merges to rc, this PR auto-rebases. Out of scope for this PR (deferred to follow-ups): - Per-batch opt-in checkbox (RFC §1's '☑ Also index these for AI') needs a per-batch policy override path and is a separate PR - Guardrail modal at 50 GB / 10% free / 6 hr thresholds (RFC §7) is also separate; this PR is informational, not gating - Time-to-embed estimate awaits a chunks-per-second metric per host	2026-05-20 10:16:00 -07:00
Jake Turner	4c211964e0	fix(KB): add re-embed and reset & rebuild opts to fix broken embeddings (#886 )	2026-05-20 10:16:00 -07:00
chriscrosstalk	62e75fdb54	feat(Content): custom ZIM library sources with pre-seeded mirrors (#593 ) * feat(content): add custom ZIM library sources with pre-seeded mirrors Users reported slow download speeds from the default Kiwix CDN. This adds the ability to browse and download ZIM files from alternative Kiwix mirrors or self-hosted repositories, all through the GUI. - Add "Custom Libraries" button next to "Browse the Kiwix Library" - Source dropdown to switch between Default (Kiwix) and custom libraries - Browsable directory structure with breadcrumb navigation - 5 pre-seeded official Kiwix mirrors (US, DE, DK, UK, Global CDN) - Built-in mirrors protected from deletion - Downloads use existing pipeline (progress, cancel, Kiwix restart) - Source selection persists across page loads via localStorage - Scrollable directory browser (600px max) with sticky header - SSRF protection on all custom library URLs Closes #576 Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * fix(content): recognize Wikipedia downloads from mirror sources When Wikipedia is downloaded via a custom mirror instead of the default Kiwix server, the completion callback now matches by filename instead of exact URL. This ensures the Wikipedia selector correctly shows "Installed" status and triggers old-version cleanup regardless of which mirror was used. Also handles the case where no Wikipedia selection exists yet (file downloaded before visiting the selector), creating the record automatically. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * fix(ZIM): use cheerio for custom mirror directory parsing * fix(ZIM): use URL constructor for more robust joining --------- Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com> Co-authored-by: Jake Turner <jturner@cosmistack.com>	2026-05-20 10:16:00 -07:00
0xGlitch	94059b0aaf	feat(Maps): regional map downloads via go-pmtiles extract (#780 ) * feat(maps): add regional map downloads via go-pmtiles extract * address Copilot review feedback on PR #780 - auto-refresh preflight on selection/maxzoom change with 400ms debounce and requestId stale-safety so the confirm button no longer requires a two-step "Estimate Size" -> "Start Download" dance - safeUpdateProgress helper replaces fire-and-forget updateProgress().catch() pattern so cancelled-job errors (code -1) can't surface as unhandled rejections - gate world basemap source on worldBasemapReady - when ensureWorldBasemap() fails we already delete world.pmtiles, so emitting the source was producing 404s on every tile request - verify go-pmtiles binary SHA256 at image build time; upstream doesn't ship a checksums file so per-arch hashes are pinned as build args with a regenerate note when bumping PMTILES_VERSION	2026-05-20 10:16:00 -07:00
Henry Estela	2d8a02f257	fix(RAG): add start button in kb modal and ensure restart policy exists (#700 ) Adds a check to RAG health to make sure nomad_qdrant is online, if not then the user will be blocked from clicking any buttons in the KB modal until they click the start qdrant button and let the container start There is a new file qdrant_restart_policy_provider.ts, which tries to ensure that the restart policy always exists for the nomad_qdrant container even though the policy should have been there when the container is created.	2026-05-20 10:16:00 -07:00
chriscrosstalk	0183b42d71	feat(maps): add scale bar and location markers (#636 ) Add distance scale bar and user-placed location pins to the offline maps viewer. - Scale bar (bottom-left) shows distance reference that updates with zoom level - Click anywhere on map to place a named pin with color selection (6 colors) - Collapsible "Saved Locations" panel lists all pins with fly-to navigation - Full dark mode support for popups and panel via CSS overrides - New `map_markers` table with future-proofed columns for routing (marker_type, route_id, route_order, notes) to avoid a migration when routes are added later - CRUD endpoints: GET/POST /api/maps/markers, PATCH/DELETE /api/maps/markers/:id - VineJS validation on create/update - MapMarker Lucid model Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-03 14:26:50 -07:00
Jake Turner	877fb1276a	feat: gzip compression by default for all registered routes	2026-04-03 14:26:50 -07:00
chriscrosstalk	bac53e28dc	feat(downloads): rich progress, friendly names, cancel, and live status (#554 ) * feat(downloads): rich progress, friendly names, cancel, and live status Redesign the Active Downloads UI with four improvements: - Rich progress: BullMQ jobs now report downloadedBytes/totalBytes instead of just a percentage, showing "2.3 GB / 5.1 GB" instead of "78% / 100%" - Friendly names: dispatch title metadata from curated categories, Content Explorer library, Wikipedia selector, and map collections - Cancel button: Redis-based cross-process abort signal lets users cancel active downloads with file cleanup. Confirmation step prevents accidents. - Live status indicator: green pulsing dot with transfer speed for active downloads, orange stall warning after 60s of no data, gray dot for queued Backward compatible with in-flight jobs that have integer-only progress. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * fix(downloads): fix cancel, dismiss, speed, and retry bugs - Speed indicator: only set prevBytesRef on first observation to prevent intermediate re-renders from inflating the calculated speed - Cancel: throw UnrecoverableError on abort to prevent BullMQ retries - Dismiss: remove stale BullMQ lock before job.remove() so cancelled jobs can actually be dismissed - Retry: add getActiveByUrl() helper that checks job state before blocking re-download, auto-cleans terminal jobs - Wikipedia: reset selection status to failed on cancel so the "downloading" state doesn't persist Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * feat(downloads): improve cancellation logic and surface true BullMQ job states --------- Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com> Co-authored-by: Jake Turner <jturner@cosmistack.com>	2026-04-03 14:26:50 -07:00
0xGlitch	789fdfe95d	feat(maps): add global map download from Protomaps (#525 ) * feat(maps): add global map download from Protomaps * fix: add path traversal check to global map download	2026-04-03 14:26:50 -07:00
Henry Estela	69c15b8b1e	feat(AI): enable remote AI chat host	2026-04-03 14:26:50 -07:00
Chris Sherwood	023e3f30af	fix(downloads): allow users to dismiss failed downloads Failed download jobs persist in BullMQ forever with no way to clear them, leaving stale error notifications in Content Explorer and Easy Setup. Adds a dismiss button (X) on failed download cards that removes the job from the queue via a new DELETE endpoint. - Backend: DELETE /api/downloads/jobs/:jobId endpoint - Frontend: X button on failed download cards with immediate refresh Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-20 11:46:10 -07:00
Chris Sherwood	e4fde22dd9	feat(UI): add Debug Info modal for bug reporting Add a "Debug Info" link to the footer and settings sidebar that opens a modal with non-sensitive system information (version, OS, hardware, GPU, installed services, internet status, update availability). Users can copy the formatted text and paste it into GitHub issues. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-20 11:46:10 -07:00
Chris Sherwood	6a737ed83f	feat(UI): add Support the Project settings page Adds a new settings page with Ko-fi donation link, Rogue Support banner, and community contribution options (GitHub, Discord). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-20 11:46:10 -07:00
Jake Turner	58b106f388	feat: support for updating services	2026-03-11 14:08:09 -07:00
Jake Turner	dfa896e86b	feat(RAG): allow deletion of files from KB	2026-03-04 20:05:14 -08:00
Jake Turner	99b96c3df7	feat(RAG): display embedding queue and improve progress tracking	2026-03-04 20:05:14 -08:00
Jake Turner	e75d54bd69	fix(UI): gracefully handle legacy docs and knowledge-base paths	2026-02-18 14:52:06 -08:00
Jake Turner	d55ff7b466	feat: curated content update checking	2026-02-11 21:49:46 -08:00
Jake Turner	32d206cfd7	feat: curated content system overhaul	2026-02-11 15:44:46 -08:00
Jake Turner	4747863702	feat(AI Assistant): allow manual scan and resync KB	2026-02-09 15:16:18 -08:00
Jake Turner	6745dbf3d1	feat: move KB UI into AI Assistant UI	2026-02-08 13:20:10 -08:00
Jake Turner	36b6d8ed7a	fix: rework content tier system to dynamically determine install status Removes the InstalledTier model and instead checks presence of files on-the-fly. Avoid broken state by handling on the server-side vs. marking as installed by client-side API call	2026-02-04 22:58:21 -08:00
Jake Turner	d1f40663d3	feat(RAG): initial beta with preprocessing, embedding, semantic retrieval, and ctx passage	2026-02-01 23:59:21 +00:00
Jake Turner	1923cd4cde	feat(AI): chat suggestions and assistant settings	2026-02-01 07:24:21 +00:00
Chris Sherwood	68f374e3a8	feat: Add dedicated Wikipedia Selector with smart package management Adds a standalone Wikipedia selection section that appears prominently in both the Easy Setup Wizard and Content Explorer. Features include: - Six Wikipedia package options ranging from Quick Reference (313MB) to Complete Wikipedia with Full Media (99.6GB) - Card-based radio selection UI with clear size indicators - Smart replacement: downloads new package before deleting old one - Status tracking: shows Installed, Selected, or Downloading badges - "No Wikipedia" option for users who want to skip or remove Wikipedia Technical changes: - New wikipedia_selections database table and model - New /api/zim/wikipedia and /api/zim/wikipedia/select endpoints - WikipediaSelector component with consistent styling - Integration with existing download queue system - Callback updates status to 'installed' on successful download - Wikipedia removed from tiered category system to avoid duplication UI improvements: - Added section dividers and icons (AI Models, Wikipedia, Additional Content) - Consistent spacing between major sections in Easy Setup Wizard - Content Explorer gets matching Wikipedia section with submit button Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-31 21:00:51 -08:00
Jake Turner	243f749090	feat: [wip] native AI chat interface	2026-01-31 20:39:49 -08:00
Jake Turner	50174d2edb	feat(RAG): [wip] RAG capabilities	2026-01-31 20:39:49 -08:00
Jake Turner	8cfe490b57	feat: subscribe to release notes	2026-01-27 23:22:26 -08:00
Jake Turner	c8de767052	feat(Maps): automatically download base assets if missing	2026-01-27 20:49:56 -08:00
chriscrosstalk	7a5a254dd5	feat(benchmark): Require full benchmark with AI for community sharing (#99 ) * feat(benchmark): Require full benchmark with AI for community sharing Only allow users to share benchmark results with the community leaderboard when they have completed a full benchmark that includes AI performance data. Frontend changes: - Add AI Assistant installation check via service API query - Show pre-flight warning when clicking Full Benchmark without AI installed - Disable AI Only button when AI Assistant not installed - Show "Partial Benchmark" info alert for non-shareable results - Only display "Share with Community" for full benchmarks with AI data - Add note about AI installation requirement with link to Apps page Backend changes: - Validate benchmark_type is 'full' before allowing submission - Require ai_tokens_per_second > 0 for community submission - Return clear error messages explaining requirements Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com> * fix(benchmark): UI improvements and GPU detection fix - Fix GPU detection to properly identify AMD discrete GPUs - Fix gauge colors (high scores now green, low scores red) - Fix gauge centering (SVG size matches container) - Add info tooltips for Tokens/sec and Time to First Token Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com> * fix(benchmark): Extract iGPU from AMD APU CPU name as fallback When systeminformation doesn't detect graphics controllers (common on headless Linux), extract the integrated GPU name from AMD APU CPU model strings like "AMD Ryzen AI 9 HX 370 w/ Radeon 890M". Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com> * feat(benchmark): Add Builder Tag system for community leaderboard - Add builder_tag column to benchmark_results table - Create BuilderTagSelector component with word dropdowns + randomize - Add 50 adjectives and 50 nouns for NOMAD-themed tags (e.g., Tactical-Llama-1234) - Add anonymous sharing option checkbox - Add builder tag display in Benchmark Details section - Add Benchmark History section showing all past benchmarks - Update submission API to accept anonymous flag - Add /api/benchmark/builder-tag endpoint to update tags Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com> * feat(benchmark): Add HMAC signing for leaderboard submissions Sign benchmark submissions with HMAC-SHA256 to prevent casual API abuse. Includes X-NOMAD-Timestamp and X-NOMAD-Signature headers. Note: Since NOMAD is open source, a determined attacker could extract the secret. This provides protection against casual abuse only. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-25 00:24:31 -08:00
Chris Sherwood	5afc3a270a	feat: Improve curated collections UX with persistent tier selection - Add installed_tiers table to persist user's tier selection per category - Change tier selection behavior: clicking a tier now highlights it locally, user must click "Submit" to confirm (previously clicked = immediate download) - Remove "Recommended" badge and asterisk (*) from tier displays - Highlight installed tier instead of recommended tier in CategoryCard - Add "Click to choose" hint when no tier is installed - Save installed tier when downloading from Content Explorer or Easy Setup - Pass installed tier to modal as default selection Database: - New migration: create installed_tiers table (category_slug unique, tier_slug) - New model: InstalledTier Backend: - ZimService.listCuratedCategories() now includes installedTierSlug - New ZimService.saveInstalledTier() method - New POST /api/zim/save-installed-tier endpoint Frontend: - TierSelectionModal: local selection state, "Close" → "Submit" button - CategoryCard: highlight based on installedTierSlug, add "Click to choose" - Content Explorer: save tier after download, refresh categories - Easy Setup: save tiers on wizard completion Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-24 15:33:50 -08:00
Jake Turner	f49b9abb81	fix(Maps): static path resolution	2026-01-23 14:17:25 -08:00
Chris Sherwood	755807f95e	feat: Add system benchmark feature with NOMAD Score Add comprehensive benchmarking capability to measure server performance: Backend: - BenchmarkService with CPU, memory, disk, and AI benchmarks using sysbench - Database migrations for benchmark_results and benchmark_settings tables - REST API endpoints for running benchmarks and retrieving results - CLI commands: benchmark:run, benchmark:results, benchmark:submit - BullMQ job for async benchmark execution with SSE progress updates - Synchronous mode option (?sync=true) for simpler local dev setup Frontend: - Benchmark settings page with circular gauges for scores - NOMAD Score display with weighted composite calculation - System Performance section (CPU, Memory, Disk Read/Write) - AI Performance section (tokens/sec, time to first token) - Hardware Information display - Expandable Benchmark Details section - Progress simulation during sync benchmark execution Easy Setup Integration: - Added System Benchmark to Additional Tools section - Built-in capability pattern for non-Docker features - Click-to-navigate behavior for built-in tools Fixes: - Docker log multiplexing issue (Tty: true) for proper output parsing - Consolidated disk benchmarks into single container execution Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-22 21:48:12 -08:00
Jake Turner	9bb4ff5afc	feat: force-reinstall option for apps	2026-01-19 22:50:15 -08:00
Jake Turner	937da5d869	feat(Open WebUI): manage models via Command Center	2026-01-19 22:15:52 -08:00
Jake Turner	b3ef977484	feat: [wip] Open WebUI manipulation	2026-01-19 22:15:52 -08:00
Jake Turner	b6e6e10328	fix(CuratedCategories): improve fetching from Github	2026-01-19 14:41:51 -08:00
copilot-swe-agent[bot]	f905871392	Add NOMAD_STORAGE_PATH schema definition to start/env.ts Co-authored-by: jakeaturner <52841588+jakeaturner@users.noreply.github.com>	2026-01-19 10:29:24 -08:00
Jake Turner	393c177af1	feat: [wip] self updates	2026-01-15 15:54:59 -08:00
Jake Turner	5793fc2139	feat: [wip] easy setup wizard	2026-01-15 15:54:59 -08:00
Jake Turner	df55b48e1c	fix(admin): container healthcheck	2026-01-13 06:58:05 -08:00
Jake Turner	a2206b8c13	feat(System): check internet status on backend and allow custom test url	2025-12-24 12:00:32 -08:00
Jake Turner	6ac9d147cf	feat(Collections): map region collections	2025-12-23 16:00:33 -08:00
Jake Turner	7569aa935d	feat: background job overhaul with bullmq	2025-12-06 23:59:01 -08:00
Jake Turner	dd4e7c2c4f	feat: curated zim collections	2025-12-05 15:47:22 -08:00
Jake Turner	606dd3ad0b	feat: [wip] custom map and zim downloads	2025-12-02 08:25:09 -08:00
Jake Turner	12a6f2230d	feat: [wip] new maps system	2025-11-30 22:29:16 -08:00
Jake Turner	07a198f918	feat(Settings): display system information	2025-08-20 23:05:19 -07:00
Jake Turner	377f49162f	feat(Settings): add legal notices page	2025-08-20 23:05:19 -07:00

1 2

56 Commits