Commit Graph

70 Commits

Author SHA1 Message Date
Mutasem Aldmour
fdceec21b9
feat: Add pairwise workflow eval pipeline (#29123)
Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
Co-authored-by: Jaakko Husso <jaakko@n8n.io>
2026-05-04 13:26:27 +00:00
Luca Mattiazzi
9078bb2306
feat(ai-builder): Add a binary check to avoid code import in code blocks (no-changelog) (#28382)
Co-authored-by: cubic-dev-ai[bot] <191113872+cubic-dev-ai[bot]@users.noreply.github.com>
2026-04-14 08:02:41 +00:00
Eugene
00014420b1
refactor(core): Remove multi-agent architecture entry point from AI workflow builder (no-changelog) (#27925)
Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
2026-04-02 16:32:19 +00:00
Eugene
6294b0e56f
feat(ai-builder): Add agent text response evaluation and workflow changes binary check (no-changelog) (#27755)
Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-03-31 07:49:10 +00:00
Eugene
6314cd4842
feat(ai-builder): Support dataset context and conversation history in evaluations (no-changelog) (#27618)
Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>
Co-authored-by: cubic-dev-ai[bot] <191113872+cubic-dev-ai[bot]@users.noreply.github.com>
2026-03-30 08:14:01 +00:00
oleg
834966e145
feat(ai-builder): Add binary-checks evaluation suite (no-changelog) (#26415)
Signed-off-by: Oleg Ivaniv <me@olegivaniv.com>
2026-03-04 08:42:05 +00:00
Benjamin Schroth
dfc4fac1ce
fix(ai-builder): Fix more evals executions of workflows (no-changelog) (#26043)
Co-authored-by: oleg <me@olegivaniv.com>
2026-02-20 13:19:54 +00:00
Benjamin Schroth
ce32754088
feat(ai-builder): Add subgraph evaluation framework for responder (no-changelog) (#25419) 2026-02-18 12:33:34 +00:00
Albert Alises
9fb5d437d3
feat(core): Merge Ask and Build into unified multi-agent chat experience (#25692)
Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>
2026-02-18 10:16:24 +00:00
Benjamin Schroth
e4ac345eda
feat(ai-builder): Implement workflow execution in evaluations (no-changelog) (#25814) 2026-02-18 08:41:56 +00:00
Eugene
892f086579
feat(core): Add introspection diagnostic tool for AI workflow builder (#25172) 2026-02-12 10:57:44 +00:00
Mutasem Aldmour
cd175ddda0
refactor(ai-builder): Remove community node support from code builder (#25575)
Co-authored-by: Claude Opus 4.5 <noreply@anthropic.com>
2026-02-10 14:47:01 +00:00
Mutasem Aldmour
9729c2a5da
feat(ai-builder): Add code-base workflow builder (#24535)
Co-authored-by: Claude Opus 4.5 <noreply@anthropic.com>
2026-02-10 12:24:27 +00:00
oleg
794a8d6af4
feat(ai-builder): Planning mode (#25498)
Signed-off-by: Oleg Ivaniv <me@olegivaniv.com>
2026-02-09 08:56:01 +00:00
Michael Drury
832e580b39
chore(ai-builder): Add CSV output for evaluation results (#25193)
Co-authored-by: Claude Opus 4.5 <noreply@anthropic.com>
2026-02-04 09:02:14 +00:00
Michael Drury
21e4364920
feat(ai-builder): Adding node parameter (required and options) programmatic validations to WFB (#25101)
Co-authored-by: Claude Opus 4.5 <noreply@anthropic.com>
2026-02-02 17:16:43 +00:00
oleg
24bb638982
refactor(ai-builder): Consolidate AI Workflow Builder agents and simplify prompts (no-changelog) (#25020)
Signed-off-by: Oleg Ivaniv <me@olegivaniv.com>
Co-authored-by: cubic-dev-ai[bot] <191113872+cubic-dev-ai[bot]@users.noreply.github.com>
2026-02-02 13:57:48 +00:00
Michael Drury
d12e9318d0
feat(ai-builder): Workflow builder "respond to webhook" understanding (#24641) 2026-01-27 14:38:09 +00:00
Albert Alises
341976947f
feat(ai-builder): Add webhook notifications with HMAC authentication for AI evaluation results (#24766)
Co-authored-by: cubic-dev-ai[bot] <191113872+cubic-dev-ai[bot]@users.noreply.github.com>
2026-01-26 10:16:05 +00:00
Albert Alises
09222733e1
feat(ai-builder): Add webhook notifications for AI evaluation results (#24653) 2026-01-23 09:39:04 +00:00
Albert Alises
99cb5982a0
ci(core): Add automated AI workflow builder evaluations (#24582) 2026-01-21 15:30:45 +00:00
Albert Alises
70ef256188
feat(ai-builder): Make credentials validation minor instead of major (#24529) 2026-01-19 16:15:29 +00:00
oleg
448522142c
feat(ai-builder): Add per-stage model configuration for evaluations (no-changelog) (#24344)
Signed-off-by: Oleg Ivaniv <me@olegivaniv.com>
2026-01-15 12:58:49 +00:00
Michael Drury
3f225cafca
fix(ai-builder): Clarify with spec eval judge node type potential mis-matches (no-changelog) (#24345) 2026-01-15 12:35:45 +00:00
oleg
fb5392cdd4
fix(ai-builder): Use LangSmith SDK defaults for batch settings (no-changelog) (#24305) 2026-01-14 11:29:55 +00:00
oleg
734bed4f84
fix(ai-builder): Remove pairwise multi-gen evals and improve logs (no-changelog) (#24270)
Signed-off-by: Oleg Ivaniv <me@olegivaniv.com>
2026-01-14 08:44:09 +00:00
oleg
f880a74d99
refactor(ai-builder): Implement unified evaluations harness (#23955)
Signed-off-by: Oleg Ivaniv <me@olegivaniv.com>
Co-authored-by: cubic-dev-ai[bot] <191113872+cubic-dev-ai[bot]@users.noreply.github.com>
2026-01-13 12:11:13 +00:00
oleg
3504b982b5
chore(ai-builder): Remove legacy agent and make multi-agent default (no-changelog) (#24076)
Signed-off-by: Oleg Ivaniv <me@olegivaniv.com>
2026-01-13 10:24:27 +00:00
Eugene
755a1f2708
fix(ai-builder): Examples omitted in spec evals (#24171) 2026-01-12 13:20:44 +00:00
Michael Drury
c8391ac6b8
fix(ai-builder): Reducing confusion in spec evals from double negative don't criteria (#24099) 2026-01-12 09:58:00 +00:00
Michael Drury
64aed9fb27
chore(ai-builder): Support filtering by do/don't criteria in pairwise evals (#23914) 2026-01-06 16:40:28 +00:00
Eugene
ec34459e37
chore(ai-builder): Add evaluation run type metadata (#23793) 2026-01-02 15:21:09 +01:00
Eugene
1199904b15
feat(ai-builder): Improve generation across data_transformation category (#23609) 2025-12-29 17:30:16 +01:00
Michael Drury
94dd47b637
feat(ai-builder): Multi-agent workflow builder improvements, best practices and better template usage (#23417) 2025-12-19 14:00:06 +00:00
Michael Drury
0bd7a94d0c
fix(ai-builder): Connection validation respects node version and type (#23349) 2025-12-17 12:38:22 +00:00
oleg
be6d68408d
refactor(ai-builder): Improve pairwise evaluation architecture and LangSmith integration (no-changelog) (#23084) 2025-12-15 12:11:25 +01:00
Benjamin Schroth
537747bd63
chore: Make workflow comparison run in CI (#23011) 2025-12-10 16:42:39 +01:00
Michael Drury
272a5d4435
feat(ai-builder): WF builder programmatic validation for HTTP request and set nodes (correct credential usage) (#22941) 2025-12-10 10:35:54 +00:00
Mutasem Aldmour
dcbc24d3f7
feat(ai-builder): Consolidate builder telemetry (#22644) 2025-12-08 10:56:38 +01:00
oleg
b97b5d5bf8
feat(ai-builder): Add multi-judge pairwise evals and local mode (no-changelog) (#22708)
Signed-off-by: Oleg Ivaniv <me@olegivaniv.com>
2025-12-05 17:12:18 +01:00
Michael Drury
33a6aa665c
fix(ai-builder): Allow setting evaluation feature flags via environment variables (no-changelog) (#22813) 2025-12-05 13:37:56 +00:00
oleg
f2930e2db9
chore(ai-builder): Add workflow naming, compaction, and session cleanup to multi-agent (no-changelog) (#22646)
Signed-off-by: Oleg Ivaniv <me@olegivaniv.com>
2025-12-03 17:50:22 +01:00
oleg
ccd974ea22
feat(ai-builder): Implement Core Subgraph Infrastructure (no-changelog) (#22325)
Signed-off-by: Oleg Ivaniv <me@olegivaniv.com>
2025-12-01 07:53:55 +01:00
oleg
769c1c9ec2
test(ai-builder): Add pairwise evaluations (no-changelog) (#22438)
Signed-off-by: Oleg Ivaniv <me@olegivaniv.com>
2025-11-28 15:35:53 +01:00
Mutasem Aldmour
4319da6f1c
fix(ai-builder): Fix import of multiple nodes with maxNode, add validation (#22348) 2025-11-26 17:34:34 +01:00
Benjamin Schroth
5f5181ec28
test(ai-builder): Integrate structural workflow comparison (#22209) 2025-11-24 15:25:12 +01:00
Benjamin Schroth
246387e3c9
feat(ai-builder): Add python script for workflow comparison (#21927) 2025-11-21 14:56:37 +01:00
Michael Drury
da2446ead3
fix(ai-builder): Improving workflow builder following model instructions and using AI agent node (#22011) 2025-11-19 14:58:12 +00:00
Eugene
0a355ccadb
feat(ai-builder): Send workflow validation issues to telemetry (#21837)
Co-authored-by: Michael Drury <michael.drury@n8n.io>
2025-11-18 16:30:16 +00:00
Michael Drury
72dfa553a1
feat(ai-builder): Updating prompt suggestions for the workflow builder (#21690) 2025-11-10 19:51:06 +02:00