discourse-ai

mirror of https://github.com/discourse/discourse-ai.git synced 2025-09-08 20:50:38 +00:00

Author	SHA1	Message	Date
Sam	ab5edae121	FIX: make AI helper more robust (#1484 ) * FIX: make AI helper more robust - If JSON is broken for structured output then lean on a more forgiving parser - Gemini 2.5 flash does not support temp, support opting out - Evals for assistant were broken, fix interface - Add some missing LLMs - Translator was not mapped correctly to the feature - fix that - Don't mix XML in prompt for translator * lint * correct logic * simplify code * implement best effort json parsing direct in the structured output object	2025-07-04 14:47:11 +10:00
Natalie Tay	2b9a4f9232	FIX: Ignore captions and quotes when detecting locale and update prompts (#1483 ) A more deterministic way of making sure the LLM detects the correct language (instead of relying on prompt to LLM to ignore it) is to take the cooked and remove unwanted elements. In this commit - we remove quotes, image captions, etc. and only take the remaining text, falling back to the unadulterated cooked - and update prompts related to detection and translation - /152465/12	2025-07-03 22:57:48 +08:00
Rafael dos Santos Silva	d792919ddf	DEV: Move tokenizers to a gem (#1481 ) Also renames the Mixtral tokenizer to Mistral. See gem at github.com/discourse/discourse_ai-tokenizers Co-authored-by: Roman Rizzi <roman@discourse.org>	2025-07-02 14:43:03 -03:00
Roman Rizzi	75fb37144f	FEATURE: Use personas for generating hypothetical posts (#1482 ) * FEATURE: Use personas for generating hypothetica posts * Update prompt	2025-07-02 10:56:38 -03:00
Roman Rizzi	5ca7d5f256	FIX: Strip uploads from msg when searching for rag fragments (#1475 )	2025-06-30 15:03:17 -03:00
Roman Rizzi	57b00526f8	FIX: Clarify spam response expectations. (#1470 )	2025-06-27 16:59:55 -03:00
Roman Rizzi	b35f9bcc7c	FEATURE: Use Persona's when scanning posts for spam (#1465 )	2025-06-27 10:35:47 -03:00
Sam	3e74f09d06	FEATURE: improve custom tool infra (#1463 ) - Add support for `chain.streamCustomRaw(test)` that can be used to stream text from a JS tool direct to composer - Add support for llm params in `llm.generate` which unlocks stuff like structured outputs - Add discourse.createStagedUser, discourse.createTopic and discourse.createPost - for content creation	2025-06-25 16:25:44 +10:00
Roman Rizzi	eea96d6df9	FIX: Include JSON instructions in Helper default personas (#1458 )	2025-06-23 11:57:50 -03:00
Natalie Tay	df925f8304	DEV: Move examples out of prompt (#1438 ) * DEV: Move examples out of prompt	2025-06-17 16:12:52 +08:00
Rafael dos Santos Silva	bc8e57d7e8	DEV: Move title suggestion to an array (#1435 )	2025-06-16 18:06:54 -03:00
Natalie Tay	b5e8277083	DEV: Move AI translation feature into an AI Feature (#1424 ) This PR moves translations into an AI Feature See https://github.com/discourse/discourse-ai/pull/1424 for screenshots	2025-06-13 10:17:27 +08:00
Sam	02bc9f645e	FEATURE: hybrid artifact security mode (#1431 ) In hybrid mode ai artifacts can optionally automatically run. This is useful for cases where you may want to embed a survey and so on. Additionally, artifacts now allow for better fidelity around display: <div class="ai-artifact" data-ai-artifact-id="501" data-ai-artifact-height="300px" data-ai-artifact-autorun data-ai-artifact-seamless></div> User can supply height and seamless mode to be seamlessly rendered with no box shadow and show full screen button.	2025-06-12 20:04:48 +10:00
Sam	fdf0ff8a25	FEATURE: persistent key-value storage for AI Artifacts (#1417 ) Introduces a persistent, user-scoped key-value storage system for AI Artifacts, enabling them to be stateful and interactive. This transforms artifacts from static content into mini-applications that can save user input, preferences, and other data. The core components of this feature are: 1. Model and API: - A new `AiArtifactKeyValue` model and corresponding database table to store data associated with a user and an artifact. - A new `ArtifactKeyValuesController` provides a RESTful API for CRUD operations (`index`, `set`, `destroy`) on the key-value data. - Permissions are enforced: users can only modify their own data but can view public data from other users. 2. Secure JavaScript Bridge: - A `postMessage` communication bridge is established between the sandboxed artifact `iframe` and the parent Discourse window. - A JavaScript API is exposed to the artifact as `window.discourseArtifact` with async methods: `get(key)`, `set(key, value, options)`, `delete(key)`, and `index(filter)`. - The parent window handles these requests, makes authenticated calls to the new controller, and returns the results to the iframe. This ensures security by keeping untrusted JS isolated. 3. AI Tool Integration: - The `create_artifact` tool is updated with a `requires_storage` boolean parameter. - If an artifact requires storage, its metadata is flagged, and the system prompt for the code-generating AI is augmented with detailed documentation for the new storage API. 4. Configuration: - Adds hidden site settings `ai_artifact_kv_value_max_length` and `ai_artifact_max_keys_per_user_per_artifact` for throttling. This also includes a minor fix to use `jsonb_set` when updating artifact metadata, ensuring other metadata fields are preserved.	2025-06-11 06:59:46 +10:00
Sam	6817866de9	FEATURE: allow access to assigns from forum researcher (#1412 ) * FEATURE: allow access to assigns from forum researcher * FIX: should properly be checking for empty * finish PR	2025-06-06 16:59:00 +10:00
Roman Rizzi	a68ab76eb6	FIX: Update topic summarization prompt to work better when using full names (#1409 )	2025-06-05 12:28:29 -03:00
Roman Rizzi	0338dbea23	FEATURE: Use different personas to power AI helper features. You can now edit each AI helper prompt individually through personas, limit access to specific groups, set different LLMs, etc.	2025-06-04 14:23:00 -03:00
Sam	3e74eea1e5	FEATURE: add context and llm controls to researcher, fix username filter (#1401 ) Adds context length controls to researcher (max tokens per post and batch) Allow picking LLM for researcher Fix bug where unicode usernames were not working Fix documentation of OR logic	2025-06-04 16:39:43 +10:00
Sam	4dffd0b2c5	DEV: improve tool infra, improve forum researcher prompts, improve logging (#1391 ) - add sleep function for tool polling with rate limits - Support base64 encoding for HTTP requests and uploads - Enhance forum researcher with cost warnings and comprehensive planning - Add cancellation support for research operations - Include feature_name parameter for bot analytics - richer research support (OR queries)	2025-06-03 15:17:55 +10:00
Rafael dos Santos Silva	478f31de47	FEATURE: add inferred concepts system (#1330 ) * FEATURE: add inferred concepts system This commit adds a new inferred concepts system that: - Creates a model for storing concept labels that can be applied to topics - Provides AI personas for finding new concepts and matching existing ones - Adds jobs for generating concepts from popular topics - Includes a scheduled job that automatically processes engaging topics * FEATURE: Extend inferred concepts to include posts * Adds support for concepts to be inferred from and applied to posts * Replaces daily task with one that handles both topics and posts * Adds database migration for posts_inferred_concepts join table * Updates PersonaContext to include inferred concepts Co-authored-by: Roman Rizzi <rizziromanalejandro@gmail.com> Co-authored-by: Keegan George <kgeorge13@gmail.com>	2025-06-02 14:29:20 -03:00
Sam	77ae426d95	FEATURE: support upload.getUrl in custom tools (#1384 ) * FEATURE: support upload.getUrl in custom tools Some tools need to share images with an API. A common pattern is for APIs to expect a URL. This allows converting upload://123123 to a proper CDN friendly URL from within a custom tool * no support for secure uploads, so be explicit about it.	2025-05-30 15:47:07 +10:00
Roman Rizzi	01e29ca5d8	FIX: Bump persona's examples length (#1377 )	2025-05-28 14:01:44 -03:00
Sam	70b0db2871	FIX: improve researcher tool - fix topic filters (#1368 ) * Small fix, reasoning is now available on Claude 4 models * fix invalid filters should raise, topic filter not working * fix spec so we are consistent	2025-05-26 16:00:44 +10:00
Sam	7db2589cc4	DEV: prompt engineering to improve citations (#1351 )	2025-05-20 13:01:35 +10:00
Roman Rizzi	ff2e18f9ca	FIX: Structured output discrepancies. (#1340 ) This change fixes two bugs and adds a safeguard. The first issue is that the schema Gemini expected differed from the one sent, resulting in 400 errors when performing completions. The second issue was that creating a new persona won't define a method for `response_format`. This has to be explicitly defined when we wrap it inside the Persona class. Also, There was a mismatch between the default value and what we stored in the DB. Some parts of the code expected symbols as keys and others as strings. Finally, we add a safeguard when, even if asked to, the model refuses to reply with a valid JSON. In this case, we are making a best-effort to recover and stream the raw response.	2025-05-15 11:32:10 -03:00
Sam	1b3fdad5c7	FEATURE: allow researcher to also research specific topics (#1339 ) * FEATURE: allow researcher to also research specific topics Also improve UI around research with more accurate info * this ensures that under no conditions PMs will be included	2025-05-15 17:48:21 +10:00
Sam	c34fcc8a95	FEATURE: forum researcher persona for deep research (#1313 ) This commit introduces a new Forum Researcher persona specialized in deep forum content analysis along with comprehensive improvements to our AI infrastructure. Key additions: New Forum Researcher persona with advanced filtering and analysis capabilities Robust filtering system supporting tags, categories, dates, users, and keywords LLM formatter to efficiently process and chunk research results Infrastructure improvements: Implemented CancelManager class to centrally manage AI completion cancellations Replaced callback-based cancellation with a more robust pattern Added systematic cancellation monitoring with callbacks Other improvements: Added configurable default_enabled flag to control which personas are enabled by default Updated translation strings for the new researcher functionality Added comprehensive specs for the new components Renames Researcher -> Web Researcher This change makes our AI platform more stable while adding powerful research capabilities that can analyze forum trends and surface relevant content.	2025-05-14 12:36:16 +10:00
Roman Rizzi	aef84bc5bb	FEATURE: Examples support for personas. (#1334 ) Examples simulate previous interactions with an LLM and come right after the system prompt. This helps grounding the model and producing better responses.	2025-05-13 10:06:16 -03:00
Roman Rizzi	c0a2d4c935	DEV: Use structured responses for summaries (#1252 ) * DEV: Use structured responses for summaries * Fix system specs * Make response_format a first class citizen and update endpoints to support it * Response format can be specified in the persona * lint * switch to jsonb and make column nullable * Reify structured output chunks. Move JSON parsing to the depths of Completion * Switch to JsonStreamingTracker for partial JSON parsing	2025-05-06 10:09:39 -03:00
Sam	c6a307b473	FIX: handle unexpected errors when browsing web (#1314 )	2025-05-06 18:12:26 +10:00
Sam	e9fed4f5fa	FEATURE: ensure researcher and github helper know the date (#1312 ) The date / time is important cause it creates a bias towards action. Without it, model may think it is getting future annoucements vs up to date info.	2025-05-06 14:39:30 +10:00
Roman Rizzi	48305dc7d3	FIX: resource_url replacemente in Persona's system prompt (#1310 )	2025-05-05 11:41:04 -03:00
Sam	9196546f6f	FIX: better LLM feedback for image generation failures (#1306 ) * FIX: handle error conditions when generating images gracefully * FIX: also handle error for edit_image * Update lib/inference/open_ai_image_generator.rb Co-authored-by: Krzysztof Kotlarek <kotlarek.krzysztof@gmail.com> * lint --------- Co-authored-by: Krzysztof Kotlarek <kotlarek.krzysztof@gmail.com>	2025-05-01 19:25:38 +10:00
Sam	491dac298f	FIX: system persona state leaking between sites (#1304 ) System personas leaned on reused classes, this was a problem in a multisite environement cause state, such as "enabled" ended up being reused between sites. New implementation ensures state is pristine between sites in a multisite * more handling for new superclass story * small oversight, display name should be used for display	2025-05-01 13:24:53 +10:00
Sam	17f04c76d8	FEATURE: add OpenAI image generation and editing capabilities (#1293 ) This commit enhances the AI image generation functionality by adding support for: 1. OpenAI's GPT-based image generation model (gpt-image-1) 2. Image editing capabilities through the OpenAI API 3. A new "Designer" persona specialized in image generation and editing 4. Two new AI tools: CreateImage and EditImage Technical changes include: - Renaming `ai_openai_dall_e_3_url` to `ai_openai_image_generation_url` with a migration - Adding `ai_openai_image_edit_url` setting for the image edit API endpoint - Refactoring image generation code to handle both DALL-E and the newer GPT models - Supporting multipart/form-data for image editing requests * wild guess but maybe quantization is breaking the test sometimes this increases distance * Update lib/personas/designer.rb Co-authored-by: Alan Guo Xiang Tan <gxtan1990@gmail.com> * simplify and de-flake code * fix, in chat we need enough context so we know exactly what uploads a user uploaded. * Update lib/personas/tools/edit_image.rb Co-authored-by: Alan Guo Xiang Tan <gxtan1990@gmail.com> * cleanup downloaded files right away * fix implementation --------- Co-authored-by: Alan Guo Xiang Tan <gxtan1990@gmail.com>	2025-04-29 17:38:54 +10:00
Sam	e15984029d	FEATURE: allow tools to amend personas (#1250 ) Add API methods to AI tools for reading and updating personas, enabling more flexible AI workflows. This allows custom tools to: - Fetch persona information through discourse.getPersona() - Update personas with modified settings via discourse.updatePersona() - Also update using persona.update() These APIs enable new use cases like "trainable" moderation bots, where users with appropriate permissions can set and refine moderation rules through direct chat interactions, without needing admin panel access. Also adds a special API scope which allows people to lean on API for similar actions Additionally adds a rather powerful hidden feature can allow custom tools to inject content into the context unconditionally it can be used for memory and similar features	2025-04-09 15:48:25 +10:00
Roman Rizzi	0d60aca6ef	FEATURE: Personas powered summaries. (#1232 ) * REFACTOR: Move personas into it's own module. * WIP: Use personas for summarization * Prioritize persona default LLM or fallback to newest one * Simplify summarization strategy * Keep ai_sumarization_model as a fallback	2025-04-02 12:54:47 -03:00
Roman Rizzi	30242a27e6	REFACTOR: Move personas into its own module. (#1233 ) This change moves all the personas code into its own module. We want to treat them as a building block features can built on top of, same as `Completions::Llm`. The code to title a message was moved from `Bot` to `Playground`.	2025-03-31 14:42:33 -03:00

38 Commits