discourse-ai

mirror of https://github.com/discourse/discourse-ai.git synced 2025-09-08 20:50:38 +00:00

Author	SHA1	Message	Date
Sam	32dc45ba4f	FIX: never block spam scanning user (#1437 ) Previously staff and bots would get scanned if TL was low Additionally if somehow spam scanner user was blocked (deactivated, silenced, banned) it would stop the feature from working This adds an override that ensures unconditionally the user is setup correctly prior to scanning	2025-06-17 14:51:27 +10:00
Rafael dos Santos Silva	bc8e57d7e8	DEV: Move title suggestion to an array (#1435 )	2025-06-16 18:06:54 -03:00
Natalie Tay	b5e8277083	DEV: Move AI translation feature into an AI Feature (#1424 ) This PR moves translations into an AI Feature See https://github.com/discourse/discourse-ai/pull/1424 for screenshots	2025-06-13 10:17:27 +08:00
Keegan George	9be1049de6	DEV: Log AI related configuration to staff action log (#1416 ) is update adds logging for changes made in the AI admin panel. When making configuration changes to Embeddings, LLMs, Personas, Tools, or Spam that aren't site setting related, changes will now be logged in Admin > Logs & Screening. This will help admins debug issues related to AI. In this update a helper lib is created called `AiStaffActionLogger` which can be easily used in the future to add logging support for any other admin config we need logged for AI.	2025-06-12 12:39:58 -07:00
Natalie Tay	fc83bed7cd	FIX: When allowing private content translation, only translate group PMs and not personal PMs (#1432 ) We want to avoid translating PMs that are not group PMs. This condition is applied when `SiteSetting.ai_translation_backfill_limit_to_public_content = false`	2025-06-13 00:55:52 +08:00
Roman Rizzi	9b7f1e6ee9	FIX: Helper wasn't working when the persona doesn't use structured output (#1433 )	2025-06-12 12:33:12 -03:00
Sam	ed311de937	FIX: various bugs in AI interface (#1430 ) * FIX: improve transition logic in forms previously back button would take you back to the /new route * FIX: enum selection not working for persona tools * seed information correctly in the DB * fix broken spec * Update assets/javascripts/discourse/components/ai-tool-editor-form.gjs Co-authored-by: Alan Guo Xiang Tan <gxtan1990@gmail.com> --------- Co-authored-by: Alan Guo Xiang Tan <gxtan1990@gmail.com>	2025-06-12 13:50:52 +10:00
Roman Rizzi	8c8fd969ef	FIX: Don't check for #blank? when manipulating chunks (#1428 )	2025-06-11 20:38:58 -03:00
Joffrey JAFFEUX	26217e51f9	DEV: a real selection change has a pointerup event (#1427 ) This is needed for https://github.com/discourse/discourse/pull/33143 as we now rely on this pointerup event.	2025-06-12 00:59:21 +02:00
Sam	a907bc891a	FIX: improve admin api for artifact key values (#1425 ) Previously we had a logic error and were showing admins keys that are not theirs when querying for all keys This makes the API cleaner, to get all results you need to be explicit always	2025-06-11 19:33:34 +10:00
Sam	d97307e99b	FEATURE: optionally support OpenAI responses API (#1423 ) OpenAI ship a new API for completions called "Responses API" Certain models (o3-pro) require this API. Additionally certain features are only made available to the new API. This allow enabling it per LLM. see: https://platform.openai.com/docs/api-reference/responses	2025-06-11 17:12:25 +10:00
Natalie Tay	35d62a659b	FIX: Skip edits if localization exists (#1422 ) We will fine tune updating an outdated localization in the future. For now we are seeing that quick edits are happening and we need to prevent the job from being too trigger-happy.	2025-06-11 11:00:22 +08:00
Sam	fdf0ff8a25	FEATURE: persistent key-value storage for AI Artifacts (#1417 ) Introduces a persistent, user-scoped key-value storage system for AI Artifacts, enabling them to be stateful and interactive. This transforms artifacts from static content into mini-applications that can save user input, preferences, and other data. The core components of this feature are: 1. Model and API: - A new `AiArtifactKeyValue` model and corresponding database table to store data associated with a user and an artifact. - A new `ArtifactKeyValuesController` provides a RESTful API for CRUD operations (`index`, `set`, `destroy`) on the key-value data. - Permissions are enforced: users can only modify their own data but can view public data from other users. 2. Secure JavaScript Bridge: - A `postMessage` communication bridge is established between the sandboxed artifact `iframe` and the parent Discourse window. - A JavaScript API is exposed to the artifact as `window.discourseArtifact` with async methods: `get(key)`, `set(key, value, options)`, `delete(key)`, and `index(filter)`. - The parent window handles these requests, makes authenticated calls to the new controller, and returns the results to the iframe. This ensures security by keeping untrusted JS isolated. 3. AI Tool Integration: - The `create_artifact` tool is updated with a `requires_storage` boolean parameter. - If an artifact requires storage, its metadata is flagged, and the system prompt for the code-generating AI is augmented with detailed documentation for the new storage API. 4. Configuration: - Adds hidden site settings `ai_artifact_kv_value_max_length` and `ai_artifact_max_keys_per_user_per_artifact` for throttling. This also includes a minor fix to use `jsonb_set` when updating artifact metadata, ensuring other metadata fields are preserved.	2025-06-11 06:59:46 +10:00
Roman Rizzi	f7e0ea888d	DEV: Use a PORO to represent modules/features. (#1421 ) Additional changes: Adds a "#features" method in AiPersona to find which features are using that persona. Serializes a basic version of a LlmModel in the persona's "#default_llm" serializer attribute.	2025-06-10 14:37:53 -03:00
Roman Rizzi	98afd7f8c3	FEATURE: Display features that rely on multiple personas. (#1411 ) * FEATURE: Display features that rely on multiple personas. This change makes the previously hidden feature page visible while displaying features, like the AI helper, which relies on multiple personas. * Fix system specs	2025-06-09 16:13:09 -03:00
Keegan George	33fd6801e5	DEV: Add back validator for Spam setting (#1415 ) ## 🔍 Overview This update re-introduces the validator used on the `ai_spam_detection_enabled` setting. It was initially added here: https://github.com/discourse/discourse-ai/pull/1374 to prevent Spam from being enabled without creating an `AiModerationSetting` value in the database. However, due to issues with backups/migrations we temporarily removed it here: https://github.com/discourse/discourse-ai/pull/1393. Now with some internal fixes, we can re-introduce it. We also update the validator so that it only validates when trying to turn on rather than when turning off too.	2025-06-06 10:56:36 -07:00
Natalie Tay	6827147362	DEV: Add topic and post id when using completions for traceability to AiApiAuditLog (#1414 ) The AiApiAuditLog per translation event doesn't trace back easily to a post or topic. This commit adds support to that, and also switches the translators to named arguments rather than positional arguments.	2025-06-06 23:24:24 +08:00
Natalie Tay	8a3a247b11	DEV: Also detect locale of categories and do not translate if already in the locale (#1413 ) Previously I had omitted to add `locale` to the category, as categories tended to be just a single word, and I did not find it would be worth to carry locale information. Due to certain LLMs that do poorer at translation, category descriptions got pretty messy. We added locale support here - https://github.com/discourse/discourse/pull/32962. This PR adds the automatic locale detection, and skips translating to the category's locale.	2025-06-06 22:41:48 +08:00
Sam	6817866de9	FEATURE: allow access to assigns from forum researcher (#1412 ) * FEATURE: allow access to assigns from forum researcher * FIX: should properly be checking for empty * finish PR	2025-06-06 16:59:00 +10:00
Sam	b3d78a6a10	FIX: when tool options are added they should be available (#1406 ) Fixes a regression where tool option editor was not showing all tools	2025-06-05 12:05:55 +10:00
Roman Rizzi	c885e5697f	review feedback	2025-06-04 14:23:00 -03:00
Roman Rizzi	0338dbea23	FEATURE: Use different personas to power AI helper features. You can now edit each AI helper prompt individually through personas, limit access to specific groups, set different LLMs, etc.	2025-06-04 14:23:00 -03:00
David Taylor	cab39839fd	Revert "DEV: Patch `Net::BufferedIO` to help debug spec flakes (#1375 )" (#1403 ) This reverts commit ca78b1a1c588bd8708418bc42855837aafc6ab15. Problem resolved by https://github.com/discourse/discourse-perspective-api/pull/110	2025-06-04 14:13:45 +01:00
Sam	3e74eea1e5	FEATURE: add context and llm controls to researcher, fix username filter (#1401 ) Adds context length controls to researcher (max tokens per post and batch) Allow picking LLM for researcher Fix bug where unicode usernames were not working Fix documentation of OR logic	2025-06-04 16:39:43 +10:00
Kris	fa51e9d948	REFACTOR: update AI conversation sidebar to use sidebar sections for date grouping (#1389 )	2025-06-03 09:40:52 -05:00
Joffrey JAFFEUX	306fec2b24	FIX: edit-topic is not invisible on desktop (#1394 ) Fix due to https://github.com/discourse/discourse/pull/32941	2025-06-03 16:30:19 +02:00
Sam	4dffd0b2c5	DEV: improve tool infra, improve forum researcher prompts, improve logging (#1391 ) - add sleep function for tool polling with rate limits - Support base64 encoding for HTTP requests and uploads - Enhance forum researcher with cost warnings and comprehensive planning - Add cancellation support for research operations - Include feature_name parameter for bot analytics - richer research support (OR queries)	2025-06-03 15:17:55 +10:00
Rafael dos Santos Silva	27de71fc4f	FIX: Proper default LLM detection for inferred concepts (#1392 )	2025-06-02 17:56:47 -03:00
Rafael dos Santos Silva	478f31de47	FEATURE: add inferred concepts system (#1330 ) * FEATURE: add inferred concepts system This commit adds a new inferred concepts system that: - Creates a model for storing concept labels that can be applied to topics - Provides AI personas for finding new concepts and matching existing ones - Adds jobs for generating concepts from popular topics - Includes a scheduled job that automatically processes engaging topics * FEATURE: Extend inferred concepts to include posts * Adds support for concepts to be inferred from and applied to posts * Replaces daily task with one that handles both topics and posts * Adds database migration for posts_inferred_concepts join table * Updates PersonaContext to include inferred concepts Co-authored-by: Roman Rizzi <rizziromanalejandro@gmail.com> Co-authored-by: Keegan George <kgeorge13@gmail.com>	2025-06-02 14:29:20 -03:00
Keegan George	34c98de864	FIX: Exporting overall sentiment fails (#1388 ) ## 🔍 Overview When exporting an Overall Sentiment report in the admin panel, the export fails with: ```ruby Job exception: no implicit conversion of Symbol into Integer ``` This was happening because we are passing a single _Hash_ to `report.data` however, exports expect `report.data` to be an _Array of Hashes_. This update fixes this issue by wrapping the data in an array.	2025-05-30 11:58:28 -07:00
Sam	b5d393b4bc	FIX: custom tools incorrectly setting all fields to blank enum (#1385 ) Previous to this change, enum was set to [] which broke all non enum tools	2025-05-30 17:12:24 +10:00
Sam	77ae426d95	FEATURE: support upload.getUrl in custom tools (#1384 ) * FEATURE: support upload.getUrl in custom tools Some tools need to share images with an API. A common pattern is for APIs to expect a URL. This allows converting upload://123123 to a proper CDN friendly URL from within a custom tool * no support for secure uploads, so be explicit about it.	2025-05-30 15:47:07 +10:00
Natalie Tay	373e2305d6	FEATURE: Automatic translation and localization of posts, topics, categories (#1376 ) Related: https://github.com/discourse/discourse-translator/pull/310 This commit includes all the jobs and event hooks to localize posts, topics, and categories. A few notes: - `feature_name: "translation"` because the site setting is `ai-translation` and module is `Translation` - we will switch to proper ai-feature in the near future, and can consider using the persona_user as `localization.localizer_user_id` - keeping things flat within the module for now as we will be moving to ai-feature soon and have to rearrange - Settings renamed/introduced are: - ai_translation_backfill_rate (0) - ai_translation_backfill_limit_to_public_content (true) - ai_translation_backfill_max_age_days (5) - ai_translation_verbose_logs (false)	2025-05-29 17:28:06 +08:00
David Taylor	ca78b1a1c5	DEV: Patch `Net::BufferedIO` to help debug spec flakes (#1375 ) Internal `/t/154170`	2025-05-28 10:24:07 +01:00
Sam	70b0db2871	FIX: improve researcher tool - fix topic filters (#1368 ) * Small fix, reasoning is now available on Claude 4 models * fix invalid filters should raise, topic filter not working * fix spec so we are consistent	2025-05-26 16:00:44 +10:00
Mark VanLandingham	cead887480	FIX: Don't error when navigating from AI Bot topic to regular (#1366 ) We were getting an error in this logic causing Ember to fail to render the non-bot-topic that we navigate to. I believe this is because the getter of participants is re-calculating (due to this.header.topicInfo being updated) before the args to this connector changes. Adding some safe navigation here fixes the issue.	2025-05-23 13:30:08 -05:00
Roman Rizzi	0ce17a122f	FIX: Correctly pass tool_choice when using Claude models. (#1364 ) The `ClaudePrompt` object couldn't access the original prompt's tool_choice attribute, affecting both Anthropic and Bedrock.	2025-05-23 10:36:52 -03:00
Sam	cf220c530c	FIX: Improve MessageBus efficiency and correctly stop streaming (#1362 ) * FIX: Improve MessageBus efficiency and correctly stop streaming This commit enhances the message bus implementation for AI helper streaming by: - Adding client_id targeting for message bus publications to ensure only the requesting client receives streaming updates - Limiting MessageBus backlog size (2) and age (60 seconds) to prevent Redis bloat - Replacing clearTimeout with Ember's cancel method for proper runloop management, we were leaking a stop - Adding tests for client-specific message delivery These changes improve memory usage and make streaming more reliable by ensuring messages are properly directed to the requesting client. * composer suggestion needed a fix as well. * backlog size of 2 is risky here cause same channel name is reused between clients	2025-05-23 16:23:06 +10:00
Roman Rizzi	d72ad84f8f	FIX: Retry parsing escaped inner JSON to handle control chars. (#1357 ) The structured output JSON comes embedded inside the API response, which is also a JSON. Since we have to parse the response to process it, any control characters inside the structured output are unescaped into regular characters, leading to invalid JSON and breaking during parsing. This change adds a retry mechanism that escapes the string again if parsing fails, preventing the parser from breaking on malformed input and working around this issue. For example: ``` original = '{ "a": "{\\"key\\":\\"value with \\n newline\\"}" }' JSON.parse(original) => { "a" => "{\"key\":\"value with \n newline\"}" } # At this point, the inner JSON string contains an actual newline. ```	2025-05-21 11:25:59 -03:00
Roman Rizzi	e207eba1a4	FIX: Don't dig on nil when checking for the gemini schema (#1356 )	2025-05-21 08:30:47 -03:00
Sam	af18603b21	DEV: cancel manager should bypass webmock (#1350 ) Webmock can be a bit flaky under certain use cases.	2025-05-20 13:01:55 +10:00
Roman Rizzi	2fb691cba8	FEATURE: Triage can hide posts after adding them to the review queue (#1348 )	2025-05-20 08:19:00 +10:00
Joffrey JAFFEUX	296aa24df1	DEV: rewrites artifact spec with capybara waiters (#1347 ) Generally speaking we never want to do: ``` expect(element.text).to eq("foo") ``` As these are rspec matchers and do not add further Capybara-style waiting specifically for the text content to become present.	2025-05-20 07:27:15 +10:00
Sam	3ac2359ff1	FEATURE: allow passing in data attributes to an artifact (#1346 ) Also allow artifact access to current username Usage inside artifact is: 1. await window.discourseArtifactReady; 2. access data via window.discourseArtifactData;	2025-05-19 15:44:37 +10:00
David Taylor	381fa14158	DEV: Rename spec (#1344 ) This has nothing to do with `assets:precompile`. Likely a copy/paste from another spec	2025-05-16 09:40:08 +01:00
Keegan George	dfea784fc4	DEV: Improve diff streaming accuracy with safety checker (#1338 ) This update adds a safety checker which scans the streamed updates. It ensures that incomplete segments of text are not sent yet over message bus as this will cause breakage with the diff streamer. It also updates the diff streamer to handle a thinking state for when we are waiting for message bus updates.	2025-05-15 11:38:46 -07:00
Roman Rizzi	ff2e18f9ca	FIX: Structured output discrepancies. (#1340 ) This change fixes two bugs and adds a safeguard. The first issue is that the schema Gemini expected differed from the one sent, resulting in 400 errors when performing completions. The second issue was that creating a new persona won't define a method for `response_format`. This has to be explicitly defined when we wrap it inside the Persona class. Also, There was a mismatch between the default value and what we stored in the DB. Some parts of the code expected symbols as keys and others as strings. Finally, we add a safeguard when, even if asked to, the model refuses to reply with a valid JSON. In this case, we are making a best-effort to recover and stream the raw response.	2025-05-15 11:32:10 -03:00
Sam	1b3fdad5c7	FEATURE: allow researcher to also research specific topics (#1339 ) * FEATURE: allow researcher to also research specific topics Also improve UI around research with more accurate info * this ensures that under no conditions PMs will be included	2025-05-15 17:48:21 +10:00
Sam	2c6459429f	DEV: use a proper object for tool definition (#1337 ) * DEV: use a proper object for tool definition This moves away from using a loose hash to define tools, which is error prone. Instead given a proper object we will also be able to coerce the return values to match tool definition correctly * fix xml tools * fix anthropic tools * fix specs... a few more to go * specs are passing * FIX: coerce values for XML tool calls * Update spec/lib/completions/tool_definition_spec.rb Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> --------- Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>	2025-05-15 17:32:39 +10:00
Sam	c34fcc8a95	FEATURE: forum researcher persona for deep research (#1313 ) This commit introduces a new Forum Researcher persona specialized in deep forum content analysis along with comprehensive improvements to our AI infrastructure. Key additions: New Forum Researcher persona with advanced filtering and analysis capabilities Robust filtering system supporting tags, categories, dates, users, and keywords LLM formatter to efficiently process and chunk research results Infrastructure improvements: Implemented CancelManager class to centrally manage AI completion cancellations Replaced callback-based cancellation with a more robust pattern Added systematic cancellation monitoring with callbacks Other improvements: Added configurable default_enabled flag to control which personas are enabled by default Updated translation strings for the new researcher functionality Added comprehensive specs for the new components Renames Researcher -> Web Researcher This change makes our AI platform more stable while adding powerful research capabilities that can analyze forum trends and surface relevant content.	2025-05-14 12:36:16 +10:00

1 2 3 4 5 ...

730 Commits