discourse-ai

mirror of https://github.com/discourse/discourse-ai.git synced 2025-09-08 20:50:38 +00:00

Author	SHA1	Message	Date
Natalie Tay	3e87e92631	DEV: Remove 'experimental' from translation features (#1439 ) * DEV: Remove 'experimental' from translation features * include compat * include compat	2025-06-19 12:23:56 +08:00
Natalie Tay	d7a2af5505	DEV: Prevent multiple translation per post (#1443 ) We're seeing an aggressive number of translations being enqueued for a single post and locale. Historically, we trigger translation on `cooked` not `raw`, but that has changed a while back. ``` # from AiApiAuditLog, the same post is getting translated to the same locale within a few secs of each other zh_CN - 2025-06-17 13:02:31 UTC zh_CN - 2025-06-17 13:02:34 UTC zh_CN - 2025-06-17 13:02:35 UTC zh_CN - 2025-06-17 13:02:36 UTC zh_CN - 2025-06-17 13:02:38 UTC zh_CN - 2025-06-17 13:02:39 UTC zh_CN - 2025-06-17 13:02:40 UTC zh_CN - 2025-06-17 13:02:40 UTC zh_CN - 2025-06-17 13:02:43 UTC zh_CN - 2025-06-17 13:02:44 UTC ``` This PR prevents this from happening.	2025-06-18 13:24:02 +08:00
Rafael dos Santos Silva	9dccc1eb93	FEATURE: Add Qwen3 tokenizer and update Gemma to version 3 (#1440 )	2025-06-17 10:25:03 -03:00
Keegan George	9be1049de6	DEV: Log AI related configuration to staff action log (#1416 ) is update adds logging for changes made in the AI admin panel. When making configuration changes to Embeddings, LLMs, Personas, Tools, or Spam that aren't site setting related, changes will now be logged in Admin > Logs & Screening. This will help admins debug issues related to AI. In this update a helper lib is created called `AiStaffActionLogger` which can be easily used in the future to add logging support for any other admin config we need logged for AI.	2025-06-12 12:39:58 -07:00
Natalie Tay	fc83bed7cd	FIX: When allowing private content translation, only translate group PMs and not personal PMs (#1432 ) We want to avoid translating PMs that are not group PMs. This condition is applied when `SiteSetting.ai_translation_backfill_limit_to_public_content = false`	2025-06-13 00:55:52 +08:00
Kris	22da440130	UX: add features to persona list and other style updates (#1405 )	2025-06-12 08:23:10 -04:00
Sam	02bc9f645e	FEATURE: hybrid artifact security mode (#1431 ) In hybrid mode ai artifacts can optionally automatically run. This is useful for cases where you may want to embed a survey and so on. Additionally, artifacts now allow for better fidelity around display: <div class="ai-artifact" data-ai-artifact-id="501" data-ai-artifact-height="300px" data-ai-artifact-autorun data-ai-artifact-seamless></div> User can supply height and seamless mode to be seamlessly rendered with no box shadow and show full screen button.	2025-06-12 20:04:48 +10:00
Sam	a907bc891a	FIX: improve admin api for artifact key values (#1425 ) Previously we had a logic error and were showing admins keys that are not theirs when querying for all keys This makes the API cleaner, to get all results you need to be explicit always	2025-06-11 19:33:34 +10:00
Sam	d97307e99b	FEATURE: optionally support OpenAI responses API (#1423 ) OpenAI ship a new API for completions called "Responses API" Certain models (o3-pro) require this API. Additionally certain features are only made available to the new API. This allow enabling it per LLM. see: https://platform.openai.com/docs/api-reference/responses	2025-06-11 17:12:25 +10:00
Natalie Tay	35d62a659b	FIX: Skip edits if localization exists (#1422 ) We will fine tune updating an outdated localization in the future. For now we are seeing that quick edits are happening and we need to prevent the job from being too trigger-happy.	2025-06-11 11:00:22 +08:00
Sam	fdf0ff8a25	FEATURE: persistent key-value storage for AI Artifacts (#1417 ) Introduces a persistent, user-scoped key-value storage system for AI Artifacts, enabling them to be stateful and interactive. This transforms artifacts from static content into mini-applications that can save user input, preferences, and other data. The core components of this feature are: 1. Model and API: - A new `AiArtifactKeyValue` model and corresponding database table to store data associated with a user and an artifact. - A new `ArtifactKeyValuesController` provides a RESTful API for CRUD operations (`index`, `set`, `destroy`) on the key-value data. - Permissions are enforced: users can only modify their own data but can view public data from other users. 2. Secure JavaScript Bridge: - A `postMessage` communication bridge is established between the sandboxed artifact `iframe` and the parent Discourse window. - A JavaScript API is exposed to the artifact as `window.discourseArtifact` with async methods: `get(key)`, `set(key, value, options)`, `delete(key)`, and `index(filter)`. - The parent window handles these requests, makes authenticated calls to the new controller, and returns the results to the iframe. This ensures security by keeping untrusted JS isolated. 3. AI Tool Integration: - The `create_artifact` tool is updated with a `requires_storage` boolean parameter. - If an artifact requires storage, its metadata is flagged, and the system prompt for the code-generating AI is augmented with detailed documentation for the new storage API. 4. Configuration: - Adds hidden site settings `ai_artifact_kv_value_max_length` and `ai_artifact_max_keys_per_user_per_artifact` for throttling. This also includes a minor fix to use `jsonb_set` when updating artifact metadata, ensuring other metadata fields are preserved.	2025-06-11 06:59:46 +10:00
Roman Rizzi	f7e0ea888d	DEV: Use a PORO to represent modules/features. (#1421 ) Additional changes: Adds a "#features" method in AiPersona to find which features are using that persona. Serializes a basic version of a LlmModel in the persona's "#default_llm" serializer attribute.	2025-06-10 14:37:53 -03:00
Roman Rizzi	98afd7f8c3	FEATURE: Display features that rely on multiple personas. (#1411 ) * FEATURE: Display features that rely on multiple personas. This change makes the previously hidden feature page visible while displaying features, like the AI helper, which relies on multiple personas. * Fix system specs	2025-06-09 16:13:09 -03:00
Natalie Tay	8a3a247b11	DEV: Also detect locale of categories and do not translate if already in the locale (#1413 ) Previously I had omitted to add `locale` to the category, as categories tended to be just a single word, and I did not find it would be worth to carry locale information. Due to certain LLMs that do poorer at translation, category descriptions got pretty messy. We added locale support here - https://github.com/discourse/discourse/pull/32962. This PR adds the automatic locale detection, and skips translating to the category's locale.	2025-06-06 22:41:48 +08:00
Roman Rizzi	c885e5697f	review feedback	2025-06-04 14:23:00 -03:00
Roman Rizzi	0338dbea23	FEATURE: Use different personas to power AI helper features. You can now edit each AI helper prompt individually through personas, limit access to specific groups, set different LLMs, etc.	2025-06-04 14:23:00 -03:00
Sam	4dffd0b2c5	DEV: improve tool infra, improve forum researcher prompts, improve logging (#1391 ) - add sleep function for tool polling with rate limits - Support base64 encoding for HTTP requests and uploads - Enhance forum researcher with cost warnings and comprehensive planning - Add cancellation support for research operations - Include feature_name parameter for bot analytics - richer research support (OR queries)	2025-06-03 15:17:55 +10:00
Rafael dos Santos Silva	478f31de47	FEATURE: add inferred concepts system (#1330 ) * FEATURE: add inferred concepts system This commit adds a new inferred concepts system that: - Creates a model for storing concept labels that can be applied to topics - Provides AI personas for finding new concepts and matching existing ones - Adds jobs for generating concepts from popular topics - Includes a scheduled job that automatically processes engaging topics * FEATURE: Extend inferred concepts to include posts * Adds support for concepts to be inferred from and applied to posts * Replaces daily task with one that handles both topics and posts * Adds database migration for posts_inferred_concepts join table * Updates PersonaContext to include inferred concepts Co-authored-by: Roman Rizzi <rizziromanalejandro@gmail.com> Co-authored-by: Keegan George <kgeorge13@gmail.com>	2025-06-02 14:29:20 -03:00
Sam	b5d393b4bc	FIX: custom tools incorrectly setting all fields to blank enum (#1385 ) Previous to this change, enum was set to [] which broke all non enum tools	2025-05-30 17:12:24 +10:00
Sam	77ae426d95	FEATURE: support upload.getUrl in custom tools (#1384 ) * FEATURE: support upload.getUrl in custom tools Some tools need to share images with an API. A common pattern is for APIs to expect a URL. This allows converting upload://123123 to a proper CDN friendly URL from within a custom tool * no support for secure uploads, so be explicit about it.	2025-05-30 15:47:07 +10:00
Natalie Tay	373e2305d6	FEATURE: Automatic translation and localization of posts, topics, categories (#1376 ) Related: https://github.com/discourse/discourse-translator/pull/310 This commit includes all the jobs and event hooks to localize posts, topics, and categories. A few notes: - `feature_name: "translation"` because the site setting is `ai-translation` and module is `Translation` - we will switch to proper ai-feature in the near future, and can consider using the persona_user as `localization.localizer_user_id` - keeping things flat within the module for now as we will be moving to ai-feature soon and have to rearrange - Settings renamed/introduced are: - ai_translation_backfill_rate (0) - ai_translation_backfill_limit_to_public_content (true) - ai_translation_backfill_max_age_days (5) - ai_translation_verbose_logs (false)	2025-05-29 17:28:06 +08:00
Keegan George	d99c335dab	DEV: Ensure enabling/disabling spam is set and logged (#1378 ) Since we enable/disable `ai_spam_detection_enabled` setting in a custom Spam tab UI in AI, we want to ensure we retain the setting and logging features. To preserve that, we want to update the controller to use `SiteSetting.set_and_log` instead of setting the value directly.	2025-05-28 10:12:21 -07:00
Sam	cf220c530c	FIX: Improve MessageBus efficiency and correctly stop streaming (#1362 ) * FIX: Improve MessageBus efficiency and correctly stop streaming This commit enhances the message bus implementation for AI helper streaming by: - Adding client_id targeting for message bus publications to ensure only the requesting client receives streaming updates - Limiting MessageBus backlog size (2) and age (60 seconds) to prevent Redis bloat - Replacing clearTimeout with Ember's cancel method for proper runloop management, we were leaking a stop - Adding tests for client-specific message delivery These changes improve memory usage and make streaming more reliable by ensuring messages are properly directed to the requesting client. * composer suggestion needed a fix as well. * backlog size of 2 is risky here cause same channel name is reused between clients	2025-05-23 16:23:06 +10:00
Sam	3ac2359ff1	FEATURE: allow passing in data attributes to an artifact (#1346 ) Also allow artifact access to current username Usage inside artifact is: 1. await window.discourseArtifactReady; 2. access data via window.discourseArtifactData;	2025-05-19 15:44:37 +10:00
David Taylor	77d3a38e49	FIX: AI share page assets via CDN on login-required sites (#1343 ) AI share page assets are loaded via the app CDN, which means the requests have no authentication and will never appear to the app as "logged in". Therefore we should skip the `redirect_to_login_if_required` before_action.	2025-05-16 09:30:38 +01:00
Roman Rizzi	ff2e18f9ca	FIX: Structured output discrepancies. (#1340 ) This change fixes two bugs and adds a safeguard. The first issue is that the schema Gemini expected differed from the one sent, resulting in 400 errors when performing completions. The second issue was that creating a new persona won't define a method for `response_format`. This has to be explicitly defined when we wrap it inside the Persona class. Also, There was a mismatch between the default value and what we stored in the DB. Some parts of the code expected symbols as keys and others as strings. Finally, we add a safeguard when, even if asked to, the model refuses to reply with a valid JSON. In this case, we are making a best-effort to recover and stream the raw response.	2025-05-15 11:32:10 -03:00
Roman Rizzi	aef84bc5bb	FEATURE: Examples support for personas. (#1334 ) Examples simulate previous interactions with an LLM and come right after the system prompt. This helps grounding the model and producing better responses.	2025-05-13 10:06:16 -03:00
Sam	2a62658248	FEATURE: support configurable thinking tokens for Gemini (#1322 )	2025-05-08 07:39:50 +10:00
Roman Rizzi	c0a2d4c935	DEV: Use structured responses for summaries (#1252 ) * DEV: Use structured responses for summaries * Fix system specs * Make response_format a first class citizen and update endpoints to support it * Response format can be specified in the persona * lint * switch to jsonb and make column nullable * Reify structured output chunks. Move JSON parsing to the depths of Completion * Switch to JsonStreamingTracker for partial JSON parsing	2025-05-06 10:09:39 -03:00
Sam	491dac298f	FIX: system persona state leaking between sites (#1304 ) System personas leaned on reused classes, this was a problem in a multisite environement cause state, such as "enabled" ended up being reused between sites. New implementation ensures state is pristine between sites in a multisite * more handling for new superclass story * small oversight, display name should be used for display	2025-05-01 13:24:53 +10:00
Keegan George	ab67299acb	FIX: Invalid access error should be populated to user (#1303 ) Invalid access error should be populated to user when trying to search for something they do not have permissions for (i.e. anons searching `in:messages`	2025-04-30 12:10:10 -07:00
Isaac Janzen	cd0cfc0bfc	DEV: Group PMs by date (#1287 ) # Preview https://github.com/user-attachments/assets/3fe3ac8f-c938-4df4-9afe-11980046944d # Details - Group pms by `last_posted_at`. In this first iteration we are group by `7 days`, `30 days`, then by month beyond that. - I inject a sidebar section link with the relative (last_posted_at) date and then update a tracked value to ensure we don't do it again. Then for each month beyond the first 30days, I add a value to the `loadedMonthLabels` set and we reference that (plus the year) to see if we need to load a new month label. - I took the creative liberty to remove the `Conversations` section label - this had no purpose - I hid the _collapse all sidebar sections_ carrot. This had no purpose. - Swap `BasicTopicSerializer` to `ListableTopicSerializer` to get access to `last_posted_at`	2025-04-25 13:20:18 -05:00
Mark VanLandingham	298ebee7dd	DEV: Migration to backfill bot PM custom field (#1282 ) In the last commit, I introduced a topic_custom_field to determine if a PM is indeed a bot PM. This commit adds a migration to backfill any PM that is between 1 real user, and 1 bot. The correct topic_custom_field is added for these, so they will appear on the bot conversation sidebar properly. We can also drop the joining to topic_users in the controller for sidebar conversations, and the isPostFromAiBot logic from the sidebar.	2025-04-24 13:02:43 -05:00
Isaac Janzen	e8b0f86300	FEATURE: Bot Conversation Homepage (#1273 )	2025-04-22 10:22:03 -05:00
Mark VanLandingham	244ec9d61e	REVERT: "FEATURE: Experimental Private Message Bot Homepage (#1159 )" (#1272 ) This reverts commit 5fec8fe79eeac7dae40013ff05f07ef18b568e38.	2025-04-21 16:42:05 -05:00
Mark VanLandingham	5fec8fe79e	FEATURE: Experimental Private Message Bot Homepage (#1159 ) Overview This PR introduces a Bot Homepage that was first introduced at https://ask.discourse.org/. Key Features: Add a bot homepage: /discourse-ai/ai-bot/conversations Display a sidebar with previous bot conversations Infinite scroll for large counts Sidebar still visible when navigation mode is header_dropdown Sidebar visible on homepage and bot PM show view Add New Question button to the bottom of sidebar on bot PM show view Add persona picker to homepage	2025-04-21 15:17:10 -05:00
Keegan George	d26c7ac48d	FEATURE: Add spending metrics to AI usage (#1268 ) This update adds metrics for estimated spending in AI usage. To make use of it, admins must add cost details to the LLM config page (input, output, and cached input costs per 1M tokens). After doing so, the metrics will appear in the AI usage dashboard as the AI plugin is used.	2025-04-17 15:09:48 -07:00
Keegan George	e2b0287333	FEATURE: Enhance LLM context window settings (#1271 ) ### 🔍 Overview This update performs some enhancements to the LLM configuration screen. In particular, it renames the UI for the number of tokens for the prompt to "Context window" since the naming can be confusing to the user. Additionally, it adds a new optional field called "Max output tokens".	2025-04-17 14:44:15 -07:00
Keegan George	1300cc8a36	FEATURE: Add streaming to composer helper (#1256 ) This update adding streaming to the AI helper inside the composer.	2025-04-14 08:18:50 -07:00
Roman Rizzi	df63e36ad8	FEATURE: Make Mixtral tokenizer available for embeddings (#1258 )	2025-04-11 12:01:38 -03:00
Keegan George	4de39a07e5	FEATURE: Configure persona backed features in admin panel (#1245 ) In this feature update, we add the UI for the ability to easily configure persona backed AI-features. The feature will still be hidden until structured responses are complete.	2025-04-10 08:16:31 -07:00
Roman Rizzi	0d60aca6ef	FEATURE: Personas powered summaries. (#1232 ) * REFACTOR: Move personas into it's own module. * WIP: Use personas for summarization * Prioritize persona default LLM or fallback to newest one * Simplify summarization strategy * Keep ai_sumarization_model as a fallback	2025-04-02 12:54:47 -03:00
Keegan George	bf5ccb452c	FEATURE: Continue conversation from Discobot discovery (#1234 ) This feature update allows for continuing the conversation with Discobot Discoveries in an AI bot chat. After discoveries gives you a response to your search you can continue with the existing context.	2025-04-01 10:22:39 -07:00
Roman Rizzi	30242a27e6	REFACTOR: Move personas into its own module. (#1233 ) This change moves all the personas code into its own module. We want to treat them as a building block features can built on top of, same as `Completions::Llm`. The code to title a message was moved from `Bot` to `Playground`.	2025-03-31 14:42:33 -03:00
Sam	5b6d39a206	FEATURE: flexible image handling within messages (#1214 ) * DEV: refactor bot internals This introduces a proper object for bot context, this makes it simpler to improve context management as we go cause we have a nice object to work with Starts refactoring allowing for a single message to have multiple uploads throughout * transplant method to message builder * chipping away at inline uploads * image support is improved but not fully fixed yet partially working in anthropic, still got quite a few dialects to go * open ai and claude are now working * Gemini is now working as well * fix nova * more dialects... * fix ollama * fix specs * update artifact fixed * more tests * spam scanner * pass more specs * bunch of specs improved * more bug fixes. * all the rest of the tests are working * improve tests coverage and ensure custom tools are aware of new context object * tests are working, but we need more tests * resolve merge conflict * new preamble and expanded specs on ai tool * remove concept of "standalone tools" This is no longer needed, we can set custom raw, tool details are injected into tool calls	2025-03-31 12:39:07 -03:00
Keegan George	bab6f0be43	FIX: Ensure category badging present in sentiment reports (#1222 ) This PR ensures that the category badges are present in the sentiment analysis report. Since the core change in https://github.com/discourse/discourse/pull/31795, there was a regression in the post list drill-down where category badges were not being shown. This PR fixes that and also ensures icons/emojis are shown when categories make use of them. This PR also adds the category badge in the table list.	2025-03-26 12:37:41 -07:00
Keegan George	6aaf8a0619	DEV: Use existing topic embeddings when suggesting tags/categories on edit (#1189 ) When editing a topic (instead of creating one) and using the tag/category suggestion buttons. We want to use existing topic embeddings instead of creating new ones.	2025-03-12 18:52:07 -07:00
Keegan George	b17c688162	DEV: Improve title suggester suggestions when editing topic (#1182 ) This update ensures topic title suggestions when suggesting from edit topic take into account the whole topic for more accurate title suggestions.	2025-03-11 11:16:06 -07:00
Sam	8f4cd2fcbd	FEATURE: allow disabling of top_p and temp for thinking models (#1184 ) thinking models such as Claude 3.7 Thinking and o1 / o3 do not support top_p or temp. Previously you would have to carefully remove it from everywhere by having it be a provider param we now support blanker removing without forcing people to update automation rules or personas	2025-03-11 16:54:02 +11:00
David Taylor	1b570fcd01	PERF: Move sentiment analysis to "low" sidekiq queue (#1173 )	2025-03-07 14:12:15 +00:00

1 2 3 4 5 ...

299 Commits