discourse-ai

mirror of https://github.com/discourse/discourse-ai.git synced 2025-06-29 19:12:15 +00:00

Author	SHA1	Message	Date
Roman Rizzi	b35f9bcc7c	FEATURE: Use Persona's when scanning posts for spam (#1465 )	2025-06-27 10:35:47 -03:00
Sam	73768ce920	FEATURE: Display bot in feature list (#1466 ) - allows features to have multiple llms and multiple personas - sorts module list - adds Bot as a first class module - fixes issue where search module was always configured - some tests	2025-06-27 12:35:41 +10:00
Sam	471f96f972	FEATURE: allow seeing configured LLM on feature page (#1460 ) This is an interim fix so we can at least tell what feature is being used for what LLM. It also adds some test coverage to the feature page.	2025-06-24 17:42:47 +10:00
Sam	9f2a4094f5	FEATURE: persona/tool import and export (#1450 ) Introduces import/export feature for tools and personas. Uploads are omitted for now, and will be added in a future PR * Backend: * Adds `import` and `export` actions to `Admin::AiPersonasController` and `Admin::AiToolsController`. * Introduces `DiscourseAi::PersonaExporter` and `DiscourseAi::PersonaImporter` services to manage JSON serialization and deserialization. * The export format for a persona embeds its associated custom tools. To ensure portability, `AiTool` references are serialized using their `tool_name` rather than their internal database `id`. * The import logic detects conflicts by name. A `force=true` parameter can be passed to overwrite existing records. * Frontend: * `AiPersonaListEditor` and `AiToolListEditor` components now include an "Import" button that handles file selection and POSTs the JSON data to the respective `import` endpoint. * `AiPersonaEditorForm` and `AiToolEditorForm` components feature an "Export" button that triggers a download of the serialized record. * Handles import conflicts (HTTP `409` for tools, `422` for personas) by showing a `dialog.confirm` prompt to allow the user to force an overwrite. * Testing: * Adds comprehensive request specs for the new controller actions (`#import`, `#export`). * Includes unit specs for the `PersonaExporter` and `PersonaImporter` services. * Persona import and export implemented	2025-06-24 12:41:10 +10:00
Natalie Tay	683bb5725b	DEV: Split content based on llmmodel's max_output_tokens (#1456 ) In discourse/discourse-translator#249 we introduced splitting content (post.raw) prior to sending to translation as we were using a sync api. Now that we're streaming thanks to #1424, we'll chunk based on the LlmModel.max_output_tokens.	2025-06-23 21:11:20 +08:00
Natalie Tay	740be26625	DEV: Also make sure locale detection skips PMs that are not group PMs when public content only (#1457 ) In the earlier PR https://github.com/discourse/discourse-ai/pull/1432, when `SiteSetting.ai_translation_backfill_limit_to_public_content = false`, we translate PMs but skip translating PMs that do not involve groups. This commit covers the missing case on locale detection.	2025-06-23 19:07:40 +08:00
Natalie Tay	e2d7ca0bb9	DEV: Indicate backfill rate for translations is hourly (#1451 ) * DEV: Indicate backfill rate for translations is hourly * add ai_translation_max_post_length * default value update	2025-06-21 15:45:09 +08:00
Natalie Tay	3e87e92631	DEV: Remove 'experimental' from translation features (#1439 ) * DEV: Remove 'experimental' from translation features * include compat * include compat	2025-06-19 12:23:56 +08:00
Natalie Tay	d7a2af5505	DEV: Prevent multiple translation per post (#1443 ) We're seeing an aggressive number of translations being enqueued for a single post and locale. Historically, we trigger translation on `cooked` not `raw`, but that has changed a while back. ``` # from AiApiAuditLog, the same post is getting translated to the same locale within a few secs of each other zh_CN - 2025-06-17 13:02:31 UTC zh_CN - 2025-06-17 13:02:34 UTC zh_CN - 2025-06-17 13:02:35 UTC zh_CN - 2025-06-17 13:02:36 UTC zh_CN - 2025-06-17 13:02:38 UTC zh_CN - 2025-06-17 13:02:39 UTC zh_CN - 2025-06-17 13:02:40 UTC zh_CN - 2025-06-17 13:02:40 UTC zh_CN - 2025-06-17 13:02:43 UTC zh_CN - 2025-06-17 13:02:44 UTC ``` This PR prevents this from happening.	2025-06-18 13:24:02 +08:00
Rafael dos Santos Silva	9dccc1eb93	FEATURE: Add Qwen3 tokenizer and update Gemma to version 3 (#1440 )	2025-06-17 10:25:03 -03:00
Keegan George	9be1049de6	DEV: Log AI related configuration to staff action log (#1416 ) is update adds logging for changes made in the AI admin panel. When making configuration changes to Embeddings, LLMs, Personas, Tools, or Spam that aren't site setting related, changes will now be logged in Admin > Logs & Screening. This will help admins debug issues related to AI. In this update a helper lib is created called `AiStaffActionLogger` which can be easily used in the future to add logging support for any other admin config we need logged for AI.	2025-06-12 12:39:58 -07:00
Natalie Tay	fc83bed7cd	FIX: When allowing private content translation, only translate group PMs and not personal PMs (#1432 ) We want to avoid translating PMs that are not group PMs. This condition is applied when `SiteSetting.ai_translation_backfill_limit_to_public_content = false`	2025-06-13 00:55:52 +08:00
Kris	22da440130	UX: add features to persona list and other style updates (#1405 )	2025-06-12 08:23:10 -04:00
Sam	02bc9f645e	FEATURE: hybrid artifact security mode (#1431 ) In hybrid mode ai artifacts can optionally automatically run. This is useful for cases where you may want to embed a survey and so on. Additionally, artifacts now allow for better fidelity around display: <div class="ai-artifact" data-ai-artifact-id="501" data-ai-artifact-height="300px" data-ai-artifact-autorun data-ai-artifact-seamless></div> User can supply height and seamless mode to be seamlessly rendered with no box shadow and show full screen button.	2025-06-12 20:04:48 +10:00
Sam	a907bc891a	FIX: improve admin api for artifact key values (#1425 ) Previously we had a logic error and were showing admins keys that are not theirs when querying for all keys This makes the API cleaner, to get all results you need to be explicit always	2025-06-11 19:33:34 +10:00
Sam	d97307e99b	FEATURE: optionally support OpenAI responses API (#1423 ) OpenAI ship a new API for completions called "Responses API" Certain models (o3-pro) require this API. Additionally certain features are only made available to the new API. This allow enabling it per LLM. see: https://platform.openai.com/docs/api-reference/responses	2025-06-11 17:12:25 +10:00
Natalie Tay	35d62a659b	FIX: Skip edits if localization exists (#1422 ) We will fine tune updating an outdated localization in the future. For now we are seeing that quick edits are happening and we need to prevent the job from being too trigger-happy.	2025-06-11 11:00:22 +08:00
Sam	fdf0ff8a25	FEATURE: persistent key-value storage for AI Artifacts (#1417 ) Introduces a persistent, user-scoped key-value storage system for AI Artifacts, enabling them to be stateful and interactive. This transforms artifacts from static content into mini-applications that can save user input, preferences, and other data. The core components of this feature are: 1. Model and API: - A new `AiArtifactKeyValue` model and corresponding database table to store data associated with a user and an artifact. - A new `ArtifactKeyValuesController` provides a RESTful API for CRUD operations (`index`, `set`, `destroy`) on the key-value data. - Permissions are enforced: users can only modify their own data but can view public data from other users. 2. Secure JavaScript Bridge: - A `postMessage` communication bridge is established between the sandboxed artifact `iframe` and the parent Discourse window. - A JavaScript API is exposed to the artifact as `window.discourseArtifact` with async methods: `get(key)`, `set(key, value, options)`, `delete(key)`, and `index(filter)`. - The parent window handles these requests, makes authenticated calls to the new controller, and returns the results to the iframe. This ensures security by keeping untrusted JS isolated. 3. AI Tool Integration: - The `create_artifact` tool is updated with a `requires_storage` boolean parameter. - If an artifact requires storage, its metadata is flagged, and the system prompt for the code-generating AI is augmented with detailed documentation for the new storage API. 4. Configuration: - Adds hidden site settings `ai_artifact_kv_value_max_length` and `ai_artifact_max_keys_per_user_per_artifact` for throttling. This also includes a minor fix to use `jsonb_set` when updating artifact metadata, ensuring other metadata fields are preserved.	2025-06-11 06:59:46 +10:00
Roman Rizzi	f7e0ea888d	DEV: Use a PORO to represent modules/features. (#1421 ) Additional changes: Adds a "#features" method in AiPersona to find which features are using that persona. Serializes a basic version of a LlmModel in the persona's "#default_llm" serializer attribute.	2025-06-10 14:37:53 -03:00
Roman Rizzi	98afd7f8c3	FEATURE: Display features that rely on multiple personas. (#1411 ) * FEATURE: Display features that rely on multiple personas. This change makes the previously hidden feature page visible while displaying features, like the AI helper, which relies on multiple personas. * Fix system specs	2025-06-09 16:13:09 -03:00
Natalie Tay	8a3a247b11	DEV: Also detect locale of categories and do not translate if already in the locale (#1413 ) Previously I had omitted to add `locale` to the category, as categories tended to be just a single word, and I did not find it would be worth to carry locale information. Due to certain LLMs that do poorer at translation, category descriptions got pretty messy. We added locale support here - https://github.com/discourse/discourse/pull/32962. This PR adds the automatic locale detection, and skips translating to the category's locale.	2025-06-06 22:41:48 +08:00
Roman Rizzi	c885e5697f	review feedback	2025-06-04 14:23:00 -03:00
Roman Rizzi	0338dbea23	FEATURE: Use different personas to power AI helper features. You can now edit each AI helper prompt individually through personas, limit access to specific groups, set different LLMs, etc.	2025-06-04 14:23:00 -03:00
Sam	4dffd0b2c5	DEV: improve tool infra, improve forum researcher prompts, improve logging (#1391 ) - add sleep function for tool polling with rate limits - Support base64 encoding for HTTP requests and uploads - Enhance forum researcher with cost warnings and comprehensive planning - Add cancellation support for research operations - Include feature_name parameter for bot analytics - richer research support (OR queries)	2025-06-03 15:17:55 +10:00
Rafael dos Santos Silva	478f31de47	FEATURE: add inferred concepts system (#1330 ) * FEATURE: add inferred concepts system This commit adds a new inferred concepts system that: - Creates a model for storing concept labels that can be applied to topics - Provides AI personas for finding new concepts and matching existing ones - Adds jobs for generating concepts from popular topics - Includes a scheduled job that automatically processes engaging topics * FEATURE: Extend inferred concepts to include posts * Adds support for concepts to be inferred from and applied to posts * Replaces daily task with one that handles both topics and posts * Adds database migration for posts_inferred_concepts join table * Updates PersonaContext to include inferred concepts Co-authored-by: Roman Rizzi <rizziromanalejandro@gmail.com> Co-authored-by: Keegan George <kgeorge13@gmail.com>	2025-06-02 14:29:20 -03:00
Sam	b5d393b4bc	FIX: custom tools incorrectly setting all fields to blank enum (#1385 ) Previous to this change, enum was set to [] which broke all non enum tools	2025-05-30 17:12:24 +10:00
Sam	77ae426d95	FEATURE: support upload.getUrl in custom tools (#1384 ) * FEATURE: support upload.getUrl in custom tools Some tools need to share images with an API. A common pattern is for APIs to expect a URL. This allows converting upload://123123 to a proper CDN friendly URL from within a custom tool * no support for secure uploads, so be explicit about it.	2025-05-30 15:47:07 +10:00
Natalie Tay	373e2305d6	FEATURE: Automatic translation and localization of posts, topics, categories (#1376 ) Related: https://github.com/discourse/discourse-translator/pull/310 This commit includes all the jobs and event hooks to localize posts, topics, and categories. A few notes: - `feature_name: "translation"` because the site setting is `ai-translation` and module is `Translation` - we will switch to proper ai-feature in the near future, and can consider using the persona_user as `localization.localizer_user_id` - keeping things flat within the module for now as we will be moving to ai-feature soon and have to rearrange - Settings renamed/introduced are: - ai_translation_backfill_rate (0) - ai_translation_backfill_limit_to_public_content (true) - ai_translation_backfill_max_age_days (5) - ai_translation_verbose_logs (false)	2025-05-29 17:28:06 +08:00
Keegan George	d99c335dab	DEV: Ensure enabling/disabling spam is set and logged (#1378 ) Since we enable/disable `ai_spam_detection_enabled` setting in a custom Spam tab UI in AI, we want to ensure we retain the setting and logging features. To preserve that, we want to update the controller to use `SiteSetting.set_and_log` instead of setting the value directly.	2025-05-28 10:12:21 -07:00
Sam	cf220c530c	FIX: Improve MessageBus efficiency and correctly stop streaming (#1362 ) * FIX: Improve MessageBus efficiency and correctly stop streaming This commit enhances the message bus implementation for AI helper streaming by: - Adding client_id targeting for message bus publications to ensure only the requesting client receives streaming updates - Limiting MessageBus backlog size (2) and age (60 seconds) to prevent Redis bloat - Replacing clearTimeout with Ember's cancel method for proper runloop management, we were leaking a stop - Adding tests for client-specific message delivery These changes improve memory usage and make streaming more reliable by ensuring messages are properly directed to the requesting client. * composer suggestion needed a fix as well. * backlog size of 2 is risky here cause same channel name is reused between clients	2025-05-23 16:23:06 +10:00
Sam	3ac2359ff1	FEATURE: allow passing in data attributes to an artifact (#1346 ) Also allow artifact access to current username Usage inside artifact is: 1. await window.discourseArtifactReady; 2. access data via window.discourseArtifactData;	2025-05-19 15:44:37 +10:00
David Taylor	77d3a38e49	FIX: AI share page assets via CDN on login-required sites (#1343 ) AI share page assets are loaded via the app CDN, which means the requests have no authentication and will never appear to the app as "logged in". Therefore we should skip the `redirect_to_login_if_required` before_action.	2025-05-16 09:30:38 +01:00
Roman Rizzi	ff2e18f9ca	FIX: Structured output discrepancies. (#1340 ) This change fixes two bugs and adds a safeguard. The first issue is that the schema Gemini expected differed from the one sent, resulting in 400 errors when performing completions. The second issue was that creating a new persona won't define a method for `response_format`. This has to be explicitly defined when we wrap it inside the Persona class. Also, There was a mismatch between the default value and what we stored in the DB. Some parts of the code expected symbols as keys and others as strings. Finally, we add a safeguard when, even if asked to, the model refuses to reply with a valid JSON. In this case, we are making a best-effort to recover and stream the raw response.	2025-05-15 11:32:10 -03:00
Roman Rizzi	aef84bc5bb	FEATURE: Examples support for personas. (#1334 ) Examples simulate previous interactions with an LLM and come right after the system prompt. This helps grounding the model and producing better responses.	2025-05-13 10:06:16 -03:00
Sam	2a62658248	FEATURE: support configurable thinking tokens for Gemini (#1322 )	2025-05-08 07:39:50 +10:00
Roman Rizzi	c0a2d4c935	DEV: Use structured responses for summaries (#1252 ) * DEV: Use structured responses for summaries * Fix system specs * Make response_format a first class citizen and update endpoints to support it * Response format can be specified in the persona * lint * switch to jsonb and make column nullable * Reify structured output chunks. Move JSON parsing to the depths of Completion * Switch to JsonStreamingTracker for partial JSON parsing	2025-05-06 10:09:39 -03:00
Sam	491dac298f	FIX: system persona state leaking between sites (#1304 ) System personas leaned on reused classes, this was a problem in a multisite environement cause state, such as "enabled" ended up being reused between sites. New implementation ensures state is pristine between sites in a multisite * more handling for new superclass story * small oversight, display name should be used for display	2025-05-01 13:24:53 +10:00
Keegan George	ab67299acb	FIX: Invalid access error should be populated to user (#1303 ) Invalid access error should be populated to user when trying to search for something they do not have permissions for (i.e. anons searching `in:messages`	2025-04-30 12:10:10 -07:00
Isaac Janzen	cd0cfc0bfc	DEV: Group PMs by date (#1287 ) # Preview https://github.com/user-attachments/assets/3fe3ac8f-c938-4df4-9afe-11980046944d # Details - Group pms by `last_posted_at`. In this first iteration we are group by `7 days`, `30 days`, then by month beyond that. - I inject a sidebar section link with the relative (last_posted_at) date and then update a tracked value to ensure we don't do it again. Then for each month beyond the first 30days, I add a value to the `loadedMonthLabels` set and we reference that (plus the year) to see if we need to load a new month label. - I took the creative liberty to remove the `Conversations` section label - this had no purpose - I hid the _collapse all sidebar sections_ carrot. This had no purpose. - Swap `BasicTopicSerializer` to `ListableTopicSerializer` to get access to `last_posted_at`	2025-04-25 13:20:18 -05:00
Mark VanLandingham	298ebee7dd	DEV: Migration to backfill bot PM custom field (#1282 ) In the last commit, I introduced a topic_custom_field to determine if a PM is indeed a bot PM. This commit adds a migration to backfill any PM that is between 1 real user, and 1 bot. The correct topic_custom_field is added for these, so they will appear on the bot conversation sidebar properly. We can also drop the joining to topic_users in the controller for sidebar conversations, and the isPostFromAiBot logic from the sidebar.	2025-04-24 13:02:43 -05:00
Isaac Janzen	e8b0f86300	FEATURE: Bot Conversation Homepage (#1273 )	2025-04-22 10:22:03 -05:00
Mark VanLandingham	244ec9d61e	REVERT: "FEATURE: Experimental Private Message Bot Homepage (#1159 )" (#1272 ) This reverts commit 5fec8fe79eeac7dae40013ff05f07ef18b568e38.	2025-04-21 16:42:05 -05:00
Mark VanLandingham	5fec8fe79e	FEATURE: Experimental Private Message Bot Homepage (#1159 ) Overview This PR introduces a Bot Homepage that was first introduced at https://ask.discourse.org/. Key Features: Add a bot homepage: /discourse-ai/ai-bot/conversations Display a sidebar with previous bot conversations Infinite scroll for large counts Sidebar still visible when navigation mode is header_dropdown Sidebar visible on homepage and bot PM show view Add New Question button to the bottom of sidebar on bot PM show view Add persona picker to homepage	2025-04-21 15:17:10 -05:00
Keegan George	d26c7ac48d	FEATURE: Add spending metrics to AI usage (#1268 ) This update adds metrics for estimated spending in AI usage. To make use of it, admins must add cost details to the LLM config page (input, output, and cached input costs per 1M tokens). After doing so, the metrics will appear in the AI usage dashboard as the AI plugin is used.	2025-04-17 15:09:48 -07:00
Keegan George	e2b0287333	FEATURE: Enhance LLM context window settings (#1271 ) ### 🔍 Overview This update performs some enhancements to the LLM configuration screen. In particular, it renames the UI for the number of tokens for the prompt to "Context window" since the naming can be confusing to the user. Additionally, it adds a new optional field called "Max output tokens".	2025-04-17 14:44:15 -07:00
Keegan George	1300cc8a36	FEATURE: Add streaming to composer helper (#1256 ) This update adding streaming to the AI helper inside the composer.	2025-04-14 08:18:50 -07:00
Roman Rizzi	df63e36ad8	FEATURE: Make Mixtral tokenizer available for embeddings (#1258 )	2025-04-11 12:01:38 -03:00
Keegan George	4de39a07e5	FEATURE: Configure persona backed features in admin panel (#1245 ) In this feature update, we add the UI for the ability to easily configure persona backed AI-features. The feature will still be hidden until structured responses are complete.	2025-04-10 08:16:31 -07:00
Roman Rizzi	0d60aca6ef	FEATURE: Personas powered summaries. (#1232 ) * REFACTOR: Move personas into it's own module. * WIP: Use personas for summarization * Prioritize persona default LLM or fallback to newest one * Simplify summarization strategy * Keep ai_sumarization_model as a fallback	2025-04-02 12:54:47 -03:00
Keegan George	bf5ccb452c	FEATURE: Continue conversation from Discobot discovery (#1234 ) This feature update allows for continuing the conversation with Discobot Discoveries in an AI bot chat. After discoveries gives you a response to your search you can continue with the existing context.	2025-04-01 10:22:39 -07:00

1 2 3 4 5 ...

306 Commits