discourse-ai

mirror of https://github.com/discourse/discourse-ai.git synced 2025-07-06 06:22:19 +00:00

Author	SHA1	Message	Date
Sam	ab5edae121	FIX: make AI helper more robust (#1484 ) * FIX: make AI helper more robust - If JSON is broken for structured output then lean on a more forgiving parser - Gemini 2.5 flash does not support temp, support opting out - Evals for assistant were broken, fix interface - Add some missing LLMs - Translator was not mapped correctly to the feature - fix that - Don't mix XML in prompt for translator * lint * correct logic * simplify code * implement best effort json parsing direct in the structured output object	2025-07-04 14:47:11 +10:00
Joffrey JAFFEUX	0f904977a4	FIX: correctly translate automation name (#1485 ) * FIX: correctly translation automation name The key is persona, not name. * simplify	2025-07-04 13:01:45 +10:00
Natalie Tay	2b9a4f9232	FIX: Ignore captions and quotes when detecting locale and update prompts (#1483 ) A more deterministic way of making sure the LLM detects the correct language (instead of relying on prompt to LLM to ignore it) is to take the cooked and remove unwanted elements. In this commit - we remove quotes, image captions, etc. and only take the remaining text, falling back to the unadulterated cooked - and update prompts related to detection and translation - /152465/12	2025-07-03 22:57:48 +08:00
moin-Jana	8b4f401a7b	FIX: Typo in custom_tool_exists text (#1480 )	2025-07-03 10:17:31 -03:00
Discourse Translator Bot	92e3615378	Update translations (#1479 )	2025-07-02 22:36:46 +02:00
Rafael dos Santos Silva	d792919ddf	DEV: Move tokenizers to a gem (#1481 ) Also renames the Mixtral tokenizer to Mistral. See gem at github.com/discourse/discourse_ai-tokenizers Co-authored-by: Roman Rizzi <roman@discourse.org>	2025-07-02 14:43:03 -03:00
Roman Rizzi	75fb37144f	FEATURE: Use personas for generating hypothetical posts (#1482 ) * FEATURE: Use personas for generating hypothetica posts * Update prompt	2025-07-02 10:56:38 -03:00
Sam	40fa527633	FIX: cross talk when in ai helper (#1478 ) Previous to this change we reused channels for proofreading progress and ai helper progress The new changeset ensures each POST to stream progress gets a dedicated message bus channel This fixes a class of issues where the wrong information could be displayed to end users on subsequent proofreading or helper calls * fix tests * fix implementation (got to subscribe at 0)	2025-07-01 18:02:16 +10:00
moin-Jana	897f31e564	UX: Fix typo in bot.description text (#1474 ) Introducing a typo isn't the right way to bypass the check that blocks the term "private messages". Nice try, though ;) I changed it to "personal messages".	2025-07-01 17:24:05 +10:00
Yuriy Kurant	8527279594	dev: removes messages section from sidebar (#1477 )	2025-07-01 17:23:11 +10:00
Kris	4ad64ed3b6	DEV: replace sortBy with toSorted (#1476 )	2025-06-30 16:41:59 -04:00
Roman Rizzi	5ca7d5f256	FIX: Strip uploads from msg when searching for rag fragments (#1475 )	2025-06-30 15:03:17 -03:00
Natalie Tay	a94daa14e2	FIX: Return no topics when embeddings is disabled (#1473 ) When an invalid model is set for embeddings, topics do not load even if embeddings is disabled. Error: ## RuntimeError in TopicsController#show Invalid embeddings selected model This commit checks for valid settings before attempting to load related topics.	2025-06-30 17:45:04 +08:00
Kris	262bd8b145	UX: add filter to features page, update styles (#1471 ) * UX: add filter to features page, update styles * merge fix * update toggle spec * test fix	2025-06-30 09:26:53 +10:00
Roman Rizzi	57b00526f8	FIX: Clarify spam response expectations. (#1470 )	2025-06-27 16:59:55 -03:00
Roman Rizzi	8d943fa29d	FEATURE: Display spam module on features list. (#1469 )	2025-06-27 14:18:01 -03:00
Roman Rizzi	b35f9bcc7c	FEATURE: Use Persona's when scanning posts for spam (#1465 )	2025-06-27 10:35:47 -03:00
Sam	cc4e9e030f	FIX: normalize keys in structured output (#1468 ) * FIX: normalize keys in structured output Previously we did not validate the hash passed in to structured outputs which could either be string based or symbol base Specifically this broke structured outputs for Gemini in some specific cases. * comment out flake	2025-06-27 15:42:48 +10:00
Sam	73768ce920	FEATURE: Display bot in feature list (#1466 ) - allows features to have multiple llms and multiple personas - sorts module list - adds Bot as a first class module - fixes issue where search module was always configured - some tests	2025-06-27 12:35:41 +10:00
Rafael dos Santos Silva	a40e2d3156	FEATURE: Update OpenAI tokenizer to GPT-4o and later (#1467 )	2025-06-26 15:26:09 -03:00
Kris	2fe99a0bec	UX: add missing translation for uploads (#1464 )	2025-06-25 11:36:00 -04:00
Sam	3e74f09d06	FEATURE: improve custom tool infra (#1463 ) - Add support for `chain.streamCustomRaw(test)` that can be used to stream text from a JS tool direct to composer - Add support for llm params in `llm.generate` which unlocks stuff like structured outputs - Add discourse.createStagedUser, discourse.createTopic and discourse.createPost - for content creation	2025-06-25 16:25:44 +10:00
Discourse Translator Bot	3cfc749fad	Update translations (#1462 )	2025-06-24 16:29:23 +02:00
Jarek Radosz	5735f063a3	FIX: A typo in bot filtration in ai-bot-header-icon (#1455 ) * FIX: A typo in bot filtration in ai-bot-header-icon * FIX: Show header icon when there's only one persona with a default LLM set --------- Co-authored-by: Roman Rizzi <rizziromanalejandro@gmail.com>	2025-06-24 10:51:07 -03:00
Kris	757c93e514	UX: make topic list gists link to the topic (#1459 )	2025-06-24 09:14:18 -04:00
Natalie Tay	4c1cd5d819	UX: Align llm button in ai features (#1461 )	2025-06-24 17:23:58 +08:00
Sam	471f96f972	FEATURE: allow seeing configured LLM on feature page (#1460 ) This is an interim fix so we can at least tell what feature is being used for what LLM. It also adds some test coverage to the feature page.	2025-06-24 17:42:47 +10:00
dependabot[bot]	1f851bb2e1	bump rack from 3.1.14 to 3.1.16 (#1408 ) --- updated-dependencies: - dependency-name: rack dependency-version: 3.1.16 dependency-type: indirect ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2025-06-24 12:42:22 +10:00
Sam	9f2a4094f5	FEATURE: persona/tool import and export (#1450 ) Introduces import/export feature for tools and personas. Uploads are omitted for now, and will be added in a future PR * Backend: * Adds `import` and `export` actions to `Admin::AiPersonasController` and `Admin::AiToolsController`. * Introduces `DiscourseAi::PersonaExporter` and `DiscourseAi::PersonaImporter` services to manage JSON serialization and deserialization. * The export format for a persona embeds its associated custom tools. To ensure portability, `AiTool` references are serialized using their `tool_name` rather than their internal database `id`. * The import logic detects conflicts by name. A `force=true` parameter can be passed to overwrite existing records. * Frontend: * `AiPersonaListEditor` and `AiToolListEditor` components now include an "Import" button that handles file selection and POSTs the JSON data to the respective `import` endpoint. * `AiPersonaEditorForm` and `AiToolEditorForm` components feature an "Export" button that triggers a download of the serialized record. * Handles import conflicts (HTTP `409` for tools, `422` for personas) by showing a `dialog.confirm` prompt to allow the user to force an overwrite. * Testing: * Adds comprehensive request specs for the new controller actions (`#import`, `#export`). * Includes unit specs for the `PersonaExporter` and `PersonaImporter` services. * Persona import and export implemented	2025-06-24 12:41:10 +10:00
Roman Rizzi	eea96d6df9	FIX: Include JSON instructions in Helper default personas (#1458 )	2025-06-23 11:57:50 -03:00
Natalie Tay	683bb5725b	DEV: Split content based on llmmodel's max_output_tokens (#1456 ) In discourse/discourse-translator#249 we introduced splitting content (post.raw) prior to sending to translation as we were using a sync api. Now that we're streaming thanks to #1424, we'll chunk based on the LlmModel.max_output_tokens.	2025-06-23 21:11:20 +08:00
Natalie Tay	740be26625	DEV: Also make sure locale detection skips PMs that are not group PMs when public content only (#1457 ) In the earlier PR https://github.com/discourse/discourse-ai/pull/1432, when `SiteSetting.ai_translation_backfill_limit_to_public_content = false`, we translate PMs but skip translating PMs that do not involve groups. This commit covers the missing case on locale detection.	2025-06-23 19:07:40 +08:00
Natalie Tay	e2d7ca0bb9	DEV: Indicate backfill rate for translations is hourly (#1451 ) * DEV: Indicate backfill rate for translations is hourly * add ai_translation_max_post_length * default value update	2025-06-21 15:45:09 +08:00
Keegan George	238538c405	DEV: Remove deprecated integer duration in toasts (#1453 ) This update replaces deprecated integer duration with standardized duration of either `short` or `long` throughout the plugin usages of `FloatKit` toasts.	2025-06-20 12:42:08 -07:00
Keegan George	a4194d3fb2	FIX: AI preferences tab button not appearing unless Helper enabled (#1452 ) This update fixes an issue where the AI user preferences tab was not appearing unless `SiteSetting.ai_helper_enabled` was `true`. This is because we previously checked for it's presence when user preferences only had a single setting related to Helper. However, since then, we've also added search discoveries setting there too. As such, we don't want it to depend on Helper. We also sneak in this update a modernization of converting the preferences template from `.hbs` to `.gjs`.	2025-06-20 10:12:08 -07:00
Sam	eab6dd3f8e	DEV: re-implement bulk sentiment classifier (#1449 ) New implementation uses core concurrent job queue, it is more robust and predictable than the one shipped in Concurrent. Additionally: - Trickles through updates during bulk classification - Reports errors if we fail during a bulk classification * push concurrency down to 40. 100 feels quite high.	2025-06-20 16:06:03 +10:00
Keegan George	baaa3d199a	FIX: streaming related specs (#1448 ) ## 🔍 Overview This update fixes an issue where message bus streaming related specs were not working correctly. To do so we pass the `last_id` when subscribing to `MessageBus` which allows us to unskip those broken tests. --------- Co-authored-by: Joffrey JAFFEUX <j.jaffeux@gmail.com>	2025-06-19 07:41:18 -07:00
Joffrey JAFFEUX	6a33e5154d	DEV: makes ai menu helper a standalone menu (#1434 ) The current menu was rendering inside the post text toolbar (on desktop). This is not ideal as the post text toolbar rendering is conditioned on the presence of text selection, when you click a button on the toolbar, by design of the web browsers you will lose your text selection, making all of this super tricky. This commit makes desktop and mobile behave in the same way by rendering their own menu and capturing the quote state when we render the post text selection toolbar, this allows us to reason a much simpler way about the AI helper. This commit also removes what appears to be an unused file and corrects which was seemingly copy/paste mistakes. ⚠️ Technical note, this commit is correcting the message bus subscription which amongst other things allows to write specs which are not flaky. However due to the current implementation we have a channel per post, which means we need to serialize on last message bus id per post. We have two possible solutions here: - subscribe at the topic level - refactor the code to be able to use `MessageBus.last_ids` to be able to grab multiple posts at once instead of having to call `MessageBus.last_id` and done one Redis call per post --------- Co-authored-by: Keegan George <kgeorge13@gmail.com>	2025-06-19 11:56:00 +02:00
Sam	37dbd48513	FIX: implement max_output tokens (anthropic/openai/bedrock/gemini/open router) (#1447 ) * FIX: implement max_output tokens (anthropic/openai/bedrock/gemini/open router) Previously this feature existed but was not implemented Also updates a bunch of models to in our preset to point to latest * implementing in base is safer, simpler and easier to manage * anthropic 3.5 is getting older, lets use 4.0 here and fix spec	2025-06-19 16:00:11 +10:00
Natalie Tay	3e87e92631	DEV: Remove 'experimental' from translation features (#1439 ) * DEV: Remove 'experimental' from translation features * include compat * include compat	2025-06-19 12:23:56 +08:00
Mark VanLandingham	cd14b0c0be	FIX: Bring back empty state message when appropriate (#1446 ) The Today section was added always, but a side-effect was that we hid the empty state component. This commit brings back the empty state	2025-06-18 17:34:08 -05:00
Keegan George	cea8fd423e	FIX: unable to scroll AI bot persona selector (#1445 ) This update fixes a UX issue on the AI bot conversations page where the persona selector dropdown doesn't scroll when there are many items and the viewport is small. No tests as it's tricky to test scrolling.	2025-06-18 12:20:50 -07:00
Natalie Tay	d7a2af5505	DEV: Prevent multiple translation per post (#1443 ) We're seeing an aggressive number of translations being enqueued for a single post and locale. Historically, we trigger translation on `cooked` not `raw`, but that has changed a while back. ``` # from AiApiAuditLog, the same post is getting translated to the same locale within a few secs of each other zh_CN - 2025-06-17 13:02:31 UTC zh_CN - 2025-06-17 13:02:34 UTC zh_CN - 2025-06-17 13:02:35 UTC zh_CN - 2025-06-17 13:02:36 UTC zh_CN - 2025-06-17 13:02:38 UTC zh_CN - 2025-06-17 13:02:39 UTC zh_CN - 2025-06-17 13:02:40 UTC zh_CN - 2025-06-17 13:02:40 UTC zh_CN - 2025-06-17 13:02:43 UTC zh_CN - 2025-06-17 13:02:44 UTC ``` This PR prevents this from happening.	2025-06-18 13:24:02 +08:00
Discourse Translator Bot	6dbe19a772	Update translations (#1441 )	2025-06-17 23:07:50 +02:00
Keegan George	62d746662a	FIX: Cleanup properties on closing `DiffModal` (#1442 ) This update ensures that we reset the tracked properties when closing the DiffModal so that the state doesn't leak when triggering the AI suggestions again. We also reset before suggesting new changes, thus if regeneration is called there shouldn't be any leaks either. No tests in this PR as tests currently not working great due to streaming/animation issues. Will do a broader PR following up with various specs to improve test coverage here.	2025-06-17 13:57:46 -07:00
Rafael dos Santos Silva	9dccc1eb93	FEATURE: Add Qwen3 tokenizer and update Gemma to version 3 (#1440 )	2025-06-17 10:25:03 -03:00
Natalie Tay	df925f8304	DEV: Move examples out of prompt (#1438 ) * DEV: Move examples out of prompt	2025-06-17 16:12:52 +08:00
Sam	32dc45ba4f	FIX: never block spam scanning user (#1437 ) Previously staff and bots would get scanned if TL was low Additionally if somehow spam scanner user was blocked (deactivated, silenced, banned) it would stop the feature from working This adds an override that ensures unconditionally the user is setup correctly prior to scanning	2025-06-17 14:51:27 +10:00
Rafael dos Santos Silva	bc8e57d7e8	DEV: Move title suggestion to an array (#1435 )	2025-06-16 18:06:54 -03:00
Kris	24416c5b87	UX: focus conversation input on route transition and button click (#1404 )	2025-06-13 17:45:51 -04:00

1 2 3 4 5 ...

1430 Commits