discourse-ai

mirror of https://github.com/discourse/discourse-ai.git synced 2025-09-08 20:50:38 +00:00

Author	SHA1	Message	Date
Jarek Radosz	d4501f2928	stylelint fix	2025-06-20 23:54:40 +02:00
Kris	22da440130	UX: add features to persona list and other style updates (#1405 )	2025-06-12 08:23:10 -04:00
Sam	02bc9f645e	FEATURE: hybrid artifact security mode (#1431 ) In hybrid mode ai artifacts can optionally automatically run. This is useful for cases where you may want to embed a survey and so on. Additionally, artifacts now allow for better fidelity around display: <div class="ai-artifact" data-ai-artifact-id="501" data-ai-artifact-height="300px" data-ai-artifact-autorun data-ai-artifact-seamless></div> User can supply height and seamless mode to be seamlessly rendered with no box shadow and show full screen button.	2025-06-12 20:04:48 +10:00
Sam	fdf0ff8a25	FEATURE: persistent key-value storage for AI Artifacts (#1417 ) Introduces a persistent, user-scoped key-value storage system for AI Artifacts, enabling them to be stateful and interactive. This transforms artifacts from static content into mini-applications that can save user input, preferences, and other data. The core components of this feature are: 1. Model and API: - A new `AiArtifactKeyValue` model and corresponding database table to store data associated with a user and an artifact. - A new `ArtifactKeyValuesController` provides a RESTful API for CRUD operations (`index`, `set`, `destroy`) on the key-value data. - Permissions are enforced: users can only modify their own data but can view public data from other users. 2. Secure JavaScript Bridge: - A `postMessage` communication bridge is established between the sandboxed artifact `iframe` and the parent Discourse window. - A JavaScript API is exposed to the artifact as `window.discourseArtifact` with async methods: `get(key)`, `set(key, value, options)`, `delete(key)`, and `index(filter)`. - The parent window handles these requests, makes authenticated calls to the new controller, and returns the results to the iframe. This ensures security by keeping untrusted JS isolated. 3. AI Tool Integration: - The `create_artifact` tool is updated with a `requires_storage` boolean parameter. - If an artifact requires storage, its metadata is flagged, and the system prompt for the code-generating AI is augmented with detailed documentation for the new storage API. 4. Configuration: - Adds hidden site settings `ai_artifact_kv_value_max_length` and `ai_artifact_max_keys_per_user_per_artifact` for throttling. This also includes a minor fix to use `jsonb_set` when updating artifact metadata, ensuring other metadata fields are preserved.	2025-06-11 06:59:46 +10:00
Martin Brennan	1fb358a9d0	UX: Style tweaks for RAG uploader and form width (#1407 ) This commit changes the RAG uploader form elements to be @format="full" instead of doing a hardcoded 500px width, which was causing a horizontal scrollbar in the tools form on mobile. Also, it moves the 80% max width for the tools form into the new viewport CSS API, and only applies it on desktop, because this was also causing width issues on mobile.	2025-06-05 12:40:00 +10:00
David Taylor	4ce8973e56	PERF: Optimize `.ai-debug-modal__tokens` selector (#1390 ) This is showing as the most expensive CSS selector in Discourse at the moment. Adding specific classes and dropping the general `span` selector will make this much cheaper.	2025-05-30 21:47:30 +01:00
Roman Rizzi	c0a2d4c935	DEV: Use structured responses for summaries (#1252 ) * DEV: Use structured responses for summaries * Fix system specs * Make response_format a first class citizen and update endpoints to support it * Response format can be specified in the persona * lint * switch to jsonb and make column nullable * Reify structured output chunks. Move JSON parsing to the depths of Completion * Switch to JsonStreamingTracker for partial JSON parsing	2025-05-06 10:09:39 -03:00
Kris	81a664b3da	UX: put full page search discoveries in sidebar (#1289 )	2025-04-30 12:01:21 -04:00
Sam	7dc3c30fa4	FEATURE: correctly decorate AI bots (#1300 ) AI bots come in 2 flavors 1. An LLM and LLM user, in this case we should decorate posts with persona name 2. A Persona user, in this case, in PMs we decorate with LLM name (2) is a significant improvement, cause previously when creating a conversation you could not tell which LLM you were talking to by simply looking at the post, you would have to scroll to the top of the page. * lint * translation missing	2025-04-30 16:36:38 +10:00
Joffrey JAFFEUX	d2002f81a7	DEV: migrates tools form to form-kit (#1204 ) This PR is a retry of: #1135, where we migrate AiTools form to FormKit. The previous PR accidentally removed code related to setting enum values, and as a result was reverted. This update includes enums correctly along with the previous updates.	2025-04-22 09:23:25 -07:00
Kris	32da999144	FIX: less generic animation names (#1243 )	2025-04-02 11:28:10 -04:00
Kris	7b56d7d4fc	UX: adjust artificat UI styles (#1240 )	2025-04-01 16:11:36 -04:00
Keegan George	bf5ccb452c	FEATURE: Continue conversation from Discobot discovery (#1234 ) This feature update allows for continuing the conversation with Discobot Discoveries in an AI bot chat. After discoveries gives you a response to your search you can continue with the existing context.	2025-04-01 10:22:39 -07:00
Kris	5331b6dd8e	UX: wider search pane, border, smaller font size (#1238 )	2025-04-01 12:57:32 -04:00
Keegan George	f3e78f0d80	FIX: Search discoveries improvements (#1228 ) This update makes a few improvements to search discoveries: - [x] in search menu panel: search discoveries should still be triggered when no regular results are present - [x] in full page search: search discoveries should still be triggered when no regular results are present - [x] flakiness in search discoveries sometimes not working properly. --------- Co-authored-by: awesomerobot <kris.aubuchon@discourse.org>	2025-03-31 08:38:40 -07:00
Roman Rizzi	2a8be6e2d7	REFACTOR: Migrate Personas' form to FormKit (#1178 ) * REFACTOR: Migrate Personas' form to FormKit We re-arranged fields into sections so we can better differentiate which options are specific to the AI bot. * few form-kit improvements https://github.com/discourse/discourse/pull/31934 --------- Co-authored-by: Joffrey JAFFEUX <j.jaffeux@gmail.com>	2025-03-21 14:46:33 -03:00
Sam	b6483e416d	Revert "DEV: Convert tool editor to form kit (#1135 )" (#1201 ) This reverts commit 107f14456b0e4b51fd2b934ee7cceced78b2e0cc. enum was not handled, so reverting for now	2025-03-18 18:07:04 +11:00
Keegan George	107f14456b	DEV: Convert tool editor to form kit (#1135 ) * DEV: Make tool presets a dropdown * DEV: Select tool presets via DMenu instead * WIP * WIP: Add parameter types, uploader, script, etc. * WIP * updates * fix lint * FIX: spec * fixes	2025-03-17 11:38:25 -03:00
Kris	24e6aa52bb	UX: try AI search to side on large screens (#1196 )	2025-03-14 14:11:02 -04:00
Jarek Radosz	ec8018333e	DEV: Update linting (#1191 )	2025-03-13 13:25:38 +00:00
Penar Musaraj	ac29d3080f	DEV: Fix SCSS linting issue (#1187 )	2025-03-12 13:25:17 -04:00
Keegan George	bb32d0d737	FEATURE: Add ability to disable search discoveries (#1177 ) This update adds the ability to disable search discoveries. This can be done through a tooltip when search discoveries are shown. It can also be done in the AI user preferences, which has also been updated to accommodate more than just the one image caption setting.	2025-03-10 14:17:58 -07:00
Keegan George	e15952031d	UX: Smoother streaming for discoveries (#1154 ) ## 🔍 Overview This update ensures that the streaming for discoveries is smoother, especially on first update. ## ➕ More details To help with smoother streaming, the discovery preview (which was being tracked as a separate property in the JS logic) will be removed and the entire discovery content will be shown/hidden via the existing CSS. The preview was already receiving the full update even though it was visually hidden, so removing the separate property shouldn't have any negative performance hit. Visually hiding it with CSS only will help simplify the component and also allow for smoother streaming. We will instead remove the buffered streaming approach and instead use typing timers similar to what we did for streaming summarization. No related tests as streaming animations are difficult to test.	2025-02-27 07:32:39 -08:00
Kris	55dde0a9e6	UX: minor adjustments to search bot (#1146 )	2025-02-21 19:40:53 -05:00
Roman Rizzi	f922012499	UX: Display a tooltip signalling this is an AI powered feature (#1141 )	2025-02-20 16:21:26 -03:00
Roman Rizzi	6765a13a40	FEATURE: Experimental search results from an AI Persona. (#1139 ) * FEATURE: Experimental search results from an AI Persona. When a user searches discourse, we'll send the query to an AI Persona to provide additional context and enrich the results. The feature depends on the user being a member of a group to which the persona has access. * Update assets/stylesheets/common/ai-blinking-animation.scss Co-authored-by: Keegan George <kgeorge13@gmail.com> --------- Co-authored-by: Keegan George <kgeorge13@gmail.com>	2025-02-20 14:37:58 -03:00
Sam	7ca21cc329	FEATURE: first class support for OpenRouter (#1011 ) * FEATURE: first class support for OpenRouter This new implementation supports picking quantization and provider pref Also: - Improve logging for summary generation - Improve error message when contacting LLMs fails * Better support for full screen artifacts on iPad Support back button to close full screen	2024-12-10 05:59:19 +11:00
Sam	0d7f353284	FEATURE: AI artifacts (#898 ) This is a significant PR that introduces AI Artifacts functionality to the discourse-ai plugin along with several other improvements. Here are the key changes: 1. AI Artifacts System: - Adds a new `AiArtifact` model and database migration - Allows creation of web artifacts with HTML, CSS, and JavaScript content - Introduces security settings (`strict`, `lax`, `disabled`) for controlling artifact execution - Implements artifact rendering in iframes with sandbox protection - New `CreateArtifact` tool for AI to generate interactive content 2. Tool System Improvements: - Adds support for partial tool calls, allowing incremental updates during generation - Better handling of tool call states and progress tracking - Improved XML tool processing with CDATA support - Fixes for tool parameter handling and duplicate invocations 3. LLM Provider Updates: - Updates for Anthropic Claude models with correct token limits - Adds support for native/XML tool modes in Gemini integration - Adds new model configurations including Llama 3.1 models - Improvements to streaming response handling 4. UI Enhancements: - New artifact viewer component with expand/collapse functionality - Security controls for artifact execution (click-to-run in strict mode) - Improved dialog and response handling - Better error management for tool execution 5. Security Improvements: - Sandbox controls for artifact execution - Public/private artifact sharing controls - Security settings to control artifact behavior - CSP and frame-options handling for artifacts 6. Technical Improvements: - Better post streaming implementation - Improved error handling in completions - Better memory management for partial tool calls - Enhanced testing coverage 7. Configuration: - New site settings for artifact security - Extended LLM model configurations - Additional tool configuration options This PR significantly enhances the plugin's capabilities for generating and displaying interactive content while maintaining security and providing flexible configuration options for administrators.	2024-11-19 09:22:39 +11:00
Sam	bdf3b6268b	FEATURE: smarter persona tethering (#832 ) Splits persona permissions so you can allow a persona on: - chat dms - personal messages - topic mentions - chat channels (any combination is allowed) Previously we did not have this flexibility. Additionally, adds the ability to "tether" a language model to a persona so it will always be used by the persona. This allows people to use a cheaper language model for one group of people and more expensive one for other people	2024-10-16 07:20:31 +11:00
Sam	5cbc9190eb	FEATURE: RAG search within tools (#802 ) This allows custom tools access to uploads and sophisticated searches using embedding. It introduces: - A shared front end for listing and uploading files (shared with personas) - Backend implementation of index.search function within a custom tool. Custom tools now may search through uploaded files function invoke(params) { return index.search(params.query) } This means that RAG implementers now may preload tools with knowledge and have high fidelity over the search. The search function support specifying max results specifying a subset of files to search (from uploads) Also - Improved documentation for tools (when creating a tool a preamble explains all the functionality) - uploads were a bit finicky, fixed an edge case where the UI would not show them as updated	2024-09-30 17:27:50 +10:00
Keegan George	fdadfa029e	FEATURE: smooth streaming animation for summarization (#778 )	2024-08-29 15:07:07 -07:00
Sam	b863ddc94b	FEATURE: custom user defined tools (#677 ) Introduces custom AI tools functionality. 1. Why it was added: The PR adds the ability to create, manage, and use custom AI tools within the Discourse AI system. This feature allows for more flexibility and extensibility in the AI capabilities of the platform. 2. What it does: - Introduces a new `AiTool` model for storing custom AI tools - Adds CRUD (Create, Read, Update, Delete) operations for AI tools - Implements a tool runner system for executing custom tool scripts - Integrates custom tools with existing AI personas - Provides a user interface for managing custom tools in the admin panel 3. Possible use cases: - Creating custom tools for specific tasks or integrations (stock quotes, currency conversion etc...) - Allowing administrators to add new functionalities to AI assistants without modifying core code - Implementing domain-specific tools for particular communities or industries 4. Code structure: The PR introduces several new files and modifies existing ones: a. Models: - `app/models/ai_tool.rb`: Defines the AiTool model - `app/serializers/ai_custom_tool_serializer.rb`: Serializer for AI tools b. Controllers: - `app/controllers/discourse_ai/admin/ai_tools_controller.rb`: Handles CRUD operations for AI tools c. Views and Components: - New Ember.js components for tool management in the admin interface - Updates to existing AI persona management components to support custom tools d. Core functionality: - `lib/ai_bot/tool_runner.rb`: Implements the custom tool execution system - `lib/ai_bot/tools/custom.rb`: Defines the custom tool class e. Routes and configurations: - Updates to route configurations to include new AI tool management pages f. Migrations: - `db/migrate/20240618080148_create_ai_tools.rb`: Creates the ai_tools table g. Tests: - New test files for AI tool functionality and integration The PR integrates the custom tools system with the existing AI persona framework, allowing personas to use both built-in and custom tools. It also includes safety measures such as timeouts and HTTP request limits to prevent misuse of custom tools. Overall, this PR significantly enhances the flexibility and extensibility of the Discourse AI system by allowing administrators to create and manage custom AI tools tailored to their specific needs. Co-authored-by: Martin Brennan <martin@discourse.org>	2024-06-27 17:27:40 +10:00
Sam	52a7dd2a4b	FEATURE: optional tool detail blocks (#662 ) This is a rather huge refactor with 1 new feature (tool details can be suppressed) Previously we use the name "Command" to describe "Tools", this unifies all the internal language and simplifies the code. We also amended the persona UI to use less DToggles which aligns with our design guidelines. Co-authored-by: Martin Brennan <martin@discourse.org>	2024-06-11 18:14:14 +10:00
Sam	f8381c0e8a	UX: suppress "this is a warning" (#636 ) When triggering a PM from new-message route, we still had the UI for "this is an official warning" This removes that UI from bot messages, which is all clutter.	2024-05-23 12:55:33 +08:00
Roman Rizzi	d8ebed8fb5	UX: Follow plugin user interface UI guidelines. (#628 )	2024-05-16 14:28:57 -03:00
Sam	cb23ae614f	UX: Remove multi llm selector from header and move to composer (#619 ) LLM selector control had no memory and was awkward to click. Instead we now: - Clearly display which llm you are talking to - Allow you to change llm direct from composer	2024-05-14 17:54:54 +10:00
Sam	514823daca	FIX: streaming broken in bedrock when chunks are not aligned (#609 ) Also - Stop caching llm list - this cause llm list in persona to be incorrect - Add more UI to debug screen so you can properly see raw response	2024-05-09 12:11:50 +10:00
Sam	e4b326c711	FEATURE: support Chat with AI Persona via a DM (#488 ) Add support for chat with AI personas - Allow enabling chat for AI personas that have an associated user - Add new setting `allow_chat` to AI persona to enable/disable chat - When a message is created in a DM channel with an allowed AI persona user, schedule a reply job - AI replies to chat messages using the persona's `max_context_posts` setting to determine context - Store tool calls and custom prompts used to generate a chat reply on the `ChatMessageCustomPrompt` table - Add tests for AI chat replies with tools and context At the moment unlike posts we do not carry tool calls in the context. No @mention support yet for ai personas in channels, this is future work	2024-05-06 09:49:02 +10:00
Sam	4a29f8ed1c	FEATURE: Enhance AI debugging capabilities and improve interface adjustments (#577 ) * FIX: various RAG edge cases - Nicer text to describe RAG, avoids the word RAG - Do not attempt to save persona when removing uploads and it is not created - Remove old code that avoided touching rag params on create * FIX: Missing pause button for persona users * Feature: allow specific users to debug ai request / response chains This can help users easily tune RAG and figure out what is going on with requests. * discourse helper so it does not explode * fix test * simplify implementation	2024-04-15 23:22:06 +10:00
Sam	f6ac5cd0a8	FEATURE: allow tuning of RAG generation (#565 ) * FEATURE: allow tuning of RAG generation - change chunking to be token based vs char based (which is more accurate) - allow control over overlap / tokens per chunk and conversation snippets inserted - UI to control new settings * improve ui a bit * fix various reindex issues * reduce concurrency * try ultra low queue ... concurrency 1 is too slow.	2024-04-12 10:32:46 -03:00
Roman Rizzi	aa8918911d	UX: Display the indexing progress for RAG uploads (#557 )	2024-04-09 11:03:07 -03:00
Roman Rizzi	1f1c94e5c6	FEATURE: AI Bot RAG support. (#537 ) This PR lets you associate uploads to an AI persona, which we'll split and generate embeddings from. When building the system prompt to get a bot reply, we'll do a similarity search followed by a re-ranking (if available). This will let us find the most relevant fragments from the body of knowledge you associated with the persona, resulting in better, more informed responses. For now, we'll only allow plain-text files, but this will change in the future. Commits: * FEATURE: RAG embeddings for the AI Bot This first commit introduces a UI where admins can upload text files, which we'll store, split into fragments, and generate embeddings of. In a next commit, we'll use those to give the bot additional information during conversations. * Basic asymmetric similarity search to provide guidance in system prompt * Fix tests and lint * Apply reranker to fragments * Uploads filter, css adjustments and file validations * Add placeholder for rag fragments * Update annotations	2024-04-01 13:43:34 -03:00
Sam	61e4c56e1a	FEATURE: Add vision support to AI personas (Claude 3) (#546 ) This commit adds the ability to enable vision for AI personas, allowing them to understand images that are posted in the conversation. For personas with vision enabled, any images the user has posted will be resized to be within the configured max_pixels limit, base64 encoded and included in the prompt sent to the AI provider. The persona editor allows enabling/disabling vision and has a dropdown to select the max supported image size (low, medium, high). Vision is disabled by default. This initial vision support has been tested and implemented with Anthropic's claude-3 models which accept images in a special format as part of the prompt. Other integrations will need to be updated to support images. Several specs were added to test the new functionality at the persona, prompt building and API layers. - Gemini is omitted, pending API support for Gemini 1.5. Current Gemini bot is not performing well, adding images is unlikely to make it perform any better. - Open AI is omitted, vision support on GPT-4 it limited in that the API has no tool support when images are enabled so we would need to full back to a different prompting technique, something that would add lots of complexity --------- Co-authored-by: Martin Brennan <martin@discourse.org>	2024-03-27 14:30:11 +11:00
Martin Brennan	fb0d56324f	FEATURE: Improve admin plugin UI and use new plugins show route (#512 ) This commit changes Discourse AI's admin plugin page to use the new plugin show route. The UI for persona editing has also been improved for consistency, and other plugin UIs will follow suit: Settings for the plugin are now listed in the plugin UI and can be changed from there directly after core PR discourse/discourse#26154 is merged. See also: * https://github.com/discourse/discourse/pull/26024 * https://github.com/discourse/discourse/pull/26154 * https://github.com/discourse/discourse/pull/26254	2024-03-21 14:29:56 +10:00
Sam	a03bc6ddec	FEATURE: Share conversations with AI via a URL (#521 ) This allows users to share a static page of an AI conversation with the rest of the world. By default this feature is disabled, it is enabled by turning on ai_bot_allow_public_sharing via site settings Precautions are taken when sharing 1. We make a carbonite copy 2. We minimize work generating page 3. We limit to 100 interactions 4. Many security checks - including disallowing if there is a mix of users in the PM. * Bonus commit, large PRs like this PR did not work with github tool large objects would destroy context Co-authored-by: Martin Brennan <martin@discourse.org>	2024-03-12 16:51:41 +11:00
David Taylor	114b96f2b4	DEV: Update to new header API and FloatKit (#516 )	2024-03-08 10:07:48 +00:00
Sam	3a8d95f6b2	FEATURE: mentionable personas and random picker tool, context limits (#466 ) 1. Personas are now optionally mentionable, meaning that you can mention them either from public topics or PMs - Mentioning from PMs helps "switch" persona mid conversation, meaning if you want to look up sites setting you can invoke the site setting bot, or if you want to generate an image you can invoke dall e - Mentioning outside of PMs allows you to inject a bot reply in a topic trivially - We also add the support for max_context_posts this allow you to limit the amount of context you feed in, which can help control costs 2. Add support for a "random picker" tool that can be used to pick random numbers 3. Clean up routing ai_personas -> ai-personas 4. Add Max Context Posts so users can control how much history a persona can consume (this is important for mentionable personas) Co-authored-by: Martin Brennan <martin@discourse.org>	2024-02-15 16:37:59 +11:00
Kris	900df4e8c8	UX: start progress dot animation instantly if it's the only content (#437 )	2024-01-22 13:10:51 -05:00
Sam	8df966e9c5	FEATURE: smooth streaming of AI responses on the client (#413 ) This PR introduces 3 things: 1. Fake bot that can be used on local so you can test LLMs, to enable on dev use: SiteSetting.ai_bot_enabled_chat_bots = "fake" 2. More elegant smooth streaming of progress on LLM completion This leans on JavaScript to buffer and trickle llm results through. It also amends it so the progress dot is much more consistently rendered 3. It fixes the Claude dialect Claude needs newlines exactly at the right spot, amended so it is happy --------- Co-authored-by: Martin Brennan <martin@discourse.org>	2024-01-11 15:56:40 +11:00
Sam	05f7808057	FEATURE: more elegant progress (#409 ) Previous to this change it was very hard to tell if completion was stuck or not. This introduces a "dot" that follows the completion and starts flashing after 5 seconds.	2024-01-09 09:20:28 -03:00

1 2

61 Commits