discourse-ai

mirror of https://github.com/discourse/discourse-ai.git synced 2025-02-18 17:34:52 +00:00

Author	SHA1	Message	Date
Sam	5e80f93e4c	FEATURE: PDF support for rag pipeline (#1118 ) This PR introduces several enhancements and refactorings to the AI Persona and RAG (Retrieval-Augmented Generation) functionalities within the discourse-ai plugin. Here's a breakdown of the changes: 1. LLM Model Association for RAG and Personas: - New Database Columns: Adds `rag_llm_model_id` to both `ai_personas` and `ai_tools` tables. This allows specifying a dedicated LLM for RAG indexing, separate from the persona's primary LLM. Adds `default_llm_id` and `question_consolidator_llm_id` to `ai_personas`. - Migration: Includes a migration (`20250210032345_migrate_persona_to_llm_model_id.rb`) to populate the new `default_llm_id` and `question_consolidator_llm_id` columns in `ai_personas` based on the existing `default_llm` and `question_consolidator_llm` string columns, and a post migration to remove the latter. - Model Changes: The `AiPersona` and `AiTool` models now `belong_to` an `LlmModel` via `rag_llm_model_id`. The `LlmModel.proxy` method now accepts an `LlmModel` instance instead of just an identifier. `AiPersona` now has `default_llm_id` and `question_consolidator_llm_id` attributes. - UI Updates: The AI Persona and AI Tool editors in the admin panel now allow selecting an LLM for RAG indexing (if PDF/image support is enabled). The RAG options component displays an LLM selector. - Serialization: The serializers (`AiCustomToolSerializer`, `AiCustomToolListSerializer`, `LocalizedAiPersonaSerializer`) have been updated to include the new `rag_llm_model_id`, `default_llm_id` and `question_consolidator_llm_id` attributes. 2. PDF and Image Support for RAG: - Site Setting: Introduces a new hidden site setting, `ai_rag_pdf_images_enabled`, to control whether PDF and image files can be indexed for RAG. This defaults to `false`. - File Upload Validation: The `RagDocumentFragmentsController` now checks the `ai_rag_pdf_images_enabled` setting and allows PDF, PNG, JPG, and JPEG files if enabled. Error handling is included for cases where PDF/image indexing is attempted with the setting disabled. - PDF Processing: Adds a new utility class, `DiscourseAi::Utils::PdfToImages`, which uses ImageMagick (`magick`) to convert PDF pages into individual PNG images. A maximum PDF size and conversion timeout are enforced. - Image Processing: A new utility class, `DiscourseAi::Utils::ImageToText`, is included to handle OCR for the images and PDFs. - RAG Digestion Job: The `DigestRagUpload` job now handles PDF and image uploads. It uses `PdfToImages` and `ImageToText` to extract text and create document fragments. - UI Updates: The RAG uploader component now accepts PDF and image file types if `ai_rag_pdf_images_enabled` is true. The UI text is adjusted to indicate supported file types. 3. Refactoring and Improvements: - LLM Enumeration: The `DiscourseAi::Configuration::LlmEnumerator` now provides a `values_for_serialization` method, which returns a simplified array of LLM data (id, name, vision_enabled) suitable for use in serializers. This avoids exposing unnecessary details to the frontend. - AI Helper: The `AiHelper::Assistant` now takes optional `helper_llm` and `image_caption_llm` parameters in its constructor, allowing for greater flexibility. - Bot and Persona Updates: Several updates were made across the codebase, changing the string based association to a LLM to the new model based. - Audit Logs: The `DiscourseAi::Completions::Endpoints::Base` now formats raw request payloads as pretty JSON for easier auditing. - Eval Script: An evaluation script is included. 4. Testing: - The PR introduces a new eval system for LLMs, this allows us to test how functionality works across various LLM providers. This lives in `/evals`	2025-02-14 12:15:07 +11:00
Joffrey JAFFEUX	e2afbc26d3	FIX: correctly handle provider edit (#1125 ) Prior to this commit, editing the provider wouldn't recompute the provider params. It would also not correctly recompute the "canEditURL" property. To make possible this commit has: - made a fix in core: https://github.com/discourse/discourse/pull/31329 - ensures the provider params are recomputed when provider is changed - made the check on `canEditURL` based on form state and not initial model value Tests have been added to confirm the expected behavior.	2025-02-13 12:03:13 +01:00
Joffrey JAFFEUX	40e996b174	DEV: converts llm admin page to use form kit (#1099 ) This also converts the quota editor, and the quota modal.	2025-02-04 11:51:01 +01:00
Kris	99e73f09ff	UX: improve embeddings config styles (#1085 ) * WIP: improve embeddings config styles * switch to textarea, fix back button * remove log, update button, fix tests * stree * fix spec * spec fix * remove comment	2025-01-24 16:24:59 +11:00
Roman Rizzi	3b66fb3e87	FIX: Restore the accidentally deleted query prefix. (#1079 ) Additionally, we add a prefix for embedding generation. Both are stored in the definitions table.	2025-01-21 14:10:31 -03:00
Roman Rizzi	f5cf1019fb	FEATURE: configurable embeddings (#1049 ) * Use AR model for embeddings features * endpoints * Embeddings CRUD UI * Add presets. Hide a couple more settings * system specs * Seed embedding definition from old settings * Generate search bit index on the fly. cleanup orphaned data * support for seeded models * Fix run test for new embedding * fix selected model not set correctly	2025-01-21 12:23:19 -03:00
Roman Rizzi	46fcdb6ba5	FIX: Make summaries backfill job more resilient. (#1071 ) To quickly select backfill candidates without comparing SHAs, we compare the last summarized post to the topic's highest_post_number. However, hiding or deleting a post and adding a small action will update this column, causing the job to stall and re-generate the same summary repeatedly until someone posts a regular reply. On top of this, this is not always true for topics with `best_replies`, as this last reply isn't necessarily included. Since this is not evident at first glance and each summarization strategy picks its targets differently, I'm opting to simplify the backfill logic and how we track potential candidates. The first step is dropping `content_range`, which serves no purpose and it's there because summary caching was supposed to work differently at the beginning. So instead, I'm replacing it with a column called `highest_target_number`, which tracks `highest_post_number` for topics and could track other things like channel's `message_count` in the future. Now that we have this column when selecting every potential backfill candidate, we'll check if the summary is truly outdated by comparing the SHAs, and if it's not, we just update the column and move on	2025-01-16 09:42:53 -03:00
Kris	d15876025f	UX: disabled preseeded edit button, add description (#1038 )	2024-12-20 19:33:45 -05:00
Martin Brennan	f35db8068b	DEV: Change to use DPageSubheader (#1033 ) Previously was AdminPageSubheader until https://github.com/discourse/discourse/pull/30146	2024-12-18 17:39:31 +10:00
Krzysztof Kotlarek	04c4ff8cf0	UX: No admin header for edit personas tools or llms (#1021 ) In this PR, we added functionality to hide the admin header for edit/new actions - https://github.com/discourse/discourse/pull/30175 To make it work properly, we have to rename `show` to `edit` which is also a more accurate name.	2024-12-12 10:48:58 +11:00
Sam	47f5da7e42	FEATURE: Add AI-powered spam detection for new user posts (#1004 ) This introduces a comprehensive spam detection system that uses LLM models to automatically identify and flag potential spam posts. The system is designed to be both powerful and configurable while preventing false positives. Key Features: * Automatically scans first 3 posts from new users (TL0/TL1) * Creates dedicated AI flagging user to distinguish from system flags * Tracks false positives/negatives for quality monitoring * Supports custom instructions to fine-tune detection * Includes test interface for trying detection on any post Technical Implementation: * New database tables: - ai_spam_logs: Stores scan history and results - ai_moderation_settings: Stores LLM config and custom instructions * Rate limiting and safeguards: - Minimum 10-minute delay between rescans - Only scans significant edits (>10 char difference) - Maximum 3 scans per post - 24-hour maximum age for scannable posts * Admin UI features: - Real-time testing capabilities - 7-day statistics dashboard - Configurable LLM model selection - Custom instruction support Security and Performance: * Respects trust levels - only scans TL0/TL1 users * Skips private messages entirely * Stops scanning users after 3 successful public posts * Includes comprehensive test coverage * Maintains audit log of all scan attempts --------- Co-authored-by: Keegan George <kgeorge13@gmail.com> Co-authored-by: Martin Brennan <martin@discourse.org>	2024-12-12 09:17:25 +11:00
Keegan George	d6beac48f8	DEV: Improve explain suggestion footnote replacement (#999 ) Previously, when clicking add footnote on an explain suggestion it would replace the selected word by finding the first occurrence of the word. This results in issues when there are more than one occurrences of a word in a post. This is not trivial to solve, so this PR instead prevents incorrect text replacements by only allowing the replacement if it's unique. We use the same logic here that we use to determine if something can be fast edited. In this PR we also update tests for post helper explain suggestions. For a while, we haven't had tests here due to streaming/timing issues, we've been skipping our system specs. In this PR, we add acceptance tests to handle this which gives us improved ability to publish message bus updates in the testing environment so that it can be better tested without issues.	2024-12-04 11:41:34 -08:00
Kris	8203bdfbc9	UX: move topic summary from DMenu to DModal (#992 ) Co-authored-by: Keegan George <kgeorge13@gmail.com>	2024-12-03 13:30:15 -05:00
Keegan George	fc88bb08ab	FIX: Tag suggester is suggesting already assigned tags (#990 ) This PR fixes an issue where the tag suggester for edit title topic area was suggesting tags that are already assigned on a post. It also updates the amount of suggested tags to 7 so that there is still a decent amount of tags suggested when tags are already assigned.	2024-12-03 07:25:04 +11:00
Rafael dos Santos Silva	3828370679	DEV: Cleanup deprecations (#952 )	2024-12-02 14:18:03 -03:00
Keegan George	6b7d7c1179	REFACTOR: Helper suggestions (#914 ) This PR adds some updates to the Helper suggestions to improve it's functionality and modernize some of the codebase.	2024-11-27 12:21:03 -08:00
Martin Brennan	2f7895bb91	UX: Applying more admin UI guidelines (#956 ) This commit applies further admin UI guidelines, now that they have been more fleshed out in core, to the AI admin UI: * Tools * LLMs * Personas The changes include but are not limited to: * Applying the table CSS classes, for desktop and mobile * Adding a description and learn more link for each tab * Adding an empty list placeholder with CTA using `AdminConfigAreaEmptyList` * Replacing custom headings with `AdminPageSubheader`	2024-11-27 13:34:56 +10:00
Rafael dos Santos Silva	6c25718a7f	FEATURE: Add links to filtered emotion view on emotion dashboard table (#953 )	2024-11-25 15:51:01 -03:00
Sam	0d7f353284	FEATURE: AI artifacts (#898 ) This is a significant PR that introduces AI Artifacts functionality to the discourse-ai plugin along with several other improvements. Here are the key changes: 1. AI Artifacts System: - Adds a new `AiArtifact` model and database migration - Allows creation of web artifacts with HTML, CSS, and JavaScript content - Introduces security settings (`strict`, `lax`, `disabled`) for controlling artifact execution - Implements artifact rendering in iframes with sandbox protection - New `CreateArtifact` tool for AI to generate interactive content 2. Tool System Improvements: - Adds support for partial tool calls, allowing incremental updates during generation - Better handling of tool call states and progress tracking - Improved XML tool processing with CDATA support - Fixes for tool parameter handling and duplicate invocations 3. LLM Provider Updates: - Updates for Anthropic Claude models with correct token limits - Adds support for native/XML tool modes in Gemini integration - Adds new model configurations including Llama 3.1 models - Improvements to streaming response handling 4. UI Enhancements: - New artifact viewer component with expand/collapse functionality - Security controls for artifact execution (click-to-run in strict mode) - Improved dialog and response handling - Better error management for tool execution 5. Security Improvements: - Sandbox controls for artifact execution - Public/private artifact sharing controls - Security settings to control artifact behavior - CSP and frame-options handling for artifacts 6. Technical Improvements: - Better post streaming implementation - Improved error handling in completions - Better memory management for partial tool calls - Enhanced testing coverage 7. Configuration: - New site settings for artifact security - Extended LLM model configurations - Additional tool configuration options This PR significantly enhances the plugin's capabilities for generating and displaying interactive content while maintaining security and providing flexible configuration options for administrators.	2024-11-19 09:22:39 +11:00
Sérgio Saquetim	9583964676	DEV: Added compatibility with the Glimmer Post Menu (#887 )	2024-11-12 15:46:17 -03:00
Keegan George	644141ff08	FIX: Regenerate summary button still shows cached summary (#903 ) This PR fixes an issue where clicking to regenerate a summary was still showing the cached summary. To resolve this we call resetSummary() to reset all the summarization related properties before creating a new request.	2024-11-07 16:01:18 -08:00
Rafael dos Santos Silva	820b506910	DEV: Hide soon to be deprecated modules settings (#872 )	2024-10-28 14:27:25 -03:00
Sam	12869f2146	FIX: testing tool was not showing rag results (#867 ) This changeset contains 4 fixes: 1. We were allowing running tests on unsaved tools, this is problematic cause uploads are not yet associated or indexed leading to confusing results. We now only show the test button when tool is saved. 2. We were not properly scoping rag document fragements, this meant that personas and ai tools could get results from other unrelated tools, just to be filtered out later 3. index.search showed options as "optional" but implementation required the second option 4. When testing tools searching through document fragments was not working at all cause we did not properly load the tool	2024-10-25 16:01:25 +11:00
Sam	4923837165	FIX: Llm selector / forced tools / search tool (#862 ) * FIX: Llm selector / forced tools / search tool This fixes a few issues: 1. When search was not finding any semantic results we would break the tool 2. Gemin / Anthropic models did not implement forced tools previously despite it being an API option 3. Mechanics around displaying llm selector were not right. If you disabled LLM selector server side persona PM did not work correctly. 4. Disabling native tools for anthropic model moved out of a site setting. This deliberately does not migrate cause this feature is really rare to need now, people who had it set probably did not need it. 5. Updates anthropic model names to latest release * linting * fix a couple of tests I missed * clean up conditional	2024-10-25 06:24:53 +11:00
Keegan George	9af0c2e719	UX: Improve seeded LLM edit page (#856 )	2024-10-23 13:58:27 -07:00
Sam	a1f859a415	FEATURE: improve visibility of AI usage in LLM page (#845 ) This changeset: 1. Corrects some issues with "force_default_llm" not applying 2. Expands the LLM list page to show LLM usage 3. Clarifies better what "enabling a bot" on an llm means (you get it in the selector)	2024-10-22 11:16:02 +11:00
Roman Rizzi	c7acb4a6a0	REFACTOR: Support of different summarization targets/prompts. (#835 ) * DEV: Add summary types * Refactor for different summary types * Use enum for summary types * Update lib/summarization/strategies/topic_summary.rb Co-authored-by: Penar Musaraj <pmusaraj@gmail.com> * Update lib/summarization/strategies/topic_gist.rb Co-authored-by: Penar Musaraj <pmusaraj@gmail.com> * Update lib/summarization/strategies/chat_messages.rb Co-authored-by: Penar Musaraj <pmusaraj@gmail.com> * Fix chat_messages single prompt * Small tweak to the chat summarization prompt --------- Co-authored-by: Penar Musaraj <pmusaraj@gmail.com>	2024-10-15 13:53:26 -03:00
Sam	6c4c96e83c	FEATURE: allow persona to only force tool calls on limited replies (#827 ) This introduces another configuration that allows operators to limit the amount of interactions with forced tool usage. Forced tools are very handy in initial llm interactions, but as conversation progresses they can hinder by slowing down stuff and adding confusion.	2024-10-11 07:23:42 +11:00
Rafael dos Santos Silva	95e70474fd	DEV: Skip flaky test (#829 )	2024-10-10 12:02:31 -03:00
Sam	e1a0eb6131	FEATURE: support chain halting and upload creation support (#821 ) This adds chain halting (ability to terminate llm chain in a tool) and the ability to create uploads in a tool Together this lets us integrate custom image generators into a custom tool.	2024-10-09 08:17:45 +11:00
Sam	545500b329	FEATURE: allows forced LLM tool use (#818 ) * FEATURE: allows forced LLM tool use Sometimes we need to force LLMs to use tools, for example in RAG like use cases we may want to force an unconditional search. The new framework allows you backend to force tool usage. Front end commit to follow * UI for forcing tools now works, but it does not react right * fix bugs * fix tests, this is now ready for review	2024-10-05 09:46:57 +10:00
Kris	18ecc843e5	UX: move templates to main LLM config tab, restyle (#813 ) Restructures LLM config page so it is far clearer. Also corrects bugs around adding LLMs and having LLMs not editable post addition --------- Co-authored-by: Sam Saffron <sam.saffron@gmail.com>	2024-09-30 17:15:11 +10:00
Keegan George	493d65af1f	FIX: Diff modal closing along with composer menu on mobile (#803 ) The `DiffModal` is triggered after selecting an option in the composer helper menu. After selecting an option, we should close the composer helper menu and only show the diff modal. On mobile, there was an edge-case where `this.args.close()` for was causing the closing of both the `DiffModal` and the `AiComposerHelperMenu`. This PR resolves that by ensuring the menu is closed _first_ asynchronously, followed by opening the relevant modal.	2024-09-16 14:00:41 -07:00
Keegan George	9cd14b0003	DEV: Move composer AI helper to toolbar (#796 ) Previously we had moved the AI helper from the options menu to a selection menu that appears when selecting text in the composer. This had the benefit of making the AI helper a more discoverable feature. Now that some time has passed and the AI helper is more recognized, we will be moving it back to the composer toolbar. This is better because: - It consistent with other behavior and ways of accessing tools in the composer - It has an improved mobile experience - It reduces unnecessary code and keeps things easier to migrate when we have composer V2. - It allows for easily triggering AI helper for all content by clicking the button instead of having to select everything.	2024-09-13 11:59:30 -07:00
Roman Rizzi	c4c9dc2034	FIX: Display cached summaries with our new streamer. (#792 ) Make sure the summary box is in the DOM before attempting to display a cached summary.:	2024-09-03 18:45:28 -03:00
Sam	41054c4fb8	FEATURE: improve site setting search (#780 ) This improves the site setting search so it performs a somewhat fuzzy match. Previously it did not handle seperators such as "space" and a term such as "min_post_length" would not find "min_first_post_length" A more liberal search algorithm makes it easier to the AI to navigate settings. * Minor fix, {{and parameter.enum parameter.enum.length}} is non obviously broken. If parameter.enum is a tracked array it will return the object cause embers and helper implementation. This corrects an issue where enum keeps on selecting itself by mistake.	2024-08-29 16:05:38 +10:00
Keegan George	943504049c	FIX: Prevent proofreading when there is no content (#779 )	2024-08-28 12:21:34 -07:00
Sam	f148452f4c	FEATURE: single click proofreading (#769 ) Previously there was too much work proofreading text, new implementation provides a single shortcut and easy way of proofreading text. Co-authored-by: Martin Brennan <martin@discourse.org>	2024-08-26 15:43:40 +10:00
Roman Rizzi	9019e90b87	FIX: urlEditable must be true for all providers except Bedrock (#766 )	2024-08-22 11:31:28 -03:00
Keegan George	bfe3b1c3b8	FIX: Modals in composer helper menu not working (#755 )	2024-08-16 10:08:58 -07:00
Keegan George	23b88537d9	FIX: Prevent AI caption setting from showing unless all criteria is met (#753 )	2024-08-14 10:17:36 -07:00
Keegan George	f72ab12761	DEV: Clearly separate post/composer helper settings (#747 )	2024-08-12 15:40:23 -07:00
Keegan George	1d6a6c9f8f	FEATURE: Stream other post helper options (#745 )	2024-08-08 11:32:39 -07:00
Keegan George	1254d7c7d0	REFACTOR: AI Composer Helper Menu (#715 )	2024-08-06 10:57:39 -07:00
Roman Rizzi	5c196bca89	FEATURE: Track if a model can do vision in the llm_models table (#725 ) * FEATURE: Track if a model can do vision in the llm_models table * Data migration	2024-07-24 16:29:47 -03:00
Keegan George	08355ea5d8	FEATURE: Show post helper as bottom modal on mobile (#704 )	2024-07-10 11:01:05 -07:00
Martin Brennan	da6d70da8f	FEATURE: Add breadcrumbs to LLMs and Persona admin pages (#666 ) Followup to https://github.com/discourse/discourse-ai/pull/656, adding these back in with the new core component.	2024-07-10 10:56:13 +10:00
Sam	1320eed9b2	FEATURE: move summary to use llm_model (#699 ) This allows summary to use the new LLM models and migrates of API key based model selection Claude 3.5 etc... all work now. --------- Co-authored-by: Roman Rizzi <rizziromanalejandro@gmail.com>	2024-07-04 10:48:18 +10:00
Keegan George	1b0ba9197c	DEV: Add summarization logic from core (#658 )	2024-07-02 08:51:59 -07:00
Sam	b863ddc94b	FEATURE: custom user defined tools (#677 ) Introduces custom AI tools functionality. 1. Why it was added: The PR adds the ability to create, manage, and use custom AI tools within the Discourse AI system. This feature allows for more flexibility and extensibility in the AI capabilities of the platform. 2. What it does: - Introduces a new `AiTool` model for storing custom AI tools - Adds CRUD (Create, Read, Update, Delete) operations for AI tools - Implements a tool runner system for executing custom tool scripts - Integrates custom tools with existing AI personas - Provides a user interface for managing custom tools in the admin panel 3. Possible use cases: - Creating custom tools for specific tasks or integrations (stock quotes, currency conversion etc...) - Allowing administrators to add new functionalities to AI assistants without modifying core code - Implementing domain-specific tools for particular communities or industries 4. Code structure: The PR introduces several new files and modifies existing ones: a. Models: - `app/models/ai_tool.rb`: Defines the AiTool model - `app/serializers/ai_custom_tool_serializer.rb`: Serializer for AI tools b. Controllers: - `app/controllers/discourse_ai/admin/ai_tools_controller.rb`: Handles CRUD operations for AI tools c. Views and Components: - New Ember.js components for tool management in the admin interface - Updates to existing AI persona management components to support custom tools d. Core functionality: - `lib/ai_bot/tool_runner.rb`: Implements the custom tool execution system - `lib/ai_bot/tools/custom.rb`: Defines the custom tool class e. Routes and configurations: - Updates to route configurations to include new AI tool management pages f. Migrations: - `db/migrate/20240618080148_create_ai_tools.rb`: Creates the ai_tools table g. Tests: - New test files for AI tool functionality and integration The PR integrates the custom tools system with the existing AI persona framework, allowing personas to use both built-in and custom tools. It also includes safety measures such as timeouts and HTTP request limits to prevent misuse of custom tools. Overall, this PR significantly enhances the flexibility and extensibility of the Discourse AI system by allowing administrators to create and manage custom AI tools tailored to their specific needs. Co-authored-by: Martin Brennan <martin@discourse.org>	2024-06-27 17:27:40 +10:00

1 2 3

121 Commits