discourse-ai

Commit Graph

Author	SHA1	Message	Date
David Taylor	890b85bff3	DEV: Update icons for FA6 (#1074 )	2025-01-17 10:12:57 +00:00
Roman Rizzi	4784e7fe43	FIX: Set default for existing records. (#1073 ) We'll later copy the correct value from content_range. 1 should be the min highest post number a topic has.	2025-01-16 10:38:53 -03:00
Roman Rizzi	46fcdb6ba5	FIX: Make summaries backfill job more resilient. (#1071 ) To quickly select backfill candidates without comparing SHAs, we compare the last summarized post to the topic's highest_post_number. However, hiding or deleting a post and adding a small action will update this column, causing the job to stall and re-generate the same summary repeatedly until someone posts a regular reply. On top of this, this is not always true for topics with `best_replies`, as this last reply isn't necessarily included. Since this is not evident at first glance and each summarization strategy picks its targets differently, I'm opting to simplify the backfill logic and how we track potential candidates. The first step is dropping `content_range`, which serves no purpose and it's there because summary caching was supposed to work differently at the beginning. So instead, I'm replacing it with a column called `highest_target_number`, which tracks `highest_post_number` for topics and could track other things like channel's `message_count` in the future. Now that we have this column when selecting every potential backfill candidate, we'll check if the summary is truly outdated by comparing the SHAs, and if it's not, we just update the column and move on	2025-01-16 09:42:53 -03:00
Rafael dos Santos Silva	f9aa2de413	FIX: AWS Bedrock non-streaming calls response log (#1072 )	2025-01-15 18:51:25 -03:00
Sam	81b952d56e	FIX: only hide posts detected explicitly as spam (#1070 ) When enabling spam scanner it there may be old unscanned posts this can create a risky situation where spam scanner operates on legit posts during false positives To keep this a lot safer we no longer try to hide old stuff by the spammers.	2025-01-15 16:50:41 +11:00
Natalie Tay	c881f8b361	DEV: Add rake task to send topics or posts to spam scanner (#1059 )	2025-01-15 11:48:57 +08:00
Rafael dos Santos Silva	92f122c54d	SECURITY: Fix XSS on Shared AI Conversations local Onebox (#1069 )	2025-01-14 18:05:37 -03:00
Roman Rizzi	cd03874b4d	FIX: Missing table check in post_migration (#1068 )	2025-01-14 17:33:01 -03:00
Roman Rizzi	65456c8b30	DEV: Migration to remove old embeddings tables~ (#1067 ) * DEV: Migration to remove old embeddings tables~ * Check for table existence	2025-01-14 17:13:34 -03:00
Roman Rizzi	c4d2b7de1d	PERF: Optimize backfill query to prevent statement timeouts (#1066 )	2025-01-14 15:39:19 -03:00
Roman Rizzi	6721c6751d	FIX: Do batches for backfilling huge embeddings tables (#1065 )	2025-01-14 14:42:40 -03:00
Keegan George	bbae790c2b	FIX: Composer helper not appearing on tablets (#1064 ) This update fixes an issue when the composer helper menu was not being shown on tablets in desktop mode. Updating the `z-index` to use the modal-dialog case is more appropriate here.	2025-01-14 09:35:31 -08:00
Roman Rizzi	356ea77201	FIX: Split backfill into separate migrations to use independent transactions (#1063 )	2025-01-14 13:30:52 -03:00
Roman Rizzi	09ca123757	FIX: Split statements to avoid timeout (#1062 )	2025-01-14 12:54:18 -03:00
Discourse Translator Bot	c029bc8979	Update translations (#1060 )	2025-01-14 16:20:00 +01:00
Roman Rizzi	65bbcd71fc	DEV: Embedding tables' model_id has to be a bigint (#1058 ) * DEV: Embedding tables' model_id has to be a bigint * Drop old search_bit indexes * copy rag fragment embeddings created during deploy window	2025-01-14 10:53:06 -03:00
Sam	d07cf51653	FEATURE: llm quotas (#1047 ) Adds a comprehensive quota management system for LLM models that allows: - Setting per-group (applied per user in the group) token and usage limits with configurable durations - Tracking and enforcing token/usage limits across user groups - Quota reset periods (hourly, daily, weekly, or custom) - Admin UI for managing quotas with real-time updates This system provides granular control over LLM API usage by allowing admins to define limits on both total tokens and number of requests per group. Supports multiple concurrent quotas per model and automatically handles quota resets. Co-authored-by: Keegan George <kgeorge13@gmail.com>	2025-01-14 15:54:09 +11:00
Sam	20612fde52	FEATURE: add the ability to disable streaming on an Open AI LLM Disabling streaming is required for models such o1 that do not have streaming enabled yet It is good to carry this feature around in case various apis decide not to support streaming endpoints and Discourse AI can continue to work just as it did before. Also: fixes issue where sharing artifacts would miss viewport leading to tiny artifacts on mobile	2025-01-13 17:01:01 +11:00
Jarek Radosz	7e9c0dc076	FIX: Invalid locale yaml (#1057 )	2025-01-11 23:31:33 +01:00
Mark VanLandingham	2d1ce01320	DEV: Add no-results className to full page search toggle (#1055 )	2025-01-10 10:59:18 -06:00
Keegan George	b24669c810	DEV: Add structure for errors in spam (#1054 ) This update adds some structure for handling errors in the spam config while also handling a specific error related to the spam scanning user not being an admin account.	2025-01-09 09:17:06 -08:00
Keegan George	24b69bf840	FIX: Update spam controller action should consider seeded LLM properly (#1053 ) The seeded LLM setting: `SiteSetting.ai_spam_detection_model_allowed_seeded_models` returns a _string_ with IDs separated by pipes. running `_map` on it will return an array with strings. We were previously checking for the id with custom prefix identifier, but instead we should be checking the stringified ID.	2025-01-08 13:41:25 -08:00
Guhyoun Nam	404092a68c	DEV: Add appEvents trigger when Ai search results toggled (#1052 ) This PR adds appEvents triggers when Ai search results are toggled.	2025-01-08 12:17:25 -06:00
Mark VanLandingham	327adbde29	UX: Full page search -- always show tooltip & add msg (#1051 )	2025-01-08 09:05:30 -06:00
Kris	749af40fad	UX: close summary modal on click outside (#1050 )	2025-01-07 11:24:27 -05:00
Discourse Translator Bot	61758ff8a6	Update translations (#1040 )	2025-01-03 14:01:41 +01:00
Mark VanLandingham	b6cefd10fa	DEV: Move semantic search from connector to component (#1048 )	2025-01-02 12:32:49 -06:00
Sam	11d0f60f1e	FEATURE: smart date support for AI helper (#1044 ) * FEATURE: smart date support for AI helper This feature allows conversion of human typed in dates and times to smart "Discourse" timezone friendly dates. * fix specs and lint * lint * address feedback * add specs	2024-12-31 08:04:25 +11:00
Sam	f9f89adac5	FIX: keep track of silence reason when spam detection flags user (#1046 ) Previously reason was blank for silencing user	2024-12-27 17:47:16 +11:00
Keegan George	b480f13a0f	FIX: Prevent LLM enumerator from erroring when spam enabled (#1045 ) This PR fixes an issue where LLM enumerator would error out when `SiteSetting.ai_spam_detection = true` but there was no `AiModerationSetting.spam` present. Typically, we add an `LlmDependencyValidator` for the setting itself, however, since Spam is unique in that it has it's model set in `AiModerationSetting` instead of a `SiteSetting`, we'll add a simple check here to prevent erroring out.	2024-12-27 09:12:29 +11:00
Sam	47ecf86aa1	FIX: embedding validation (#1043 )	2024-12-24 09:37:23 +11:00
Rafael dos Santos Silva	792df58fbc	FIX: AI Helper category / tag suggestion when user does not categories muted (#1042 )	2024-12-23 15:58:26 -03:00
Roman Rizzi	ceac6e5efb	FIX: Embeddings validator test needs to use the new Vector class. (#1041 )	2024-12-23 14:19:22 -03:00
Keegan George	bdb8f1d5e0	FIX: Custom prefix causing allowed seeded LLMs not to be shown (#1039 ) * FIX: Custom prefix causing allowed seeded LLMs not to be shown * DEV: update spec * not `_map` so should be string not array	2024-12-23 16:42:26 +11:00
Kris	d15876025f	UX: disabled preseeded edit button, add description (#1038 )	2024-12-20 19:33:45 -05:00
Rafael dos Santos Silva	7607477ff9	FIX: Cloudflare Workers AI embeddings (#1037 ) Regressed on `534b0df`	2024-12-20 17:45:27 -03:00
Keegan George	059b3fabb8	DEV: Unreachable LLM error shouldn't prevent setting (#1036 ) Previously we had the behaviour for model settings so that when you try and set a model, it runs a test and returns an error if it can't run the test successfully. The error then prevents you from setting the site setting. This results in some issues when we try and automate things. This PR updates that so that the test runs and discreetly logs the changes, but doesn't prevent the setting from being set. Instead we rely on "run test" in the LLM config along with ProblemChecks to catch issues.	2024-12-20 11:52:11 -08:00
Sam	6a7a45fd4f	FIX: properly spin down unused streamer threads (#1035 ) Previous version was prone to the bug: https://github.com/ruby-concurrency/concurrent-ruby/issues/1075 This is particularly bad cause we could have a DB connection attached to the thread and we never clear it up, so after N hours this could start exhibiting weird connection issues.	2024-12-20 12:09:42 +11:00
Kris	ac705b694b	UX: minor improvements to LLM page and admin tables (#1034 )	2024-12-19 18:14:22 -05:00
Discourse Translator Bot	a4033e2af9	Update translations (#1032 )	2024-12-18 15:19:47 +01:00
Martin Brennan	f35db8068b	DEV: Change to use DPageSubheader (#1033 ) Previously was AdminPageSubheader until https://github.com/discourse/discourse/pull/30146	2024-12-18 17:39:31 +10:00
Sam	fae2d5ff2c	FEATURE: link correctly to filters to assist in debugging spam (#1031 ) - Add spam_score_type to AiSpamSerializer for better integration with reviewables. - Introduce a custom filter for detecting AI spam false negatives in moderation workflows. - Refactor spam report generation to improve identification of false negatives. - Add tests to verify the custom filter and its behavior. - Introduce links for all spam counts in report	2024-12-17 11:02:18 +11:00
Keegan George	90ce942108	FEATURE: Add periodic problem checks for each LLM in use (#1020 ) This feature adds a periodic problem check which periodically checks for issues with LLMs that are in use. Periodically, we will run a test to see if the in use LLMs are still operational. If it is not, the LLM with the problem is surfaced to the admin so they can easily go and update the configuration.	2024-12-16 15:00:05 -08:00
Mark VanLandingham	24b107881a	FEATURE: Unavailable state for semantic search when sort is not Relevant (#1030 ) This commit adds an "unavailable" state for the AI semantic search toggle. Currently the AI toggle disappears when the sort by is anything but Relevance which makes the UI confusing for users looking for AI results. This should help!	2024-12-16 14:30:11 -06:00
Roman Rizzi	534b0df391	REFACTOR: Separation of concerns for embedding generation. (#1027 ) In a previous refactor, we moved the responsibility of querying and storing embeddings into the `Schema` class. Now, it's time for embedding generation. The motivation behind these changes is to isolate vector characteristics in simple objects to later replace them with a DB-backed version, similar to what we did with LLM configs.	2024-12-16 09:55:39 -03:00
Martin Brennan	222e2cf4f9	UX: Use new DStatTiles reusable component from core (#1025 ) For the Spam and Usage tabs in admin	2024-12-16 16:48:46 +10:00
Roman Rizzi	94b85ece80	FIX: Make sure gists are atleast five minutes old before updating them (#1029 ) * FIX: Make sure gists are atleast five minutes old before updating them * Update app/jobs/regular/fast_track_topic_gist.rb Co-authored-by: Keegan George <kgeorge13@gmail.com> --------- Co-authored-by: Keegan George <kgeorge13@gmail.com>	2024-12-13 19:36:34 -03:00
Roman Rizzi	1c40a698ca	FIX: get strategy version through vector_rep (#1028 )	2024-12-13 18:49:18 -03:00
David Taylor	fae1fbc796	DEV: Correct i18n lint violation (#1026 )	2024-12-13 16:01:04 +00:00
Roman Rizzi	eae527f99d	REFACTOR: A Simpler way of interacting with embeddings tables. (#1023 ) * REFACTOR: A Simpler way of interacting with embeddings' tables. This change adds a new abstraction called `Schema`, which acts as a repository that supports the same DB features `VectorRepresentation::Base` has, with the exception that removes the need to have duplicated methods per embeddings table. It is also a bit more flexible when performing a similarity search because you can pass it a block that gives you access to the builder, allowing you to add multiple joins/where conditions.	2024-12-13 10:15:21 -03:00

1 2 3 4 5 ...

1036 Commits All Branches Search

1036 Commits

All Branches