discourse-ai

mirror of https://github.com/discourse/discourse-ai.git synced 2025-02-18 17:34:52 +00:00

Author	SHA1	Message	Date
Loïc Guitaut	6ae4218a96	DEV: Fix new Rubocop offenses	2024-03-06 15:23:29 +01:00
Sam	8b382d6098	FEATURE: support for claude opus and sonnet (#508 ) This provides new support for messages API from Claude. It is required for latest model access. Also corrects implementation of function calls. * Fix message interleving * fix broken spec * add new models to automation	2024-03-06 06:04:37 +11:00
Sam	b7a96e3bcb	FIX: avoid all bot feedback loops (#507 ) We need to ensure that under no circumstances feedback loops between bots will emerge cause this can eat up a lot of tokens	2024-03-05 10:02:49 +11:00
Keegan George	cee1b3d275	FIX: Backspace in composer custom prompt closes menu (#505 )	2024-03-04 13:33:31 -08:00
Sam	77cf9e2cff	FIX: system persona non English save, missing bot pms - FIX: only update system attributes when updating system persona - FIX: update participant count by hand so bot messages show in inbox Co-authored-by: Joffrey JAFFEUX <j.jaffeux@gmail.com>	2024-03-04 09:56:59 +11:00
Sam	c02794cf2e	FIX: support multiple tool calls (#502 ) * FIX: support multiple tool calls Prior to this change we had a hard limit of 1 tool call per llm round trip. This meant you could not google multiple things at once or perform searches across two tools. Also: - Hint when Google stops working - Log topic_id / post_id when performing completions * Also track id for title	2024-03-02 07:53:21 +11:00
Kris	b72ee805b6	DEV: update caption test for core interaction change (#503 )	2024-03-01 14:58:33 -05:00
Sam	59bab2bba3	FIX: stream messages when directly PMing a persona (#500 ) previous to this fix we did not consider personas a bot in the front end	2024-03-01 07:53:42 +11:00
Sam	9fb1430e40	FIX: support spaces within arguments for Open AI (#499 ) Previous to this fix if a tool call ever streamed a SPACE alone, we would eat it and ignore it, breaking params Also fixes some tests to ensure they are actually called :)	2024-02-29 12:47:34 +11:00
Rafael dos Santos Silva	1b72a00d2c	FEATURE: Option for AI triage to send a post to the review queue (#498 ) Option for AI triage to send a post to the review queue	2024-02-29 12:33:28 +11:00
Keegan George	6a30b06a55	DEV: Cancel popup should abort request (#497 )	2024-02-28 13:32:45 -08:00
Sam	484fd1435b	DEV: improve internal design of ai persona and bug fix (#495 ) * DEV: improve internal design of ai persona and bug fix - Fixes bug where OpenAI could not describe images - Fixes bug where mentionable personas could not be mentioned unless overarching bot was enabled - Improves internal design of playground and bot to allow better for non "bot" users - Allow PMs directly to persona users (previously bot user would also have to be in PM) - Simplify internal code Co-authored-by: Martin Brennan <martin@discourse.org>	2024-02-28 16:46:32 +11:00
Sam	d036f3fb8e	FEATURE: AI helper support in non English languages (#489 ) * FEATURE: AI helper support in non English languages This attempts some prompt engineering to coerce AI helper to answer in the appropriate language. Note mileage will vary, in testing GPT-4 produces the best results GPT-3.5 can return OKish results. * Extend non english support for GPT-4V image caption * Update db/fixtures/ai_helper/603_completion_prompts.rb --------- Co-authored-by: Rafael Silva <xfalcox@gmail.com>	2024-02-27 16:31:51 -03:00
Sam	aabff87501	FIX: image generation in gemini was broken (#490 ) We need to inject blank model answers after tool calls if absent otherwise model will reject it.	2024-02-27 18:24:30 +11:00
Keegan George	2c7d34ff1f	DEV: Image caption system specs (#487 )	2024-02-26 12:22:09 -08:00
Roman Rizzi	94ba0dadc2	SECURITY: Place a SSRF protection when calling services from the plugin. (#485 ) The Faraday adapter and `FinalDestionation::HTTP` will protect us from admin-initiated SSRF attacks when interacting with the external services powering this plugin features.:	2024-02-21 17:14:50 -03:00
Keegan George	97f3cba603	DEV: Add attribution to AI captioned images (#483 )	2024-02-21 10:10:22 -08:00
Sam	becbe01f68	FIX: unable to share conversations with persona user (#479 ) Persona users are still bots, but we were not properly accounting for it and share icon was not showing up. This depends on a core change that adds .topic to transformed posts	2024-02-20 16:16:23 +11:00
Martin Brennan	0c1aad7850	DEV: Cleanup caption endpoint and account for secure uploads (#478 ) Utilizes the check for secure upload permissions from core PR https://github.com/discourse/discourse/pull/25758 and cleans up controller codes and spec code to reuse existing code and better reflect reality.	2024-02-19 23:43:39 -03:00
Keegan George	a9b2d6a30a	FEATURE: AI image caption (#470 ) This PR adds a new feature where you can generate captions for images in the composer using AI. --------- Co-authored-by: Rafael Silva <xfalcox@gmail.com>	2024-02-19 14:56:28 -03:00
Sam	1f74a77e17	DEV: correct flaky spec (#475 ) We were not properly expiring prompt cache	2024-02-19 15:21:55 +11:00
Sam	0fb87b00e2	FEATURE: new Discourse Helper persona (#473 ) This persona searches Discourse Meta for help with Discourse and points users at relevant posts. It is somewhat similar to using "Forum Helper" on meta, with the notable difference that we can not lean on semantic search so using some prompt engineering we try to keep it simple.	2024-02-19 14:52:12 +11:00
Keegan George	d66915ecc1	DEV: Make prompts available on `CurrentUserSerializer` (#472 )	2024-02-16 10:57:14 -08:00
Sam	3a8d95f6b2	FEATURE: mentionable personas and random picker tool, context limits (#466 ) 1. Personas are now optionally mentionable, meaning that you can mention them either from public topics or PMs - Mentioning from PMs helps "switch" persona mid conversation, meaning if you want to look up sites setting you can invoke the site setting bot, or if you want to generate an image you can invoke dall e - Mentioning outside of PMs allows you to inject a bot reply in a topic trivially - We also add the support for max_context_posts this allow you to limit the amount of context you feed in, which can help control costs 2. Add support for a "random picker" tool that can be used to pick random numbers 3. Clean up routing ai_personas -> ai-personas 4. Add Max Context Posts so users can control how much history a persona can consume (this is important for mentionable personas) Co-authored-by: Martin Brennan <martin@discourse.org>	2024-02-15 16:37:59 +11:00
Rafael dos Santos Silva	0dba6623a0	FIX: Better AI chat thread titles (#467 ) * FIX: Better AI chat thread titles - Fix quote removal when multi-line - Use XML tags for better LLM output parsing - Use stop_sequences for faster and less wasteful LLM calls - Adds truncation as the last line of defense	2024-02-09 14:49:28 -03:00
Rafael dos Santos Silva	bccb7efdd6	FIX: Use a dedicated prompt for thread titles (#464 )	2024-02-07 15:05:50 -03:00
Sam	ba3c3951cf	FIX: typo causing text_embedding_3_large to fail (#460 )	2024-02-05 11:16:36 +11:00
Sam	a3c827efcc	FEATURE: allow personas to supply top_p and temperature params (#459 ) * FEATURE: allow personas to supply top_p and temperature params Code assistance generally are more focused at a lower temperature This amends it so SQL Helper runs at 0.2 temperature vs the more common default across LLMs of 1.0. Reduced temperature leads to more focused, concise and predictable answers for the SQL Helper * fix tests * This is not perfect, but far better than what we do today Instead of fishing for 1. Draft sequence 2. Draft body We skip (2), this means the composer "only" needs 1 http request to open, we also want to eliminate (1) but it is a bit of a trickier core change, may figure out how to pull it off (defer it to first draft save) Value of bot drafts < value of opening bot conversations really fast	2024-02-03 07:09:34 +11:00
Keegan George	944fd6569c	DEV: Add granular control for AI composer helper features (#458 )	2024-02-01 14:58:04 -08:00
Roman Rizzi	fba9c1bf2c	UX: Re-introduce embedding settings validations (#457 ) * Revert "Revert "UX: Validate embeddings settings (#455)" (#456)" This reverts commit 392e2e8aef7d5b0d988b3c3bc5cc19f1d83c4491. * Resstore previous default	2024-02-01 16:54:09 -03:00
Roman Rizzi	392e2e8aef	Revert "UX: Validate embeddings settings (#455 )" (#456 ) This reverts commit 85fca89e011933a0479abaf4bf0945983fb948b8.	2024-02-01 14:06:51 -03:00
Roman Rizzi	85fca89e01	UX: Validate embeddings settings (#455 )	2024-02-01 13:05:38 -03:00
Sam	dcafc8032f	FIX: improve embedding generation (#452 ) 1. on failure we were queuing a job to generate embeddings, it had the wrong params. This is both fixed and covered in a test. 2. backfill embedding in the order of bumped_at, so newest content is embedded first, cover with a test 3. add a safeguard for hidden site setting that only allows batches of 50k in an embedding job run Previously old embeddings were updated in a random order, this changes it so we update in a consistent order	2024-01-31 10:38:47 -03:00
Sam	ab7e9e31aa	FEATURE: allow excluding tags and categories from LLM report (#447 ) Also - Better diagnostics, output model being used - Prompt LLM that true content is being injected in <context> tag	2024-01-30 15:55:05 +11:00
Roman Rizzi	bae71eb047	FIX: Include provider in automation models (#446 )	2024-01-29 18:07:29 -03:00
Roman Rizzi	0634b85a81	UX: Validations to LLM-backed features (except AI Bot) (#436 ) * UX: Validations to Llm-backed features (except AI Bot) This change is part of an ongoing effort to prevent enabling a broken feature due to lack of configuration. We also want to explicit which provider we are going to use. For example, Claude models are available through AWS Bedrock and Anthropic, but the configuration differs. Validations are: * You must choose a model before enabling the feature. * You must turn off the feature before setting the model to blank. * You must configure each model settings before being able to select it. * Add provider name to summarization options * vLLM can technically support same models as HF * Check we can talk to the selected model * Check for Bedrock instead of anthropic as a site could have both creds setup	2024-01-29 16:04:25 -03:00
Sam	b2b01185f2	FEATURE: add support for new OpenAI embedding models (#445 ) * FEATURE: add support for new OpenAI embedding models This adds support for just released text_embedding_3_small and large Note, we have not yet implemented truncation support which is a new API feature. (triggered using dimensions) * Tiny side fix, recalc bots when ai is enabled or disabled * FIX: downsample to 2000 items per vector which is a pgvector limitation	2024-01-29 13:24:30 -03:00
Rafael dos Santos Silva	04bc402aae	FEATURE: Setting to control per post embeddings (#439 ) * FEATURE: Setting to control per post embeddings	2024-01-23 22:09:27 -03:00
Jarek Radosz	4b4aedb50f	DEV: Use the new controller/period component for the dashboard (#435 )	2024-01-19 13:27:33 +01:00
Jarek Radosz	5802cd1a0c	DEV: Fix various typos (#434 )	2024-01-19 12:51:26 +01:00
Roman Rizzi	5bdf3dc1f4	DEV: Stop using shared_examples for endpoint specs (#430 )	2024-01-17 15:08:49 -03:00
Gerhard Schlager	8eb1e851fc	DEV: Spec didn't work correctly with translations (#429 )	2024-01-16 16:28:24 +01:00
Sam	05d8b021f1	FIX: scrub invalid prompts when truncating (#426 ) When you trim a prompt we never want to have a state where there is a "tool" reply without a corresponding tool call, it makes no sense Also - GPT-4-Turbo is 128k, fix that - Claude was not preserving username in prompt - We were throwing away unicode usernames instead of adding to message	2024-01-16 13:48:00 +11:00
Roman Rizzi	ff4da6ace8	FIX: Clean unicode usernames when adding messages through prompt's contrstuctor (#425 )	2024-01-15 12:01:40 -03:00
Ted Johansson	37e6ac169e	DEV: Update test setup to work with auto groups (#424 ) We're updating core to change TL based access settings to be group based. This requires some updates of tests to work correctly. (The existing test setup gives false positives.)	2024-01-15 20:18:56 +08:00
Sam	825f01cfb2	FEATURE: even smoother streaming (#420 ) Account properly for function calls, don't stream through <details> blocks - Rush cooked content back to client - Wait longer (up to 60 seconds) before giving up on streaming - Clean up message bus channels so we don't have leftover data - Make ai streamer much more reusable and much easier to read - If buffer grows quickly, rush update so you are not artificially waiting - Refine prompt interface - Fix lost system message when prompt gets long	2024-01-15 18:51:14 +11:00
Jarek Radosz	6b8a57d957	DEV: Update linting (#423 ) Co-authored-by: Keegan George <kgeorge13@gmail.com>	2024-01-13 00:28:06 +01:00
Roman Rizzi	04eae76f68	REFACTOR: Represent generic prompts with an Object. (#416 ) * REFACTOR: Represent generic prompts with an Object. * Adds a bit more validation for clarity * Rewrite bot title prompt and fix quirk handling --------- Co-authored-by: Sam Saffron <sam.saffron@gmail.com>	2024-01-12 14:36:44 -03:00
Rafael dos Santos Silva	3be76ebd7a	FEATURE: Move the default embeddings model to bge-large-en (#417 )	2024-01-11 14:16:25 -03:00
Sam	8df966e9c5	FEATURE: smooth streaming of AI responses on the client (#413 ) This PR introduces 3 things: 1. Fake bot that can be used on local so you can test LLMs, to enable on dev use: SiteSetting.ai_bot_enabled_chat_bots = "fake" 2. More elegant smooth streaming of progress on LLM completion This leans on JavaScript to buffer and trickle llm results through. It also amends it so the progress dot is much more consistently rendered 3. It fixes the Claude dialect Claude needs newlines exactly at the right spot, amended so it is happy --------- Co-authored-by: Martin Brennan <martin@discourse.org>	2024-01-11 15:56:40 +11:00

1 2 3 4 5

250 Commits