discourse-ai

Commit Graph

Author	SHA1	Message	Date
Rafael dos Santos Silva	7174fff7f0	FEATURE: Automatic embeddings backfill	2023-10-23 16:54:24 -03:00
Keegan George	76c356dba9	DEV: Make post/composer AI helper dependant on different settings	2023-10-12 14:06:42 -07:00
Keegan George	d68c586b91	DEV: Add front-end and lint	2023-10-04 15:56:20 -07:00
Rafael dos Santos Silva	cd46825057	adds explain	2023-10-04 18:42:05 -03:00
Rafael dos Santos Silva	84cc369552	FEATURE: Bge-large-en embeddings via Cloudflare Workers AI API (#241 ) * FEATURE: Bge-large-en embeddings via Cloudflare Workers AI API * forgot a file * lint	2023-10-04 13:47:51 -03:00
Discourse Translator Bot	05c256f65b	Update translations (#239 )	2023-10-04 09:54:32 +02:00
Sam	0cbf14e343	FEATURE: automation rule for triaging posts using LLM (#236 ) The new automation rule can be used to perform llm based classification and categorization of topics. You specify a system prompt (which has %%POST%% as an input), if it returns a particular piece of text then we will apply rules such as tagging, hiding, replying or categorizing. This can be used as a spam filter, a "oops you are in the wrong place" filter and so on. Co-authored-by: Joffrey JAFFEUX <j.jaffeux@gmail.com>	2023-10-03 08:55:30 +11:00
Rafael dos Santos Silva	102f47c1c4	FEATURE: Allow Anthropic inference via AWS Bedrock (#235 ) If a module LLM model is set to claude-2 and the ai_bedrock variables are all present we will use AWS Bedrock instead of Antrhopic own APIs. This is quite hacky, but will allow us to test the waters with AWS Bedrock early access with every module. This situation of "same module, completely different API" is quite a bit far from what we had in the OpenAI/Azure separation, so it's more food for thought for when we start working on the LLM abstraction layer soon this year.	2023-10-02 12:58:36 -03:00
Sam	ed7d1f06d1	FIX: improve token counting (#234 ) We were running out of tokens under certain conditions (really long chains) Add more buffer.	2023-09-28 15:32:22 +10:00
Discourse Translator Bot	782600e64f	Update translations (#229 )	2023-09-27 11:03:11 +02:00
Sam	aa463d64f1	FEATURE: Add creative persona (#231 ) This adds a new creative persona that has access to the underlying model and no external integrations. It allows people to use Claude/GPT models in a Discourse agnostic way.	2023-09-27 10:48:38 +10:00
Keegan George	2e5a39360a	FEATURE: Create custom prompts with composer AI helper (#214 ) * DEV: Add icon support * DEV: Add basic setup for custom prompt menu * FEATURE: custom prompt backend * fix custom prompt param check * fix custom prompt replace * WIP * fix custom prompt usage * fixes * DEV: Update front-end * DEV: No more custom prompt state * DEV: Add specs * FIX: Title/Category/Tag suggestions Suggestion dropdowns broke because it `messages_with_user_input(user_input)` expects a hash now. * DEV: Apply syntax tree * DEV: Restrict custom prompts to configured groups * oops * fix tests * lint * I love tests * lint is cool tho --------- Co-authored-by: Rafael dos Santos Silva <xfalcox@gmail.com>	2023-09-25 15:12:54 -03:00
Roman Rizzi	f57c1bb0f6	FEATURE: AI Helper endpoint to generate a thumbnail from text. (#224 ) We pass the text to the current LLM and ask them to generate a StableDifussion prompt. We'll use that to generate 4 samples, temporarily creating uploads and returning their short URLs.	2023-09-14 12:53:44 -03:00
Sam	9e94457154	FIX: Made bot more robust (#226 ) * FIX: Made bot more robust This is a collection of small fixes - Display "Searching for: ..." while searching instead of showing found 0 results. - Only allow 5 commands in lang chain - 6 feels like too much - On the 5th command stop informing the engine about functions, so it is forced to complete - Add another 30 tokens of buffer and explain why - Typo in command prompt Co-authored-by: Alan Guo Xiang Tan <gxtan1990@gmail.com>	2023-09-14 16:46:56 +10:00
Rafael dos Santos Silva	d1642533fb	FIX: Use "Related Topics" label consistently (#221 )	2023-09-12 16:23:24 -03:00
Discourse Translator Bot	0d761f4305	Update translations (#218 )	2023-09-12 15:27:58 +02:00
Rafael dos Santos Silva	2c0f535bab	FEATURE: HyDE-powered semantic search. (#136 ) * FEATURE: HyDE-powered semantic search. It relies on the new outlet added on discourse/discourse#23390 to display semantic search results in an unobtrusive way. We'll use a HyDE-backed approach for semantic search, which consists on generating an hypothetical document from a given keywords, which gets transformed into a vector and used in a asymmetric similarity topic search. This PR also reorganizes the internals to have less moving parts, maintaining one hierarchy of DAOish classes for vector-related operations like transformations and querying. Completions and vectors created by HyDE will remain cached on Redis for now, but we could later use Postgres instead. * Missing translation and rate limiting --------- Co-authored-by: Roman Rizzi <rizziromanalejandro@gmail.com>	2023-09-05 11:08:23 -03:00
Discourse Translator Bot	3d83d062a1	Update translations (#186 )	2023-09-05 15:42:46 +02:00
Sam	e3abbd9f46	FEATURE: add researcher persona (#181 ) The researcher persona has access to Google and can perform various internet research tasks. At the moment it can not read web pages, but that is under consideration	2023-09-04 12:05:27 +10:00
Sam	3f9973586e	FIX: ai_bot_allowed_groups now works with restricted visibility (#180 ) Previous to this change we relied on client side settings to determine if an end user has access to the ai bot. This meant that if a user was not aware they are a member of a group (as it is with restricted visibility ones) they would not see the bot button. All checking has now moved to the server side, and tests were added to cover.	2023-09-04 11:52:44 +10:00
Rafael dos Santos Silva	43e485cbd9	FEATURE: Additional AI suggestion options (#176 )	2023-09-01 17:10:58 -07:00
Sam	181113159b	FIX: setting explorer was exceeding token budget This refactor changes it so we only include minimal data in the system prompt which leaves us lots of tokens for specific searches The new search command allows us to pull in settings on demand Descriptions are include in short search results, and names only in longer results Also: * In dev it is important to tell when calls are made to open ai this adds a console log to increase awareness around token usage * PERF: stop counting tokens so often This changes it so we only count tokens once per response Previously each time we heard back from open ai we would count tokens, leading to uneeded delays * bug fix, commands may reach in for tokenizer * add logging to console for anthropic calls as well * Update lib/shared/inference/openai_completions.rb Co-authored-by: Martin Brennan <mjrbrennan@gmail.com>	2023-09-01 11:48:51 +10:00
Sam	00d69b463e	FEATURE: new site setting explorer persona (#178 ) Also adds ai_bot_enabled_personas so admins can tweak which stock personas are enabled. The new persona has a full listing of all site settings and is able to get context for each setting. This means you can ask it to search through settings for something relevant. Security wise there is no access to actual configuration of settings just to the names / description and implementation. Previously this was part of the forum helper persona however it just clashes too much with other behaviors, isolating it makes it far more powerful. * sneaking this one in, user_emails is a non obvious table in our structure. usually one would assume users has emails so the clarifies a bit better. plus it is a very common table to hit.	2023-08-31 17:02:03 +10:00
Sam	8e4347acba	DEV: rename ai_helper_add_ai_pm_to_header -> ai_bot_add_to_header (#177 ) Old name was very unclear, this setting is only used for the bot so now it follows the same convention others do	2023-08-31 14:42:28 +10:00
Sam	db19e37748	FEATURE: add initial support for personas (#172 ) This splits out a bunch of code that used to live inside bots into a dedicated concept called a Persona. This allows us to start playing with multiple personas for the bot Ships with: artist - for making images sql helper - for helping with data explorer general - for everything and anything Also includes a few fixes that make the generic LLM function implementation more robust	2023-08-30 16:15:03 +10:00
Rafael dos Santos Silva	6d69fb479e	DEV: Hide old embeddings pg setting (#169 ) * DEV: Hide old embeddings pg setting * fix yaml	2023-08-29 17:39:21 -03:00
Keegan George	4da4b5609f	FIX: Show warning when trying to generate suggestions without content (#175 )	2023-08-29 11:58:45 -07:00
Keegan George	7457feced8	FEATURE: Show suggested title prompt in new location (#171 )	2023-08-29 09:45:53 -07:00
Discourse Translator Bot	345bfed19f	Update translations (#173 )	2023-08-29 15:51:02 +02:00
Sam	b14cb864dc	FEATURE: add setting_context experimental command (#160 ) This command can be used to extract information about a discourse site setting directly from source. To operate it needs the rg binary in the container.	2023-08-29 10:43:58 +10:00
Rafael dos Santos Silva	e673b568d9	FEATURE: StableBeluga2 support for AiHelper (#162 ) * FEATURE: StableBeluga2 support for AiHelper * lint	2023-08-25 15:54:51 -03:00
Keegan George	7790313b1b	DEV: Add review menu state (#159 )	2023-08-24 17:49:24 -07:00
Keegan George	6df850d473	FEATURE: AI Helper Context Menu (#148 )	2023-08-23 10:35:40 -07:00
Discourse Translator Bot	95881fce74	Update translations (#149 )	2023-08-22 14:34:48 -03:00
Roman Rizzi	f111bcb35b	FIX: Display related topics when scrolling to the bottom of a topic. (#150 ) Besides updating the connector using the new tracking preference service interface, this PR fixes a bug where due to `ai_embeddings_semantic_related_topics_enabled` not having `client: true` the initializer never ran, and we didn't show the related topics list when scrolling to the bottom of a long topic.	2023-08-22 14:10:21 -03:00
Martin Brennan	486a130c25	DEV: Categorize plugin settings into discourse_ai (#144 ) Moving the plugin settings into a more specific category makes them easier to find in the plugin UI and removes them from the generic "Plugins" tab.	2023-08-21 14:46:34 -03:00
Sam	b4477ecdcd	FEATURE: support 16k and 32k variants for Azure GPT (#140 ) Azure requires a single HTTP endpoint per type of completion. The settings: `ai_openai_gpt35_16k_url` and `ai_openai_gpt4_32k_url` can be used now to configure the extra endpoints This amends token limit which was off a bit due to function calls and fixes a minor JS issue where we were not testing for a property	2023-08-17 11:00:11 +10:00
Sam	01f833f86e	FEATURE: optional warning attached to all AI bot conversations (#137 ) * FEATURE: optional warning attached to all AI bot conversations This commit introduces `ai_bot_enable_chat_warning` which can be used to warn people prior to starting a chat with the bot. In particular this is useful if moderators are regularly reading chat transcripts as it sets expectations early. By default this is disabled. Also: - Stops making ajax call prior to opening composer - Hides PM title when starting a bot PM Co-authored-by: Rafael dos Santos Silva <xfalcox@gmail.com>	2023-08-17 06:29:58 +10:00
Discourse Translator Bot	525c8b0913	Update translations (#135 )	2023-08-15 21:25:07 +02:00
Régis Hanol	7077c31ab8	Typo in site setting's description (#132 )	2023-08-10 14:07:13 -03:00
Sam	7eedbf29e0	FIX: refine image and read command (#131 ) - Attempt to hint reading is done by sending complete:true - Do not include post_number in result unless it was sent in - Rush visual feedback when a command is run (ensure we always revise) - Include hyperlink in read command description - Stop round tripping to GPT after image generation (speeds up images by a lot) - Add a test for image command	2023-08-09 16:01:48 +10:00
Sam	958dfc360e	FEATURE: experimental read command for bot (#129 ) This command is useful for reading a topics content. It allows us to perform critical analysis or suggest answers. Given 8k token limit in GPT-4 I hardcoded reading to 1500 tokens, but we can follow up and allow larger windows on models that support more tokens. On local testing even in this limited form this can be very useful.	2023-08-09 07:19:56 +10:00
Discourse Translator Bot	b1987f279d	Update translations (#130 )	2023-08-08 15:42:39 +02:00
Rafael dos Santos Silva	eb7fff3a55	FEATURE: Add support for StableBeluga and Upstage Llama2 instruct (#126 ) * FEATURE: Add support for StableBeluga and Upstage Llama2 instruct This means we support all models in the top3 of the Open LLM Leaderboard Since some of those models have RoPE, we now have a setting so you can customize the token limit depending which model you use.	2023-08-03 15:29:30 -03:00
Rafael dos Santos Silva	8b157feea5	FEATURE: Compatibility with protected Hugging Face Endpoints (#123 ) * FEATURE: Compatibility with protected Hugging Face Endpoints	2023-08-02 17:00:00 -03:00
Sam	602bb843ea	FEATURE: add support for final stable diffusion xl model (#122 )	2023-08-02 16:53:28 -03:00
Discourse Translator Bot	c26d48e3b1	Update translations (#119 )	2023-08-01 16:05:55 +02:00
Roman Rizzi	c8de9495c8	UX: Update related-topics to follow <MoreTopics/> conventions (#118 )	2023-07-31 18:33:37 -03:00
Rafael dos Santos Silva	3e7c99de89	FEATURE: Support for locally infered embeddings in 100 languages (#115 ) * FEATURE: Support for locally infered embeddings in 100 languages * add table	2023-07-27 15:50:03 -03:00
Rafael dos Santos Silva	b25daed60b	FEATURE: Llama2 for summarization (#116 )	2023-07-27 13:55:32 -03:00

1 2

98 Commits