discourse-ai

Commit Graph

Author	SHA1	Message	Date
Sam	d75e3ca82b	FEATURE: include tag and category context in search (#217 ) Previous to this we just included title/body.. tags and category structure can be very critical for decision making.	2023-09-12 16:09:28 +10:00
Sam	b0310f90d3	FEATURE: add tags and categories to read context (#215 ) Note, we perform permission checks on tag list against anon to ensure we do not disclose information about private tags to the llm which could get extracted.	2023-09-12 11:06:55 +10:00
Sam	615eb8b440	FEATURE: add semantic search with hyde bot (#210 ) In specific scenarios (no special filters or limits) we will also always include 5 semantic results (at least) with every query. This effectively means that all very wide queries will always return 20 results, regardless of how complex they are. Also: FIX: embedding backfill rake task not working We renamed internals, this corrects the implementation	2023-09-07 13:25:26 +10:00
Sam	38af2ca63e	FIX: cut completion short after function call is found (#182 ) Previous to this change we would keep completing and throw away result	2023-09-05 10:37:58 +10:00
Sam	e3abbd9f46	FEATURE: add researcher persona (#181 ) The researcher persona has access to Google and can perform various internet research tasks. At the moment it can not read web pages, but that is under consideration	2023-09-04 12:05:27 +10:00
Sam	181113159b	FIX: setting explorer was exceeding token budget This refactor changes it so we only include minimal data in the system prompt which leaves us lots of tokens for specific searches The new search command allows us to pull in settings on demand Descriptions are include in short search results, and names only in longer results Also: * In dev it is important to tell when calls are made to open ai this adds a console log to increase awareness around token usage * PERF: stop counting tokens so often This changes it so we only count tokens once per response Previously each time we heard back from open ai we would count tokens, leading to uneeded delays * bug fix, commands may reach in for tokenizer * add logging to console for anthropic calls as well * Update lib/shared/inference/openai_completions.rb Co-authored-by: Martin Brennan <mjrbrennan@gmail.com>	2023-09-01 11:48:51 +10:00
Sam	00d69b463e	FEATURE: new site setting explorer persona (#178 ) Also adds ai_bot_enabled_personas so admins can tweak which stock personas are enabled. The new persona has a full listing of all site settings and is able to get context for each setting. This means you can ask it to search through settings for something relevant. Security wise there is no access to actual configuration of settings just to the names / description and implementation. Previously this was part of the forum helper persona however it just clashes too much with other behaviors, isolating it makes it far more powerful. * sneaking this one in, user_emails is a non obvious table in our structure. usually one would assume users has emails so the clarifies a bit better. plus it is a very common table to hit.	2023-08-31 17:02:03 +10:00
Sam	db19e37748	FEATURE: add initial support for personas (#172 ) This splits out a bunch of code that used to live inside bots into a dedicated concept called a Persona. This allows us to start playing with multiple personas for the bot Ships with: artist - for making images sql helper - for helping with data explorer general - for everything and anything Also includes a few fixes that make the generic LLM function implementation more robust	2023-08-30 16:15:03 +10:00
Sam	8fdb88604f	FIX: trim first space when getting a reply from anthropic (#164 ) Anthropic loves sending a pointless leading space with completions this throws off the command framework.	2023-08-29 10:57:36 +10:00
Sam	b14cb864dc	FEATURE: add setting_context experimental command (#160 ) This command can be used to extract information about a discourse site setting directly from source. To operate it needs the rg binary in the container.	2023-08-29 10:43:58 +10:00
Sam	7d943be7b2	FIX: automatic bot titles missing sometime (#151 ) This fixes 2 big issues: 1. No matter how hard you try, grounding anthropic title prompt is just too hard. This works around by only looking at the last sentence it returns and treating as title 2. Non English locales would be stuck with "generic" title, this ensures every bot message gets a title, using a custom field to track Also, slightly tunes some anthropic prompts.	2023-08-24 07:20:24 +10:00
Sam	f0e1c72aa7	FEATURE: implement command framework for non Open AI (#147 ) Open AI support function calling, this has a very specific shape that other LLMs have not quite adopted. This simulates a command framework using system prompts on LLMs that are not open AI. Features include: - Smart system prompt to steer the LLM - Parameter validation (we ensure all the params are specified correctly) This is being tested on Anthropic at the moment and intial results are promising.	2023-08-23 07:49:36 +10:00
Sam	20c1f2d788	FEATURE: basic progress for image generation (#133 ) previously you would have to wait quite a while to see the prompt this implements a very basic implementation of progress so you can see the API is working. Also: - Fix google progress. - Handle the incredibly rare, zero results from google. - Simplify command so it is less error prone - replace invoke and attache results with a invoke - ensure invoke can only ever be run once - pass in all the information a command needs in constructor - use new pattern throughout - test invocation in isolation	2023-08-14 16:30:12 +10:00
Sam	7eedbf29e0	FIX: refine image and read command (#131 ) - Attempt to hint reading is done by sending complete:true - Do not include post_number in result unless it was sent in - Rush visual feedback when a command is run (ensure we always revise) - Include hyperlink in read command description - Stop round tripping to GPT after image generation (speeds up images by a lot) - Add a test for image command	2023-08-09 16:01:48 +10:00
Sam	958dfc360e	FEATURE: experimental read command for bot (#129 ) This command is useful for reading a topics content. It allows us to perform critical analysis or suggest answers. Given 8k token limit in GPT-4 I hardcoded reading to 1500 tokens, but we can follow up and allow larger windows on models that support more tokens. On local testing even in this limited form this can be very useful.	2023-08-09 07:19:56 +10:00
Sam	03e689deb7	FIX: Google command was including full payload (#128 ) * FIX: Google command was including full payload Additionally there was no truncating happening meaning you could blow token budget easily on a single search. This made Google search mostly useless and it would mean that after using Google we would revert to a clean slate which is very confusing. * no need for nil there	2023-08-08 15:41:57 +10:00
Sam	7edb57c005	DEV: simplify command framework (#125 ) The command framework had some confusing dispatching where it would dispatch JSON blobs, this meant there was lots of parsing required in every command The refactor handles transforming the args prior to dispatch which makes consuming far simpler This is also general prep to supporting some basic command framework in other llms.	2023-08-04 09:37:58 +10:00
Sam	4b0c077ce5	FEATURE: port to use claude-2 for chat bot (#114 ) Claude 1 costs the same and is less good than Claude 2. Make use of Claude 2 in all spots ... This also fixes streaming so it uses the far more efficient streaming protocol.	2023-07-27 11:24:44 +10:00
Roman Rizzi	1b568f2391	FIX: Claude's max_tookens_to_sample is a required field (#97 )	2023-06-27 14:42:33 -03:00
Sam	9390fba768	FIX: adjust token limits to account for functions (#96 ) Reduce maximum replies to 2500 tokens and make them even for both GPT-3.5 and 4 Account for 400+ tokens in function definitions (this was unaccounted for)	2023-06-23 10:02:04 +10:00
Sam	a028309cbd	FEATURE: add ai_bot_enabled_chat commands and tune search (#94 ) * FEATURE: add ai_bot_enabled_chat commands and tune search This allows admins to disable/enable GPT command integrations. Also hones search results which were looping cause the result did not denote the failure properly (it lost context) * include more context for google command include more context for time command * type	2023-06-21 17:10:30 +10:00
Sam	30778d8af8	FIX: avoid storing corrupt prompts (#92 ) ``` prompt << build_message(bot_user.username, reply) ``` Would store a "cooked" prompt which is invalid, instead just store the raw values which are later passed to build_message Additionally: 1. Disable summary command which needs honing 2. Stop storing decorations (searched for X) in prompt which leads to straying 3. Ship username directly to model, avoiding "user: content" in prompts. This was causing GPT to stray	2023-06-20 15:44:03 +10:00
Sam	70c158cae1	FEATURE: add full bot support for GPT 3.5 (#87 ) Given latest GPT 3.5 16k which is both better steered and supports functions we can now support rich bot integration. Clunky system message based steering is removed and instead we use the function framework provided by Open AI	2023-06-20 08:45:31 +10:00
Sam	081231a6eb	FIX: support multiple command executions (#85 ) Previous to this change we were chaining stuff too late and would execute commands serially leading to very unexpected results This corrects this and allows us to run stuff like: > Search google 3/4 times on various permutations of QUERY and answer this question. We limit at 5 commands to ensure there are not pathological user cases where you lean on the LLM to flood us with results.	2023-06-06 07:09:33 +10:00
Sam	840968630e	FEATURE: disable smart commands on Claude and GPT 3.5 (#84 ) For the time being smart commands only work consistently on GPT 4. Avoid using any smart commands on the earlier models. Additionally adds better error handling to Claude which sometimes streams partial json and slightly tunes the search command.	2023-06-01 09:10:33 +10:00
Sam	96d521198b	FIX: missing localization (#81 ) blog.start_gpt_chat -> was on my blog This also slightly tunes the search prompt to support filtering by oldest and try a tiny bit harder to guide GPT 3.5 which is a bit of a losing battle Co-authored-by: Krzysztof Kotlarek <kotlarek.krzysztof@gmail.com>	2023-05-25 11:05:02 +10:00
Sam	d85b503ed4	FIX: guide GPT 3.5 better (#77 ) * FIX: guide GPT 3.5 better This limits search results to 10 cause we were blowing the whole token budget on search results, additionally it includes a quick exchange at the start of a session to try and guide GPT 3.5 to follow instructions Sadly GPT 3.5 drifts off very quickly but this does improve stuff a bit. It also attempts to correct some issues with anthropic, though it still is surprisingly hard to ground * add status:public, this is a bit of a hack but ensures that we can search for any filter provided * fix specs	2023-05-23 23:08:17 +10:00
Sam	e0cf7b7d70	FIX: results will be nil for invalid queries (#74 ) Previous to this change invalid searches would break the command.	2023-05-22 15:14:26 +10:00
Sam	92fb84e24d	iterate commands (#73 ) * FEATURE: introduce a more efficient formatter Previous formatting style was space inefficient given JSON consumes lots of tokens, the new format is now used consistently across commands Also fixes - search limited to 10 - search breaking on limit: non existent directive * Slight improvement to summarizer Stop blowing up context with custom prompts * ensure we include the guiding message * correct spec * langchain style summarizer ... much more accurate (albeit more expensive) * lint	2023-05-22 12:09:14 +10:00
Sam	d59ed1091b	FEATURE: add support for GPT <-> Forum integration This change-set connects GPT based chat with the forum it runs on. Allowing it to perform search, lookup tags and categories and summarize topics. The integration is currently restricted to public portions of the forum. Changes made: - Do not run ai reply job for small actions - Improved composable system prompt - Trivial summarizer for topics - Image generator - Google command for searching via Google - Corrected trimming of posts raw (was replacing with numbers) - Bypass of problem specs The feature works best with GPT-4 --------- Co-authored-by: Roman Rizzi <rizziromanalejandro@gmail.com>	2023-05-20 17:45:54 +10:00
Roman Rizzi	362f6167d1	FEATURE: Less friction for starting a conversation with an AI bot. (#63 ) * FEATURE: Less friction for starting a conversation with an AI bot. This PR adds a new header icon as a shortcut to start a conversation with one of our AI Bots. After clicking and selecting one from the dropdown menu, we'll open the composer with some fields already filled (recipients and title). If you leave the title as is, we'll queue a job after five minutes to update it using a bot suggestion. * Update assets/javascripts/initializers/ai-bot-replies.js Co-authored-by: Rafael dos Santos Silva <xfalcox@gmail.com> * Update assets/javascripts/initializers/ai-bot-replies.js Co-authored-by: Rafael dos Santos Silva <xfalcox@gmail.com> --------- Co-authored-by: Rafael dos Santos Silva <xfalcox@gmail.com>	2023-05-16 14:38:21 -03:00
Roman Rizzi	7e3cb0ea16	FEATURE: Multi-model support for the AI Bot module. (#56 ) We'll create one bot user for each available model. When listed in the `ai_bot_enabled_chat_bots` setting, they will reply. This PR lets us use Claude-v1 in stream mode.	2023-05-11 10:03:03 -03:00
Roman Rizzi	71b105a1bb	FEATURE: Introduce the ai-bot module (#52 ) This module lets you chat with our GPT bot inside a PM. The bot only replies to members of the groups listed on the ai_bot_allowed_groups setting and only if you invite it to participate in the PM.	2023-05-05 15:28:31 -03:00

1 2

83 Commits