discourse-ai

mirror of https://github.com/discourse/discourse-ai.git synced 2025-02-18 17:34:52 +00:00

Author	SHA1	Message	Date
Sam	9390fba768	FIX: adjust token limits to account for functions (#96 ) Reduce maximum replies to 2500 tokens and make them even for both GPT-3.5 and 4 Account for 400+ tokens in function definitions (this was unaccounted for)	2023-06-23 10:02:04 +10:00
Sam	f8cabfad6b	FEATURE: Try to hone search so it reduces search terms in subsequent rounds (#95 ) Teach via system message that you can reduce search terms to get more results	2023-06-21 20:07:55 +10:00
Sam	a028309cbd	FEATURE: add ai_bot_enabled_chat commands and tune search (#94 ) * FEATURE: add ai_bot_enabled_chat commands and tune search This allows admins to disable/enable GPT command integrations. Also hones search results which were looping cause the result did not denote the failure properly (it lost context) * include more context for google command include more context for time command * type	2023-06-21 17:10:30 +10:00
Sam	30778d8af8	FIX: avoid storing corrupt prompts (#92 ) ``` prompt << build_message(bot_user.username, reply) ``` Would store a "cooked" prompt which is invalid, instead just store the raw values which are later passed to build_message Additionally: 1. Disable summary command which needs honing 2. Stop storing decorations (searched for X) in prompt which leads to straying 3. Ship username directly to model, avoiding "user: content" in prompts. This was causing GPT to stray	2023-06-20 15:44:03 +10:00
Sam	70c158cae1	FEATURE: add full bot support for GPT 3.5 (#87 ) Given latest GPT 3.5 16k which is both better steered and supports functions we can now support rich bot integration. Clunky system message based steering is removed and instead we use the function framework provided by Open AI	2023-06-20 08:45:31 +10:00
Sam	081231a6eb	FIX: support multiple command executions (#85 ) Previous to this change we were chaining stuff too late and would execute commands serially leading to very unexpected results This corrects this and allows us to run stuff like: > Search google 3/4 times on various permutations of QUERY and answer this question. We limit at 5 commands to ensure there are not pathological user cases where you lean on the LLM to flood us with results.	2023-06-06 07:09:33 +10:00
Sam	840968630e	FEATURE: disable smart commands on Claude and GPT 3.5 (#84 ) For the time being smart commands only work consistently on GPT 4. Avoid using any smart commands on the earlier models. Additionally adds better error handling to Claude which sometimes streams partial json and slightly tunes the search command.	2023-06-01 09:10:33 +10:00
Sam	96d521198b	FIX: missing localization (#81 ) blog.start_gpt_chat -> was on my blog This also slightly tunes the search prompt to support filtering by oldest and try a tiny bit harder to guide GPT 3.5 which is a bit of a losing battle Co-authored-by: Krzysztof Kotlarek <kotlarek.krzysztof@gmail.com>	2023-05-25 11:05:02 +10:00
Sam	d85b503ed4	FIX: guide GPT 3.5 better (#77 ) * FIX: guide GPT 3.5 better This limits search results to 10 cause we were blowing the whole token budget on search results, additionally it includes a quick exchange at the start of a session to try and guide GPT 3.5 to follow instructions Sadly GPT 3.5 drifts off very quickly but this does improve stuff a bit. It also attempts to correct some issues with anthropic, though it still is surprisingly hard to ground * add status:public, this is a bit of a hack but ensures that we can search for any filter provided * fix specs	2023-05-23 23:08:17 +10:00
Sam	074d00ca32	FEATURE: improve search prompt (#75 ) - We only support searching public topics - make it clear - Stop using bug/feature, cause is poisons system - these may not exist - Add after: and before: which are very handy for bounding search results	2023-05-23 07:52:14 +10:00
Sam	e0cf7b7d70	FIX: results will be nil for invalid queries (#74 ) Previous to this change invalid searches would break the command.	2023-05-22 15:14:26 +10:00
Sam	92fb84e24d	iterate commands (#73 ) * FEATURE: introduce a more efficient formatter Previous formatting style was space inefficient given JSON consumes lots of tokens, the new format is now used consistently across commands Also fixes - search limited to 10 - search breaking on limit: non existent directive * Slight improvement to summarizer Stop blowing up context with custom prompts * ensure we include the guiding message * correct spec * langchain style summarizer ... much more accurate (albeit more expensive) * lint	2023-05-22 12:09:14 +10:00
Sam	d59ed1091b	FEATURE: add support for GPT <-> Forum integration This change-set connects GPT based chat with the forum it runs on. Allowing it to perform search, lookup tags and categories and summarize topics. The integration is currently restricted to public portions of the forum. Changes made: - Do not run ai reply job for small actions - Improved composable system prompt - Trivial summarizer for topics - Image generator - Google command for searching via Google - Corrected trimming of posts raw (was replacing with numbers) - Bypass of problem specs The feature works best with GPT-4 --------- Co-authored-by: Roman Rizzi <rizziromanalejandro@gmail.com>	2023-05-20 17:45:54 +10:00
Roman Rizzi	362f6167d1	FEATURE: Less friction for starting a conversation with an AI bot. (#63 ) * FEATURE: Less friction for starting a conversation with an AI bot. This PR adds a new header icon as a shortcut to start a conversation with one of our AI Bots. After clicking and selecting one from the dropdown menu, we'll open the composer with some fields already filled (recipients and title). If you leave the title as is, we'll queue a job after five minutes to update it using a bot suggestion. * Update assets/javascripts/initializers/ai-bot-replies.js Co-authored-by: Rafael dos Santos Silva <xfalcox@gmail.com> * Update assets/javascripts/initializers/ai-bot-replies.js Co-authored-by: Rafael dos Santos Silva <xfalcox@gmail.com> --------- Co-authored-by: Rafael dos Santos Silva <xfalcox@gmail.com>	2023-05-16 14:38:21 -03:00
Rafael dos Santos Silva	3c9513e754	Refinements to embeddings and tokenizers (#61 ) * Refinements to embeddings and tokenizers * lint * Truncate with tokenizers for summary * fix	2023-05-15 15:10:42 -03:00
Roman Rizzi	7e3cb0ea16	FEATURE: Multi-model support for the AI Bot module. (#56 ) We'll create one bot user for each available model. When listed in the `ai_bot_enabled_chat_bots` setting, they will reply. This PR lets us use Claude-v1 in stream mode.	2023-05-11 10:03:03 -03:00
Roman Rizzi	71b105a1bb	FEATURE: Introduce the ai-bot module (#52 ) This module lets you chat with our GPT bot inside a PM. The bot only replies to members of the groups listed on the ai_bot_allowed_groups setting and only if you invite it to participate in the PM.	2023-05-05 15:28:31 -03:00

17 Commits