discourse-ai

Commit Graph

Author	SHA1	Message	Date
Sam	9242da545e	FEATURE: support OpenAI-Organization header (#245 ) Per: https://platform.openai.com/docs/api-reference/authentication There is an organization option which is useful for large orgs > For users who belong to multiple organizations, you can pass a header to specify which organization is used for an API request. Usage from these API requests will count against the specified organization's subscription quota.	2023-10-06 10:23:18 +11:00
Sam	d87adcebea	FEATURE: Claude based scanning and OpenAI retries (#243 ) llm_triage supported claude 2 in triage, this implements it OpenAI rate limits frequently, this introduces some exponential backoff (3 attempts - 3 seconds, 9 and 27) Also reduces temp of classifiers so they have consistent behavior	2023-10-05 09:00:45 +11:00
Sam	181113159b	FIX: setting explorer was exceeding token budget This refactor changes it so we only include minimal data in the system prompt which leaves us lots of tokens for specific searches The new search command allows us to pull in settings on demand Descriptions are include in short search results, and names only in longer results Also: * In dev it is important to tell when calls are made to open ai this adds a console log to increase awareness around token usage * PERF: stop counting tokens so often This changes it so we only count tokens once per response Previously each time we heard back from open ai we would count tokens, leading to uneeded delays * bug fix, commands may reach in for tokenizer * add logging to console for anthropic calls as well * Update lib/shared/inference/openai_completions.rb Co-authored-by: Martin Brennan <mjrbrennan@gmail.com>	2023-09-01 11:48:51 +10:00
Sam	f0e1c72aa7	FEATURE: implement command framework for non Open AI (#147 ) Open AI support function calling, this has a very specific shape that other LLMs have not quite adopted. This simulates a command framework using system prompts on LLMs that are not open AI. Features include: - Smart system prompt to steer the LLM - Parameter validation (we ensure all the params are specified correctly) This is being tested on Anthropic at the moment and intial results are promising.	2023-08-23 07:49:36 +10:00
Sam	b4477ecdcd	FEATURE: support 16k and 32k variants for Azure GPT (#140 ) Azure requires a single HTTP endpoint per type of completion. The settings: `ai_openai_gpt35_16k_url` and `ai_openai_gpt4_32k_url` can be used now to configure the extra endpoints This amends token limit which was off a bit due to function calls and fixes a minor JS issue where we were not testing for a property	2023-08-17 11:00:11 +10:00
Roman Rizzi	b076e43d67	FEATURE: streaming mode for the FoldContent strategy. (#134 )	2023-08-11 15:08:54 -03:00
Sam	d1ab79e82f	FEATURE: Add Azure cognitive service support (#93 ) The new site settings: ai_openai_gpt35_url : distribution for GPT 16k ai_openai_gpt4_url: distribution for GPT 4 ai_openai_embeddings_url: distribution for ada2 If untouched we will simply use OpenAI endpoints. Azure requires 1 URL per model, OpenAI allows a single URL to serve multiple models. Hence the new settings.	2023-06-21 10:39:51 +10:00
Sam	70c158cae1	FEATURE: add full bot support for GPT 3.5 (#87 ) Given latest GPT 3.5 16k which is both better steered and supports functions we can now support rich bot integration. Clunky system message based steering is removed and instead we use the function framework provided by Open AI	2023-06-20 08:45:31 +10:00
Rafael dos Santos Silva	3c9513e754	Refinements to embeddings and tokenizers (#61 ) * Refinements to embeddings and tokenizers * lint * Truncate with tokenizers for summary * fix	2023-05-15 15:10:42 -03:00
Roman Rizzi	7e3cb0ea16	FEATURE: Multi-model support for the AI Bot module. (#56 ) We'll create one bot user for each available model. When listed in the `ai_bot_enabled_chat_bots` setting, they will reply. This PR lets us use Claude-v1 in stream mode.	2023-05-11 10:03:03 -03:00
Roman Rizzi	71b105a1bb	FEATURE: Introduce the ai-bot module (#52 ) This module lets you chat with our GPT bot inside a PM. The bot only replies to members of the groups listed on the ai_bot_allowed_groups setting and only if you invite it to participate in the PM.	2023-05-05 15:28:31 -03:00
Sam	2cd60a4b3b	FEATURE: add a table to audit OpenAI usage (#45 ) Still need to build a job to purge logs	2023-04-26 11:44:29 +10:00
Sam	057fbe1ce6	FEATURE: add internal support for streaming mode (#42 ) Also adds some tests around completions and supports additional params such as top_p, temperature and max_tokens This also migrates off Faraday to using Net::HTTP directly	2023-04-21 16:54:25 +10:00
Rafael dos Santos Silva	5549e4d5b3	FEATURE: Chat channel summarization. (#32 ) * start summary module * chat channel summarization * FEATURE: modal for channel summarization --------- Co-authored-by: Roman Rizzi <rizziromanalejandro@gmail.com>	2023-04-04 11:24:09 -03:00
Rafael dos Santos Silva	b942a18298	FEATURE: Support for GPT-4 in AI Helper module (#29 )	2023-03-28 23:22:34 -03:00
Roman Rizzi	4c960970fa	DEV: Log information about errors from the completions OpenAI API (#26 )	2023-03-22 16:00:28 -03:00
Sam	1d14f7ffaf	FEATURE: Add a markdown table AI helper (#25 )	2023-03-22 13:16:29 -03:00
Roman Rizzi	39f7f1f29e	FEATURE: Prompts can consist of multiple messages. (#21 ) A prompt with multiple messages leads to better results, as the AI can learn for given examples. Alongside this change, we provide a better default proofreading prompt.	2023-03-21 12:04:59 -03:00
Roman Rizzi	fea9041ee1	DEV: Use 10s timeout when using the completions API (#19 )	2023-03-20 16:43:51 -03:00
Roman Rizzi	f99fe7e1ed	FEATURE: Composer AI helper (#8 ) * FEATURE: Composer AI helper This change introduces a new composer button for the group members listed in the `ai_helper_allowed_groups` site setting. Users can use chatGPT to review, improve, or translate their posts to English. * Add a safeguard for PMs and don't rely on parentView	2023-03-15 17:02:20 -03:00
Roman Rizzi	aa2fca6086	DEV: DiscourseAI -> DiscourseAi rename to have consistent folders and files (#9 )	2023-03-14 16:03:50 -03:00
Rafael dos Santos Silva	510c6487e3	DEV: Preparation work for multiple inference providers (#5 )	2023-03-07 16:14:39 -03:00

22 Commits