discourse-ai

Commit Graph

Author	SHA1	Message	Date
Roman Rizzi	e0691e70e8	DEV: Updates to the summarization strategy API (#301 ) Introduced by discourse/discourse#24489 In the future, this change will let us log who requested the summary in the `AiApiAuditLog`.:	2023-11-21 13:27:35 -03:00
Rafael dos Santos Silva	4df258ce7d	FIX: Follow fix for missing claude tags in `a7adce0` (#242 )	2023-10-03 19:56:29 -03:00
Rafael dos Santos Silva	a7adce0cf7	FIX: Fallback to whole response when Claude forgets tags (#240 )	2023-10-03 15:39:30 -03:00
Rafael dos Santos Silva	3c4a53b2cb	FEATURE: Better link in Claude summaries (#183 ) * FEATURE: Better link in Claude summaries * lint	2023-09-04 12:04:47 -03:00
Rafael dos Santos Silva	ea5a443588	FEATURE: Try to generate OpenAI Summaries in current language (#146 ) * FEATURE: Try to generate OpenAI Summaries in current language * lint	2023-08-21 15:40:32 -03:00
Rafael dos Santos Silva	49f2453c2d	FEATURE: Tweaks to Anthropic Summarization (#138 ) * FEATURE: Tweaks to Anthropic Summarization * fix specs	2023-08-16 15:09:52 -03:00
Roman Rizzi	b076e43d67	FEATURE: streaming mode for the FoldContent strategy. (#134 )	2023-08-11 15:08:54 -03:00
Rafael dos Santos Silva	eb7fff3a55	FEATURE: Add support for StableBeluga and Upstage Llama2 instruct (#126 ) * FEATURE: Add support for StableBeluga and Upstage Llama2 instruct This means we support all models in the top3 of the Open LLM Leaderboard Since some of those models have RoPE, we now have a setting so you can customize the token limit depending which model you use.	2023-08-03 15:29:30 -03:00
Rafael dos Santos Silva	8b157feea5	FEATURE: Compatibility with protected Hugging Face Endpoints (#123 ) * FEATURE: Compatibility with protected Hugging Face Endpoints	2023-08-02 17:00:00 -03:00
Rafael dos Santos Silva	b25daed60b	FEATURE: Llama2 for summarization (#116 )	2023-07-27 13:55:32 -03:00
Sam	4b0c077ce5	FEATURE: port to use claude-2 for chat bot (#114 ) Claude 1 costs the same and is less good than Claude 2. Make use of Claude 2 in all spots ... This also fixes streaming so it uses the far more efficient streaming protocol.	2023-07-27 11:24:44 +10:00
Roman Rizzi	473732c18a	FIX: Return base prompt instead of nil (#106 )	2023-07-13 21:48:25 -03:00
Roman Rizzi	5f0c617880	REFACTOR: Cohesive narrative for single-chunk summaries. (#103 ) Single and multi-chunk summaries end using different prompts for the last summary. This change detects when the summarized content fits in a single chunk and uses a slightly different prompt, which leads to more consistent summary formats. This PR also moves the chunk-splitting step to the `FoldContent` strategy as preparation for implementing streamed summaries.	2023-07-13 17:05:41 -03:00
Rafael dos Santos Silva	9d10a152b9	FEATURE: Claude 2 for summarization and AIHelper (#101 )	2023-07-13 12:32:08 -03:00
Roman Rizzi	fbe1bab980	FIX: typo while updating a section (#98 )	2023-06-27 17:57:58 -03:00
Roman Rizzi	9a79afcdbf	DEV: Better strategies for summarization (#88 ) * DEV: Better strategies for summarization The strategy responsibility needs to be "Given a collection of texts, I know how to summarize them most efficiently, using the minimum amount of requests and maximizing token usage". There are different token limits for each model, so it all boils down to two different strategies: Fold all these texts into a single one, doing the summarization in chunks, and then build a summary from those. Build it by combining texts in a single prompt, and truncate it according to your token limits. While the latter is less than ideal, we need it for "bart-large-cnn-samsum" and "flan-t5-base-samsum", both with low limits. The rest will rely on folding. * Expose summarized chunks to users	2023-06-27 12:26:33 -03:00
Rafael dos Santos Silva	8742535024	FEATURE: Allow using large context OpenAI models for summarization (#86 )	2023-06-13 15:23:48 -03:00
Roman Rizzi	3364fec425	DEV: Remove the summarization feature (#83 ) * DEV: Remove the summarization feature Instead, we'll register summarization implementations for OpenAI, Anthropic, and Discourse AI using the API defined in discourse/discourse#21813. Core and chat will implement features on top of these implementations instead of this plugin extending them. * Register instances that contain the model, requiring less site settings	2023-06-13 14:32:26 -03:00
Rafael dos Santos Silva	3c9513e754	Refinements to embeddings and tokenizers (#61 ) * Refinements to embeddings and tokenizers * lint * Truncate with tokenizers for summary * fix	2023-05-15 15:10:42 -03:00
Rafael dos Santos Silva	97124b30de	FEATURE: Update summarization token count and add Claude 100k (#58 )	2023-05-11 15:35:58 -03:00
Rafael dos Santos Silva	c96edc8a72	FIX: Pass correct API Key to summarization service (#50 )	2023-05-02 21:41:11 -03:00
Roman Rizzi	38e007a3a5	FEATURE: Topic summarization (#41 ) * FEATURE: Topic summarization Summarize topics using the TopicView's "summary" filter. The UI is similar to what we do for chat, but we don't allow the user to select a timeframe. Co-authored-by: Rafael dos Santos Silva <xfalcox@gmail.com>	2023-04-19 17:57:31 -03:00
Rafael dos Santos Silva	bb0b829634	FEATURE: Anthropic Claude for AIHelper and Summarization modules (#39 )	2023-04-10 11:04:42 -03:00
Rafael dos Santos Silva	5549e4d5b3	FEATURE: Chat channel summarization. (#32 ) * start summary module * chat channel summarization * FEATURE: modal for channel summarization --------- Co-authored-by: Roman Rizzi <rizziromanalejandro@gmail.com>	2023-04-04 11:24:09 -03:00

24 Commits