discourse-ai

Commit Graph

Author	SHA1	Message	Date
Sam Saffron	d544f2101b	more tweaks...	2023-05-31 17:15:32 +10:00
Sam Saffron	2d299900b9	we certainly do not need search through pinned topics for now... This just causes bots to get confused.	2023-05-29 14:15:12 +10:00
Sam Saffron	c7781c57fc	Fix all specs Also ensure triage is more consistent by reducing temp	2023-05-29 14:11:32 +10:00
Sam Saffron	db11fe391d	FEATURE: use triage command to attempt to ground models on GPT 3.5 Previous to this change our results were ungrounded and simpler models like GPT 3.5 and claude had a real tough time figuring out how to run commands This splits responses into 2 phases: Phase 1: figure out if you need to run a command Phase 2: respond to user This seems to produce better results on both Claude and GPT 3.5 but still needs a fair bit of tuning.	2023-05-26 17:07:52 +10:00
Sam	96d521198b	FIX: missing localization (#81 ) blog.start_gpt_chat -> was on my blog This also slightly tunes the search prompt to support filtering by oldest and try a tiny bit harder to guide GPT 3.5 which is a bit of a losing battle Co-authored-by: Krzysztof Kotlarek <kotlarek.krzysztof@gmail.com>	2023-05-25 11:05:02 +10:00
Rafael dos Santos Silva	cfc6e388df	FIX: Ensure embeddings database outages are handled gracefully (#80 ) The rails_failover middleware will intercept all `PG::ConnectionBad` errors and put the cluster into readonly mode. It does not have any handling for multiple databases. Therefore, an issue with the embeddings database was taking the whole cluster into readonly. This commit fixes the issue by rescuing `PG::Error` from all AI database accesses, and re-raises errors with a different class. It also adds a spec to ensure that an embeddings database outage does not affect the functionality of the topics/show route. Co-authored-by: David Taylor <david@taylorhq.com>	2023-05-23 22:57:52 +01:00
Rafael dos Santos Silva	b213fe7f94	FIX: Give up trying to reuse the DB connection and rely on pgbouncer (#79 )	2023-05-23 15:12:59 -03:00
Sam	d85b503ed4	FIX: guide GPT 3.5 better (#77 ) * FIX: guide GPT 3.5 better This limits search results to 10 cause we were blowing the whole token budget on search results, additionally it includes a quick exchange at the start of a session to try and guide GPT 3.5 to follow instructions Sadly GPT 3.5 drifts off very quickly but this does improve stuff a bit. It also attempts to correct some issues with anthropic, though it still is surprisingly hard to ground * add status:public, this is a bit of a hack but ensures that we can search for any filter provided * fix specs	2023-05-23 23:08:17 +10:00
Sam	b82fc1e692	FIX: ensure we only attempt embedding once every 15 minutes (#76 ) This also heavily reduced log noise and ensures our exception handling is more surgical.	2023-05-23 10:43:24 +10:00
Sam	074d00ca32	FEATURE: improve search prompt (#75 ) - We only support searching public topics - make it clear - Stop using bug/feature, cause is poisons system - these may not exist - Add after: and before: which are very handy for bounding search results	2023-05-23 07:52:14 +10:00
Sam	e0cf7b7d70	FIX: results will be nil for invalid queries (#74 ) Previous to this change invalid searches would break the command.	2023-05-22 15:14:26 +10:00
Sam	92fb84e24d	iterate commands (#73 ) * FEATURE: introduce a more efficient formatter Previous formatting style was space inefficient given JSON consumes lots of tokens, the new format is now used consistently across commands Also fixes - search limited to 10 - search breaking on limit: non existent directive * Slight improvement to summarizer Stop blowing up context with custom prompts * ensure we include the guiding message * correct spec * langchain style summarizer ... much more accurate (albeit more expensive) * lint	2023-05-22 12:09:14 +10:00
Sam	d59ed1091b	FEATURE: add support for GPT <-> Forum integration This change-set connects GPT based chat with the forum it runs on. Allowing it to perform search, lookup tags and categories and summarize topics. The integration is currently restricted to public portions of the forum. Changes made: - Do not run ai reply job for small actions - Improved composable system prompt - Trivial summarizer for topics - Image generator - Google command for searching via Google - Corrected trimming of posts raw (was replacing with numbers) - Bypass of problem specs The feature works best with GPT-4 --------- Co-authored-by: Roman Rizzi <rizziromanalejandro@gmail.com>	2023-05-20 17:45:54 +10:00
Rafael dos Santos Silva	262ed4753e	FEATURE: Basic StableDiffusion text2img support (#72 )	2023-05-20 09:38:08 +10:00
Rafael dos Santos Silva	739b314312	Fixes for embeddings and truncate (#67 )	2023-05-18 09:21:28 +10:00
Rafael dos Santos Silva	e9ae28f773	FIX: Non instructor OSS embeddings was broken (#65 )	2023-05-17 12:10:10 -03:00
Roman Rizzi	362f6167d1	FEATURE: Less friction for starting a conversation with an AI bot. (#63 ) * FEATURE: Less friction for starting a conversation with an AI bot. This PR adds a new header icon as a shortcut to start a conversation with one of our AI Bots. After clicking and selecting one from the dropdown menu, we'll open the composer with some fields already filled (recipients and title). If you leave the title as is, we'll queue a job after five minutes to update it using a bot suggestion. * Update assets/javascripts/initializers/ai-bot-replies.js Co-authored-by: Rafael dos Santos Silva <xfalcox@gmail.com> * Update assets/javascripts/initializers/ai-bot-replies.js Co-authored-by: Rafael dos Santos Silva <xfalcox@gmail.com> --------- Co-authored-by: Rafael dos Santos Silva <xfalcox@gmail.com>	2023-05-16 14:38:21 -03:00
Rafael dos Santos Silva	2ed1f874c2	Use correct API signature for instructor embeddings (#62 )	2023-05-15 17:18:11 -03:00
Rafael dos Santos Silva	3c9513e754	Refinements to embeddings and tokenizers (#61 ) * Refinements to embeddings and tokenizers * lint * Truncate with tokenizers for summary * fix	2023-05-15 15:10:42 -03:00
Rafael dos Santos Silva	97124b30de	FEATURE: Update summarization token count and add Claude 100k (#58 )	2023-05-11 15:35:58 -03:00
Rafael dos Santos Silva	66bf4c74c6	FEATURE: Handle invalid media in NSFW module (#57 ) * FEATURE: Handle invalid media in NSFW module * fix lint	2023-05-11 15:35:39 -03:00
Roman Rizzi	7e3cb0ea16	FEATURE: Multi-model support for the AI Bot module. (#56 ) We'll create one bot user for each available model. When listed in the `ai_bot_enabled_chat_bots` setting, they will reply. This PR lets us use Claude-v1 in stream mode.	2023-05-11 10:03:03 -03:00
Rafael dos Santos Silva	e5537d4c77	FEATURE: Allow excluding closed topics from semantic related (#55 )	2023-05-09 15:30:50 -03:00
Rafael dos Santos Silva	f1133f66a6	Updates to embedding rake tasks (#54 ) - Creates embeddings in topic ID order, so it's easier to stop and restart from where we stopped - Update index parameters with current best practices	2023-05-09 13:45:16 -03:00
Sam	e76fc77189	fixes (#53 ) * Minor... use username suggester in case username already exists * FIX: ensure we truncate long prompts Previously we 1. Used raw length instead of token counts for counting length 2. We totally dropped a prompt if it was too long New implementation will truncate "raw" if it gets too long maintaining meaning.	2023-05-06 07:31:53 -03:00
Roman Rizzi	71b105a1bb	FEATURE: Introduce the ai-bot module (#52 ) This module lets you chat with our GPT bot inside a PM. The bot only replies to members of the groups listed on the ai_bot_allowed_groups setting and only if you invite it to participate in the PM.	2023-05-05 15:28:31 -03:00
Rafael dos Santos Silva	c96edc8a72	FIX: Pass correct API Key to summarization service (#50 )	2023-05-02 21:41:11 -03:00
Rafael dos Santos Silva	89ac5d720a	FIX: Only send supported image types for classification (#49 ) * FIX: Only send supported image types for classification	2023-04-27 17:52:20 -03:00
Sam	2cd60a4b3b	FEATURE: add a table to audit OpenAI usage (#45 ) Still need to build a job to purge logs	2023-04-26 11:44:29 +10:00
David Taylor	a0542d1859	DEV: Resolve add_to_serializer deprecations (#46 ) `26b7f8a63b`	2023-04-24 16:07:17 +01:00
Sam	057fbe1ce6	FEATURE: add internal support for streaming mode (#42 ) Also adds some tests around completions and supports additional params such as top_p, temperature and max_tokens This also migrates off Faraday to using Net::HTTP directly	2023-04-21 16:54:25 +10:00
Meghna	14b21b4f4d	UX: add a custom sparkles icon for AI action buttons (#44 )	2023-04-20 20:41:24 +05:30
Roman Rizzi	38e007a3a5	FEATURE: Topic summarization (#41 ) * FEATURE: Topic summarization Summarize topics using the TopicView's "summary" filter. The UI is similar to what we do for chat, but we don't allow the user to select a timeframe. Co-authored-by: Rafael dos Santos Silva <xfalcox@gmail.com>	2023-04-19 17:57:31 -03:00
Rafael dos Santos Silva	9783e3b025	FEATURE: Add a basic tokenizer API (#37 ) * FEATURE: Add a basic tokenizer API * Add tests * lint	2023-04-19 11:55:59 -03:00
Rafael dos Santos Silva	4368ef29d8	FIX: Sometimes Claude sends all titles suggestions in a single ai tag (#40 )	2023-04-10 16:02:44 -03:00
Rafael dos Santos Silva	bb0b829634	FEATURE: Anthropic Claude for AIHelper and Summarization modules (#39 )	2023-04-10 11:04:42 -03:00
Rafael dos Santos Silva	5549e4d5b3	FEATURE: Chat channel summarization. (#32 ) * start summary module * chat channel summarization * FEATURE: modal for channel summarization --------- Co-authored-by: Roman Rizzi <rizziromanalejandro@gmail.com>	2023-04-04 11:24:09 -03:00
Roman Rizzi	7a54455cf6	FIX: Use correct variable and method for embeddings (#35 )	2023-03-31 16:15:10 -03:00
Roman Rizzi	4e05763a99	FEATURE: Semantic assymetric full-page search (#34 ) Depends on discourse/discourse#20915 Hooks to the full-page-search component using an experimental API and performs an assymetric similarity search using our embeddings database.	2023-03-31 15:29:56 -03:00
Sam	6543c50758	FIX: stop returning self as a candidate for related topics (#31 )	2023-03-31 11:04:17 +10:00
Sam	0d80d9ec49	FEATURE: allow limiting results in related topics section (#30 ) Also: - Normalizes behavior between logged in and anon, we only show related topics in the related topic section - Renames "suggested" to "related" given this only exists in related section - Adds a spec section to ensure anon does not regress - Adds `ai_embeddings_semantic_related_topics` to limit related topics Renamed settings: ai_embeddings_semantic_suggested_model -> ai_embeddings_semantic_related_model ai_embeddings_semantic_suggested_topics_enabled -> ai_embeddings_semantic_related_topics_enabled Plugins is still in an experimental phase and not much is overidden hence avoiding adding site setting migrations. Co-authored-by: Krzysztof Kotlarek <kotlarek.krzysztof@gmail.com>	2023-03-31 11:04:34 +11:00
Sam	1d097b9d82	FEATURE: attempt to include related topics above suggested (#28 ) Allows related topics to show up for logged on users - Introduces a new "Related Topics" block above suggested when related topics exist - Renames `ai_embeddings_semantic_suggested_topics_anons_enabled` -> `ai_embeddings_semantic_suggested_topics_enabled` (given it is only deployed on 1 site not bothering with a migration) - Adds an integration test to ensure data arrives correctly on the client	2023-03-31 09:07:22 +11:00
Rafael dos Santos Silva	b942a18298	FEATURE: Support for GPT-4 in AI Helper module (#29 )	2023-03-28 23:22:34 -03:00
Rafael dos Santos Silva	45950f1bb4	FIX: Only show public visible topics as suggested for anons (#27 ) * FIX: Only show public visible topics as suggested for anons * DEV: Add tests for embeddings * Update spec/lib/modules/embeddings/semantic_suggested_spec.rb Co-authored-by: Bianca Nenciu <nbianca@users.noreply.github.com> * Update spec/lib/modules/embeddings/semantic_suggested_spec.rb Co-authored-by: Bianca Nenciu <nbianca@users.noreply.github.com> * move to top --------- Co-authored-by: Bianca Nenciu <nbianca@users.noreply.github.com>	2023-03-23 17:28:01 -03:00
Roman Rizzi	4c960970fa	DEV: Log information about errors from the completions OpenAI API (#26 )	2023-03-22 16:00:28 -03:00
Sam	1d14f7ffaf	FEATURE: Add a markdown table AI helper (#25 )	2023-03-22 13:16:29 -03:00
Rafael dos Santos Silva	bd342f538d	FEATURE: Try to generate embeddings for a topic when those aren't found (#23 )	2023-03-21 18:20:46 -03:00
Roman Rizzi	39f7f1f29e	FEATURE: Prompts can consist of multiple messages. (#21 ) A prompt with multiple messages leads to better results, as the AI can learn for given examples. Alongside this change, we provide a better default proofreading prompt.	2023-03-21 12:04:59 -03:00
Rafael dos Santos Silva	6bdbc0e32d	FIX: Proper flow when a topic doesn't have embeddings (#20 )	2023-03-20 16:44:55 -03:00
Roman Rizzi	fea9041ee1	DEV: Use 10s timeout when using the completions API (#19 )	2023-03-20 16:43:51 -03:00

1 2

70 Commits