discourse-ai

Commit Graph

Author	SHA1	Message	Date
Sam	b2b01185f2	FEATURE: add support for new OpenAI embedding models (#445 ) * FEATURE: add support for new OpenAI embedding models This adds support for just released text_embedding_3_small and large Note, we have not yet implemented truncation support which is a new API feature. (triggered using dimensions) * Tiny side fix, recalc bots when ai is enabled or disabled * FIX: downsample to 2000 items per vector which is a pgvector limitation	2024-01-29 13:24:30 -03:00
Rafael dos Santos Silva	16d666fe69	FIX: Misconfigured OpenAI API for embeddings shouldn't spam logs (#440 )	2024-01-24 15:57:18 -03:00
Jarek Radosz	5802cd1a0c	DEV: Fix various typos (#434 )	2024-01-19 12:51:26 +01:00
Jarek Radosz	6b8a57d957	DEV: Update linting (#423 ) Co-authored-by: Keegan George <kgeorge13@gmail.com>	2024-01-13 00:28:06 +01:00
Rafael dos Santos Silva	8fcba12fae	FEATURE: Support for SRV records for Discourse services (#414 ) This allows admins to configure services with multiple backends using DNS SRV records. This PR also adds support for shared secret auth via headers for TEI and vLLM endpoints, so they are inline with the other ones.	2024-01-10 19:23:07 -03:00
Roman Rizzi	f9d7d7f5f0	DEV: AI bot migration to the Llm pattern. (#343 ) * DEV: AI bot migration to the Llm pattern. We added tool and conversation context support to the Llm service in discourse-ai#366, meaning we met all the conditions to migrate this module. This PR migrates to the new pattern, meaning adding a new bot now requires minimal effort as long as the service supports it. On top of this, we introduce the concept of a "Playground" to separate the PM-specific bits from the completion, allowing us to use the bot in other contexts like chat in the future. Commands are called tools, and we simplified all the placeholder logic to perform updates in a single place, making the flow more one-wayish. * Followup fixes based on testing * Cleanup unused inference code * FIX: text-based tools could be in the middle of a sentence * GPT-4-turbo support * Use new LLM API	2024-01-04 10:44:07 -03:00
Rafael dos Santos Silva	1287ef4428	FEATURE: Support for Gemini Embeddings (#382 )	2023-12-28 10:28:01 -03:00
Sam	af2e692761	FIX: under certain conditions we would get duplicate data from llm (#373 ) Previously endpoint/base would `+=` decoded_chunk to leftover This could lead to cases where the leftover buffer had duplicate previously processed data Fix ensures we properly skip previously decoded data.	2023-12-20 14:28:05 -03:00
Rafael dos Santos Silva	4d7ccdda2f	FEATURE: DNS SRV support for TEI (#363 )	2023-12-18 13:21:21 -03:00
Sam	3c9901d43a	FEATURE: implement GPT-4 turbo support (#345 ) Keep in mind: - GPT-4 is only going to be fully released next year - so this hardcodes preview model for now - Fixes streaming bugs which became a big problem with GPT-4 turbo - Adds Azure endpoing for turbo as well Co-authored-by: Martin Brennan <martin@discourse.org>	2023-12-11 14:59:57 +11:00
Rafael dos Santos Silva	381b0d74ca	FIX: Handle truncation in HyDE search (#342 )	2023-12-07 10:36:56 -03:00
Rafael dos Santos Silva	252efdf142	FIX: Don't echo prompt back on HF/TGI (#338 ) * FIX: Don't echo prompt back on HF/TGI * teeeeests	2023-12-06 16:06:26 -03:00
Rafael dos Santos Silva	d8267d8da0	FIX: Many fixes for huggingface and llama2 inference (#335 )	2023-12-06 11:22:42 -03:00
Sam	6ddc17fd61	DEV: port directory structure to Zeitwerk (#319 ) Previous to this change we relied on explicit loading for a files in Discourse AI. This had a few downsides: - Busywork whenever you add a file (an extra require relative) - We were not keeping to conventions internally ... some places were OpenAI others are OpenAi - Autoloader did not work which lead to lots of full application broken reloads when developing. This moves all of DiscourseAI into a Zeitwerk compatible structure. It also leaves some minimal amount of manual loading (automation - which is loading into an existing namespace that may or may not be there) To avoid needing /lib/discourse_ai/... we mount a namespace thus we are able to keep /lib pointed at ::DiscourseAi Various files were renamed to get around zeitwerk rules and minimize usage of custom inflections Though we can get custom inflections to work it is not worth it, will require a Discourse core patch which means we create a hard dependency.	2023-11-29 15:17:46 +11:00

14 Commits