discourse-ai

Commit Graph

Author	SHA1	Message	Date
Rafael dos Santos Silva	04bc402aae	FEATURE: Setting to control per post embeddings (#439 ) * FEATURE: Setting to control per post embeddings	2024-01-23 22:09:27 -03:00
Roman Rizzi	04eae76f68	REFACTOR: Represent generic prompts with an Object. (#416 ) * REFACTOR: Represent generic prompts with an Object. * Adds a bit more validation for clarity * Rewrite bot title prompt and fix quirk handling --------- Co-authored-by: Sam Saffron <sam.saffron@gmail.com>	2024-01-12 14:36:44 -03:00
Rafael dos Santos Silva	705ef986b4	FIX: Set ivfflat.probes using topic count, not post count (#421 ) Fixes a regression from `140359c` which caused we to set this globally based on post count, rendering the cost of an index scan on the topics table too high and making the planner, correctly, not use the index anymore. Hopefully https://github.com/pgvector/pgvector/issues/235 lands soon.	2024-01-12 11:20:23 -03:00
Martin Brennan	37b957dbbb	DEV: Fix SemanticRelated module load error (#419 ) Followup `2636efcd1b`, whenever ruby code was changed locally this would break module loading, giving an "uninitialized constant DiscourseAi::Embeddings::EntryPoint::SemanticRelated" error.	2024-01-11 13:52:50 +10:00
Rafael dos Santos Silva	8fcba12fae	FEATURE: Support for SRV records for Discourse services (#414 ) This allows admins to configure services with multiple backends using DNS SRV records. This PR also adds support for shared secret auth via headers for TEI and vLLM endpoints, so they are inline with the other ones.	2024-01-10 19:23:07 -03:00
Rafael dos Santos Silva	23b2809638	FEATURE: Generate proper embeddings for posts/topics with embedded content (#401 )	2024-01-05 10:27:45 -03:00
Rafael dos Santos Silva	6fc1c9f7a6	FEATURE: Try to automatically handle larger embedding indexes (#403 ) * FEATURE: Try to automatically handle larger embedding indexes * linteeeeeeeer	2024-01-05 09:56:28 -03:00
Sam	03fc94684b	FIX: AI helper not working correctly with mixtral (#399 ) * FIX: AI helper not working correctly with mixtral This PR introduces a new function on the generic llm called #generate This will replace the implementation of completion! #generate introduces a new way to pass temperature, max_tokens and stop_sequences Then LLM implementers need to implement #normalize_model_params to ensure the generic names match the LLM specific endpoint This also adds temperature and stop_sequences to completion_prompts this allows for much more robust completion prompts * port everything over to #generate * Fix translation - On anthropic this no longer throws random "This is your translation:" - On mixtral this actually works * fix markdown table generation as well	2024-01-04 09:53:47 -03:00
Rafael dos Santos Silva	cec9bb8910	FIX: Skip embeddings for blank content (#392 )	2023-12-29 14:59:08 -03:00
Rafael dos Santos Silva	c778592da4	FIX: Corner cases on post embedding and crawler related (#391 ) * FIX: Simplify markup for crawler related * FIX: Handle title-less topics when truncating for embeddings * fix	2023-12-29 14:05:02 -03:00
Rafael dos Santos Silva	140359c2ef	FEATURE: Per post embeddings (#387 )	2023-12-29 12:28:45 -03:00
Rafael dos Santos Silva	2636efcd1b	FEATURE: Render Related Topics for Crawlers (#386 )	2023-12-28 15:32:03 -03:00
Rafael dos Santos Silva	1287ef4428	FEATURE: Support for Gemini Embeddings (#382 )	2023-12-28 10:28:01 -03:00
Rafael dos Santos Silva	4d7ccdda2f	FEATURE: DNS SRV support for TEI (#363 )	2023-12-18 13:21:21 -03:00
Rafael dos Santos Silva	381b0d74ca	FIX: Handle truncation in HyDE search (#342 )	2023-12-07 10:36:56 -03:00
Rafael dos Santos Silva	c8352f21ce	FIX: Fallback to whole LLM response when XML fail (#340 )	2023-12-06 18:58:26 -03:00
Sam	a0b9fb9721	FIX: explicitly load embedding strategies (#325 ) If not, sometimes during tests these constants may not be loaded leading to flaky tests	2023-11-29 16:36:56 +11:00
Sam	6ddc17fd61	DEV: port directory structure to Zeitwerk (#319 ) Previous to this change we relied on explicit loading for a files in Discourse AI. This had a few downsides: - Busywork whenever you add a file (an extra require relative) - We were not keeping to conventions internally ... some places were OpenAI others are OpenAi - Autoloader did not work which lead to lots of full application broken reloads when developing. This moves all of DiscourseAI into a Zeitwerk compatible structure. It also leaves some minimal amount of manual loading (automation - which is loading into an existing namespace that may or may not be there) To avoid needing /lib/discourse_ai/... we mount a namespace thus we are able to keep /lib pointed at ::DiscourseAi Various files were renamed to get around zeitwerk rules and minimize usage of custom inflections Though we can get custom inflections to work it is not worth it, will require a Discourse core patch which means we create a hard dependency.	2023-11-29 15:17:46 +11:00

18 Commits