discourse-ai

Commit Graph

Author	SHA1	Message	Date
Roman Rizzi	534b0df391	REFACTOR: Separation of concerns for embedding generation. (#1027 ) In a previous refactor, we moved the responsibility of querying and storing embeddings into the `Schema` class. Now, it's time for embedding generation. The motivation behind these changes is to isolate vector characteristics in simple objects to later replace them with a DB-backed version, similar to what we did with LLM configs.	2024-12-16 09:55:39 -03:00
Loïc Guitaut	6ae4218a96	DEV: Fix new Rubocop offenses	2024-03-06 15:23:29 +01:00
Rafael dos Santos Silva	2c0f535bab	FEATURE: HyDE-powered semantic search. (#136 ) * FEATURE: HyDE-powered semantic search. It relies on the new outlet added on discourse/discourse#23390 to display semantic search results in an unobtrusive way. We'll use a HyDE-backed approach for semantic search, which consists on generating an hypothetical document from a given keywords, which gets transformed into a vector and used in a asymmetric similarity topic search. This PR also reorganizes the internals to have less moving parts, maintaining one hierarchy of DAOish classes for vector-related operations like transformations and querying. Completions and vectors created by HyDE will remain cached on Redis for now, but we could later use Postgres instead. * Missing translation and rate limiting --------- Co-authored-by: Roman Rizzi <rizziromanalejandro@gmail.com>	2023-09-05 11:08:23 -03:00
Rafael dos Santos Silva	0738f67fa4	FIX: Fix embeddings truncation strategy (#139 )	2023-08-16 15:09:41 -03:00

4 Commits