discourse-ai

Commit Graph

Author	SHA1	Message	Date
Roman Rizzi	534b0df391	REFACTOR: Separation of concerns for embedding generation. (#1027 ) In a previous refactor, we moved the responsibility of querying and storing embeddings into the `Schema` class. Now, it's time for embedding generation. The motivation behind these changes is to isolate vector characteristics in simple objects to later replace them with a DB-backed version, similar to what we did with LLM configs.	2024-12-16 09:55:39 -03:00
Roman Rizzi	eae527f99d	REFACTOR: A Simpler way of interacting with embeddings tables. (#1023 ) * REFACTOR: A Simpler way of interacting with embeddings' tables. This change adds a new abstraction called `Schema`, which acts as a repository that supports the same DB features `VectorRepresentation::Base` has, with the exception that removes the need to have duplicated methods per embeddings table. It is also a bit more flexible when performing a similarity search because you can pass it a block that gives you access to the builder, allowing you to add multiple joins/where conditions.	2024-12-13 10:15:21 -03:00
Sam	bdf3b6268b	FEATURE: smarter persona tethering (#832 ) Splits persona permissions so you can allow a persona on: - chat dms - personal messages - topic mentions - chat channels (any combination is allowed) Previously we did not have this flexibility. Additionally, adds the ability to "tether" a language model to a persona so it will always be used by the persona. This allows people to use a cheaper language model for one group of people and more expensive one for other people	2024-10-16 07:20:31 +11:00
Sam	a5b5c3bebe	PERF: speed up spec (#794 ) ~500ms -> ~100ms It is still not a super fast spec given search is not free, but it is a bit faster and clearer	2024-09-04 16:14:32 +10:00
Sam	cabecb801e	FEATURE: disable rate limiting when skipping hyde (#793 ) Embedding search is rate limited due to potentially expensive hyde operation (which require LLM access). Embedding generally is very cheap compared to it. (usually 100x cheaper) This raises the limit to 100 per minute for embedding searches, while keeping the old 4 per minute for HyDE powered search.	2024-09-04 15:51:01 +10:00
Roman Rizzi	e408cd080c	FIX: coerce value before downcasing the hyde param (#787 )	2024-08-30 12:13:29 -03:00
Sam	eee8e72756	FEATURE: API scope for semantic search (#785 ) The new API scope allows restricting access to semantic search only.	2024-08-30 09:35:20 +10:00
Sam	0687ec75c3	FEATURE: allow embedding based search without hyde (#777 ) This allows callers of embedding based search to bypass hyde. Hyde will expand the search term using an LLM, but if an LLM is performing the search we can skip this expansion. It also introduced some tests for the controller which we did not have	2024-08-28 14:17:34 +10:00

8 Commits