discourse-ai

Commit Graph

Author	SHA1	Message	Date
Sam	f6ac5cd0a8	FEATURE: allow tuning of RAG generation (#565 ) * FEATURE: allow tuning of RAG generation - change chunking to be token based vs char based (which is more accurate) - allow control over overlap / tokens per chunk and conversation snippets inserted - UI to control new settings * improve ui a bit * fix various reindex issues * reduce concurrency * try ultra low queue ... concurrency 1 is too slow.	2024-04-12 10:32:46 -03:00
Roman Rizzi	1f1c94e5c6	FEATURE: AI Bot RAG support. (#537 ) This PR lets you associate uploads to an AI persona, which we'll split and generate embeddings from. When building the system prompt to get a bot reply, we'll do a similarity search followed by a re-ranking (if available). This will let us find the most relevant fragments from the body of knowledge you associated with the persona, resulting in better, more informed responses. For now, we'll only allow plain-text files, but this will change in the future. Commits: * FEATURE: RAG embeddings for the AI Bot This first commit introduces a UI where admins can upload text files, which we'll store, split into fragments, and generate embeddings of. In a next commit, we'll use those to give the bot additional information during conversations. * Basic asymmetric similarity search to provide guidance in system prompt * Fix tests and lint * Apply reranker to fragments * Uploads filter, css adjustments and file validations * Add placeholder for rag fragments * Update annotations	2024-04-01 13:43:34 -03:00
Martin Brennan	c3b26ccb10	DEV: Move admin routes + templates to admin/assets/javascripts/ path (#545 ) This ensures these routes and templates are not loaded if the user isn't admin. Only affects Persona routes at this point in time.	2024-03-25 09:58:53 +10:00

Author

SHA1

Message

Date

Sam

f6ac5cd0a8

FEATURE: allow tuning of RAG generation (#565 )

* FEATURE: allow tuning of RAG generation

- change chunking to be token based vs char based (which is more accurate)
- allow control over overlap / tokens per chunk and conversation snippets inserted
- UI to control new settings

* improve ui a bit

* fix various reindex issues

* reduce concurrency

* try ultra low queue ... concurrency 1 is too slow.

2024-04-12 10:32:46 -03:00

Roman Rizzi

1f1c94e5c6

FEATURE: AI Bot RAG support. (#537 )

This PR lets you associate uploads to an AI persona, which we'll split and generate embeddings from. When building the system prompt to get a bot reply, we'll do a similarity search followed by a re-ranking (if available). This will let us find the most relevant fragments from the body of knowledge you associated with the persona, resulting in better, more informed responses.

For now, we'll only allow plain-text files, but this will change in the future.

Commits:

* FEATURE: RAG embeddings for the AI Bot

This first commit introduces a UI where admins can upload text files, which we'll store, split into fragments,
and generate embeddings of. In a next commit, we'll use those to give the bot additional information during
conversations.

* Basic asymmetric similarity search to provide guidance in system prompt

* Fix tests and lint

* Apply reranker to fragments

* Uploads filter, css adjustments and file validations

* Add placeholder for rag fragments

* Update annotations

2024-04-01 13:43:34 -03:00

Martin Brennan

c3b26ccb10

DEV: Move admin routes + templates to admin/assets/javascripts/ path (#545 )

This ensures these routes and templates are not loaded if the user isn't
admin. Only affects Persona routes at this point in time.

2024-03-25 09:58:53 +10:00

3 Commits