discourse-ai/lib/embeddings
Sam dcafc8032f
FIX: improve embedding generation (#452)
1. on failure we were queuing a job to generate embeddings, it had the wrong params. This is both fixed and covered in a test.
2. backfill embedding in the order of bumped_at, so newest content is embedded first, cover with a test
3. add a safeguard for hidden site setting that only allows batches of 50k in an embedding job run

Previously old embeddings were updated in a random order, this changes it so we update in a consistent order
2024-01-31 10:38:47 -03:00
..
strategies FEATURE: Generate proper embeddings for posts/topics with embedded content (#401) 2024-01-05 10:27:45 -03:00
vector_representations FEATURE: add support for new OpenAI embedding models (#445) 2024-01-29 13:24:30 -03:00
entry_point.rb FEATURE: Setting to control per post embeddings (#439) 2024-01-23 22:09:27 -03:00
semantic_related.rb FIX: improve embedding generation (#452) 2024-01-31 10:38:47 -03:00
semantic_search.rb FIX: Add table name to remove ambiguous column reference in SQL (#449) 2024-01-30 15:50:26 -03:00
semantic_topic_query.rb DEV: port directory structure to Zeitwerk (#319) 2023-11-29 15:17:46 +11:00