discourse-ai

Author	SHA1	Message	Date
Roman Rizzi	534b0df391	REFACTOR: Separation of concerns for embedding generation. (#1027 ) In a previous refactor, we moved the responsibility of querying and storing embeddings into the `Schema` class. Now, it's time for embedding generation. The motivation behind these changes is to isolate vector characteristics in simple objects to later replace them with a DB-backed version, similar to what we did with LLM configs.	2024-12-16 09:55:39 -03:00
Roman Rizzi	79021252e9	REFACTOR: Tidy-up embedding endpoints config. (#937 ) Two changes worth mentioning: `#instance` returns a fully configured embedding endpoint ready to use. All endpoints respond to the same method and have the same signature - `perform!(text)` This makes it easier to reuse them when generating embeddings in bulk.	2024-11-25 13:12:43 -03:00
Rafael dos Santos Silva	1686a8a683	DEV: Move to single table per embeddings type (#561 ) Also move us to halfvecs for speed and disk usage gains	2024-08-08 11:55:20 -03:00
Rafael dos Santos Silva	eb93b21769	FEATURE: Add BGE-M3 embeddings support (#569 ) BAAI/bge-m3 is an interesting model, that is multilingual and with a context size of 8192. Even with a 16x larger context, it's only 4x slower to compute it's embeddings on the worst case scenario. Also includes a minor refactor of the rake task, including setting model and concurrency levels when running the backfill task.	2024-04-10 17:24:01 -03:00