discourse-ai/lib/embeddings/vector_representations/gemini.rb

# frozen_string_literal: true

module DiscourseAi
  module Embeddings
    module VectorRepresentations
      class Gemini < Base
        class << self
          def name
            "gemini"
          end

          def correctly_configured?
            SiteSetting.ai_gemini_api_key.present?
          end

          def dependant_setting_names
            %w[ai_gemini_api_key]
          end
        end

        def id
          5
        end

        def version
          1
        end

        def dimensions
          768
        end

        def max_sequence_length
          1536 # Gemini has a max sequence length of 2048, but the API has a limit of 10000 bytes, hence the lower value
        end

        def pg_function
          "<=>"
        end

        def pg_index_type
          "halfvec_cosine_ops"
        end

        def vector_from(text, asymetric: false)
          inference_client.perform!(text).dig(:embedding, :values)
        end

        # There is no public tokenizer for Gemini, and from the ones we already ship in the plugin
        # OpenAI gets the closest results. Gemini Tokenizer results in ~10% less tokens, so it's safe
        # to use OpenAI tokenizer since it will overestimate the number of tokens.
        def tokenizer
          DiscourseAi::Tokenizer::OpenAiTokenizer
        end

        def inference_client
          DiscourseAi::Inference::GeminiEmbeddings.instance
        end
      end
    end
  end
end
FEATURE: Support for Gemini Embeddings (#382) 2023-12-28 08:28:01 -05:00			`# frozen_string_literal: true`

			`module DiscourseAi`
			`module Embeddings`
			`module VectorRepresentations`
			`class Gemini < Base`
UX: Re-introduce embedding settings validations (#457) * Revert "Revert "UX: Validate embeddings settings (#455)" (#456)" This reverts commit 392e2e8aef7d5b0d988b3c3bc5cc19f1d83c4491. * Resstore previous default 2024-02-01 14:54:09 -05:00			`class << self`
			`def name`
			`"gemini"`
			`end`

			`def correctly_configured?`
			`SiteSetting.ai_gemini_api_key.present?`
			`end`

			`def dependant_setting_names`
			`%w[ai_gemini_api_key]`
			`end`
			`end`

FEATURE: Support for Gemini Embeddings (#382) 2023-12-28 08:28:01 -05:00			`def id`
			`5`
			`end`

			`def version`
			`1`
			`end`

			`def dimensions`
			`768`
			`end`

			`def max_sequence_length`
FIX: Lower truncation size for Gemini Embeddings (#493) 2024-02-27 16:52:53 -05:00			`1536 # Gemini has a max sequence length of 2048, but the API has a limit of 10000 bytes, hence the lower value`
FEATURE: Support for Gemini Embeddings (#382) 2023-12-28 08:28:01 -05:00			`end`

			`def pg_function`
			`"<=>"`
			`end`

			`def pg_index_type`
DEV: Move to single table per embeddings type (#561) Also move us to halfvecs for speed and disk usage gains 2024-08-08 10:55:20 -04:00			`"halfvec_cosine_ops"`
FEATURE: Support for Gemini Embeddings (#382) 2023-12-28 08:28:01 -05:00			`end`

FEATURE: AI Quick Semantic Search (#501) This PR adds AI semantic search to the search pop available on every page. It depends on several new and optional settings, like per post embeddings and a reranker model, so this is an experimental endeavour. --------- Co-authored-by: Rafael Silva <xfalcox@gmail.com> 2024-03-08 11:02:50 -05:00			`def vector_from(text, asymetric: false)`
REFACTOR: Tidy-up embedding endpoints config. (#937) Two changes worth mentioning: `#instance` returns a fully configured embedding endpoint ready to use. All endpoints respond to the same method and have the same signature - `perform!(text)` This makes it easier to reuse them when generating embeddings in bulk. 2024-11-25 11:12:43 -05:00			`inference_client.perform!(text).dig(:embedding, :values)`
FEATURE: Support for Gemini Embeddings (#382) 2023-12-28 08:28:01 -05:00			`end`

			`# There is no public tokenizer for Gemini, and from the ones we already ship in the plugin`
			`# OpenAI gets the closest results. Gemini Tokenizer results in ~10% less tokens, so it's safe`
			`# to use OpenAI tokenizer since it will overestimate the number of tokens.`
			`def tokenizer`
			`DiscourseAi::Tokenizer::OpenAiTokenizer`
			`end`
REFACTOR: Tidy-up embedding endpoints config. (#937) Two changes worth mentioning: `#instance` returns a fully configured embedding endpoint ready to use. All endpoints respond to the same method and have the same signature - `perform!(text)` This makes it easier to reuse them when generating embeddings in bulk. 2024-11-25 11:12:43 -05:00
			`def inference_client`
			`DiscourseAi::Inference::GeminiEmbeddings.instance`
			`end`
FEATURE: Support for Gemini Embeddings (#382) 2023-12-28 08:28:01 -05:00			`end`
			`end`
			`end`
			`end`