discourse-ai/lib/completions/dialects/gemini.rb

# frozen_string_literal: true

module DiscourseAi
  module Completions
    module Dialects
      class Gemini < Dialect
        class << self
          def can_translate?(model_name)
            %w[gemini-pro].include?(model_name)
          end

          def tokenizer
            DiscourseAi::Tokenizer::OpenAiTokenizer ## TODO Replace with GeminiTokenizer
          end
        end

        def translate
          # Gemini complains if we don't alternate model/user roles.
          noop_model_response = { role: "model", parts: { text: "Ok." } }

          messages = prompt.messages

          # Gemini doesn't use an assistant msg to improve long-context responses.
          messages.pop if messages.last[:type] == :model

          memo = []

          trim_messages(messages).each do |msg|
            if msg[:type] == :system
              memo << { role: "user", parts: { text: msg[:content] } }
              memo << noop_model_response.dup
            elsif msg[:type] == :model
              memo << { role: "model", parts: { text: msg[:content] } }
            elsif msg[:type] == :tool_call
              call_details = JSON.parse(msg[:content], symbolize_names: true)

              memo << {
                role: "model",
                parts: {
                  functionCall: {
                    name: call_details[:name],
                    args: call_details[:arguments],
                  },
                },
              }
            elsif msg[:type] == :tool
              memo << {
                role: "function",
                parts: {
                  functionResponse: {
                    name: msg[:id],
                    response: {
                      content: msg[:content],
                    },
                  },
                },
              }
            else
              # Gemini quirk. Doesn't accept tool -> user or user -> user msgs.
              previous_msg_role = memo.last&.dig(:role)
              if previous_msg_role == "user" || previous_msg_role == "function"
                memo << noop_model_response.dup
              end

              memo << { role: "user", parts: { text: msg[:content] } }
            end
          end

          memo
        end

        def tools
          return if prompt.tools.blank?

          translated_tools =
            prompt.tools.map do |t|
              tool = t.slice(:name, :description)

              if t[:parameters]
                tool[:parameters] = t[:parameters].reduce(
                  { type: "object", required: [], properties: {} },
                ) do |memo, p|
                  name = p[:name]
                  memo[:required] << name if p[:required]

                  memo[:properties][name] = p.except(:name, :required, :item_type)

                  memo[:properties][name][:items] = { type: p[:item_type] } if p[:item_type]
                  memo
                end
              end

              tool
            end

          [{ function_declarations: translated_tools }]
        end

        def max_prompt_tokens
          16_384 # 50% of model tokens
        end

        protected

        def calculate_message_token(context)
          self.class.tokenizer.size(context[:content].to_s + context[:name].to_s)
        end
      end
    end
  end
end
FEATURE: Support for Gemini in AiHelper / Search / Summarization (#358) 2023-12-15 12:32:01 -05:00			`# frozen_string_literal: true`

			`module DiscourseAi`
			`module Completions`
			`module Dialects`
DEV: Tool support for the LLM service. (#366) This PR adds tool support to available LLMs. We'll buffer tool invocations and return them instead of making users of this service parse the response. It also adds support for conversation context in the generic prompt. It includes bot messages, user messages, and tool invocations, which we'll trim to make sure it doesn't exceed the prompt limit, then translate them to the correct dialect. Finally, It adds some buffering when reading chunks to handle cases when streaming is extremely slow.:M 2023-12-18 16:06:01 -05:00			`class Gemini < Dialect`
			`class << self`
			`def can_translate?(model_name)`
			`%w[gemini-pro].include?(model_name)`
			`end`

			`def tokenizer`
			`DiscourseAi::Tokenizer::OpenAiTokenizer ## TODO Replace with GeminiTokenizer`
			`end`
FEATURE: Support for Gemini in AiHelper / Search / Summarization (#358) 2023-12-15 12:32:01 -05:00			`end`

DEV: Tool support for the LLM service. (#366) This PR adds tool support to available LLMs. We'll buffer tool invocations and return them instead of making users of this service parse the response. It also adds support for conversation context in the generic prompt. It includes bot messages, user messages, and tool invocations, which we'll trim to make sure it doesn't exceed the prompt limit, then translate them to the correct dialect. Finally, It adds some buffering when reading chunks to handle cases when streaming is extremely slow.:M 2023-12-18 16:06:01 -05:00			`def translate`
FEATURE: AI Bot Gemini support. (#402) It also corrects the syntax around tool support, which was wrong. Gemini doesn't want us to include messages about previous tool invocations, so I had to shuffle around some code to send the response it generated from those invocations instead. For this, I created the "multi_turn" context, which bundles all the context involved in the interaction. 2024-01-04 16:15:34 -05:00			`# Gemini complains if we don't alternate model/user roles.`
			`noop_model_response = { role: "model", parts: { text: "Ok." } }`

REFACTOR: Represent generic prompts with an Object. (#416) * REFACTOR: Represent generic prompts with an Object. * Adds a bit more validation for clarity * Rewrite bot title prompt and fix quirk handling --------- Co-authored-by: Sam Saffron <sam.saffron@gmail.com> 2024-01-12 12:36:44 -05:00			`messages = prompt.messages`
FEATURE: Support for Gemini in AiHelper / Search / Summarization (#358) 2023-12-15 12:32:01 -05:00
REFACTOR: Represent generic prompts with an Object. (#416) * REFACTOR: Represent generic prompts with an Object. * Adds a bit more validation for clarity * Rewrite bot title prompt and fix quirk handling --------- Co-authored-by: Sam Saffron <sam.saffron@gmail.com> 2024-01-12 12:36:44 -05:00			`# Gemini doesn't use an assistant msg to improve long-context responses.`
			`messages.pop if messages.last[:type] == :model`
DEV: Tool support for the LLM service. (#366) This PR adds tool support to available LLMs. We'll buffer tool invocations and return them instead of making users of this service parse the response. It also adds support for conversation context in the generic prompt. It includes bot messages, user messages, and tool invocations, which we'll trim to make sure it doesn't exceed the prompt limit, then translate them to the correct dialect. Finally, It adds some buffering when reading chunks to handle cases when streaming is extremely slow.:M 2023-12-18 16:06:01 -05:00
FIX: image generation in gemini was broken (#490) We need to inject blank model answers after tool calls if absent otherwise model will reject it. 2024-02-27 02:24:30 -05:00			`memo = []`

			`trim_messages(messages).each do \|msg\|`
REFACTOR: Represent generic prompts with an Object. (#416) * REFACTOR: Represent generic prompts with an Object. * Adds a bit more validation for clarity * Rewrite bot title prompt and fix quirk handling --------- Co-authored-by: Sam Saffron <sam.saffron@gmail.com> 2024-01-12 12:36:44 -05:00			`if msg[:type] == :system`
			`memo << { role: "user", parts: { text: msg[:content] } }`
			`memo << noop_model_response.dup`
			`elsif msg[:type] == :model`
			`memo << { role: "model", parts: { text: msg[:content] } }`
			`elsif msg[:type] == :tool_call`
			`call_details = JSON.parse(msg[:content], symbolize_names: true)`
FEATURE: AI Bot Gemini support. (#402) It also corrects the syntax around tool support, which was wrong. Gemini doesn't want us to include messages about previous tool invocations, so I had to shuffle around some code to send the response it generated from those invocations instead. For this, I created the "multi_turn" context, which bundles all the context involved in the interaction. 2024-01-04 16:15:34 -05:00
REFACTOR: Represent generic prompts with an Object. (#416) * REFACTOR: Represent generic prompts with an Object. * Adds a bit more validation for clarity * Rewrite bot title prompt and fix quirk handling --------- Co-authored-by: Sam Saffron <sam.saffron@gmail.com> 2024-01-12 12:36:44 -05:00			`memo << {`
			`role: "model",`
			`parts: {`
			`functionCall: {`
			`name: call_details[:name],`
			`args: call_details[:arguments],`
			`},`
			`},`
			`}`
			`elsif msg[:type] == :tool`
			`memo << {`
			`role: "function",`
			`parts: {`
			`functionResponse: {`
			`name: msg[:id],`
			`response: {`
			`content: msg[:content],`
			`},`
			`},`
			`},`
			`}`
			`else`
			`# Gemini quirk. Doesn't accept tool -> user or user -> user msgs.`
			`previous_msg_role = memo.last&.dig(:role)`
FIX: image generation in gemini was broken (#490) We need to inject blank model answers after tool calls if absent otherwise model will reject it. 2024-02-27 02:24:30 -05:00			`if previous_msg_role == "user" \|\| previous_msg_role == "function"`
REFACTOR: Represent generic prompts with an Object. (#416) * REFACTOR: Represent generic prompts with an Object. * Adds a bit more validation for clarity * Rewrite bot title prompt and fix quirk handling --------- Co-authored-by: Sam Saffron <sam.saffron@gmail.com> 2024-01-12 12:36:44 -05:00			`memo << noop_model_response.dup`
			`end`

			`memo << { role: "user", parts: { text: msg[:content] } }`
			`end`
			`end`
FIX: image generation in gemini was broken (#490) We need to inject blank model answers after tool calls if absent otherwise model will reject it. 2024-02-27 02:24:30 -05:00
			`memo`
FEATURE: Support for Gemini in AiHelper / Search / Summarization (#358) 2023-12-15 12:32:01 -05:00			`end`

DEV: Tool support for the LLM service. (#366) This PR adds tool support to available LLMs. We'll buffer tool invocations and return them instead of making users of this service parse the response. It also adds support for conversation context in the generic prompt. It includes bot messages, user messages, and tool invocations, which we'll trim to make sure it doesn't exceed the prompt limit, then translate them to the correct dialect. Finally, It adds some buffering when reading chunks to handle cases when streaming is extremely slow.:M 2023-12-18 16:06:01 -05:00			`def tools`
REFACTOR: Represent generic prompts with an Object. (#416) * REFACTOR: Represent generic prompts with an Object. * Adds a bit more validation for clarity * Rewrite bot title prompt and fix quirk handling --------- Co-authored-by: Sam Saffron <sam.saffron@gmail.com> 2024-01-12 12:36:44 -05:00			`return if prompt.tools.blank?`
DEV: Tool support for the LLM service. (#366) This PR adds tool support to available LLMs. We'll buffer tool invocations and return them instead of making users of this service parse the response. It also adds support for conversation context in the generic prompt. It includes bot messages, user messages, and tool invocations, which we'll trim to make sure it doesn't exceed the prompt limit, then translate them to the correct dialect. Finally, It adds some buffering when reading chunks to handle cases when streaming is extremely slow.:M 2023-12-18 16:06:01 -05:00
			`translated_tools =`
REFACTOR: Represent generic prompts with an Object. (#416) * REFACTOR: Represent generic prompts with an Object. * Adds a bit more validation for clarity * Rewrite bot title prompt and fix quirk handling --------- Co-authored-by: Sam Saffron <sam.saffron@gmail.com> 2024-01-12 12:36:44 -05:00			`prompt.tools.map do \|t\|`
FEATURE: AI Bot Gemini support. (#402) It also corrects the syntax around tool support, which was wrong. Gemini doesn't want us to include messages about previous tool invocations, so I had to shuffle around some code to send the response it generated from those invocations instead. For this, I created the "multi_turn" context, which bundles all the context involved in the interaction. 2024-01-04 16:15:34 -05:00			`tool = t.slice(:name, :description)`
DEV: Tool support for the LLM service. (#366) This PR adds tool support to available LLMs. We'll buffer tool invocations and return them instead of making users of this service parse the response. It also adds support for conversation context in the generic prompt. It includes bot messages, user messages, and tool invocations, which we'll trim to make sure it doesn't exceed the prompt limit, then translate them to the correct dialect. Finally, It adds some buffering when reading chunks to handle cases when streaming is extremely slow.:M 2023-12-18 16:06:01 -05:00
FEATURE: AI Bot Gemini support. (#402) It also corrects the syntax around tool support, which was wrong. Gemini doesn't want us to include messages about previous tool invocations, so I had to shuffle around some code to send the response it generated from those invocations instead. For this, I created the "multi_turn" context, which bundles all the context involved in the interaction. 2024-01-04 16:15:34 -05:00			`if t[:parameters]`
			`tool[:parameters] = t[:parameters].reduce(`
			`{ type: "object", required: [], properties: {} },`
			`) do \|memo, p\|`
			`name = p[:name]`
			`memo[:required] << name if p[:required]`
DEV: Tool support for the LLM service. (#366) This PR adds tool support to available LLMs. We'll buffer tool invocations and return them instead of making users of this service parse the response. It also adds support for conversation context in the generic prompt. It includes bot messages, user messages, and tool invocations, which we'll trim to make sure it doesn't exceed the prompt limit, then translate them to the correct dialect. Finally, It adds some buffering when reading chunks to handle cases when streaming is extremely slow.:M 2023-12-18 16:06:01 -05:00
FEATURE: AI Bot Gemini support. (#402) It also corrects the syntax around tool support, which was wrong. Gemini doesn't want us to include messages about previous tool invocations, so I had to shuffle around some code to send the response it generated from those invocations instead. For this, I created the "multi_turn" context, which bundles all the context involved in the interaction. 2024-01-04 16:15:34 -05:00			`memo[:properties][name] = p.except(:name, :required, :item_type)`

			`memo[:properties][name][:items] = { type: p[:item_type] } if p[:item_type]`
			`memo`
			`end`
DEV: Tool support for the LLM service. (#366) This PR adds tool support to available LLMs. We'll buffer tool invocations and return them instead of making users of this service parse the response. It also adds support for conversation context in the generic prompt. It includes bot messages, user messages, and tool invocations, which we'll trim to make sure it doesn't exceed the prompt limit, then translate them to the correct dialect. Finally, It adds some buffering when reading chunks to handle cases when streaming is extremely slow.:M 2023-12-18 16:06:01 -05:00			`end`

FEATURE: AI Bot Gemini support. (#402) It also corrects the syntax around tool support, which was wrong. Gemini doesn't want us to include messages about previous tool invocations, so I had to shuffle around some code to send the response it generated from those invocations instead. For this, I created the "multi_turn" context, which bundles all the context involved in the interaction. 2024-01-04 16:15:34 -05:00			`tool`
DEV: Tool support for the LLM service. (#366) This PR adds tool support to available LLMs. We'll buffer tool invocations and return them instead of making users of this service parse the response. It also adds support for conversation context in the generic prompt. It includes bot messages, user messages, and tool invocations, which we'll trim to make sure it doesn't exceed the prompt limit, then translate them to the correct dialect. Finally, It adds some buffering when reading chunks to handle cases when streaming is extremely slow.:M 2023-12-18 16:06:01 -05:00			`end`

			`[{ function_declarations: translated_tools }]`
			`end`

			`def max_prompt_tokens`
			`16_384 # 50% of model tokens`
			`end`

			`protected`

			`def calculate_message_token(context)`
			`self.class.tokenizer.size(context[:content].to_s + context[:name].to_s)`
FEATURE: Support for Gemini in AiHelper / Search / Summarization (#358) 2023-12-15 12:32:01 -05:00			`end`
			`end`
			`end`
			`end`
			`end`