discourse-ai/lib/completions/dialects/open_ai_compatible.rb

# frozen_string_literal: true

module DiscourseAi
  module Completions
    module Dialects
      class OpenAiCompatible < Dialect
        class << self
          def can_translate?(_model_name)
            true
          end
        end

        def tokenizer
          llm_model&.tokenizer_class || DiscourseAi::Tokenizer::Llama3Tokenizer
        end

        def tools
          @tools ||= tools_dialect.translated_tools
        end

        def max_prompt_tokens
          return llm_model.max_prompt_tokens if llm_model&.max_prompt_tokens

          32_000
        end

        private

        def system_msg(msg)
          msg = { role: "system", content: msg[:content] }

          if tools_dialect.instructions.present?
            msg[:content] = msg[:content].dup << "\n\n#{tools_dialect.instructions}"
          end

          msg
        end

        def model_msg(msg)
          { role: "assistant", content: msg[:content] }
        end

        def tool_call_msg(msg)
          translated = tools_dialect.from_raw_tool_call(msg)
          { role: "assistant", content: translated }
        end

        def tool_msg(msg)
          translated = tools_dialect.from_raw_tool(msg)
          { role: "user", content: translated }
        end

        def user_msg(msg)
          content = +""
          content << "#{msg[:id]}: " if msg[:id]
          content << msg[:content]

          message = { role: "user", content: content }

          message[:content] = inline_images(message[:content], msg) if vision_support?

          message
        end

        def inline_images(content, message)
          encoded_uploads = prompt.encoded_uploads(message)
          return content if encoded_uploads.blank?

          content_w_imgs =
            encoded_uploads.reduce([]) do |memo, details|
              memo << {
                type: "image_url",
                image_url: {
                  url: "data:#{details[:mime_type]};base64,#{details[:base64]}",
                },
              }
            end

          content_w_imgs << { type: "text", text: message[:content] }
        end
      end
    end
  end
end
HACK: Llama3 support for summarization/AI helper. (#616) There are still some limitations to which models we can support with the `LlmModel` class. This will enable support for Llama3 while we sort those out. 2024-05-13 14:54:42 -04:00			`# frozen_string_literal: true`

			`module DiscourseAi`
			`module Completions`
			`module Dialects`
			`class OpenAiCompatible < Dialect`
			`class << self`
			`def can_translate?(_model_name)`
			`true`
			`end`
FEATURE: Set endpoint credentials directly from LlmModel. (#625) * FEATURE: Set endpoint credentials directly from LlmModel. Drop Llama2Tokenizer since we no longer use it. * Allow http for custom LLMs --------- Co-authored-by: Rafael Silva <xfalcox@gmail.com> 2024-05-16 08:50:22 -04:00			`end`
HACK: Llama3 support for summarization/AI helper. (#616) There are still some limitations to which models we can support with the `LlmModel` class. This will enable support for Llama3 while we sort those out. 2024-05-13 14:54:42 -04:00
FEATURE: Set endpoint credentials directly from LlmModel. (#625) * FEATURE: Set endpoint credentials directly from LlmModel. Drop Llama2Tokenizer since we no longer use it. * Allow http for custom LLMs --------- Co-authored-by: Rafael Silva <xfalcox@gmail.com> 2024-05-16 08:50:22 -04:00			`def tokenizer`
			`llm_model&.tokenizer_class \|\| DiscourseAi::Tokenizer::Llama3Tokenizer`
HACK: Llama3 support for summarization/AI helper. (#616) There are still some limitations to which models we can support with the `LlmModel` class. This will enable support for Llama3 while we sort those out. 2024-05-13 14:54:42 -04:00			`end`

			`def tools`
			`@tools \|\|= tools_dialect.translated_tools`
			`end`

			`def max_prompt_tokens`
FEATURE: Set endpoint credentials directly from LlmModel. (#625) * FEATURE: Set endpoint credentials directly from LlmModel. Drop Llama2Tokenizer since we no longer use it. * Allow http for custom LLMs --------- Co-authored-by: Rafael Silva <xfalcox@gmail.com> 2024-05-16 08:50:22 -04:00			`return llm_model.max_prompt_tokens if llm_model&.max_prompt_tokens`
HACK: Llama3 support for summarization/AI helper. (#616) There are still some limitations to which models we can support with the `LlmModel` class. This will enable support for Llama3 while we sort those out. 2024-05-13 14:54:42 -04:00
			`32_000`
			`end`

			`private`

			`def system_msg(msg)`
FIX: Add tool support to open ai compatible dialect and vllm (#734) * FIX: Add tool support to open ai compatible dialect and vllm Automatic tools are in progress in vllm see: https://github.com/vllm-project/vllm/pull/5649 Even when they are supported, initial support will be uneven, only some models have native tool support notably mistral which has some special tokens for tool support. After the above PR lands in vllm we will still need to swap to XML based tools on models without native tool support. * fix specs 2024-08-02 08:52:33 -04:00			`msg = { role: "system", content: msg[:content] }`

			`if tools_dialect.instructions.present?`
			`msg[:content] = msg[:content].dup << "\n\n#{tools_dialect.instructions}"`
			`end`

			`msg`
HACK: Llama3 support for summarization/AI helper. (#616) There are still some limitations to which models we can support with the `LlmModel` class. This will enable support for Llama3 while we sort those out. 2024-05-13 14:54:42 -04:00			`end`

			`def model_msg(msg)`
			`{ role: "assistant", content: msg[:content] }`
			`end`

			`def tool_call_msg(msg)`
FIX: Add tool support to open ai compatible dialect and vllm (#734) * FIX: Add tool support to open ai compatible dialect and vllm Automatic tools are in progress in vllm see: https://github.com/vllm-project/vllm/pull/5649 Even when they are supported, initial support will be uneven, only some models have native tool support notably mistral which has some special tokens for tool support. After the above PR lands in vllm we will still need to swap to XML based tools on models without native tool support. * fix specs 2024-08-02 08:52:33 -04:00			`translated = tools_dialect.from_raw_tool_call(msg)`
			`{ role: "assistant", content: translated }`
HACK: Llama3 support for summarization/AI helper. (#616) There are still some limitations to which models we can support with the `LlmModel` class. This will enable support for Llama3 while we sort those out. 2024-05-13 14:54:42 -04:00			`end`

			`def tool_msg(msg)`
FIX: Add tool support to open ai compatible dialect and vllm (#734) * FIX: Add tool support to open ai compatible dialect and vllm Automatic tools are in progress in vllm see: https://github.com/vllm-project/vllm/pull/5649 Even when they are supported, initial support will be uneven, only some models have native tool support notably mistral which has some special tokens for tool support. After the above PR lands in vllm we will still need to swap to XML based tools on models without native tool support. * fix specs 2024-08-02 08:52:33 -04:00			`translated = tools_dialect.from_raw_tool(msg)`
			`{ role: "user", content: translated }`
HACK: Llama3 support for summarization/AI helper. (#616) There are still some limitations to which models we can support with the `LlmModel` class. This will enable support for Llama3 while we sort those out. 2024-05-13 14:54:42 -04:00			`end`

			`def user_msg(msg)`
			`content = +""`
			`content << "#{msg[:id]}: " if msg[:id]`
			`content << msg[:content]`

FEATURE: Track if a model can do vision in the llm_models table (#725) * FEATURE: Track if a model can do vision in the llm_models table * Data migration 2024-07-24 15:29:47 -04:00			`message = { role: "user", content: content }`

			`message[:content] = inline_images(message[:content], msg) if vision_support?`

			`message`
			`end`

			`def inline_images(content, message)`
			`encoded_uploads = prompt.encoded_uploads(message)`
			`return content if encoded_uploads.blank?`

			`content_w_imgs =`
			`encoded_uploads.reduce([]) do \|memo, details\|`
			`memo << {`
			`type: "image_url",`
			`image_url: {`
			`url: "data:#{details[:mime_type]};base64,#{details[:base64]}",`
			`},`
			`}`
			`end`

			`content_w_imgs << { type: "text", text: message[:content] }`
HACK: Llama3 support for summarization/AI helper. (#616) There are still some limitations to which models we can support with the `LlmModel` class. This will enable support for Llama3 while we sort those out. 2024-05-13 14:54:42 -04:00			`end`
			`end`
			`end`
			`end`
			`end`