discourse-ai/app/models/rag_document_fragment.rb

# frozen_string_literal: true

class RagDocumentFragment < ActiveRecord::Base
  # TODO Jan 2025 - remove
  self.ignored_columns = %i[ai_persona_id]

  belongs_to :upload
  belongs_to :target, polymorphic: true

  class << self
    def link_target_and_uploads(target, upload_ids)
      return if target.blank?
      return if upload_ids.blank?
      return if !SiteSetting.ai_embeddings_enabled?

      UploadReference.ensure_exist!(upload_ids: upload_ids, target: target)

      upload_ids.each do |upload_id|
        Jobs.enqueue(
          :digest_rag_upload,
          target_id: target.id,
          target_type: target.class.to_s,
          upload_id: upload_id,
        )
      end
    end

    def update_target_uploads(target, upload_ids)
      return if target.blank?
      return if !SiteSetting.ai_embeddings_enabled?

      if upload_ids.blank?
        RagDocumentFragment.where(target: target).destroy_all
        UploadReference.where(target: target).destroy_all
      else
        RagDocumentFragment.where(target: target).where.not(upload_id: upload_ids).destroy_all
        link_target_and_uploads(target, upload_ids)
      end
    end

    def indexing_status(persona, uploads)
      truncation = DiscourseAi::Embeddings::Strategies::Truncation.new
      vector_rep =
        DiscourseAi::Embeddings::VectorRepresentations::Base.current_representation(truncation)

      embeddings_table = vector_rep.rag_fragments_table_name

      results =
        DB.query(
          <<~SQL,
        SELECT
          uploads.id,
          SUM(CASE WHEN (rdf.upload_id IS NOT NULL) THEN 1 ELSE 0 END) AS total,
          SUM(CASE WHEN (eft.rag_document_fragment_id IS NOT NULL) THEN 1 ELSE 0 END) as indexed,
          SUM(CASE WHEN (rdf.upload_id IS NOT NULL AND eft.rag_document_fragment_id IS NULL) THEN 1 ELSE 0 END) as left
        FROM uploads
        LEFT OUTER JOIN rag_document_fragments rdf ON uploads.id = rdf.upload_id AND rdf.target_id = :target_id
          AND rdf.target_type = :target_type
        LEFT OUTER JOIN #{embeddings_table} eft ON rdf.id = eft.rag_document_fragment_id
        WHERE uploads.id IN (:upload_ids)
        GROUP BY uploads.id
      SQL
          target_id: persona.id,
          target_type: persona.class.to_s,
          upload_ids: uploads.map(&:id),
        )

      results.reduce({}) do |acc, r|
        acc[r.id] = { total: r.total, indexed: r.indexed, left: r.left }
        acc
      end
    end

    def publish_status(upload, status)
      MessageBus.publish("/discourse-ai/rag/#{upload.id}", status, user_ids: [upload.user_id])
    end
  end
end

# == Schema Information
#
# Table name: rag_document_fragments
#
#  id              :bigint           not null, primary key
#  fragment        :text             not null
#  upload_id       :integer          not null
#  fragment_number :integer          not null
#  created_at      :datetime         not null
#  updated_at      :datetime         not null
#  metadata        :text
#  target_id       :bigint           not null
#  target_type     :string(800)      not null
#
# Indexes
#
#  index_rag_document_fragments_on_target_type_and_target_id  (target_type,target_id)
#
FEATURE: AI Bot RAG support. (#537) This PR lets you associate uploads to an AI persona, which we'll split and generate embeddings from. When building the system prompt to get a bot reply, we'll do a similarity search followed by a re-ranking (if available). This will let us find the most relevant fragments from the body of knowledge you associated with the persona, resulting in better, more informed responses. For now, we'll only allow plain-text files, but this will change in the future. Commits: * FEATURE: RAG embeddings for the AI Bot This first commit introduces a UI where admins can upload text files, which we'll store, split into fragments, and generate embeddings of. In a next commit, we'll use those to give the bot additional information during conversations. * Basic asymmetric similarity search to provide guidance in system prompt * Fix tests and lint * Apply reranker to fragments * Uploads filter, css adjustments and file validations * Add placeholder for rag fragments * Update annotations 2024-04-01 12:43:34 -04:00			`# frozen_string_literal: true`

			`class RagDocumentFragment < ActiveRecord::Base`
FEATURE: Make tool support polymorphic (#798) Polymorphic RAG means that we will be able to access RAG fragments both from AiPersona and AiCustomTool In turn this gives us support for richer RAG implementations. 2024-09-15 18:17:17 -04:00			`# TODO Jan 2025 - remove`
			`self.ignored_columns = %i[ai_persona_id]`

FEATURE: AI Bot RAG support. (#537) This PR lets you associate uploads to an AI persona, which we'll split and generate embeddings from. When building the system prompt to get a bot reply, we'll do a similarity search followed by a re-ranking (if available). This will let us find the most relevant fragments from the body of knowledge you associated with the persona, resulting in better, more informed responses. For now, we'll only allow plain-text files, but this will change in the future. Commits: * FEATURE: RAG embeddings for the AI Bot This first commit introduces a UI where admins can upload text files, which we'll store, split into fragments, and generate embeddings of. In a next commit, we'll use those to give the bot additional information during conversations. * Basic asymmetric similarity search to provide guidance in system prompt * Fix tests and lint * Apply reranker to fragments * Uploads filter, css adjustments and file validations * Add placeholder for rag fragments * Update annotations 2024-04-01 12:43:34 -04:00			`belongs_to :upload`
FEATURE: Make tool support polymorphic (#798) Polymorphic RAG means that we will be able to access RAG fragments both from AiPersona and AiCustomTool In turn this gives us support for richer RAG implementations. 2024-09-15 18:17:17 -04:00			`belongs_to :target, polymorphic: true`
FEATURE: AI Bot RAG support. (#537) This PR lets you associate uploads to an AI persona, which we'll split and generate embeddings from. When building the system prompt to get a bot reply, we'll do a similarity search followed by a re-ranking (if available). This will let us find the most relevant fragments from the body of knowledge you associated with the persona, resulting in better, more informed responses. For now, we'll only allow plain-text files, but this will change in the future. Commits: * FEATURE: RAG embeddings for the AI Bot This first commit introduces a UI where admins can upload text files, which we'll store, split into fragments, and generate embeddings of. In a next commit, we'll use those to give the bot additional information during conversations. * Basic asymmetric similarity search to provide guidance in system prompt * Fix tests and lint * Apply reranker to fragments * Uploads filter, css adjustments and file validations * Add placeholder for rag fragments * Update annotations 2024-04-01 12:43:34 -04:00
			`class << self`
FEATURE: Make tool support polymorphic (#798) Polymorphic RAG means that we will be able to access RAG fragments both from AiPersona and AiCustomTool In turn this gives us support for richer RAG implementations. 2024-09-15 18:17:17 -04:00			`def link_target_and_uploads(target, upload_ids)`
			`return if target.blank?`
FEATURE: AI Bot RAG support. (#537) This PR lets you associate uploads to an AI persona, which we'll split and generate embeddings from. When building the system prompt to get a bot reply, we'll do a similarity search followed by a re-ranking (if available). This will let us find the most relevant fragments from the body of knowledge you associated with the persona, resulting in better, more informed responses. For now, we'll only allow plain-text files, but this will change in the future. Commits: * FEATURE: RAG embeddings for the AI Bot This first commit introduces a UI where admins can upload text files, which we'll store, split into fragments, and generate embeddings of. In a next commit, we'll use those to give the bot additional information during conversations. * Basic asymmetric similarity search to provide guidance in system prompt * Fix tests and lint * Apply reranker to fragments * Uploads filter, css adjustments and file validations * Add placeholder for rag fragments * Update annotations 2024-04-01 12:43:34 -04:00			`return if upload_ids.blank?`
			`return if !SiteSetting.ai_embeddings_enabled?`

FEATURE: Make tool support polymorphic (#798) Polymorphic RAG means that we will be able to access RAG fragments both from AiPersona and AiCustomTool In turn this gives us support for richer RAG implementations. 2024-09-15 18:17:17 -04:00			`UploadReference.ensure_exist!(upload_ids: upload_ids, target: target)`
FEATURE: AI Bot RAG support. (#537) This PR lets you associate uploads to an AI persona, which we'll split and generate embeddings from. When building the system prompt to get a bot reply, we'll do a similarity search followed by a re-ranking (if available). This will let us find the most relevant fragments from the body of knowledge you associated with the persona, resulting in better, more informed responses. For now, we'll only allow plain-text files, but this will change in the future. Commits: * FEATURE: RAG embeddings for the AI Bot This first commit introduces a UI where admins can upload text files, which we'll store, split into fragments, and generate embeddings of. In a next commit, we'll use those to give the bot additional information during conversations. * Basic asymmetric similarity search to provide guidance in system prompt * Fix tests and lint * Apply reranker to fragments * Uploads filter, css adjustments and file validations * Add placeholder for rag fragments * Update annotations 2024-04-01 12:43:34 -04:00
			`upload_ids.each do \|upload_id\|`
FEATURE: Make tool support polymorphic (#798) Polymorphic RAG means that we will be able to access RAG fragments both from AiPersona and AiCustomTool In turn this gives us support for richer RAG implementations. 2024-09-15 18:17:17 -04:00			`Jobs.enqueue(`
			`:digest_rag_upload,`
			`target_id: target.id,`
			`target_type: target.class.to_s,`
			`upload_id: upload_id,`
			`)`
FEATURE: AI Bot RAG support. (#537) This PR lets you associate uploads to an AI persona, which we'll split and generate embeddings from. When building the system prompt to get a bot reply, we'll do a similarity search followed by a re-ranking (if available). This will let us find the most relevant fragments from the body of knowledge you associated with the persona, resulting in better, more informed responses. For now, we'll only allow plain-text files, but this will change in the future. Commits: * FEATURE: RAG embeddings for the AI Bot This first commit introduces a UI where admins can upload text files, which we'll store, split into fragments, and generate embeddings of. In a next commit, we'll use those to give the bot additional information during conversations. * Basic asymmetric similarity search to provide guidance in system prompt * Fix tests and lint * Apply reranker to fragments * Uploads filter, css adjustments and file validations * Add placeholder for rag fragments * Update annotations 2024-04-01 12:43:34 -04:00			`end`
			`end`

FEATURE: Make tool support polymorphic (#798) Polymorphic RAG means that we will be able to access RAG fragments both from AiPersona and AiCustomTool In turn this gives us support for richer RAG implementations. 2024-09-15 18:17:17 -04:00			`def update_target_uploads(target, upload_ids)`
			`return if target.blank?`
FEATURE: AI Bot RAG support. (#537) This PR lets you associate uploads to an AI persona, which we'll split and generate embeddings from. When building the system prompt to get a bot reply, we'll do a similarity search followed by a re-ranking (if available). This will let us find the most relevant fragments from the body of knowledge you associated with the persona, resulting in better, more informed responses. For now, we'll only allow plain-text files, but this will change in the future. Commits: * FEATURE: RAG embeddings for the AI Bot This first commit introduces a UI where admins can upload text files, which we'll store, split into fragments, and generate embeddings of. In a next commit, we'll use those to give the bot additional information during conversations. * Basic asymmetric similarity search to provide guidance in system prompt * Fix tests and lint * Apply reranker to fragments * Uploads filter, css adjustments and file validations * Add placeholder for rag fragments * Update annotations 2024-04-01 12:43:34 -04:00			`return if !SiteSetting.ai_embeddings_enabled?`

			`if upload_ids.blank?`
FEATURE: Make tool support polymorphic (#798) Polymorphic RAG means that we will be able to access RAG fragments both from AiPersona and AiCustomTool In turn this gives us support for richer RAG implementations. 2024-09-15 18:17:17 -04:00			`RagDocumentFragment.where(target: target).destroy_all`
			`UploadReference.where(target: target).destroy_all`
FEATURE: AI Bot RAG support. (#537) This PR lets you associate uploads to an AI persona, which we'll split and generate embeddings from. When building the system prompt to get a bot reply, we'll do a similarity search followed by a re-ranking (if available). This will let us find the most relevant fragments from the body of knowledge you associated with the persona, resulting in better, more informed responses. For now, we'll only allow plain-text files, but this will change in the future. Commits: * FEATURE: RAG embeddings for the AI Bot This first commit introduces a UI where admins can upload text files, which we'll store, split into fragments, and generate embeddings of. In a next commit, we'll use those to give the bot additional information during conversations. * Basic asymmetric similarity search to provide guidance in system prompt * Fix tests and lint * Apply reranker to fragments * Uploads filter, css adjustments and file validations * Add placeholder for rag fragments * Update annotations 2024-04-01 12:43:34 -04:00			`else`
FEATURE: Make tool support polymorphic (#798) Polymorphic RAG means that we will be able to access RAG fragments both from AiPersona and AiCustomTool In turn this gives us support for richer RAG implementations. 2024-09-15 18:17:17 -04:00			`RagDocumentFragment.where(target: target).where.not(upload_id: upload_ids).destroy_all`
			`link_target_and_uploads(target, upload_ids)`
FEATURE: AI Bot RAG support. (#537) This PR lets you associate uploads to an AI persona, which we'll split and generate embeddings from. When building the system prompt to get a bot reply, we'll do a similarity search followed by a re-ranking (if available). This will let us find the most relevant fragments from the body of knowledge you associated with the persona, resulting in better, more informed responses. For now, we'll only allow plain-text files, but this will change in the future. Commits: * FEATURE: RAG embeddings for the AI Bot This first commit introduces a UI where admins can upload text files, which we'll store, split into fragments, and generate embeddings of. In a next commit, we'll use those to give the bot additional information during conversations. * Basic asymmetric similarity search to provide guidance in system prompt * Fix tests and lint * Apply reranker to fragments * Uploads filter, css adjustments and file validations * Add placeholder for rag fragments * Update annotations 2024-04-01 12:43:34 -04:00			`end`
			`end`
UX: Display the indexing progress for RAG uploads (#557) 2024-04-09 10:03:07 -04:00
			`def indexing_status(persona, uploads)`
			`truncation = DiscourseAi::Embeddings::Strategies::Truncation.new`
			`vector_rep =`
			`DiscourseAi::Embeddings::VectorRepresentations::Base.current_representation(truncation)`

			`embeddings_table = vector_rep.rag_fragments_table_name`

FEATURE: Make tool support polymorphic (#798) Polymorphic RAG means that we will be able to access RAG fragments both from AiPersona and AiCustomTool In turn this gives us support for richer RAG implementations. 2024-09-15 18:17:17 -04:00			`results =`
			`DB.query(`
			`<<~SQL,`
UX: Display the indexing progress for RAG uploads (#557) 2024-04-09 10:03:07 -04:00			`SELECT`
			`uploads.id,`
			`SUM(CASE WHEN (rdf.upload_id IS NOT NULL) THEN 1 ELSE 0 END) AS total,`
			`SUM(CASE WHEN (eft.rag_document_fragment_id IS NOT NULL) THEN 1 ELSE 0 END) as indexed,`
			`SUM(CASE WHEN (rdf.upload_id IS NOT NULL AND eft.rag_document_fragment_id IS NULL) THEN 1 ELSE 0 END) as left`
			`FROM uploads`
FEATURE: Make tool support polymorphic (#798) Polymorphic RAG means that we will be able to access RAG fragments both from AiPersona and AiCustomTool In turn this gives us support for richer RAG implementations. 2024-09-15 18:17:17 -04:00			`LEFT OUTER JOIN rag_document_fragments rdf ON uploads.id = rdf.upload_id AND rdf.target_id = :target_id`
			`AND rdf.target_type = :target_type`
UX: Display the indexing progress for RAG uploads (#557) 2024-04-09 10:03:07 -04:00			`LEFT OUTER JOIN #{embeddings_table} eft ON rdf.id = eft.rag_document_fragment_id`
			`WHERE uploads.id IN (:upload_ids)`
			`GROUP BY uploads.id`
			`SQL`
FEATURE: Make tool support polymorphic (#798) Polymorphic RAG means that we will be able to access RAG fragments both from AiPersona and AiCustomTool In turn this gives us support for richer RAG implementations. 2024-09-15 18:17:17 -04:00			`target_id: persona.id,`
			`target_type: persona.class.to_s,`
			`upload_ids: uploads.map(&:id),`
			`)`
UX: Display the indexing progress for RAG uploads (#557) 2024-04-09 10:03:07 -04:00
			`results.reduce({}) do \|acc, r\|`
			`acc[r.id] = { total: r.total, indexed: r.indexed, left: r.left }`
			`acc`
			`end`
			`end`

			`def publish_status(upload, status)`
FEATURE: RAG search within tools (#802) This allows custom tools access to uploads and sophisticated searches using embedding. It introduces: - A shared front end for listing and uploading files (shared with personas) - Backend implementation of index.search function within a custom tool. Custom tools now may search through uploaded files function invoke(params) { return index.search(params.query) } This means that RAG implementers now may preload tools with knowledge and have high fidelity over the search. The search function support specifying max results specifying a subset of files to search (from uploads) Also - Improved documentation for tools (when creating a tool a preamble explains all the functionality) - uploads were a bit finicky, fixed an edge case where the UI would not show them as updated 2024-09-30 03:27:50 -04:00			`MessageBus.publish("/discourse-ai/rag/#{upload.id}", status, user_ids: [upload.user_id])`
UX: Display the indexing progress for RAG uploads (#557) 2024-04-09 10:03:07 -04:00			`end`
FEATURE: AI Bot RAG support. (#537) This PR lets you associate uploads to an AI persona, which we'll split and generate embeddings from. When building the system prompt to get a bot reply, we'll do a similarity search followed by a re-ranking (if available). This will let us find the most relevant fragments from the body of knowledge you associated with the persona, resulting in better, more informed responses. For now, we'll only allow plain-text files, but this will change in the future. Commits: * FEATURE: RAG embeddings for the AI Bot This first commit introduces a UI where admins can upload text files, which we'll store, split into fragments, and generate embeddings of. In a next commit, we'll use those to give the bot additional information during conversations. * Basic asymmetric similarity search to provide guidance in system prompt * Fix tests and lint * Apply reranker to fragments * Uploads filter, css adjustments and file validations * Add placeholder for rag fragments * Update annotations 2024-04-01 12:43:34 -04:00			`end`
			`end`

			`# == Schema Information`
			`#`
			`# Table name: rag_document_fragments`
			`#`
			`# id :bigint not null, primary key`
			`# fragment :text not null`
			`# upload_id :integer not null`
			`# fragment_number :integer not null`
			`# created_at :datetime not null`
			`# updated_at :datetime not null`
FEATURE: Add metadata support for RAG (#553) * FEATURE: Add metadata support for RAG You may include non indexed metadata in the RAG document by using [[metadata ....]] This information is attached to all the text below and provided to the retriever. This allows for RAG to operate within a rich amount of contexts without getting lost Also: - re-implemented chunking algorithm so it streams - moved indexing to background low priority queue * Baran gem no longer required. * tokenizers is on 4.4 ... upgrade it ... 2024-04-04 10:02:16 -04:00			`# metadata :text`
DEV: Update plugin annotations (#871) 2024-10-28 10:07:09 -04:00			`# target_id :bigint not null`
			`# target_type :string(800) not null`
FEATURE: Make tool support polymorphic (#798) Polymorphic RAG means that we will be able to access RAG fragments both from AiPersona and AiCustomTool In turn this gives us support for richer RAG implementations. 2024-09-15 18:17:17 -04:00			`#`
			`# Indexes`
			`#`
			`# index_rag_document_fragments_on_target_type_and_target_id (target_type,target_id)`
FEATURE: AI Bot RAG support. (#537) This PR lets you associate uploads to an AI persona, which we'll split and generate embeddings from. When building the system prompt to get a bot reply, we'll do a similarity search followed by a re-ranking (if available). This will let us find the most relevant fragments from the body of knowledge you associated with the persona, resulting in better, more informed responses. For now, we'll only allow plain-text files, but this will change in the future. Commits: * FEATURE: RAG embeddings for the AI Bot This first commit introduces a UI where admins can upload text files, which we'll store, split into fragments, and generate embeddings of. In a next commit, we'll use those to give the bot additional information during conversations. * Basic asymmetric similarity search to provide guidance in system prompt * Fix tests and lint * Apply reranker to fragments * Uploads filter, css adjustments and file validations * Add placeholder for rag fragments * Update annotations 2024-04-01 12:43:34 -04:00			`#`