FIX: Add migration to reindex invalid indexes (#19858)

In Discourse, there are many migration files where we CREATE INDEX CONCURRENTLY which requires us to set disable_ddl_transaction!. Setting disable_ddl_transaction! in a migration file runs the SQL statements outside of a transaction. The implication of this is that there is no ROLLBACK should any of the SQL statements fail.

We have seen lock timeouts occuring when running CREATE INDEX CONCURRENTLY. When that happens, the index would still have been created but marked as invalid by Postgres.

Per the postgres documentation:

> If a problem arises while scanning the table, such as a deadlock or a uniqueness violation in a unique index, the CREATE INDEX command will fail but leave behind an “invalid” index. This index will be ignored for querying purposes because it might be incomplete; however it will still consume update overhead.
> The recommended recovery method in such cases is to drop the index and try again to perform CREATE INDEX CONCURRENTLY . (Another possibility is to rebuild the index with REINDEX INDEX CONCURRENTLY ).

When such scenarios happen, we are supposed to either drop and create the index again or run a REINDEX operation. However, I noticed today that we have not been doing so in Discourse. Instead, we’ve been incorrectly working around the problem by checking for the index existence before creating the index in order to make the migration idempotent. What this potentially mean is that we might have invalid indexes which are lying around in the database which PG will ignore for querying purposes.

This commits adds a migration which queries for all the
invalid indexes in the `public` namespace and reindexes them.
This commit is contained in:
Alan Guo Xiang Tan 2023-01-13 11:04:26 +08:00 committed by GitHub
parent 690e2f15ab
commit 8ee71d439b
No known key found for this signature in database
GPG Key ID: 4AEE18F83AFDEB23
1 changed files with 23 additions and 0 deletions

View File

@ -0,0 +1,23 @@
# frozen_string_literal: true
class ReindexInvalidIndexes < ActiveRecord::Migration[7.0]
disable_ddl_transaction!
def up
invalid_index_names = DB.query_single(<<~SQL)
SELECT
pg_class.relname
FROM pg_class, pg_index, pg_namespace
WHERE pg_index.indisvalid = false
AND pg_index.indexrelid = pg_class.oid
AND pg_namespace.nspname = 'public'
AND relnamespace = pg_namespace.oid;
SQL
invalid_index_names.each { |index_name| execute "REINDEX INDEX CONCURRENTLY #{index_name}" }
end
def down
raise ActiveRecord::IrreversibleMigration
end
end