DEV: make sure we don't load all data into memory when exporting chat messages (#22276)
This commit makes sure we don't load all data into memory when doing CSV exports.
The most important change here made to the recently introduced export of chat
messages (3ea31f4). We were loading all data into memory in the first version, with
this commit it's not the case anymore.
Speaking of old exports. Some of them already use find_each, and it worked as
expected, without loading all data into memory. And it will proceed working as
expected after this commit.
In general, I made sure this change didn't break other CSV exports, first manually, and
then by writing system specs for them. Sadly, I haven't managed yet to make those
specs stable, they work fine locally, but flaky in GitHub actions, so I've disabled them
for now.
I'll be making more changes to the CSV exports code soon, those system specs will be
very helpful. I'll be running them locally, and I hope I'll manage to make them stable
while doing that work.
2023-07-12 10:52:18 -04:00
|
|
|
# frozen_string_literal: true
|
|
|
|
|
|
|
|
RSpec.describe "Chat CSV exports", type: :system do
|
|
|
|
fab!(:admin) { Fabricate(:admin) }
|
|
|
|
let(:csv_export_pm_page) { PageObjects::Pages::CSVExportPM.new }
|
|
|
|
|
|
|
|
before do
|
|
|
|
Jobs.run_immediately!
|
|
|
|
sign_in(admin)
|
|
|
|
chat_system_bootstrap
|
|
|
|
end
|
|
|
|
|
|
|
|
xit "exports chat messages" do
|
2023-07-27 13:56:32 -04:00
|
|
|
message_1 = Fabricate(:chat_message, created_at: 12.months.ago)
|
|
|
|
message_2 = Fabricate(:chat_message, created_at: 6.months.ago)
|
|
|
|
message_3 = Fabricate(:chat_message, created_at: 1.months.ago)
|
|
|
|
message_4 = Fabricate(:chat_message, created_at: Time.now)
|
DEV: make sure we don't load all data into memory when exporting chat messages (#22276)
This commit makes sure we don't load all data into memory when doing CSV exports.
The most important change here made to the recently introduced export of chat
messages (3ea31f4). We were loading all data into memory in the first version, with
this commit it's not the case anymore.
Speaking of old exports. Some of them already use find_each, and it worked as
expected, without loading all data into memory. And it will proceed working as
expected after this commit.
In general, I made sure this change didn't break other CSV exports, first manually, and
then by writing system specs for them. Sadly, I haven't managed yet to make those
specs stable, they work fine locally, but flaky in GitHub actions, so I've disabled them
for now.
I'll be making more changes to the CSV exports code soon, those system specs will be
very helpful. I'll be running them locally, and I hope I'll manage to make them stable
while doing that work.
2023-07-12 10:52:18 -04:00
|
|
|
|
|
|
|
visit "/admin/plugins/chat"
|
|
|
|
click_button "Create export"
|
|
|
|
|
|
|
|
visit "/u/#{admin.username}/messages"
|
|
|
|
click_link "[Chat Message] Data export complete"
|
|
|
|
expect(csv_export_pm_page).to have_download_link
|
|
|
|
exported_data = csv_export_pm_page.download_and_extract
|
|
|
|
|
|
|
|
expect(exported_data.first).to eq(
|
|
|
|
%w[
|
|
|
|
id
|
|
|
|
chat_channel_id
|
|
|
|
chat_channel_name
|
|
|
|
user_id
|
|
|
|
username
|
|
|
|
message
|
|
|
|
cooked
|
|
|
|
created_at
|
|
|
|
updated_at
|
|
|
|
deleted_at
|
|
|
|
in_reply_to_id
|
|
|
|
last_editor_id
|
|
|
|
last_editor_username
|
|
|
|
],
|
|
|
|
)
|
|
|
|
|
2023-07-27 13:56:32 -04:00
|
|
|
assert_message(exported_data.second, message_1)
|
|
|
|
assert_message(exported_data.third, message_2)
|
|
|
|
assert_message(exported_data.fourth, message_3)
|
|
|
|
assert_message(exported_data.fifth, message_4)
|
|
|
|
ensure
|
|
|
|
csv_export_pm_page.clear_downloads
|
|
|
|
end
|
|
|
|
|
|
|
|
def assert_message(exported_message, message)
|
DEV: make sure we don't load all data into memory when exporting chat messages (#22276)
This commit makes sure we don't load all data into memory when doing CSV exports.
The most important change here made to the recently introduced export of chat
messages (3ea31f4). We were loading all data into memory in the first version, with
this commit it's not the case anymore.
Speaking of old exports. Some of them already use find_each, and it worked as
expected, without loading all data into memory. And it will proceed working as
expected after this commit.
In general, I made sure this change didn't break other CSV exports, first manually, and
then by writing system specs for them. Sadly, I haven't managed yet to make those
specs stable, they work fine locally, but flaky in GitHub actions, so I've disabled them
for now.
I'll be making more changes to the CSV exports code soon, those system specs will be
very helpful. I'll be running them locally, and I hope I'll manage to make them stable
while doing that work.
2023-07-12 10:52:18 -04:00
|
|
|
time_format = "%Y-%m-%d %H:%M:%S UTC"
|
2023-07-27 13:56:32 -04:00
|
|
|
expect(exported_message).to eq(
|
DEV: make sure we don't load all data into memory when exporting chat messages (#22276)
This commit makes sure we don't load all data into memory when doing CSV exports.
The most important change here made to the recently introduced export of chat
messages (3ea31f4). We were loading all data into memory in the first version, with
this commit it's not the case anymore.
Speaking of old exports. Some of them already use find_each, and it worked as
expected, without loading all data into memory. And it will proceed working as
expected after this commit.
In general, I made sure this change didn't break other CSV exports, first manually, and
then by writing system specs for them. Sadly, I haven't managed yet to make those
specs stable, they work fine locally, but flaky in GitHub actions, so I've disabled them
for now.
I'll be making more changes to the CSV exports code soon, those system specs will be
very helpful. I'll be running them locally, and I hope I'll manage to make them stable
while doing that work.
2023-07-12 10:52:18 -04:00
|
|
|
[
|
|
|
|
message.id.to_s,
|
|
|
|
message.chat_channel.id.to_s,
|
|
|
|
message.chat_channel.name,
|
|
|
|
message.user.id.to_s,
|
|
|
|
message.user.username,
|
|
|
|
message.message,
|
|
|
|
message.cooked,
|
|
|
|
message.created_at.strftime(time_format),
|
|
|
|
message.updated_at.strftime(time_format),
|
|
|
|
nil,
|
|
|
|
nil,
|
|
|
|
message.last_editor.id.to_s,
|
|
|
|
message.last_editor.username,
|
|
|
|
],
|
|
|
|
)
|
|
|
|
end
|
|
|
|
end
|