discourse

Commit Graph

Author	SHA1	Message	Date
GeckoLinux	d1e844841d	Fix occasional bug in order of imported comments (#20204 ) This bug is actually a Drupal issue where some edited posts have their `created` and `changed` timestamps set to the same value. But even when that happens in Drupal it still maintains the correct post order in an affected thread. This PR makes the Discourse importer also maintain the original Drupal comment order by sorting comments in the source DB by their `cid`, which is sequential and never changes. More details from this post onward: https://meta.discourse.org/t/large-drupal-forum-migration-importer-errors-and-limitations/246939/24?u=rahim123	2023-02-08 22:20:46 -05:00
Gerhard Schlager	c3e978ada9	DEV: Add import script for Yammer (#20074 ) Co-authored-by: Jay Pfaffman <jay@literatecomputing.com>	2023-01-31 10:12:01 +01:00
Michael Fitz-Payne	df4a9f96ae	DEV(cache_critical_dns): add additional service runtime variable We'd like to lean on the DNS caching service for more than the standard DB and Redis hosts, but without having to add additional code each time. Define a new environment variable DISCOURSE_DNS_CACHE_ADDITIONAL_SERVICE_NAMES (admittedly a mouthful) which is a list of service names to be added to the static list at process execution time. For example, plugin foo may reference two services that you want to cache the address of. By specifying the following two variables in the process environment, cache_critical_dns will perform the lookup alongside the DB and Redis host variables. ``` DISCOURSE_DNS_CACHE_ADDITIONAL_SERVICE_NAMES='FOO_SERVICE1,FOO_SERVICE2' FOO_SERVICE1='foo.service1.example.com' FOO_SERVICE1_SRV='foo._tcp.example.com' FOO_SERVICE2='foo.service2.example.com' ``` The behaviour when it comes to SRV record lookup is the same as previously implemented for the `DISCOURSE_DB_..` and `DISCOURSE_REDIS_..` variables. For the purposes of the health checks, services defined in the list _are always considered healthy_. This is a compromise for conveniences sake. Defining a dynamic method for health checks at runtime is not practical. See t/88457/32.	2023-01-20 10:03:08 +10:00
David Taylor	436b3b392b	DEV: Apply syntax_tree formatting to `script/*`	2023-01-09 11:13:22 +00:00
David Taylor	d5491b13f5	DEV: Fix syntax/formatting in xenforo import script (#19761 ) Followup to `7dfe85fc`	2023-01-05 12:47:05 +00:00
Alan Guo Xiang Tan	0da79561c3	DEV: Improve/Fix script/bench.rb (#19646 ) 1. Fix bug where we were not waiting for all unicorn workers to start up before running benchmarks. 2. Fix a bug where headers were not used when benchmarking. Admin benchmarks were basically running as anon user. 3. Disable rate limits when in profile env. We're pretty much going to hit the rate limit every time as a normal user. 4. Benchmark against topic with a fixed posts count of 100. Previously profiling script was just randomly creating posts and we would benchmark against a topic with a fixed posts count of 30. Sometimes, the script fails because no topics with a posts count of 30 exists. 5. Benchmarks are not run against a normal user on top of anon and admin. 6. Add script option to select tests that should be run.	2022-12-30 07:25:11 +08:00
Bianca Nenciu	c358151a6c	DEV: Promote historic post_deploy migrations (#19492 ) This commit promotes all post_deploy migrations which existed in Discourse v2.8.0 (timestamp <= 20220107014925). This commit includes a fix to the promote_migrations script to promote all migrations of the first version of the previous stable version. For example, if the current stable version is v2.8.13, the version used as a cutoff for promoting migrations is v2.8.0.	2022-12-16 13:36:30 +02:00
GeckoLinux	cc5b4cd49a	FIX: change drupal permalink creation to use /node/ Drupal URL scheme for nodes begins with `/node/` , not `/topic/` .	2022-12-02 16:03:00 +11:00
Alan Guo Xiang Tan	7c321d3aad	PERF: Update `Group#user_count` counter cache outside DB transaction (#19256 ) While load testing our user creation code path in production, we identified that executing the DB statement to update the `Group#user_count` column within a transaction is creating a bottleneck for us. This is because the creation of a user and addition of the user to the relevant groups are done in a transaction. When we execute the DB statement to update `Group#user_count` for the relevant group, a row level lock is held until the transaction completes. This row level lock acts like a global lock when the server is creating users that will be added to the same group in quick succession. Instead of updating the counter cache within a transaction which the default ActiveRecord `counter_cache` option does, we simply update the counter cache outside of the committing transaction. Co-authored-by: Rafael dos Santos Silva <xfalcox@gmail.com> Co-authored-by: Rafael dos Santos Silva <xfalcox@gmail.com>	2022-11-30 11:52:08 -03:00
Leonardo Mosquera	bfecbde837	Fixes for vBulletin bulk importer (#17618 ) * Allow taking table prefix from env var * FIX: remove unused column references The columns `filedata` and `extension` are not present in a v4.2.4 database, and they aren't used in the method anyways. * FIX: report progress for tables without imported_id * FIX: effectively check for AR validation errors NOTE: other migration scripts also have this problem; see /t/58202 * FIX: properly count Posts when importing attachments * FIX: improve logging * Remove leftover comment * FIX: show progress when exporting Permalink file * PERF: stream Permalink file The current way results in tons of memory usage; write once per line instead * Document fixes needed * WIP - deduplicate category names * Ignore non alphanumeric chars for grouping * FIX: properly deduplicate user emails by merging accounts * FIX: don't merge empty UserEmails * Improve logging * Merge users AFTER fixing primary key sequences * Parallelize user merging * Save duplicated users structure for debugging purposes * Add progress logging for the (multiple hour) user merging step	2022-11-28 16:30:19 -03:00
communiteq	7dfe85fcc7	DEV: Xenforo importer improvements (#18457 ) * Fix: make expressions non-greedy * Feature: import Xenforo avatars * Feature: import Xenforo likes * Feature: import Xenforo private messages * Feature: Xenforo create permalinks * Feature: Xenforo migrate view counts * Fix: Xenforo list regexes * Fix: Xenforo import all attachments	2022-11-28 16:42:39 +01:00
Pierre Ozoux	9e9235ca62	FEATURE: Add import script for Elgg (#19140 )	2022-11-28 16:28:08 +01:00
David Taylor	84bec1cbae	DEV: Cleanup legacy asset compilation gems and code (#19177 ) We now use Ember CLI (core/plugins) and DiscourseJSProcessor (themes) for all Ember and template compilation. This commit removes the remnants of the legacy Sprockets-based Ember compilation system. Sprockets, and its DiscourseJSProcess-based Babel transformations, is still in use for a few assets. Ideally that will be removed/replaced in the near future.	2022-11-24 12:13:59 +00:00
Jarek Radosz	bc22fe4fdf	DEV: Convert the downsizing script to a rake task (#18976 ) …to make it testable!	2022-11-11 13:00:44 +01:00
Michael Fitz-Payne	5fdbbe3045	DEV(cache_critical_dns): add caching for MessageBus Redis hostname We are already caching any DB_HOST and REDIS_HOST (and their accompanying replicas), we should also cache the resolved addresses for the MessageBus specific Redis. This is a noop if no MB redis is defined in config. A side effect is that the MB will also support SRV lookup and priorities, following the same convention as the other cached services. The port argument was added to redis_healthcheck so that the script supports a setup where Redis is running on a non-default port. Did some minor refactoring to improve readability when filtering out the CRITICAL_HOST_ENV_VARS. The `select` block was a bit confusing, so the sequence was made easier to follow. We were coercing an environment variable to an int in a few places, so the `env_as_int` method was introduced to do that coercion in one place and for convenience purposes default to a value if provided. See /t/68301/30.	2022-10-12 10:11:22 +10:00
Constanza	067c4deb4c	Fix comment to include phpbb 3.3, which is now supported (#18006 )	2022-08-19 16:42:32 -04:00
Constanza	ef842a4b29	FEATURE: Adding a simple CSV importer (#17993 )	2022-08-19 13:09:30 -04:00
Constanza	8836c8bcdf	FIX: the phpbbb import script was not parsing youtube tags (#17787 )	2022-08-05 15:20:32 -04:00
communiteq	603f36ca4a	DEV: Support phpBB 3.3 imports (#17641 ) * handle polls with duplicate items * handle polls with incorrect poll_option_total values * handle group IDs in personal messages * support for version 3.3	2022-07-25 22:07:03 +02:00
Jay Pfaffman	7ab5dcf82f	FEATURE: my_bb import supports avatars (#17617 )	2022-07-25 15:22:25 +02:00
Constanza	b9ac8e5748	Adding 3.2 to the versions of phpbb supported by the migration script (#17483 )	2022-07-14 18:06:47 +05:30
Michael Fitz-Payne	1867202a4d	DEV(cache_critical_dns): add option to run once and exit There are situations where a container running Discourse may want to cache the critical DNS services without running the cache_critical_dns service, for example running migrations prior to running a full bore application container. Add a `--once` argument for the cache_critical_dns script that will only execute the main loop once, and return the status code for the script to use when exiting. 0 indicates no errors occured during SRV resolution, and 1 indicates a failure during the SRV lookup. Nothing is reported to prometheus in run_once mode. Generally this mode of operation would be a part of a unix pipeline, in which the exit status is a more meaningful and immediate signal than a prometheus metric. The reporting has been moved into it's own method that can be called only when the script is running as a service. See /t/69597.	2022-07-06 14:53:02 +10:00
Michael Fitz-Payne	aabbc9e63e	DOC(cache_critical_dns): add program description Describes the behaviour and configuration of the cache_critical_dns script, mainly cribbed from commit messages. Tries to make this program a bit less of an enigma.	2022-05-26 14:26:57 +10:00
Michael Fitz-Payne	0553788d3b	DEV(cache_critical_dns): improve postgres_healthcheck The `PG::Connection#ping` method is only reliable for checking if the given host is accepting connections, and not if the authentication details are valid. This extends the healthcheck to confirm that the auth details are able to both create a connection and execute queries against the database. We expect the empty query to return an empty result set, so we can assert on that. If a failure occurs for any reason, the healthcheck will return false.	2022-05-24 08:20:10 +10:00
Martin Brennan	fcc2e7ebbf	FEATURE: Promote polymorphic bookmarks to default and migrate (#16729 ) This commit migrates all bookmarks to be polymorphic (using the bookmarkable_id and bookmarkable_type) columns. It also deletes all the old code guarded behind the use_polymorphic_bookmarks setting and changes that setting to true for all sites and by default for the sake of plugins. No data is deleted in the migrations, the old post_id and for_topic columns for bookmarks will be dropped later on.	2022-05-23 10:07:15 +10:00
Gabe Pacuilla	4284ba9c27	FIX(cache_critical_dns): use correct DISCOURSE_DB_USERNAME envvar (#16862 )	2022-05-18 13:01:18 -04:00
Gabe Pacuilla	9f246e6969	FIX(cache_critical_dns): use discourse database name and user by default (#16856 )	2022-05-17 16:09:32 -04:00
Michael Fitz-Payne	35d5c29e10	DEV(cache_critical_dns): add SRV priority tunables An SRV RR contains a priority value for each of the SRV targets that are present, ranging from 0 - 65535. When caching SRV records we may want to filter out any targets above or below a particular threshold. This change adds support for specifying a lower and/or upper bound on target priorities for any SRV RRs. Any targets returned when resolving the SRV RR whose priority does not fall between the lower and upper thresholds are ignored. For example: Let's say we are running two Redis servers, a primary and cold server as a backup (but not a replica). Both servers would pass health checks, but clearly the primary should be preferred over the backup server. In this case, we could configure our SRV RR with the primary target as priority 1 and backup target as priority 10. The `DISCOURSE_REDIS_HOST_SRV_LE` could then be set to 1 and the target with priority 10 would be ignored. See /t/66045.	2022-05-12 08:08:56 +10:00
Loïc Guitaut	ab6ca78486	FIX: Use proper ActiveRecord method in import scripts `ActiveRecord::Base.connection_config` has been deprecated since Rails 6.1 and was completely removed from Rails 7. Instead we need to use `ActiveRecord::Base.connection_db_config.configuration_hash`. Import scripts were forgotten when we did the Rails 7 upgrade, this patch fixes them.	2022-05-09 11:09:27 +02:00
Martin Brennan	222c8d9b6a	FEATURE: Polymorphic bookmarks pt. 3 (reminders, imports, exports, refactors) (#16591 ) A bit of a mixed bag, this addresses several edge areas of bookmarks and makes them compatible with polymorphic bookmarks (hidden behind the `use_polymorphic_bookmarks` site setting). The main ones are: * ExportUserArchive compatibility * SyncTopicUserBookmarked job compatibility * Sending different notifications for the bookmark reminders based on the bookmarkable type * Import scripts compatibility * BookmarkReminderNotificationHandler compatibility This PR also refactors the `register_bookmarkable` API so it accepts a class descended from a `BaseBookmarkable` class instead. This was done because we kept having to add more and more lambdas/properties inline and it was very messy, so a factory pattern is cleaner. The classes can be tested independently as well. Some later PRs will address some other areas like the discourse narrative bot, advanced search, reports, and the .ics endpoint for bookmarks.	2022-05-09 09:37:23 +10:00
Leonardo Mosquera	3e5faffb0d	DEV: mbox importer improvements (#16557 ) * FIX: support specifying parent_category_id in mbox import metadata * FIX: elide tabs from topic titles * FIX: optionally fix Mailman from: addresses * DEV: optionally elide anything up to the last = in email addresses * Fix Mailmain broken from: detection	2022-04-29 13:24:29 -03:00
Michael Fitz-Payne	1acc4751ff	FIX: remove refresh seconds override on cache_critical_dns (#16572 ) This removes the option to override the sleep time between caching of DNS records. The override was invalid because `''.to_i` is 0 in Ruby, causing a tight loop calling the `run` method.	2022-04-27 12:42:35 +08:00
Michael Fitz-Payne	0784c28702	FIX: cache_critical_dns - add TLS support for Redis healthcheck For Redis connections that operate over TLS, we need to ensure that we are setting the correct arguments for the Redis client. We can utilise the existing environment variable `DISCOURSE_REDIS_USE_SSL` to toggle this behaviour. No SSL verification is performed for two reasons: - the Discourse application will perform a verification against any FQDN as specified for the Redis host - the healthcheck is run against the _resolved_ IP address for the Redis hostname, and any SSL verification will always fail against a direct IP address If no SSL arguments are provided, the IP address is never cached against the hostname as no healthy address is ever found in the HealthyCache.	2022-04-27 12:27:58 +10:00
Michael Fitz-Payne	c4ea439cc3	DEV: refactor cache_critical_dns for SRV RR awareness Modify the cache_critical_dns script for SRV RR awareness. The new behaviour is only enabled when one or more of the following environment variables are present (and only for a host where the `DISCOURSE__HOST_SRV` variable is present): - `DISCOURSE_DB_HOST_SRV` - `DISCOURSE_DB_REPLICA_HOST_SRV` - `DISCOURSE_REDIS_HOST_SRV` - `DISCOURSE_REDIS_REPLICA_HOST_SRV` Some minor changes in refactor to original script behaviour: - add Name and SRVName classes for storing resolved addresses for a hostname - pass DNS client into main run loop instead of creating inside the loop - ensure all times are UTC - add environment override for system hosts file path and time between DNS checks mainly for testing purposes The environment variable for `BUNDLE_GEMFILE` is set to enables Ruby to load gems that are installed and vendored via the project's Gemfile. This script is usually not run from the project directory as it is configured as a system service (see `71ba9fb7b5/templates/cache-dns.template.yml (L19)`) and therefore cannot load gems like `pg` or `redis` from the default load paths. Setting this environment variable configures bundler to look in the correct project directory during it's setup phase. When a `DISCOURSE__HOST_SRV` environment variable is present, the decision for which target to cache is as follows: - resolve the SRV targets for the provided hostname - lookup the addresses for all of the resolved SRV targets via the A and AAAA RRs for the target's hostname - perform a protocol-aware healthcheck (PostgreSQL or Redis pings) - pick the newest target that passes the healthcheck From there, the resolved address for the SRV target is cached against the hostname as specified by the original form of the environment variable. For example: The hostname specified by the `DISCOURSE_DB_HOST` record is `database.example.com`, and the `DISCOURSE_DB_HOST_SRV` record is `database._postgresql._tcp.sd.example.com`. An SRV RR lookup will return zero or more targets. Each of the targets will be queried for A and AAAA RRs. For each of the addresses returned, the newest address that passes a protocol-aware healthcheck will be cached. This address is cached so that if any newer address for the SRV target appears we can perform a health check and prefer the newer address if the check passes. All resolved SRV targets are cached for a minimum of 30 minutes in memory so that we can prefer newer hosts over older hosts when more than one target is returned. Any host in the cache that hasn't been seen for more than 30 minutes is purged. See /t/61485.	2022-04-27 10:14:33 +10:00
David Taylor	d81359246a	DEV: Be more lenient in CLI confirmation (#16290 ) If someone types `yes` rather than `YES`, continue anyway. The chance of typing `yes`, when you actually want to stop, is non-existent. The chance of typing `yes` when you meant `YES` is high, and it's very frustrating when the script quite because you got the case wrong!	2022-03-25 20:14:41 +00:00
David Taylor	f3aab19829	DEV: Promote historic post_deploy migrations (#16288 ) This commit promotes all post_deploy migrations which existed in Discourse v2.7.13 (timestamp <= 20210328233843) This reduces the likelihood of issues relating to migration run order Also fixes a couple of typos in `script/promote_migrations`	2022-03-25 15:48:20 +00:00
Jarek Radosz	2fc70c5572	DEV: Correctly tag heredocs (#16061 ) This allows text editors to use correct syntax coloring for the heredoc sections. Heredoc tag names we use: languages: SQL, JS, RUBY, LUA, HTML, CSS, SCSS, SH, HBS, XML, YAML/YML, MF, ICS other: MD, TEXT/TXT, RAW, EMAIL	2022-02-28 20:50:55 +01:00
Jarek Radosz	6f6406ea03	DEV: Fix random typos (#16066 )	2022-02-28 10:20:58 +08:00
David Taylor	5374e587a3	DEV: Add message-bus analysis script (#15979 ) This will count how many messages are published per-channel and produce a table of channels ordered by 'most messages'	2022-02-18 20:21:17 +00:00
Michael Brown	3bf3b9a4a5	DEV: pull email address validation out to a new EmailAddressValidator We validate the format of email addresses in many places with a match against a regex, often with very slightly different syntax. Adding a separate EmailAddressValidator simplifies the code in a few spots and feels cleaner. Deprecated the old location in case someone is using it in a plugin. No functionality change is in this commit. Note: the regex used at the moment does not support using address literals, e.g.: * localpart@[192.168.0.1] * localpart@[2001:db8::1]	2022-02-17 21:49:22 -05:00
Gerhard Schlager	6394d7cddf	DEV: Improve phpBB3 import script (#15956 ) * Optional import of custom user fields from phpBB 3.1+ * Optional import of likes from phpBB3 Requires the phpBB "Thanks for posts" extension * Fix import of bookmarks from phpBB3 * Update `created_at` of existing user * Support mapping of phpBB forums to existing Discourse categories This is in addition to the ability of merging phpBB forums and importing into newly created Discourse categories.	2022-02-16 13:04:31 +01:00
Gerhard Schlager	33d6ed60a4	DEV: Don't import year of birth (#15937 ) The cakeday plugin doesn't use the year.	2022-02-14 18:10:35 +01:00
Gerhard Schlager	6a41ec179c	FIX: Default settings for phpBB3 import were broken (#15913 )	2022-02-11 18:18:54 +01:00
David Taylor	9e43f0303d	DEV: Include DISCOURSE_REDIS_REPLICA_HOST in cache_critical_dns (#15877 ) This is the replacement for DISCOURSE_REDIS_SLAVE_HOST	2022-02-09 14:41:26 +00:00
Canapin	ea2fd75d10	DEV: Fix some regexes in phpBB3 import script (#15829 ) 1. bbcode hashes don't always have exactly 8 characters. 2. colors aren't always hex values, it can be a color string ("red", "blue", etc). 3. The closing tag of smileys doesn't always include a `:` character (the start of the regex was already right for this particular issue)	2022-02-07 16:16:46 +01:00
David Taylor	ed2f700440	DEV: Wait for initdb to complete in docker.rake (#15614 ) On slower hardware it can take a while to init the database. If we don't wait, the `rake db:create` step will fail.	2022-01-17 17:45:39 +00:00
Peter Zhu	c5fd8c42db	DEV: Fix methods removed in Ruby 3.2 (#15459 ) * File.exists? is deprecated and removed in Ruby 3.2 in favor of File.exist? * Dir.exists? is deprecated and removed in Ruby 3.2 in favor of Dir.exist?	2022-01-05 18:45:08 +01:00
David Taylor	0e87f882a7	DEV: Use discourse image for postgres in GitHub Actions (#15291 ) The discourse base image already contains a postgres installation, so pulling a separate postgres image is a little wasteful. Using the copy of Postgres in the discourse image saves about 20 seconds on every GitHub actions run. This commit sets up Postgres with a few performance-improving flags, which we were already using for the `rake docker:test` task (used on our internal CI system).	2021-12-14 17:20:06 +00:00
Jarek Radosz	cfabdb72bc	FIX: Ambiguous column in `downsize_uploads` (#14972 )	2021-11-16 16:23:32 +01:00
Leonardo Mosquera	48a08cc397	FIX: Vanilla importer fixes (#14699 ) Import script was out of date	2021-10-27 14:22:37 +02:00

1 2 3 4 5 ...

1055 Commits