discourse

Commit Graph

Author	SHA1	Message	Date
Blake Erickson	4b7b586861	DEV: Remove unused rswag script We no longer use this script for helping generate rswag formatted docs because we can use json schema now.	2021-05-07 09:36:55 +08:00
Justin DiRose	c1517e428e	DEV: Add vBulletin5 bulk importer (#12904 ) This is a pretty straightforward bulk importer, just tailored to the vBulletin 5 database structure. Also made a few minor improvements to the base importer -- should be self explanatory in the code.	2021-04-30 11:03:33 -05:00
David Taylor	8b87cb07c3	DEV: Return a non-zero exit code when `discourse remap` is aborted (#12873 ) This makes it much easier to integrate the script with external tools	2021-04-28 14:10:35 +01:00
Pilou	e7892df10d	FIX: handle charset=windows-1252 in mbox import script (#12832 ) Co-authored-by: Loïc Dachary <loic@dachary.org>	2021-04-27 15:43:31 +02:00
Michael Maroszek	5bec0e5763	fix vbulletin importer to import unreferenced attachments (#12187 )	2021-04-19 21:05:16 +02:00
Justin DiRose	e302e32a3f	DEV: Add Higher Logic import script (#12623 ) Wrote up a new script to import from Higher Logic. Nothing too crazy going on here. Two major things about this script: It requires you to convert a Microsoft SQL file to a format MySQL can read. Higher Logic stores posts (at least in the case of the import I ran) with the email thread shown in the post body. The script does its best to truncate this out, but the logic may need to be improved on future imports. For the import I ran, it worked just fine as is. 🤷‍♂️	2021-04-06 16:53:55 -05:00
Justin DiRose	1aaf588fb7	DEV: Improvements to vanilla_mysql importer (#12308 ) Made some improvements to the Vanilla MySQL script -- mainly because not all SQL imports require use of the VanillaBodyParser. Still left it as an option to turn on and use if so desired. Also added subcategory support, importing of likes, and solve status.	2021-03-11 10:21:56 -06:00
Jarek Radosz	a60e26e799	DEV: Clean up and refactor CI workflow(s) (#12144 ) Includes: * DEV: Remove external plugin linting (that's covered by CI in their repositories) * DEV: Move lint stages to a separate workflow (partial de-`if`-ication of workflows) * DEV: Run CI on `main` branch too * DEV: Update postgres to 13 * DEV: Update redis to 6.x Other changes: * DEV: Remove matrix.os * DEV: Remove env.BUILD_TYPE * DEV: Remove env.TARGET * DEV: Rename `build_types` config option to `build_type` * DEV: Lowercase `target` and `build_type` names * DEV: Rename `ci` to `tests` * DEV: Rename `lint` to `linting` * DEV: Lower the wizard qunit timeout (30 min -> 10) * DEV: Ruby version is no longer configurable * DEV: Run plugin tests only in the `plugins` target * DEV: Use binstubs where applicable * DEV: We don't open PRs to `tests-passed`	2021-02-22 10:28:32 +01:00
Blake Erickson	dbcda617b3	DEV: Add a CSV importer for restoring deleted users (#12147 ) This is an importer I wrote to restore some users that were accidentally deleted for being purged as old staged users or old unactivated users. It reads from CSV files exported from a discourse sql backup.	2021-02-19 13:46:54 -07:00
Blake Erickson	ed0e4582a1	DEV: If disabled do not change setting after import (#12142 ) When running an import script there are many site settings that are changed but we reset them back to where they were originally before the import. However, there are two settings that we don't roll back: ``` purge_unactivated_users_grace_period_days purge_deleted_uploads_grace_period_days ``` which could have some unintended consequences. My first question is do we really have to change these settings? I'm not a huge fan of changing someones settings without them really knowing they were changed. If we really do have to change these settings here is my proposed PR where we don't alter the `purge_unactivated_users_grace_period_days` if it has been disabled. As I'm writing this another change we could make is that we don't change either of these site settings if we detect that they aren't set to the default values. The drive behind this PR is that there is a discourse instance which relies on staged users as part of their workflow and this setting was changed by accident via the import script causing users to be deleted that shouldn't have been.	2021-02-19 09:33:35 -07:00
Michael Maroszek	144584aacb	fix vbulletin importer to hide soft-deleted posts (#12057 ) equal to theads posts can be soft-deleted which results in a visibile = 2 state. at the moment those posts will be imported fully visible.	2021-02-12 14:29:05 +01:00
Gerhard Schlager	4d719725c8	FEATURE: Allow overriding the backup location when restoring via CLI (#12015 ) You can use `discourse restore --location=local FILENAME` if you want to restore a backup that is stored locally even though the `backup_location` has the value `s3`.	2021-02-09 16:02:44 +01:00
David Taylor	821bb1e8cb	FEATURE: Rename 'Discourse SSO' to DiscourseConnect (#11978 ) The 'Discourse SSO' protocol is being rebranded to DiscourseConnect. This should help to reduce confusion when 'SSO' is used in the generic sense. This commit aims to: - Rename `sso_` site settings. DiscourseConnect specific ones are prefixed `discourse_connect_`. Generic settings are prefixed `auth_` - Add (server-side-only) backwards compatibility for the old setting names, with deprecation notices - Copy `site_settings` database records to the new names - Rename relevant translation keys - Update relevant translations This commit does not aim to: - Rename any Ruby classes or methods. This might be done in a future commit - Change any URLs. This would break existing integrations - Make any changes to the protocol. This would break existing integrations - Change any functionality. Further normalization across DiscourseConnect and other auth methods will be done separately The risks are: - There is no backwards compatibility for site settings on the client-side. Accessing auth-related site settings in Javascript is fairly rare, and an error on the client side would not be security-critical. - If a plugin is monkey-patching parts of the auth process, changes to locale keys could cause broken error messages. This should also be unlikely. The old site setting names remain functional, so security-related overrides will remain working. A follow-up commit will be made with a post-deploy migration to delete the old `site_settings` rows.	2021-02-08 10:04:33 +00:00
Bianca Nenciu	a71b219c9a	Improvements to phpBB3 import script (#10999 ) * FEATURE: Import attachments * FEATURE: Add support for importing multiple forums in one * FEATURE: Add support for category and tag mapping * FEATURE: Import groups * FIX: Add spaces around images * FEATURE: Custom mapping of user rank to trust levels * FIX: Do not fail import if it cannot import polls * FIX: Optimize existing records lookup Co-authored-by: Gerhard Schlager <mail@gerhard-schlager.at> Co-authored-by: Jarek Radosz <jradosz@gmail.com>	2021-01-14 21:44:43 +02:00
Arpit Jalan	bd7cbcd8f8	Improve Vanilla import script. (#11701 ) - import groups and group users - import uploads/attachments - improved code tag parsing - improved text formatting - mark topics as solved	2021-01-13 23:10:00 +05:30
Justin DiRose	f6e87e1e5e	DEV: Improvements to Discourse Merger script (#11660 ) After running the Discourse merge script, it was pretty evident it held up well after all these years ;) Made a few fixes: Included an environment variable for DB_PASS as likely the password will need to be changed if running the import in an official Docker container (recommended) Set a hard order for imported categories, otherwise sometimes they'd be imported in a weird order making things unpredictable for parent/child category imports Fixed a couple of instances where we added unique indexes (such as on category slugs) Set up upload regex to handle AWS URLs better Fixed the script to work with frozen string literals	2021-01-08 09:31:39 -06:00
Gerhard Schlager	fc9155f3ee	DEV: Lint MessageFormat strings to prevent usage of "one {foo 1 bar}" (#11608 ) Follow-up to `6b53f26f`	2021-01-04 12:29:20 +01:00
Gerhard Schlager	6b53f26fc0	DEV: Lint MessageFormat strings to prevent usage of "one {1 foo}" (#11605 )	2020-12-29 21:42:47 +01:00
Régis Hanol	a85d5edbf1	DEV: set digest_attempted_at during migrations (#11369 )	2020-12-14 10:58:14 +11:00
David Taylor	cf21de0e7a	DEV: Migrate Github authentication to ManagedAuthenticator (#11170 ) This commit adds an additional find_user_by_email hook to ManagedAuthenticator so that GitHub login can continue to support secondary email addresses The github_user_infos table will be dropped in a follow-up commit. This is the last core authenticator to be migrated to ManagedAuthenticator 🎉	2020-11-10 10:09:15 +00:00
Jarek Radosz	2f4a1ff61b	DEV: Update rubocop-discourse from 2.3.2 to 2.4.0 (#11079 ) Also fixes whitespace related issues raised by rubocop.	2020-10-30 15:04:29 +01:00
Gerhard Schlager	57095f0bb7	FIX: Killing a Unicorn worker shouldn't kill a running backup or restore process By spawning and forking the backup and restore, the process owner changes from 🦄 to the init process.	2020-10-13 19:48:53 +02:00
Shane Dela Rosa	8b678b5dbe	Update vanilla_body_parser for stability	2020-09-23 09:54:26 -04:00
Gerhard Schlager	76477a1c8b	FIX: Forking prevented notifications from being sent after backup This is a workaround for https://github.com/rubyjs/mini_racer/issues/175	2020-09-18 17:35:17 +02:00
Rachel Carvalho	812e0d6b5e	FIX: improve Vanilla importing (#10478 ) * ensure emails don't have spaces * import banned users as suspended for 1k yrs * upgrade users to TL2 if they have comments * topic: import views, closed and pinned info * import messages * encode vanilla usernames for permalinks. Vanilla usernames can contain spaces and special characters. * parse Vanilla's new rich body format	2020-08-24 16:19:57 -04:00
Neil Lalonde	530058918e	DEV: Support env var for prometheus port in cache_critical_dns	2020-08-17 15:48:14 -04:00
Robin Ward	7df57b35da	REFACTOR: Remove `Discourse.__widget_helpers` It's now a variable in the context where the templates are created.	2020-08-06 14:35:46 -04:00
Justin DiRose	d266baff2b	DEV: Improve q2a import script (#10346 ) The changes here are largely pulled from the below gist by RGJ on meta.discourse.org. Thanks much for the changes! https://gist.github.com/discoursehosting/769eff2014d5482f0ab776de03dc3349	2020-07-31 12:04:03 -05:00
Krzysztof Kotlarek	e0d9232259	FIX: use allowlist and blocklist terminology (#10209 ) This is a PR of the renaming whitelist to allowlist and blacklist to the blocklist.	2020-07-27 10:23:54 +10:00
Discourse Translator Bot	29788f2c26	DEV: Switch from Transifex to Crowdin	2020-07-16 14:00:02 +02:00
Krzysztof Kotlarek	93ff54e184	FIX: improvements for vanilla bulk import (#10212 ) Adjustments to the base: 1. PG connection doesn't require host - it was broken on import droplet 2. Drop `topic_reply_count` - it was removed here - https://github.com/discourse/discourse/blob/master/db/post_migrate/20200513185052_drop_topic_reply_count.rb 3. Error with `backtrace.join("\n")` -> `e.backtrace.join("\n")` 4. Correctly link the user and avatar to quote block Adjustments to vanilla: 1. Top-level Vanilla categories are valid categories 2. Posts have `format` column which should be used to decide if the format is HTML or Markdown 3. Remove no UTF8 characters 4. Remove not supported HTML elements like `font` `span` `sub` `u`	2020-07-14 15:58:27 +10:00
Régis Hanol	7a6d772ad2	DEV: couple bug fixes in getsatisfaction importer - Ensure we don't modify a frozen string - Ensure we have a slug before trying to create a permalink	2020-07-06 17:41:28 +02:00
Gerhard Schlager	1f053173a4	FEATURE: Import script for jForum	2020-06-21 12:12:42 +02:00
Régis Hanol	47a1157458	DEV: various bugfixes in bulk importer	2020-06-19 17:53:06 +02:00
Régis Hanol	5143309014	DEV: ensure values are converted to integers in bulk importer	2020-06-18 17:42:14 +02:00
Bernhard Suttner	e31471585a	DEV: allow to have duplicate topic titles if categegory is different (#10034 ) Co-authored-by: Robin Ward <robin.ward@gmail.com> Co-authored-by: Robin Ward <robin.ward@gmail.com>	2020-06-18 11:19:47 -04:00
Régis Hanol	823b940b9d	PERF: improve loading of indexes in bulk import Similar strategy as for `c52191d` in which we stream the results from the database into an automatically growing array instead of using a hash.	2020-06-18 16:32:27 +02:00
Régis Hanol	c52191d49e	PERF: improve loading a imported_ids in bulk imports - Stream the queries that load the imported_ids - Use an array instead of a hash for keeping the mapping between imported_ids and new ids - Ensure we always treat the imported_ids as integers instead of strings	2020-06-16 19:55:08 +02:00
Jarek Radosz	669c940ec3	Revert "DEV: Remove the remaining ENV["TRAVIS"] usage (#10041 )" This reverts commit `78aff841e3`. See https://review.discourse.org/t/dev-remove-the-remaining-env-travis-usage-10041/12737/4?u=cvx	2020-06-16 19:42:00 +02:00
Jarek Radosz	78aff841e3	DEV: Remove the remaining ENV["TRAVIS"] usage (#10041 )	2020-06-16 17:41:15 +02:00
Jarek Radosz	3d55f2e3b7	FIX: Improvements and fixes to the image downsizing script (#9950 ) Fixed bugs, added specs, extracted the upload downsizing code to a class, added support for non-S3 setups, changed it so that images aren't downloaded twice. This code has been tested on production and successfully resized ~180k uploads. Includes: * DEV: Extract upload downsizing logic * DEV: Add support for non-S3 uploads * DEV: Process only images uploaded by users * FIX: Incorrect usage of `count` and `exist?` typo * DEV: Spec S3 image downsizing * DEV: Avoid downloading images twice * DEV: Update filesizes earlier in the process * DEV: Return false on invalid upload * FIX: Download images that currently above the limit (If the image size limit is decreased, then there was no way to resize those images that now fall outside the allowed size range) * Update script/downsize_uploads.rb (Co-authored-by: Régis Hanol <regis@hanol.fr>)	2020-06-11 14:47:59 +02:00
Justin DiRose	be28fc73a0	DEV: Improvements to Drupal script (#10016 ) Refactors script to follow conventions of other importers and adds some features including like import, processing of post raw text, and, if needed, SSO import.	2020-06-10 10:59:17 -05:00
Arpit Jalan	a93d24501c	FIX: base import script was not updating first_post_created_at column FEATURE: new rake task to update first_post_created_at column The not-equal operator (`<>`) in PostgreSQL does not compare values with NULL. We should instead use `IS DISTINCT FROM` when comparing values with NULL.	2020-06-04 11:26:40 +05:30
Gerhard Schlager	f683c5d0e0	DEV: Check English locale for errors in CI Moves the most important checks into a linter. It gets executed by Lefthook as well as the docker rake task and Github actions. Doing those checks in rspec takes too long and it produces errors when the discourse:test Docker image contains old, invalid locale files.	2020-06-03 21:54:58 +02:00
Jarek Radosz	4babdf510b	DEV: Update facter usage `Facter.reset` (`65d167eac9/lib/facter.rb (L126-L137)`) clears `Facter::Options[:external_dir]` which seems to be the 4.x equivalent of `Facter::Util::Config.external_facts_dirs`. This commit also makes sure that version 4.0 or higher is installed.	2020-06-01 05:50:49 +02:00
Jarek Radosz	8c8391c742	DEV: Don't require wget in script/bench	2020-06-01 05:50:48 +02:00
Jarek Radosz	eb462bfb3d	FIX: Improve image downsizing script (#9549 ) Correctly handles more upload formats in posts, updates post custom fields, fixes more edge cases, adds debugging capabilities. (VERBOSE=1 and INTERACTIVE=1 flags) Includes these commits and some more: * DEV: Show the fixed image dimensions * FIX: Support more upload url formats * DEV: Remove the old upload after updating posts * FIX: Use the `process_post_#{id}` mutex * FIX: Avoid rebaking twice * DEV: Print out the link to the post * DEV: Process posts chronologically * DEV: Do a dry-run before saving, pause on any issue * FIX: Also process deleted posts * DEV: Make matchers case-insensitive * DEV: Pause on "detached" uploads, add more debug info * DEV: Print out time when finished * DEV: Add support for WORKER_ID/WORKER_COUNT * DEV: Fix the onebox in cooked text heuristic * DEV: Don't report already processed posts * DEV: Beep when done! * DEV: Ignore issues with deleted posts * DEV: Ignore issues with deleted topics * DEV: Multiline SQL * DEV: Use the bulk attribute assignment * DEV: Add ENV["INTERACTIVE"] mode * DEV: Handle post custom fields * DEV: Bail on non-S3 sites * DEV: Allow sizes smaller than 1 mpix	2020-05-26 15:38:23 +02:00
Kane York	869f9b20a2	PERF: Dematerialize topic_reply_count (#9769 ) * PERF: Dematerialize topic_reply_count It's only ever used for trust level promotions that run daily, or compared to 0. We don't need to track it on every post creation. * UX: Add symbol in TL3 report if topic reply count is capped * DEV: Drop user_stats.topic_reply_count column	2020-05-14 15:42:00 -07:00
Blake Erickson	ab919332dc	DEV: api documentation updates (#9612 ) * DEV: api documentation updates - Created a script to convert json responses to rswag - Documented several api endpoints - Switched rswag to use header based auth * Update script, fix some schema missmatches	2020-05-11 13:06:49 -06:00
Francesco Frassinelli	c6f68e4006	Fix syntax error in fluxbb.rb (#9727 )	2020-05-11 11:07:57 -04:00
Kyle E. Mitchell	6c968b7945	Add script for compiling copyright deposits (#9646 ) * Add script for compiling copyright deposits * git mv copyright-deposit script/	2020-05-06 12:51:45 -04:00
Krzysztof Kotlarek	9bff0882c3	FEATURE: Nokogumbo (#9577 ) * FEATURE: Nokogumbo Use Nokogumbo HTML parser.	2020-05-05 13:46:57 +10:00
Martin Brennan	867bc3b48e	FIX: Change base importer to create new Bookmark records (#9603 ) Also add a spec and fixture with a mock importer that we can use to test the create_X methods of the base importer	2020-05-01 11:34:55 +10:00
Sam Saffron	d0d5a138c3	DEV: stop freezing frozen strings We have the `# frozen_string_literal: true` comment on all our files. This means all string literals are frozen. There is no need to call #freeze on any literals. For files with `# frozen_string_literal: true` ``` puts %w{a b}[0].frozen? => true puts "hi".frozen? => true puts "a #{1} b".frozen? => true puts ("a " + "b").frozen? => false puts (-("a " + "b")).frozen? => true ``` For more details see: https://samsaffron.com/archive/2018/02/16/reducing-string-duplication-in-ruby	2020-04-30 16:48:53 +10:00
discoursehosting	094ddb1c1f	VBulletin5 importer improvements (#9477 ) - no more hard coded contenttypes - permalinks for topics, categories, subcategories - better uploads handling - tag support	2020-04-22 22:04:59 +02:00
Jarek Radosz	28c706bd09	DEV: Do less work in docker_test (#9470 ) * DEV: Update the working tree just once. `git pull` was effectively doing `git fetch` and `git merge FETCH_HEAD`, and only then we were checking out the desired branch/commit. This change will skip the the merge step. * DEV: Don't run lefthook in docker_test	2020-04-21 03:48:58 +02:00
Jay Pfaffman	3c72cbc5de	FIX: Google groups import changed login URL (#9432 ) I'm not clear why changing only the `wait_for_url` address was necessary and not also the `get` a few lines above, but this change seems to work for me on both literatecomputing.com Groups and a public group.	2020-04-15 16:45:14 -04:00
Gerhard Schlager	2b2584912a	Improve Telligent import script * Imports private messages * Replaces internal links for topics and replies * Allows incremental import of accepted answers	2020-04-03 18:10:52 +02:00
Robin Ward	b2b7afd310	Rename the server side widget hbs compiler	2020-03-27 12:06:14 -04:00
Gerhard Schlager	739430c01e	FIX: mbox import failed if no tags were configured	2020-03-26 16:41:11 +01:00
Gerhard Schlager	d216483c53	FIX: Importing with pgbouncer failed Checking if all records have been imported uses a temp table in PostgreSQL. This fails when pgbouncer is used unless the temp table is created inside a transaction.	2020-03-26 16:41:09 +01:00
Gerhard Schlager	c94b63bc75	DEV: Improve import of attachments from Telligent	2020-03-26 16:37:55 +01:00
Jarek Radosz	d21d80198c	DEV: Update rubocop-discourse (#9270 ) Includes: * DEV: Use `eq_time` matcher	2020-03-26 16:32:41 +01:00
Robin Ward	eaa324ecbd	Revert "Move the widget-hbs compiler to js from es6" This reverts commit `5d66a2c16e`.	2020-03-25 16:13:26 -04:00
Robin Ward	5d66a2c16e	Move the widget-hbs compiler to js from es6	2020-03-25 15:03:21 -04:00
Gerhard Schlager	5b2b769eb7	DEV: Ensure uploads aren't deleted during imports Sidekiq might delete uploads if you, for some reason, create upload records before using them in posts.	2020-03-24 17:14:16 +01:00
Gerhard Schlager	445b35381d	Improve Telligent import script * Detects mostly all attachments and it's a lot faster * Parses user properties in Ruby instead of the DB, because that's less errorprone * Imports user avatars * Imports topic views by users * Better handling of quotes and YouTube links	2020-03-23 09:18:12 +01:00
Gerhard Schlager	d27ece9ded	FIX: Method from Telligent import script was deleted by accident	2020-03-14 22:10:40 +01:00
Gerhard Schlager	e825f47daa	DEV: Better handling of incremental scrapes for Google Groups	2020-03-14 00:00:36 +01:00
Gerhard Schlager	0a88232e87	DEV: Improve mbox import script * Better documentation of settings * Add option to exclude trimmed parts of emails (enabled by default) to not revail email addresses	2020-03-14 00:00:36 +01:00
Gerhard Schlager	36062f43c8	DEV: Improve Telligent import script * Adds ability to map forums to categories and tags as well as ignore forums. * Fixes regular expression for detecting attachments in posts. * Handles "remote attachments" 😮 by inserting a link. * Imports view counts for topics. * Handles incorrect references of parent posts. * Better handling of quotes. * Finds a lot more attachments by trying to replace various Unicode characters in filenames.	2020-03-14 00:00:36 +01:00
Gerhard Schlager	ba1b840816	DEV: Don't deactivate suspended users during import Otherwise a cleanup job might delete those deactivated users.	2020-03-14 00:00:36 +01:00
Justin DiRose	6c948f27ea	FIX: Missing constant in SMF2 importer (#9178 )	2020-03-11 10:19:59 -05:00
Gerhard Schlager	0e752db411	DEV: Improve mbox import script * Customizable email subject prefixes to remove "Re" and "Fwd" as well as localized prefixes. * Configuration option for prefixes like [FOO] or (BAR) which can be replaced with tags during import. * Bugfix: Import script might have skipped some users due to missing ORDER BY.	2020-03-09 10:26:45 +01:00
Gerhard Schlager	edc8d58ac3	FEATURE: Add site setting to disable staged user cleanup ... and disabled the cleanup during imports, otherwise a running Sidekiq might delete users before posts are created	2020-03-09 10:26:41 +01:00
Jarek Radosz	f7ea2fdea5	FIX: Import posts of missing users from phpbb3 (#9085 ) Posts without a user probably shouldn't happen unless there was some direct database tampering, but data like that has been seen in the wild. The importer will assign those posts to the "system" user.	2020-03-06 22:54:40 +01:00
Gerhard Schlager	d7ccb58559	FIX: Google Groups scraper failed to login	2020-03-02 17:24:48 +01:00
Brad Morrical	ff5ff8d0d2	fix invalid byte sequence in UTF-8 (ArgumentError) (#9077 )	2020-02-28 10:26:18 -05:00
Justin DiRose	f35ee5e887	DEV: Improvements to SMF2 script (#9006 )	2020-02-24 12:51:45 -06:00
Sam Saffron	28292d2759	PERF: avoid shelling to get hostname aggressively Previously we had many places in the app that called `hostname` to get hostname of a server. This commit replaces the pattern in 2 ways 1. We cache the result in `Discourse.os_hostname` so it is only ever called once 2. We prefer to use Socket.gethostname which avoids making a shell command This improves performance as we are not spawning hostname processes throughout the app lifetime	2020-02-18 15:13:19 +11:00
David Taylor	5919618a87	DEV: Drop legacy OpenID 2.0 support (#8894 ) This is not used in core or official plugins, and has been printing a deprecation notice since v2.3.0beta4. All OpenID 2.0 code and dependencies have been dropped. The user_open_ids table remains for now, in case anyone has missed the deprecation notice, and needs to migrate their data. Context at https://meta.discourse.org/t/-/113249	2020-02-07 17:32:35 +00:00
Régis Hanol	0843e3e6ce	FIX: add support for sub-sub-categories in base_importer Also delegates 'post_already_imported?' and 'user_already_imported?' to the base importer.	2020-02-05 10:40:28 +01:00
Jarek Radosz	8a82ceb3bc	FIX: Improve downsize_uploads (#8409 ) With this change the script: * Actually removes original large-sized images * Doesn't save processed files if their size has increased * Prevents inconsistent state	2020-01-27 03:31:11 +01:00
Gerhard Schlager	ab07b945c2	Merge pull request #8736 from gschlager/rename_reply_id_column REFACTOR: Rename `post_replies.reply_id` column to `post_replies.reply_post_id`	2020-01-17 17:24:49 +01:00
Gerhard Schlager	e474cda321	REFACTOR: Restoring of backups and migration of uploads to S3	2020-01-14 11:41:35 +01:00
Sam Saffron	710eafdd35	FIX: ensure we consistently pick the same topic for bench We pick the first topic with 30 responses as our bench topic. Previously we simply picked the last topic, but hand no guarantee on ordering. This also attempts to correct previous runs of the bench.	2020-01-08 16:33:45 +11:00
David Taylor	fd6fbaa4ae	DEV: Update bench.rb for core changes (#8670 ) - Use new api key rake task - Switch to header-based API auth - Stop hard-coding topic id	2020-01-08 16:23:29 +11:00
Michael Brown	7200653e16	FIX: cache_critical_dns was erroring without IPAddr * sometimes cache_critical_dns would error out since "IPAddr" was undefined * sometimes it autoloaded, so no error	2019-12-27 12:39:08 -05:00
AlexP11223	1e4a83cc2a	FEATURE: add mybb.ru import script (#8609 )	2019-12-20 11:10:18 -05:00
Martin Brennan	edbc356593	FIX: Replace deprecated URI.encode, URI.escape, URI.unescape and URI.unencode (#8528 ) The following methods have long been deprecated in ruby due to flaws in their implementation per http://blade.nagaokaut.ac.jp/cgi-bin/vframe.rb/ruby/ruby-core/29293?29179-31097: URI.escape URI.unescape URI.encode URI.unencode escape/encode are just aliases for one another. This PR uses the Addressable gem to replace these methods with its own encode, unencode, and encode_component methods where appropriate. I have put all references to Addressable::URI here into the UrlHelper to keep them corralled in one place to make changes to this implementation easier. Addressable is now also an explicit gem dependency.	2019-12-12 12:49:21 +10:00
David Taylor	e5ce2d97f6	DEV: Simplify Rubocop runner for GitHub actions Once we are happy with basic behavior, we can try adding annotations again	2019-12-11 11:49:27 +00:00
Joffrey JAFFEUX	bd17a3a8e7	DEV: introduces Github Actions for CI (#8441 ) Co-Authored-By: David Taylor <david@taylorhq.com>	2019-12-10 14:45:47 +01:00
Sam Saffron	0c52537f10	DEV: update rubocop to version 0.77 We like to stay as close as possible to latest with rubocop cause the cops get better. This update required some code changes, specifically the default is to avoid explicit returns where implicit is done Also this renames a few rules	2019-12-10 11:48:39 +11:00
Joffrey JAFFEUX	0d3d2c43a0	DEV: s/\$redis/Discourse\.redis (#8431 ) This commit also adds a rubocop rule to prevent global variables.	2019-12-03 10:05:53 +01:00
Gerhard Schlager	c218036107	FIX: Make Google Groups scraper work for G Suite users	2019-11-28 02:09:51 +01:00
Sam Saffron	88ecb650a9	DEV: Implement a faster Discourse.cache This is a bottom up rewrite of Discourse cache to support faster performance and a limited surface area. ActiveSupport::Cache::Store accepts many options we do not use, this partial implementation only picks the bits out that we do use and want to support. Additionally params are named which avoids typos such as "expires_at" vs "expires_in" This also moves a few spots in Discourse to use Discourse.cache over setex Performance of setex and Discourse.cache.write is similar.	2019-11-27 16:11:49 +11:00
Sam Saffron	0fb497eb23	DEV: use Discourse.cache over Rails.cache Discourse.cache is a more consistent method to use and offers clean fallback if you are skipping redis This is part of a larger change that both optimizes Discoruse.cache and omits use of setex on $redis in favor of consistently using discourse cache Bench does reveal that use of Rails.cache and Discourse.cache is 1.25x slower than redis.setex / get so a re-implementation will follow prior to porting	2019-11-27 12:36:19 +11:00
Vinoth Kannan	3bb7ad4be1	FEATURE: remove support for 'suppress_from_latest' category setting. (#8308 )	2019-11-18 12:28:35 +05:30
Penar Musaraj	067696df8f	DEV: Apply Rubocop redundant return style	2019-11-14 15:10:51 -05:00
David Taylor	9fea43e46a	DEV: Remove use of `cd` in the app (#8337 ) `FileUtils.cd` and `Dir.chdir` cause the working directory to change for the entire process. We run sidekiq jobs, hijacked requests and deferred jobs in threads, which can make working directory changes have unintended side-effects. - Add a rubocop rule to warn about usage of Dir.chdir and FileUtils.cd - Added rubocop:disable for scripts used outside the app - Refactored code using cd to use alternative methods - Temporarily skipped the rubocop check for lib/backup_restore. This will require more complex refactoring, so I will create a separate PR for review	2019-11-13 09:57:39 +00:00
Sam Saffron	bf0ef73286	DEV: correct rake task used to grab admin key We amended it so "api_key:get" is no longer supported and instead we are more explicit. This matches that change and fixes the bench.	2019-11-11 10:23:14 +11:00
David Taylor	54fe887c44	DEV: Remove prototype theme-watcher script This has been superseded by the Theme CLI: https://meta.discourse.org/t/82950	2019-11-07 17:22:54 +00:00
romanrizzi	d76d0e75ec	DEV: Move warmup inside docker rake task	2019-10-25 16:31:05 -03:00
romanrizzi	4f452f0205	DEV: Add variable to warmup tmp folder and obtain accurate results when profiling specs	2019-10-25 10:52:23 -03:00
Krzysztof Kotlarek	f34a0141c7	FIX: Correct path to ImportExport module (#8227 ) During the move from Classic autoloader to Zeitwerk import_export module was moved to correct file name convention. `427d54b2b0 (diff-d896ec33b95afb7fae9f8bfe73d0580b)` Problem is that export/import is still using old path to require that module Meta: https://meta.discourse.org/t/topic-and-category-export-import/38930/40	2019-10-23 17:27:14 +11:00
Daniel Waterworth	1352a5b5fa	DEV: undo pluck_first changes to micro benchmark and add pluck_first benchmark	2019-10-21 12:21:24 +01:00
Daniel Waterworth	55a1394342	DEV: pluck_first Doing .pluck(:column).first is a very common pattern in Discourse and in most cases, a limit cause isn't being added. Instead of adding a limit clause to all these callsites, this commit adds two new methods to ActiveRecord::Relation: pluck_first, equivalent to limit(1).pluck(*columns).first and pluck_first! which, like other finder methods, raises an exception when no record is found	2019-10-21 12:08:20 +01:00
Régis Hanol	e1998ef244	FIX: downsize_uploads script The script will now correct all width/height and thumbnail_width/thumbnail_height properties of all the uploaded images. The script now uses width * height to filter out all unaffected images. Also handled the case where a downsized image was already an uploaded record.	2019-10-10 16:37:55 +02:00
Régis Hanol	4fdad12998	FIX: downsize_uploads script to support external storage Also ensured we update the sha1 property of the upload record to match the actual file.	2019-10-08 17:54:39 +02:00
jelle van der Waa	2d4c9bbaac	import_scripts: add fluxbb prefix to missing query (#8163 ) Signed-off-by: Jelle van der Waa <jelle@archlinux.org>	2019-10-08 11:46:00 +11:00
Sam Saffron	1d5c2b36f6	DEV: improve diagnostics on mem leak checker This adds mwrap logging to each iteration so we can see how much leaks per iteration and where it is coming from	2019-10-04 09:47:33 +10:00
Sam Saffron	038a38ae1c	DEV: add debugging scripts for memory leaks These scripts are somewhat rough but I needed them to help debug a memory leak we have noticed in rails 6. The biggest object script finds all the biggest objects we have in memory after boot. The test memory leak runs a very simple iteration through all multisites and observed memory.	2019-10-03 16:36:31 +10:00
Krzysztof Kotlarek	35b1185a08	FIX: Revert Demon::DemonBase back to Demon::Base (#8132 ) I introduced DemonBase because I had got some conflict between `demon/base.rb` and `jobs/base.rb`, however, to not rename base class, it is possible to use regex on absolute path in Zeitwerk custom inflector.	2019-10-02 14:54:08 +10:00
Krzysztof Kotlarek	427d54b2b0	DEV: Upgrading Discourse to Zeitwerk (#8098 ) Zeitwerk simplifies working with dependencies in dev and makes it easier reloading class chains. We no longer need to use Rails "require_dependency" anywhere and instead can just use standard Ruby patterns to require files. This is a far reaching change and we expect some followups here.	2019-10-02 14:01:53 +10:00
Gerhard Schlager	b48ca9dee9	DEV: Simplify username validation in base importer The `UsernameValidator` does already all the hard work. No need to do any additional checks in the import script. The checks were out-of-date anyway.	2019-10-01 20:33:09 +02:00
Gerhard Schlager	ed1e5ef6cc	FIX: By default, don't abort Google Groups crawling on error	2019-09-18 18:14:04 +02:00
Gerhard Schlager	ab96239f2a	FIX: Google Groups crawler failed to login Trying to automate the login into a Google account is quite hard. This makes the crawler use the content of a cookies.txt file instead. It also removes a couple of deprecation warnings and adds some color to the output.	2019-09-18 13:09:20 +02:00
Sam Saffron	cd1ab206d9	DEV: add missing ultra low queue to mwrap sidekiq note: mwrap is used for analysis of memory bloat and leaks of processes	2019-09-18 11:18:35 +10:00
Sam Saffron	015051ecaf	PERF: avoid spinning a thread each time we close a connection This is a temporary workaround for the issue in https://github.com/rails/rails/pull/36949 Discussing a proper fix in Rails with the Rails team. Prior to this fix we were spinning up a thread every time we closed a connection to the db.	2019-09-12 17:34:04 +10:00
Krzysztof Kotlarek	1d73754e84	FIX: Modify frozen String and profile_db_generator uses category id (#8080 )	2019-09-09 17:38:37 +10:00
Dan Ungureanu	ab7038bfc2	DEV: User simulator tried to modify frozen string.	2019-08-16 17:32:17 +03:00
Gerhard Schlager	888b635cfc	Import avatars and likes in the Zendesk AP importer Co-authored-by: Justin DiRose <justin@justindirose.com>	2019-08-14 10:42:52 +02:00
Gerhard Schlager	4ed517a344	DEV: Make Rubocop happy Follow-up to `6cc9fe42`	2019-08-12 23:10:58 +02:00
Mohamad Abras	6cc9fe42ce	add mongo adapter to nodebb importer (#8000 )	2019-08-12 14:15:11 -04:00
Régis Hanol	19dda59932	FIX: add back verbose option to DbHelper.remap	2019-07-31 17:30:08 +02:00
Rishabh	dcb47d902b	REFACTOR: Rename SiteSetting.disable_edit_notifications to disable_system_edit_notifications (#7958 ) * REFACTOR: Rename SiteSetting.disable_edit_notifications to disable_system_edit_notifications - The older name could cause some confusion because the setting does not disable all edit notifications, only system ones. * FIX: Add frozen_string_literal: true in the migration * DEV: Deprecate 'disable_edit_notifications'	2019-07-31 20:20:41 +05:30
Régis Hanol	89fce2ce71	DEV: remove duplicate Remap class and use DbHelper.remap instead Follow-up to `9cd3f96dee`	2019-07-29 18:43:40 +02:00
Gerhard Schlager	fd12c414e7	DEV: Refactor helper methods for upload markdown Follow-up to `a61ff167`	2019-07-25 16:36:35 +02:00
Gerhard Schlager	a61ff16740	DEV: Make attachment markdown reusable	2019-07-25 14:04:18 +02:00
Joffrey JAFFEUX	cc46de8f46	s/discourse-staff-notes/discourse-user-notes (#7936 )	2019-07-24 20:04:27 +02:00
Gerhard Schlager	f0fea5991f	FIX: Latest Selenium gem broke Google Groups import script Selenium uses Keep-Alive since version 3.141, so the net-http-persistent gem shouldn't be needed anymore.	2019-07-10 09:45:33 +02:00
Dan Ungureanu	ab6ad220c7	DEV: Fix user simulator script.	2019-07-09 18:52:08 +03:00
Arpit Jalan	6d30be1f94	Improve XenForo import script. - ensure only active, unbanned users are imported. - ensure only visible threads/posts are imported.	2019-06-18 15:52:34 +05:30
Arpit Jalan	77f5577e30	DEV: Improvements to AnswerHub import script.	2019-06-13 11:46:17 +05:30
Guo Xiang Tan	36c0cfa890	FIX: Use new attachment markdown format in `ImportScripts::Uploader`.	2019-06-11 14:49:28 +08:00
Blake Erickson	0955d9ece9	create answerhub importer (#7671 )	2019-06-03 12:17:22 +10:00
Gerhard Schlager	0f3c3bc309	Make import scripts work with frozen strings	2019-05-30 22:22:24 +02:00
Gerhard Schlager	c70d0c6659	Use an invalid domain for fake email addresses in importers	2019-05-30 22:22:24 +02:00
Gerhard Schlager	d3ba338144	Make Telligent import script more generic	2019-05-30 22:22:24 +02:00
Joffrey JAFFEUX	630e9814bc	datetime is not available at this point (#7630 )	2019-05-29 14:06:32 +02:00
Joffrey JAFFEUX	6439004161	DEV: do not use STDERR to print tests timestamps (#7629 )	2019-05-29 13:28:02 +02:00
Joffrey JAFFEUX	6be9a6eb2e	DEV: adds time logging to docker_test script (#7627 )	2019-05-29 12:06:43 +02:00
Sam Saffron	7429700389	FIX: ensure we can download maxmind without redis or db config This also corrects FileHelper.download so it supports "follow_redirect" correctly (it used to always follow 1 redirect) and adds a `validate_url` param that will bypass all uri validation if set to false (default is true)	2019-05-28 10:28:57 +10:00
Sam Saffron	2bcc3ef46b	correct type	2019-05-22 12:28:17 +10:00
Sam Saffron	12264747f7	DEV: script to analyze status of sidekiq queue This returns a proper count of all queued jobs and finds potential dupes	2019-05-22 12:27:11 +10:00
Gerhard Schlager	b788948985	FEATURE: English locale with international date formats Makes en_US the new default locale	2019-05-20 13:47:20 +02:00
Sam Saffron	678a9a61c4	DEV: lint importer commit #f490ed3b introduced a few linting issues, resolved now	2019-05-17 16:37:08 +10:00
Edmond Lepedus	f490ed3bbc	FEATURE: Add attachment support to xenforo importer (#7548 ) * FEATURE: Add attachment support to XenForo importer If `ATTACHMENT_DIR` is provided, importer will scan each imported post for `[GALLERY]` and `[ATTACH]` tags, attempt to import the referenced files as Discourse uploads and replace the tags with Discourse markup. References to files which cannot be imported are stripped. NOTE: This only imports attachments which are referenced in imported posts. Any XenForo media or files which are not referenced in any post using `[ATTACH]` or `[GALLERY]` tags will not be imported. The goal is to ensure that we don't have posts with missing images and unsightly markup, NOT to ensure that all attachments are migrated. * FEATURE: Add attachment support to XenForo importer If `ATTACHMENT_DIR` is provided, importer will scan each imported post for `[GALLERY]` and `[ATTACH]` tags, attempt to import the referenced files as Discourse uploads and replace the tags with Discourse markup. References to files which cannot be imported are stripped. NOTE: This only imports attachments which are referenced in imported posts. Any XenForo media or files which are not referenced in any post using `[ATTACH]` or `[GALLERY]` tags will not be imported. The goal is to ensure that we don't have posts with missing images and unsightly markup, NOT to ensure that all attachments are migrated. * FEATURE: Add attachment support to XenForo importer If `ATTACHMENT_DIR` is provided, importer will scan each imported post for `[GALLERY]` and `[ATTACH]` tags, attempt to import the referenced files as Discourse uploads and replace the tags with Discourse markup. References to files which cannot be imported are stripped. NOTE: This only imports attachments which are referenced in imported posts. Any XenForo media or files which are not referenced in any post using `[ATTACH]` or `[GALLERY]` tags will not be imported. The goal is to ensure that we don't have posts with missing images and unsightly markup, NOT to ensure that all attachments are migrated.	2019-05-17 16:18:28 +10:00
Sam Saffron	b3d42b3f18	DEV: remove unmaintained script osxdev script has not been maintained for a while, keeping it around is only causing confusion	2019-05-17 11:47:48 +10:00
Sam Saffron	30990006a9	DEV: enable frozen string literal on all files This reduces chances of errors where consumers of strings mutate inputs and reduces memory usage of the app. Test suite passes now, but there may be some stuff left, so we will run a few sites on a branch prior to merging	2019-05-13 09:31:32 +08:00

1 2 3 4 5 ...

1076 Commits