discourse

Commit Graph

Author	SHA1	Message	Date
Martin Brennan	e4350bb966	FEATURE: Direct S3 multipart uploads for backups (#14736 ) This PR introduces a new `enable_experimental_backup_uploads` site setting (default false and hidden), which when enabled alongside `enable_direct_s3_uploads` will allow for direct S3 multipart uploads of backup .tar.gz files. To make multipart external uploads work with both the S3BackupStore and the S3Store, I've had to move several methods out of S3Store and into S3Helper, including: * presigned_url * create_multipart * abort_multipart * complete_multipart * presign_multipart_part * list_multipart_parts Then, S3Store and S3BackupStore either delegate directly to S3Helper or have their own special methods to call S3Helper for these methods. FileStore.temporary_upload_path has also removed its dependence on upload_path, and can now be used interchangeably between the stores. A similar change was made in the frontend as well, moving the multipart related JS code out of ComposerUppyUpload and into a mixin of its own, so it can also be used by UppyUploadMixin. Some changes to ExternalUploadManager had to be made here as well. The backup direct uploads do not need an Upload record made for them in the database, so they can be moved to their final S3 resting place when completing the multipart upload. This changeset is not perfect; it introduces some special cases in UploadController to handle backups that was previously in BackupController, because UploadController is where the multipart routes are located. A subsequent pull request will pull these routes into a module or some other sharing pattern, along with hooks, so the backup controller and the upload controller (and any future controllers that may need them) can include these routes in a nicer way.	2021-11-11 08:25:31 +10:00
Martin Brennan	0d809197aa	FIX: Make sure S3 object headers are preserved on copy (#14302 ) When copying an existing upload stub temporary object on S3 to its final destination we were not copying across its additional headers such as content-disposition and cache-control, which led to issues like attachments not downloading with their original filename when clicking the download links in posts. This is because the metadata_directive = REPLACE option was not being passed to object.copy_from(), so only the source object's headers were being used. Added an option for apply_metadata_to_destination to apply this option conditionally, because we may not always want to replace this metadata, but we definitely do when copying a temporary upload.	2021-09-10 12:59:51 +10:00
Martin Brennan	841e054907	FIX: Do not prefix temp/ S3 keys with s3_bucket_folder_path in S3Helper (#14145 ) This is unnecessary, as when the temporary key is created in S3Store we already include the s3_bucket_folder_path, and the key will always start with temp/ to assist with lifecycle rules for multipart uploads. This was affecting Discourse.store.object_from_path, Discourse.store.signed_url_for_path, and possibly others. See also: `e0102a5`	2021-08-26 08:50:49 +10:00
Martin Brennan	b500949ef6	FEATURE: Initial implementation of direct S3 uploads with uppy and stubs (#13787 ) This adds a few different things to allow for direct S3 uploads using uppy. These changes are still not the default. There are hidden `enable_experimental_image_uploader` and `enable_direct_s3_uploads` settings that must be turned on for any of this code to be used, and even if they are turned on only the User Card Background for the user profile actually uses uppy-image-uploader. A new `ExternalUploadStub` model and database table is introduced in this pull request. This is used to keep track of uploads that are uploaded to a temporary location in S3 with the direct to S3 code, and they are eventually deleted a) when the direct upload is completed and b) after a certain time period of not being used. ### Starting a direct S3 upload When an S3 direct upload is initiated with uppy, we first request a presigned PUT URL from the new `generate-presigned-put` endpoint in `UploadsController`. This generates an S3 key in the `temp` folder inside the correct bucket path, along with any metadata from the clientside (e.g. the SHA1 checksum described below). This will also create an `ExternalUploadStub` and store the details of the temp object key and the file being uploaded. Once the clientside has this URL, uppy will upload the file direct to S3 using the presigned URL. Once the upload is complete we go to the next stage. ### Completing a direct S3 upload Once the upload to S3 is done we call the new `complete-external-upload` route with the unique identifier of the `ExternalUploadStub` created earlier. Only the user who made the stub can complete the external upload. One of two paths is followed via the `ExternalUploadManager`. 1. If the object in S3 is too large (currently 100mb defined by `ExternalUploadManager::DOWNLOAD_LIMIT`) we do not download and generate the SHA1 for that file. Instead we create the `Upload` record via `UploadCreator` and simply copy it to its final destination on S3 then delete the initial temp file. Several modifications to `UploadCreator` have been made to accommodate this. 2. If the object in S3 is small enough, we download it. When the temporary S3 file is downloaded, we compare the SHA1 checksum generated by the browser with the actual SHA1 checksum of the file generated by ruby. The browser SHA1 checksum is stored on the object in S3 with metadata, and is generated via the `UppyChecksum` plugin. Keep in mind that some browsers will not generate this due to compatibility or other issues. We then follow the normal `UploadCreator` path with one exception. To cut down on having to re-upload the file again, if there are no changes (such as resizing etc) to the file in `UploadCreator` we follow the same copy + delete temp path that we do for files that are too large. 3. Finally we return the serialized upload record back to the client There are several errors that could happen that are handled by `UploadsController` as well. Also in this PR is some refactoring of `displayErrorForUpload` to handle both uppy and jquery file uploader errors.	2021-07-28 08:42:25 +10:00
Gerhard Schlager	157f10db4c	FEATURE: Use path from existing URL of uploads and optimized images (#13177 ) Discourse shouldn't dynamically calculate the path of uploads and optimized images after a file has been stored on disk or S3. Otherwise it might calculate the wrong path if the SHA1 or extension stored in the database doesn't match the actual file path.	2021-05-27 17:42:25 +02:00
Josh Soref	59097b207f	DEV: Correct typos and spelling mistakes (#12812 ) Over the years we accrued many spelling mistakes in the code base. This PR attempts to fix spelling mistakes and typos in all areas of the code that are extremely safe to change - comments - test descriptions - other low risk areas	2021-05-21 11:43:47 +10:00
David Taylor	13e39d8b9f	PERF: Improve cook_url performance for topic thumbnails (#11609 ) - Only initialize the S3Helper when needed - Skip initializing the S3Helper for S3Store#cdn_url - Allow cook_url to be passed a `local` hint to skip unnecessary checks	2020-12-30 18:13:13 +00:00
Jarek Radosz	e00abbe1b7	DEV: Clean up S3 specs, stubs, and helpers Extracted commonly used spec helpers into spec/support/uploads_helpers.rb, removed unused stubs and let definitions. Makes it easier to write new S3-related specs without copy and pasting setup steps from other specs.	2020-09-28 12:02:25 +01:00
Martin Brennan	e92909aa77	FIX: Use ActionDispatch::Http::ContentDisposition for uploads content-disposition (#10108 ) See https://meta.discourse.org/t/broken-pipe-error-when-uploading-to-a-s3-clone-a-pdf-with-a-name-containing-e-i-etc/155414 When setting content-disposition for attachment, use the ContentDisposition class to format it. This handles filenames with weird characters and localization (accented characters) correctly.	2020-06-23 17:10:56 +10:00
Guo Xiang Tan	04a291ceea	DEV: Fix race conditions due to directory removal for uploads spec.	2020-06-03 12:28:39 +08:00
Michael Brown	d9a02d1336	Revert "Revert "Merge branch 'master' of https://github.com/discourse/discourse "" This reverts commit `20780a1eee`. * SECURITY: re-adds accidentally reverted commit: 03d26cd6: ensure embed_url contains valid http(s) uri * when the merge commit `e62a85cf` was reverted, git chose the `2660c2e2` parent to land on instead of the `03d26cd6` parent (which contains security fixes)	2020-05-23 00:56:13 -04:00
Jeff Atwood	20780a1eee	Revert "Merge branch 'master' of https://github.com/discourse/discourse " This reverts commit `e62a85cf6f`, reversing changes made to `2660c2e21d`.	2020-05-22 20:25:56 -07:00
Osama Sayegh	02f44def56	FIX: Don't blow up when trying to parse invalid or non-ASCII URLs (#9838 ) * FIX: Don't blow up when trying to parseinvalid or non-ASCII URLs Follow-up to `72f139191e`	2020-05-20 12:46:27 +03:00
Martin Brennan	72f139191e	FIX: S3 store has_been_uploaded? was not taking into account s3 bucket path (#9810 ) In some cases, between Discourse forums the hostname of a URL could match if they are hosting S3 files on the same bucket but the S3 bucket path might not. So e.g. https://testbucket.somesite.com/testpath/some/file/url.png vs https://testbucket.somesite.com/prodpath/some/file/url.png. So has_been_uploaded? was returning true for the second URL, even though it may have been uploaded on a different Discourse forum. This is a very rare case but must be accounted for, because this impacts UrlHelper.is_local which mistakenly thinks the file has already been downloaded and thus allows the URL to be cooked, where we want to return the full URL to be downloaded using PullHotlinkedImages.	2020-05-20 10:40:38 +10:00
Vinoth Kannan	3b7f5db5ba	FIX: parallel spec system needs a dedicated upload folder for each worker. (#8547 )	2019-12-18 11:21:57 +05:30
Mark VanLandingham	09d9baa6d7	FIX: Update S3 stubs for more aws-sdk API changes (#8534 )	2019-12-11 11:26:52 -08:00
dependabot-preview[bot]	b90a592146	DEV: Bump aws-sdk-sns from 1.13.0 to 1.21.0 (#8490 ) Bumps [aws-sdk-sns](https://github.com/aws/aws-sdk-ruby) from 1.13.0 to 1.21.0. - [Release notes](https://github.com/aws/aws-sdk-ruby/releases) - [Changelog](https://github.com/aws/aws-sdk-ruby/blob/master/gems/aws-sdk-sns/CHANGELOG.md) - [Commits](https://github.com/aws/aws-sdk-ruby/compare/1.13.0...1.21.0) Signed-off-by: dependabot-preview[bot] <support@dependabot.com>	2019-12-11 06:13:17 -08:00
Penar Musaraj	102909edb3	FEATURE: Add support for secure media (#7888 ) This PR introduces a new secure media setting. When enabled, it prevent unathorized access to media uploads (files of type image, video and audio). When the `login_required` setting is enabled, then all media uploads will be protected from unauthorized (anonymous) access. When `login_required`is disabled, only media in private messages will be protected from unauthorized access. A few notes: - the `prevent_anons_from_downloading_files` setting no longer applies to audio and video uploads - the `secure_media` setting can only be enabled if S3 uploads are already enabled and configured - upload records have a new column, `secure`, which is a boolean `true/false` of the upload's secure status - when creating a public post with an upload that has already been uploaded and is marked as secure, the post creator will raise an error - when enabling or disabling the setting on a site with existing uploads, the rake task `uploads:ensure_correct_acl` should be used to update all uploads' secure status and their ACL on S3	2019-11-18 11:25:42 +10:00
Sam Saffron	e7cf4579a8	DEV: improve usability of subfolder specs Previously people were not consistent about mocking which left internals in a fragile state when running subfolder specs. This introduces a simple helper `set_subfolder` which you can use to set the subfolder for the spec. It takes care of proper configuration of subfolder and teardown. ``` # usage set_subfolder "/my_amazing_subfolder" ``` You should no longer stub base_uri or global_settings	2019-11-15 16:48:24 +11:00
Krzysztof Kotlarek	427d54b2b0	DEV: Upgrading Discourse to Zeitwerk (#8098 ) Zeitwerk simplifies working with dependencies in dev and makes it easier reloading class chains. We no longer need to use Rails "require_dependency" anywhere and instead can just use standard Ruby patterns to require files. This is a far reaching change and we expect some followups here.	2019-10-02 14:01:53 +10:00
Gerhard Schlager	24877a7b8c	FIX: Correctly encode non-ASCII filenames in HTTP header Backport of fix from Rails 6: `890485cfce`	2019-08-07 19:10:50 +02:00
Penar Musaraj	03805e5a76	FIX: Ensure lightbox image download has correct content disposition in S3 (#7845 )	2019-07-04 11:32:51 -04:00
Vinoth Kannan	b7830680b6	DEV: use cdn url to download the external uploads to local.	2019-06-06 19:17:19 +05:30
Penar Musaraj	f00275ded3	FEATURE: Support private attachments when using S3 storage (#7677 ) * Support private uploads in S3 * Use localStore for local avatars * Add job to update private upload ACL on S3 * Test multisite paths * update ACL for private uploads in migrate_to_s3 task	2019-06-06 13:27:24 +10:00
Guo Xiang Tan	8d1b0224ac	Fix the build `a3938f98f8`.	2019-05-29 18:53:31 +08:00
Guo Xiang Tan	f0620e7118	FEATURE: Support `[description\|attachment](upload://<short-sha>)` in MD take 2. Previous attempt was missing `post_uploads` records.	2019-05-29 09:26:32 +08:00
Penar Musaraj	7c9fb95c15	Temporarily revert "FEATURE: Support `[description\|attachment](upload://<short-sha>)` in MD. (#7603 )" This reverts commit `b1d3c678ca`. We need to make sure post_upload records are correctly stored.	2019-05-28 16:37:01 -04:00
Guo Xiang Tan	b1d3c678ca	FEATURE: Support `[description\|attachment](upload://<short-sha>)` in MD. (#7603 )	2019-05-28 11:18:21 -04:00
David Taylor	ef660d5a3e	FIX: Return consistent character encodings when downloading S3 uploads Net::HTTP always returns ASCII-8BIT encoding. File.read auto-detects the encoding. This leads to an encoding inconsistency between a fresh download, and a cached download. This commit ensures all downloaded files are treated equally, by always returning the cached version from the filesystem, even during initial download. One symptom of this problem is during theme exports: https://meta.discourse.org/t/116907 Related ruby ticket: https://bugs.ruby-lang.org/issues/2567	2019-05-17 11:27:00 +01:00
Daniel Waterworth	e219588142	DEV: Prefabrication (test optimization) (#7414 ) * Introduced fab!, a helper that creates database state for a group It's almost identical to let_it_be, except: 1. It creates a new object for each test by default, 2. You can disable it using PREFABRICATION=0	2019-05-07 13:12:20 +10:00
Sam Saffron	4ea21fa2d0	DEV: use #frozen_string_literal: true on all spec This change both speeds up specs (less strings to allocate) and helps catch cases where methods in Discourse are mutating inputs. Overall we will be migrating everything to use #frozen_string_literal: true it will take a while, but this is the first and safest move in this direction	2019-04-30 10:27:42 +10:00
Sam Saffron	45285f1477	DEV: remove update_attributes which is deprecated in Rails 6 See: https://github.com/rails/rails/pull/31998 update_attributes is a relic of the past, it should no longer be used.	2019-04-29 17:32:25 +10:00
Guo Xiang Tan	b0c8fdd7da	FIX: Properly support defaults for upload site settings.	2019-03-13 16:36:57 +08:00
Vinoth Kannan	cc496de10e	FIX: Remove double quotes from etag value in API response https://github.com/aws/aws-sdk-ruby/issues/1134	2019-02-08 14:31:19 +05:30
Robin Ward	bee68bba2e	FIX: Heisentest We use the `id` of the upload to calculate a `depth` partition in the filename. This test would fail if your database had a higher seed because the depth it was looking for was hard coded to 1. The solution was to not save the records (which is faster anyway) and specify the `id` of the upload to make the hash deterministic.	2019-01-16 15:01:50 -05:00
Vinoth Kannan	f94c0283b2	FIX: Use correct version when generating file path for optimized image (#6871 )	2019-01-11 18:35:38 +05:30
Vinoth Kannan	75dbb98cca	FEATURE: Add S3 etag value to uploads table (#6795 )	2019-01-04 14:16:22 +08:00
Guo Xiang Tan	c666ef556d	Fix the build. Ref `570877da3c`	2019-01-03 15:34:39 +08:00
Guo Xiang Tan	ce6a0a5e9e	FIX: Moving upload to tombstone should update modification time. A upload created a long time ago will be nuked from the tombstone immediately if it gets deleted.	2018-09-18 10:48:29 +08:00
Raul Tambre	2271918be2	FEATURE: Use S3 dualstack endpoints Allows S3 without a CDN to serve images from dualstack domains that also support ipv6	2018-08-27 11:22:46 +10:00
Sam	5d96809abd	FIX: improve support for subfolder S3 CDN	2018-08-22 12:31:13 +10:00
Sam	f5142861e5	Revert "Revert "FIX: upload URLs from S3 on subfolder installs"" This reverts commit `26c96e97e5`. We have no choice but to run this code	2018-08-22 11:31:33 +10:00
Sam	26c96e97e5	Revert "FIX: upload URLs from S3 on subfolder installs" This reverts commit `357df2ff4f`.	2018-08-22 10:51:40 +10:00
Neil Lalonde	357df2ff4f	FIX: upload URLs from S3 on subfolder installs	2018-08-21 14:58:55 -04:00
Guo Xiang Tan	1ea23b1eae	FIX: Wrong order for `S3Helper#copy_file`.	2018-08-08 15:58:54 +08:00
Guo Xiang Tan	aafff740d2	Add `FileStore::S3Store#copy_file`.	2018-08-08 11:30:34 +08:00
Andrew Schleifer	dba22bbde2	rollback changes This reverts: * 1baba84c438e "fix s3 subfolders harder" * ea5e57938edf "fix test for absolute_base_url change"	2018-07-06 17:16:40 -05:00
Andrew Schleifer	52e9f49ec1	fix s3 subfolders harder specifically, include the folder in absolute_base_url	2018-07-06 16:28:40 -05:00
Sam	89ad2b5900	DEV: Rails 5.2 upgrade and global gem upgrade This updates tests to use latest rails 5 practice and updates ALL dependencies that could be updated Performance testing shows that performance has not regressed if anything it is marginally faster now.	2018-06-07 14:21:33 +10:00
Sam	70bb2aa426	FEATURE: allow specifying s3 config via globals This refactors handling of s3 so it can be specified via GlobalSetting This means that in a multisite environment you can configure s3 uploads without actual sites knowing credentials in s3 It is a critical setting for situations where assets are mirrored to s3.	2017-10-06 16:20:01 +11:00

1 2

83 Commits