discourse

Commit Graph

Author	SHA1	Message	Date
Gerhard Schlager	157f10db4c	FEATURE: Use path from existing URL of uploads and optimized images (#13177 ) Discourse shouldn't dynamically calculate the path of uploads and optimized images after a file has been stored on disk or S3. Otherwise it might calculate the wrong path if the SHA1 or extension stored in the database doesn't match the actual file path.	2021-05-27 17:42:25 +02:00
Josh Soref	59097b207f	DEV: Correct typos and spelling mistakes (#12812 ) Over the years we accrued many spelling mistakes in the code base. This PR attempts to fix spelling mistakes and typos in all areas of the code that are extremely safe to change - comments - test descriptions - other low risk areas	2021-05-21 11:43:47 +10:00
David Taylor	35e1e009fa	FIX: Allow restoring non-subfolder backup to subfolder site (#12537 ) `GlobalSetting.relative_url_root` comes from the destination site. We can't be sure whether it was the same on the original site. It's safer to use a wildcard here, so we can backup/restore sites with different relative_url_root values.	2021-04-12 14:00:52 +10:00
David Taylor	13e39d8b9f	PERF: Improve cook_url performance for topic thumbnails (#11609 ) - Only initialize the S3Helper when needed - Skip initializing the S3Helper for S3Store#cdn_url - Allow cook_url to be passed a `local` hint to skip unnecessary checks	2020-12-30 18:13:13 +00:00
Daniel Waterworth	721ee36425	Replace `base_uri` with `base_path` (#10879 ) DEV: Replace instances of Discourse.base_uri with Discourse.base_path This is clearer because the base_uri is actually just a path prefix. This continues the work started in `555f467`.	2020-10-09 12:51:24 +01:00
Martin Brennan	4193eb0419	FIX: Respect force download when downloading secure media via lightbox (#10769 ) The download link on the lightbox for images was not downloading the image if the upload was marked secure, because the code in the upload controller route was not respecting the dl=1 param for force download. This PR fixes this so the download link works for secure images as well as regular ligthboxed images.	2020-09-29 12:12:03 +10:00
Martin Brennan	31e31ef449	SECURITY: Add content-disposition: attachment for SVG uploads * strip out the href and xlink:href attributes from use element that are _not_ anchors in svgs which can be used for XSS * adding the content-disposition: attachment ensures that uploaded SVGs cannot be opened and executed using the XSS exploit. svgs embedded using an img tag do not suffer from the same exploit	2020-07-09 13:31:48 +10:00
Jarek Radosz	64ce12a758	FIX: `OptimizedImage#filesize` (#10095 ) `OptimizedImage#filesize` calls `Discourse.store.download` with an OptimizedImage as an argument. It would in turn attempt to call `#original_filename` and `#secure?` on that object. Both would fail as these methods do not exist on OptimizedImage, only on Upload. We didn't know about these issues because: 1. `#calculate_filesize` is not called often, because the filesize is saved on OptimizedImage creation, so it's used mostly for manual filesize recalculation 2. we were using `rescue nil` which swallows all errors	2020-07-06 17:01:29 +02:00
Martin Brennan	8ef782bdbd	FIX: Increase time of DOWNLOAD_URL_EXPIRES_AFTER_SECONDS to 5 minutes (#10160 ) * Change S3Helper::DOWNLOAD_URL_EXPIRES_AFTER_SECONDS to 5 minutes, which controls presigned URL expiry and secure-media route cache time. * This is done because of the composer preview refreshing while typing causes a lot of requests sent to our server because of the short URL expiry. If this ends up being not enough we can always increase the time or explore other avenues (e.g. GitHub has a 7 day validity for secure URLs)	2020-07-03 13:42:36 +10:00
Sam Saffron	689568c216	FIX: invalid urls should not break store.has_been_uploaded? Breaking this method has wide ramification including breaking search indexing.	2020-06-25 15:00:15 +10:00
Martin Brennan	e92909aa77	FIX: Use ActionDispatch::Http::ContentDisposition for uploads content-disposition (#10108 ) See https://meta.discourse.org/t/broken-pipe-error-when-uploading-to-a-s3-clone-a-pdf-with-a-name-containing-e-i-etc/155414 When setting content-disposition for attachment, use the ContentDisposition class to format it. This handles filenames with weird characters and localization (accented characters) correctly.	2020-06-23 17:10:56 +10:00
Guo Xiang Tan	828ceab64b	DEV: Make rubocop happy.	2020-06-17 15:47:05 +08:00
Martin Brennan	e5da2d24e5	FIX: Add attachment content-disposition for all non-image files (#10058 ) This will make it so the original filename is used when downloading all non-image files, bringing S3Store into line with the to_s3 migration and local storage. Video and audio files will still stream correctly in HTML players as well. See https://meta.discourse.org/t/cannot-download-non-image-media-files-original-filenames-lost-when-uploaded-to-s3/152797 for a lot of extra context.	2020-06-17 11:16:37 +10:00
Jarek Radosz	3d55f2e3b7	FIX: Improvements and fixes to the image downsizing script (#9950 ) Fixed bugs, added specs, extracted the upload downsizing code to a class, added support for non-S3 setups, changed it so that images aren't downloaded twice. This code has been tested on production and successfully resized ~180k uploads. Includes: * DEV: Extract upload downsizing logic * DEV: Add support for non-S3 uploads * DEV: Process only images uploaded by users * FIX: Incorrect usage of `count` and `exist?` typo * DEV: Spec S3 image downsizing * DEV: Avoid downloading images twice * DEV: Update filesizes earlier in the process * DEV: Return false on invalid upload * FIX: Download images that currently above the limit (If the image size limit is decreased, then there was no way to resize those images that now fall outside the allowed size range) * Update script/downsize_uploads.rb (Co-authored-by: Régis Hanol <regis@hanol.fr>)	2020-06-11 14:47:59 +02:00
Jarek Radosz	27ad562ff5	DEV: Rubocop fix	2020-06-01 06:07:07 +02:00
Jarek Radosz	7df688d108	FIX: Handle files removed between `glob` and `mtime`	2020-06-01 05:50:50 +02:00
Roman Rizzi	b61a291cf3	FIX: returns false if the upload url is an invalid mailto link (#9877 )	2020-05-26 10:32:48 -03:00
Michael Brown	d9a02d1336	Revert "Revert "Merge branch 'master' of https://github.com/discourse/discourse "" This reverts commit `20780a1eee`. * SECURITY: re-adds accidentally reverted commit: 03d26cd6: ensure embed_url contains valid http(s) uri * when the merge commit `e62a85cf` was reverted, git chose the `2660c2e2` parent to land on instead of the `03d26cd6` parent (which contains security fixes)	2020-05-23 00:56:13 -04:00
Jeff Atwood	20780a1eee	Revert "Merge branch 'master' of https://github.com/discourse/discourse " This reverts commit `e62a85cf6f`, reversing changes made to `2660c2e21d`.	2020-05-22 20:25:56 -07:00
Osama Sayegh	02f44def56	FIX: Don't blow up when trying to parse invalid or non-ASCII URLs (#9838 ) * FIX: Don't blow up when trying to parseinvalid or non-ASCII URLs Follow-up to `72f139191e`	2020-05-20 12:46:27 +03:00
Martin Brennan	72f139191e	FIX: S3 store has_been_uploaded? was not taking into account s3 bucket path (#9810 ) In some cases, between Discourse forums the hostname of a URL could match if they are hosting S3 files on the same bucket but the S3 bucket path might not. So e.g. https://testbucket.somesite.com/testpath/some/file/url.png vs https://testbucket.somesite.com/prodpath/some/file/url.png. So has_been_uploaded? was returning true for the second URL, even though it may have been uploaded on a different Discourse forum. This is a very rare case but must be accounted for, because this impacts UrlHelper.is_local which mistakenly thinks the file has already been downloaded and thus allows the URL to be cooked, where we want to return the full URL to be downloaded using PullHotlinkedImages.	2020-05-20 10:40:38 +10:00
Sam Saffron	0cbaa8d813	FEATURE: extend duration allowed for download Previously we would raise a warning in the logs if downloading a file (from s3) takes longer than 60 seconds. At scale this happens reasonably frequently. 1. Raised the duration to 3 minutes 2. Pulled the resizing mutex out of the downloading mutex so we have less and clearer error logs	2020-05-15 12:45:47 +10:00
Sam Saffron	d0d5a138c3	DEV: stop freezing frozen strings We have the `# frozen_string_literal: true` comment on all our files. This means all string literals are frozen. There is no need to call #freeze on any literals. For files with `# frozen_string_literal: true` ``` puts %w{a b}[0].frozen? => true puts "hi".frozen? => true puts "a #{1} b".frozen? => true puts ("a " + "b").frozen? => false puts (-("a " + "b")).frozen? => true ``` For more details see: https://samsaffron.com/archive/2018/02/16/reducing-string-duplication-in-ruby	2020-04-30 16:48:53 +10:00
Jarek Radosz	c1c211365a	FIX: Improve clearing store cache (#9568 ) 1. Shorter 2. Simpler 3. Doesn't depend on external binaries 4. Doesn't fail on large amounts of files 5. Hopefully eliminates flaky spec errors	2020-04-28 17:24:04 +02:00
David Taylor	ba616ffb50	DEV: Use a tmp directory for storing uploads in tests (#9554 ) This avoids development-mode upload files from polluting the test environment	2020-04-28 14:03:04 +01:00
Gerhard Schlager	c6b411f6c1	FIX: Restore to S3 didn't work without env variables The `uplaods:migrate_to_s3` rake task should always use the environment variables, because you usually don't want to break your site's uploads during the migration. But restoring a backup should work with site settings as well as environment variables, otherwise you can't restore uploads to S3 from the web interface.	2020-04-19 20:24:40 +02:00
Gerhard Schlager	baae0e7446	FIX: Infinite loop in migrate_to_s3 rake task	2020-04-19 20:24:40 +02:00
Gerhard Schlager	5bffb033df	FIX: The migrate_to_s3 rake task couldn't find the AWS SDK	2020-03-26 16:41:10 +01:00
Gerhard Schlager	93b8b04b06	FIX: Migrating uploads to S3 could miss files The rake task aborted the migration with "Already migrated" when all upload URLs linked to the correct S3 bucket even though the files didn't exist on S3. By removing the first check we force the rake task to check for the existance of uploads on S3.	2020-03-04 12:50:48 +01:00
Gerhard Schlager	0adab26e45	FIX: Don't count ignored, missing uploads in migration to S3	2020-02-12 16:18:52 +01:00
Jarek Radosz	63a4aa65ff	DEV: Ignore `ls` errors when clearing FileStore cache (#8780 ) A race condition issue is possible when multiple thread/processes are calling this method. `ls` prints out to stderr "cannot access '...': No such file or directory" if any of the files it's currently trying to list are being removed by the `xargs rm -rf` in an another process. That doesn't affect the result, but it did raise an error before this change. Tested on a production instance where the original issue was observed. Co-Authored-By: Régis Hanol <regis@hanol.fr>	2020-01-27 02:59:54 +01:00
Martin Brennan	7c32411881	FEATURE: Secure media allowing duplicated uploads with category-level privacy and post-based access rules (#8664 ) ### General Changes and Duplication * We now consider a post `with_secure_media?` if it is in a read-restricted category. * When uploading we now set an upload's secure status straight away. * When uploading if `SiteSetting.secure_media` is enabled, we do not check to see if the upload already exists using the `sha1` digest of the upload. The `sha1` column of the upload is filled with a `SecureRandom.hex(20)` value which is the same length as `Upload::SHA1_LENGTH`. The `original_sha1` column is filled with the _real_ sha1 digest of the file. * Whether an upload `should_be_secure?` is now determined by whether the `access_control_post` is `with_secure_media?` (if there is no access control post then we leave the secure status as is). * When serializing the upload, we now cook the URL if the upload is secure. This is so it shows up correctly in the composer preview, because we set secure status on upload. ### Viewing Secure Media * The secure-media-upload URL will take the post that the upload is attached to into account via `Guardian.can_see?` for access permissions * If there is no `access_control_post` then we just deliver the media. This should be a rare occurrance and shouldn't cause issues as the `access_control_post` is set when `link_post_uploads` is called via `CookedPostProcessor` ### Removed We no longer do any of these because we do not reuse uploads by sha1 if secure media is enabled. * We no longer have a way to prevent cross-posting of a secure upload from a private context to a public context. * We no longer have to set `secure: false` for uploads when uploading for a theme component.	2020-01-16 13:50:27 +10:00
Gerhard Schlager	e474cda321	REFACTOR: Restoring of backups and migration of uploads to S3	2020-01-14 11:41:35 +01:00
Vinoth Kannan	3b7f5db5ba	FIX: parallel spec system needs a dedicated upload folder for each worker. (#8547 )	2019-12-18 11:21:57 +05:30
Vinoth Kannan	d3e7768ea8	Revert "FIX: parallel spec system needs needs a dedicated upload folder for each worker. (#8372 )" This reverts commit `42e5176bc3`.	2019-11-19 15:02:18 +05:30
Vinoth Kannan	42e5176bc3	FIX: parallel spec system needs needs a dedicated upload folder for each worker. (#8372 )	2019-11-19 13:16:20 +05:30
Penar Musaraj	102909edb3	FEATURE: Add support for secure media (#7888 ) This PR introduces a new secure media setting. When enabled, it prevent unathorized access to media uploads (files of type image, video and audio). When the `login_required` setting is enabled, then all media uploads will be protected from unauthorized (anonymous) access. When `login_required`is disabled, only media in private messages will be protected from unauthorized access. A few notes: - the `prevent_anons_from_downloading_files` setting no longer applies to audio and video uploads - the `secure_media` setting can only be enabled if S3 uploads are already enabled and configured - upload records have a new column, `secure`, which is a boolean `true/false` of the upload's secure status - when creating a public post with an upload that has already been uploaded and is marked as secure, the post creator will raise an error - when enabling or disabling the setting on a site with existing uploads, the rake task `uploads:ensure_correct_acl` should be used to update all uploads' secure status and their ACL on S3	2019-11-18 11:25:42 +10:00
Penar Musaraj	067696df8f	DEV: Apply Rubocop redundant return style	2019-11-14 15:10:51 -05:00
David Taylor	1998be3b27	DEV: Raise errors when cleaning the download cache, and fix for macOS (#8319 ) POSIX's `head` specification states: "The application shall ensure that the number option-argument is a positive decimal integer" Negative values are supported on GNU `head`, so this works in the discourse docker image. However, in some environments (e.g. macOS), the system `head` version fails with a negative `n` parameter. This commit does two things: Checks the status at each stage of the pipe, so it cannot fail silently Flip the `ls` command to list in descending time order, and use `tail -n +501` instead of `head -n -500`. The visible result is that macOS users no longer see head: illegal line count -- -500 printed throughout the test suite.	2019-11-08 15:34:03 +00:00
Daniel Waterworth	55a1394342	DEV: pluck_first Doing .pluck(:column).first is a very common pattern in Discourse and in most cases, a limit cause isn't being added. Instead of adding a limit clause to all these callsites, this commit adds two new methods to ActiveRecord::Relation: pluck_first, equivalent to limit(1).pluck(*columns).first and pluck_first! which, like other finder methods, raises an exception when no record is found	2019-10-21 12:08:20 +01:00
Gerhard Schlager	24877a7b8c	FIX: Correctly encode non-ASCII filenames in HTTP header Backport of fix from Rails 6: `890485cfce`	2019-08-07 19:10:50 +02:00
Rafael dos Santos Silva	606c0ed14d	FIX: S3 uploads were missing a cache-control header (#7902 ) Admins still need to run the rake task to fix the files who where uploaded previously.	2019-08-06 14:55:17 -03:00
Gerhard Schlager	f2dc59d61f	FEATURE: Add hidden setting to include S3 uploads in backups	2019-07-09 14:04:16 +02:00
Penar Musaraj	03805e5a76	FIX: Ensure lightbox image download has correct content disposition in S3 (#7845 )	2019-07-04 11:32:51 -04:00
Vinoth Kannan	b7830680b6	DEV: use cdn url to download the external uploads to local.	2019-06-06 19:17:19 +05:30
Penar Musaraj	f00275ded3	FEATURE: Support private attachments when using S3 storage (#7677 ) * Support private uploads in S3 * Use localStore for local avatars * Add job to update private upload ACL on S3 * Test multisite paths * update ACL for private uploads in migrate_to_s3 task	2019-06-06 13:27:24 +10:00
Guo Xiang Tan	a3938f98f8	Revert changes to `FileStore::S3Store#path_for` in `f0620e7118`. There are some places in the code base that assumes the method should return nil.	2019-05-29 18:39:07 +08:00
Guo Xiang Tan	f0620e7118	FEATURE: Support `[description\|attachment](upload://<short-sha>)` in MD take 2. Previous attempt was missing `post_uploads` records.	2019-05-29 09:26:32 +08:00
Penar Musaraj	7c9fb95c15	Temporarily revert "FEATURE: Support `[description\|attachment](upload://<short-sha>)` in MD. (#7603 )" This reverts commit `b1d3c678ca`. We need to make sure post_upload records are correctly stored.	2019-05-28 16:37:01 -04:00
Guo Xiang Tan	b1d3c678ca	FEATURE: Support `[description\|attachment](upload://<short-sha>)` in MD. (#7603 )	2019-05-28 11:18:21 -04:00

1 2 3

144 Commits