discourse

Commit Graph

Author	SHA1	Message	Date
Ghassan Maslamani	96a6bb69b5	FIX: vimeo iframe url when data-original-href is missing (#18894 )	2023-01-31 12:00:27 +01:00
Martin Brennan	63fdb6dd65	FIX: Do not add empty use/svg tags in ExcerptParser (#19969 ) There was an issue where if hashtag-cooked HTML was sent to the ExcerptParser without the keep_svg option, we would end up with empty </use> and </svg> tags on the parts of the excerpt where the hashtag was, in this case when a post push notification was sent. Fixed this, and also added a way to only display a plaintext version of the hashtag for cases like this via PrettyText#excerpt.	2023-01-24 14:40:24 +10:00
Daniel Waterworth	666536cbd1	DEV: Prefer \A and \z over ^ and $ in regexes (#19936 )	2023-01-20 12:52:49 -06:00
David Taylor	6417173082	DEV: Apply syntax_tree formatting to `lib/*`	2023-01-09 12:10:19 +00:00
Martin Brennan	b2acc416e7	FIX: Server-side hashtag lookups of secure categories for a user (#19377 ) * FIX: Use Category.secured(guardian) for hashtag datasource Follow up to comments in #19219, changing the category hashtag datasource to use Category.secured(guardian) instead of Site.new(guardian).categories here since the latter does more work for not much benefit, and the query time is the same. Also eliminates some Hash -> Model back and forth busywork. Add some more specs too. * FIX: Server-side hashtag lookup cooking user loading When we were using the PrettyText.options.currentUser and parsing back and forth with JSON for the hashtag lookups server-side, we had a bug where the user's secure categories were not loaded since we never actually loaded a User model from the database, only parsed it from JSON. This commit fixes the issue by instead using the PretyText.options.userId and looking up the user directly from the database when calling hashtag_lookup via the PrettyText::Helpers code when cooking server-side. Added the missing spec to check for this as well.	2022-12-09 10:34:25 +10:00
Andrei Prigorshnev	a76d864c51	FEATURE: Show live user status on inline mentions on posts (#18683 ) Note that we don't have a database table and a model for post mentions yet, and I decided to implement it without adding one to avoid heavy data migrations. Still, we may want to add such a model later, that would be convenient, we have such a model for mentions in chat. Note that status appears on all mentions on all posts in a topic except of the case when you just posted a new post, and it appeared on the bottom of the topic. On such posts, status won't be shown immediately for now (you'll need to reload the page to see the status). I'll take care of it in one of the following PRs.	2022-12-06 19:10:36 +04:00
Martin Brennan	d3f02a1270	FEATURE: Generic hashtag autocomplete lookup and markdown cooking (#18937 ) This commit fleshes out and adds functionality for the new `#hashtag` search and lookup system, still hidden behind the `enable_experimental_hashtag_autocomplete` feature flag. Serverside We have two plugin API registration methods that are used to define data sources (`register_hashtag_data_source`) and hashtag result type priorities depending on the context (`register_hashtag_type_in_context`). Reading the comments in plugin.rb should make it clear what these are doing. Reading the `HashtagAutocompleteService` in full will likely help a lot as well. Each data source is responsible for providing its own lookup and search method that returns hashtag results based on the arguments provided. For example, the category hashtag data source has to take into account parent categories and how they relate, and each data source has to define their own icon to use for the hashtag, and so on. The `Site` serializer has two new attributes that source data from `HashtagAutocompleteService`. There is `hashtag_icons` that is just a simple array of all the different icons that can be used for allowlisting in our markdown pipeline, and there is `hashtag_context_configurations` that is used to store the type priority orders for each registered context. When sending emails, we cannot render the SVG icons for hashtags, so we need to change the HTML hashtags to the normal `#hashtag` text. Markdown The `hashtag-autocomplete.js` file is where I have added the new `hashtag-autocomplete` markdown rule, and like all of our rules this is used to cook the raw text on both the clientside and on the serverside using MiniRacer. Only on the server side do we actually reach out to the database with the `hashtagLookup` function, on the clientside we just render a plainer version of the hashtag HTML. Only in the composer preview do we do further lookups based on this. This rule is the first one (that I can find) that uses the `currentUser` based on a passed in `user_id` for guardian checks in markdown rendering code. This is the `last_editor_id` for both the post and chat message. In some cases we need to cook without a user present, so the `Discourse.system_user` is used in this case. Chat Channels This also contains the changes required for chat so that chat channels can be used as a data source for hashtag searches and lookups. This data source will only be used when `enable_experimental_hashtag_autocomplete` is `true`, so we don't have to worry about channel results suddenly turning up. ------ Known Rough Edges - Onebox excerpts will not render the icon svg/use tags, I plan to address that in a follow up PR - Selecting a hashtag + pressing the Quote button will result in weird behaviour, I plan to address that in a follow up PR - Mixed hashtag contexts for hashtags without a type suffix will not work correctly, e.g. #ux which is both a category and a channel slug will resolve to a category when used inside a post or within a [chat] transcript in that post. Users can get around this manually by adding the correct suffix, for example ::channel. We may get to this at some point in future - Icons will not show for the hashtags in emails since SVG support is so terrible in email (this is not likely to be resolved, but still noting for posterity) - Additional refinements and review fixes wil	2022-11-21 08:37:06 +10:00
David Taylor	f7fdaa97d1	DEV: Replace mini-loader with Ember's `loader.js` library (#18643 ) We already have this as a dependency, so it makes sense to use it rather than using our own not-quite-spec-compliant implementation	2022-10-18 14:53:06 +01:00
Martin Brennan	8ebd5edd1e	DEV: Rename secure_media to secure_uploads (#18376 ) This commit renames all secure_media related settings to secure_uploads_* along with the associated functionality. This is being done because "media" does not really cover it, we aren't just doing this for images and videos etc. but for all uploads in the site. Additionally, in future we want to secure more types of uploads, and enable a kind of "mixed mode" where some uploads are secure and some are not, so keeping media in the name is just confusing. This also keeps compatibility with the `secure-media-uploads` path, and changes new secure URLs to be `secure-uploads`. Deprecated settings: * secure_media -> secure_uploads * secure_media_allow_embed_images_in_emails -> secure_uploads_allow_embed_images_in_emails * secure_media_max_email_embed_image_size_kb -> secure_uploads_max_email_embed_image_size_kb	2022-09-29 09:24:33 +10:00
David Taylor	1bd1664ae0	DEV: Compile markdown-it-bundle with ember-cli (#18104 ) We were already compiling the markdown bundle via ember-cli, but that version was only being used in the test environment. This commit improves the implementation, and updates the filename so it's also used in production. This commit also - Removes the vendored copy of `markdown-it.js` and fetches from node_modules instead - Updates `pretty_text.rb` to remove the custom sprockets-manifest-parsing - Removes `pretty-text-bundle.js`, which was only being used by `pretty_text.rb`	2022-08-29 19:11:59 +01:00
David Taylor	3c81683955	DEV: Rename `UriHelper.escape_uri` to `.normalized_encode` This is a much better description of its function. It performs idempotent normalization of a URL. If consumers truly need to `encode` a URL (including double-encoding of existing encoded entities), they can use the existing `.encode` method.	2022-08-09 11:55:25 +01:00
Selase Krakani	862007fb18	FEATURE: Add support for case-sensitive Watched Words (#17445 ) * FEATURE: Add case-sensitivity flag to watched_words Currently, all watched words are matched case-insensitively. This flag allows a watched word to be flagged for case-sensitive matching. To allow allow for backwards compatibility the flag is set to false by default. * FEATURE: Support case-sensitive creation of Watched Words via API Extend admin creation and upload of Watched Words to support case sensitive flag. This lays the ground work for supporting case-insensitive matching of Watched Words. Support for an extra column has also been introduced for the Watched Words upload CSV file. The new column structure is as follows: word,replacement,case_sentive * FEATURE: Enable case-sensitive matching of Watched Words WordWatcher's word_matcher_regexp now returns a list of regular expressions instead of one case-insensitive regular expression. With the ability to flag a Watched Word as case-sensitive, an action can have words of both sensitivities.This makes the use of the global Regexp::IGNORECASE flag added to all words problematic. To get around platform limitations around the use of subexpression level switches/flags, a list of regular expressions is returned instead, one for each case sensitivity. Word matching has also been updated to use this list of regular expressions instead of one. * FEATURE: Use case-sensitive regular expressions for Watched Words Update Watched Words regular expressions matching and processing to handle the extra metadata which comes along with the introduction of case-sensitive Watched Words. This allows case-sensitive Watched Words to matched as such. * DEV: Simplify type casting of case-sensitive flag from uploads Use builtin semantics instead of a custom method for converting string case flags in uploaded Watched Words to boolean. * UX: Add case-sensitivity details to Admin Watched Words UI Update Watched Word form to include a toggle for case-sensitivity. This also adds support for, case-sensitive testing and matching of Watched Word in the admin UI. * DEV: Code improvements from review feedback - Extract watched word regex creation out to a utility function - Make JS array presence check more explicit and readable * DEV: Extract Watched Word regex creation to utility function Clean-up work from review feedback. Reduce code duplication. * DEV: Rename word_matcher_regexp to word_matcher_regexp_list Since a list is returned now instead of a single regular expression, change `word_matcher_regexp` to `word_matcher_regexp_list` to better communicate this change. * DEV: Incorporate WordWatcher updates from upstream Resolve conflicts and ensure apply_to_text does not remove non-word characters in matches that aren't at the beginning of the line.	2022-08-02 10:06:03 +02:00
Jarek Radosz	d1d6868325	DEV: Update xss.js (#17216 ) * DEV: Update xss.js * Fix our hack	2022-06-24 10:28:05 +08:00
Jarek Radosz	c9d3c45ba9	DEV: Remove obsolete parts of our custom loader (#17157 )	2022-06-20 18:40:25 +02:00
David Taylor	5238f6788c	FEATURE: Allow hotlinked media to be blocked (#16940 ) This commit introduces a new site setting: `block_hotlinked_media`. When enabled, all attempts to hotlink media (images, videos, and audio) will fail, and be replaced with a linked placeholder. Exceptions to the rule can be added via `block_hotlinked_media_exceptions`. `download_remote_image_to_local` can be used alongside this feature. In that case, hotlinked images will be blocked immediately when the post is created, but will then be replaced with the downloaded version a few seconds later. This implementation is purely server-side, and does not impact the composer preview. Technically, there are two stages to this feature: 1. `PrettyText.sanitize_hotlinked_media` is called during `PrettyText.cook`, and whenever new images are introduced by Onebox. It will iterate over all src/srcset attributes in the post HTML and check if they're allowed. If not, the attributes will be removed and replaced with a `data-blocked-hotlinked-src(set)` attribute 2. In the `CookedPostProcessor`, we iterate over all `data-blocked-hotlinked-src(set)` attributes and check whether we have a downloaded version of the media. If yes, we update the src to use the downloaded version. If not, the entire media element is replaced with a placeholder. The placeholder is labelled 'external media', and is a link to the offsite media.	2022-06-07 15:23:04 +01:00
Jarek Radosz	3db4b3bc5e	DEV: Drop lodash (#16110 )	2022-03-06 18:15:25 +01:00
Martin Brennan	599a72768c	DEV: Add force_quote_link option to PrettyText (#16034 ) This option will make it so the [quote] bbcode will always include the HTML link to the quoted post, even if a topic_id is not provided in the PrettyText#cook options. This is so [quote] bbcode can be used in other places, like chat messages, that always need the link and do not have an "off-topic" ID to use.	2022-02-23 16:13:46 +10:00
David Taylor	a54a06fc41	DEV: Connect pretty-text console to the Rails logger (#15909 ) This will allow pretty text deprecations / errors / warnings to appear in the Rails logs, rather than disappearing silently. (implementation adapted from `discourse_js_processor.rb`)	2022-02-11 17:16:27 +00:00
Martin Brennan	0b8d0a14d7	DEV: Add markdown_additional_options to Site (#15738 ) Sometimes plugins need to have additional data or options available when rendering custom markdown features/rules that are not available on the default opts.discourse object. These additional options should be namespaced to the plugin adding them. ``` Site.markdown_additional_options["chat"] = { limited_pretty_text_markdown_rules: [] } ``` These are passed down to markdown rules on opts.discourse.additionalOptions. The main motivation for adding this is the chat plugin, which currently stores chat_pretty_text_features and chat_pretty_text_markdown_rules on the Site object via additions to the serializer, and the Site object is not accessible to import via markdown rules (either through Site.current() or through container.lookup). So, to have this working for both front + backend code, we need to attach these additional options from the Site object onto the markdown options object.	2022-01-28 13:02:02 +10:00
Bianca Nenciu	48e5d1af03	FIX: Improve top links section from user summary (#15675 ) * Do not extract links for hotlinked images * Include only links that have been clicked at least once in user summary	2022-01-24 11:33:23 +11:00
Alan Guo Xiang Tan	c2afc3915b	FEATURE: Customizable rules and plugins for `PrettyText.markdown`. This commit extends the options which can be passed to `PrettyText.markdown` so that which Markdown-it rules and Discourse Markdown plugins to be used when rendering a text can be customizable. Currently, this extension is mainly used by plugins.	2022-01-11 10:39:40 +08:00
Martin Brennan	fa6b87a1bf	SECURITY: Strip unrendered unicode bidirectional chars in code blocks (#15032 ) When rendering the markdown code blocks we replace the offending characters in the output string with spans highlighting a textual representation of the character, along with a title attribute with information about why the character was highlighted. The list of characters stripped by this fix, which are the bidirectional characters considered relevant, are: U+202A U+202B U+202C U+202D U+202E U+2066 U+2067 U+2068 U+2069	2021-11-22 10:43:03 +10:00
Bianca Nenciu	1e992d9193	FIX: Do not check for duplicate links in Onebox (#13345 ) If a user posted a URL that appeared inside a Onebox, then the user got a duplicate link notice. This was fixed by skipping those links in Ruby. If a user posted a URL that was Oneboxes and contained other links that appeared in previous posts, then the user got a duplicate link notice. This was fixed by skipping those links in JavaScript.	2021-06-18 18:55:24 +03:00
Bianca Nenciu	d9484db718	FIX: Split link watched words from replace (#13196 ) It was not clear that replace watched words can be used to replace text with URLs. This introduces a new watched word type that makes it easier to understand.	2021-06-02 15:36:49 +10:00
Bianca Nenciu	c1dfd76658	FIX: Make replace watched words work with wildcard (#13084 ) Watched words are always regular expressions, despite watched_words_ _regular_expressions being enabled or not. Internally, wildcard characters are replaced with a regular expression that matches any non whitespace character.	2021-05-18 12:09:47 +03:00
Osama Sayegh	d56b2e85aa	FIX: Escape Font Awesome icons (#12421 ) This is not a security issue because regular users are not allowed to insert FA icons anywhere in the app. Admins can insert icons via custom badges, but they do have the ability to create themes with JS.	2021-03-17 16:11:40 +03:00
Rafael dos Santos Silva	e06076268a	FIX: Missing quote in emoji cdn setting caused by `83f332b` (#12280 )	2021-03-03 17:39:00 -03:00
Rafael dos Santos Silva	83f332b5a5	FEATURE: Add a site setting to allow emojis to come from an external URL (#12180 )	2021-03-02 16:04:16 -03:00
Bianca Nenciu	533800a87b	Add watched words of type "replace" (#12020 ) This commit includes other various improvements to watched words. auto_silence_first_post_regex site setting was removed because it overlapped with 'require approval' watched words.	2021-02-25 14:00:58 +02:00
Daniel Waterworth	df8436cd7f	FIX: Don't add a slug to constructed quote urls (#12052 ) A topic with the slug 'topic' might exist and may end up being linked to by mistake when malformed (i.e. cross-site) quotes are posted.	2021-02-11 12:21:13 -06:00
Bianca Nenciu	8ff9cdf390	FIX: Replace Vimeo iframes with a link in emails (#11443 ) This was implemented before, but it was not tested and broke at some point (probably Nokogiri update).	2020-12-09 14:58:36 +02:00
Martin Brennan	879e4a9e29	FIX: Inline avatar style for onebox when embedding secure images (#11229 ) When embedding secure images that are inline-avatars for oneboxes we weren't applying the correct sizing/style.	2020-11-16 09:58:40 +10:00
Martin Brennan	27e94f2f98	FIX: Make secure image onebox check more robust (#11179 ) When embedding secure images which have been oneboxed, we checked to see if the image's parent's parent had the class onebox-body. This was not always effective as if the image does not get resized/optimized then it does not have the aspect-image div wrapping it. This would cause the image to embed in the email but be huge. This PR changes the check to see if any of the image's ancestors have the class onebox-body, or if the image has the onebox-avatar class to account for variations in HTML structure.	2020-11-10 12:55:18 +10:00
Penar Musaraj	c1f3bd6a1c	FIX: secure_media stripping on lightboxes, non-image links (#11121 ) - Fixes stripping of lightboxes with empty srcset attribute - Does not fail when email has links with secure media URLs but no child image elements	2020-11-04 15:45:50 -05:00
Rafael dos Santos Silva	b70b3e867c	FIX: Safely skip secure_media steps when it's not enabled (#11110 ) * FIX: Safely skip secure_media steps when it's not enabled * DEV: Secure media tests should enable secure media	2020-11-03 13:53:15 -03:00
Martin Brennan	3655062c60	FIX: Ensure oneboxed secure images which are optimized and also lightboxed optimized images are embedded in email (#11061 ) We had an issue where onebox thumbnail was too large and thus was optimized, and we are using the image URLs in post to redact and re-embed, based on the sha1 in the URL. Optimized image URLs have extra stuff on the end like _99x99 so we were not parsing out the sha1 correctly. Another issue I found was for posts that have giant images, the original was being used to embed in the email and thus would basically never get included because it is huge. For example the URL `787b17ea61`_2_690x335.jpeg was not parsed correctly; we would end up with 787b17ea6140f4f022eb7f1509a692f2873cfe35_2_690x335.jpeg as the sha1 which would not find the image to re-embed that was already attached to the email. This fix will use the first optimized image of the detected upload when we are redacting and then re-embedding to make sure we are not sending giant things in email. Also, I detect if it is a onebox thumbnail or the site icon and force appropriate sizes and styles.	2020-11-02 09:52:21 +10:00
Krzysztof Kotlarek	dbec3792b7	FIX: pretty text allow list (#10977 ) Reword whitelist to allowlist in pretty-text. This library is used by plugins so we need deprecation notice.	2020-10-28 13:22:06 +11:00
Krzysztof Kotlarek	b2481adb40	FIX: persist secure image width and height if is given (#10994 ) `max-width: 50%; max-height: 400px;` is a good fallback, however, if width and height are given and are smaller than fallback - we should persist that smaller size.	2020-10-22 13:25:09 +11:00
Daniel Waterworth	721ee36425	Replace `base_uri` with `base_path` (#10879 ) DEV: Replace instances of Discourse.base_uri with Discourse.base_path This is clearer because the base_uri is actually just a path prefix. This continues the work started in `555f467`.	2020-10-09 12:51:24 +01:00
Robin Ward	e7abba9bbe	FIX: Missing files from previous commit	2020-09-15 10:59:41 -04:00
Roman Rizzi	efb9fd6ac0	FIX: Make sure rel attributes are correctly set. (#10645 ) We must guarantee that "rel=noopener" was set if "target=_blank" is present, which is not always the case for trusted users. Also, if the link contains the "nofollow" attribute, it has to have the "ugc" attribute as well.	2020-09-10 12:59:51 -03:00
Martin Brennan	dede942007	FEATURE: Allow email image embed with secure media (#10563 ) This PR introduces a few important changes to secure media redaction in emails. First of all, two new site settings have been introduced: * `secure_media_allow_embed_images_in_emails`: If enabled we will embed secure images in emails instead of redacting them. * `secure_media_max_email_embed_image_size_kb`: The cap to the size of the secure image we will embed, defaulting to 1mb, so the email does not become too big. Max is 10mb. Works in tandem with `email_total_attachment_size_limit_kb`. `Email::Sender` will now attach images to the email based on these settings. The sender will also call `inline_secure_images` in `Email::Styles` after secure media is redacted and attachments are added to replace redaction messages with attached images. I went with attachment and `cid` URLs because base64 image support is _still_ flaky in email clients. All redaction of secure media is now handled in `Email::Styles` and calls out to `PrettyText.strip_secure_media` to do the actual stripping and replacing with placeholders. `app/mailers/group_smtp_mailer.rb` and `app/mailers/user_notifications.rb` no longer do any stripping because they are earlier in the pipeline than `Email::Styles`. Finally the redaction notice has been restyled and includes a link to the media that the user can click, which will show it to them if they have the necessary permissions. ![image](https://user-images.githubusercontent.com/920448/92341012-b9a2c380-f0ff-11ea-860e-b376b4528357.png)	2020-09-10 09:50:16 +10:00
Robin Ward	c4079780be	REFACTOR: Remove `_.merge`	2020-09-03 10:03:33 -04:00
Krzysztof Kotlarek	e0d9232259	FIX: use allowlist and blocklist terminology (#10209 ) This is a PR of the renaming whitelist to allowlist and blacklist to the blocklist.	2020-07-27 10:23:54 +10:00
Robin Ward	ba00cc8ec4	DEV: Move `Discourse.getURL` and related functions to a module (#9966 ) * DEV: Move `Discourse.getURL` and related functions to a module * DEV: Remove `Discourse.getURL` and `Discourse.getURLWithCDN` * FIX: `get-url` is required for server side code * DEV: Deprecate `BaseUri` too.	2020-06-03 12:45:26 -04:00
Joffrey JAFFEUX	77801aa9be	FIX: allows to have custom emoji translation without static file (#9893 )	2020-05-27 20:11:52 +02:00
Sam Saffron	4601833e4e	PERF: ensure we run full GC on contexts Prior to this change we would never clear memory from contexts and rely on V8 reacting to pressure This could lead to bloating of PrettyText and Transpiler contexts This optimisations ensures that we will clear memory 2 seconds after the last eval on the context	2020-05-15 14:01:54 +10:00
Robin Ward	eab560fe2a	DEV: import I18n instead of global usage (#9768 ) Co-authored-by: Mark VanLandingham <markvanlan@gmail.com> Co-authored-by: Robin Ward <robin.ward@gmail.com> Co-authored-by: Mark VanLandingham <markvanlan@gmail.com>	2020-05-13 16:23:41 -04:00
Krzysztof Kotlarek	a3e1e9ef16	FIX: remove deprecated URI.escape (#9697 ) During Nokogumbo changes I introduced back URI.escape which is deprecated.	2020-05-08 11:14:59 +10:00
Krzysztof Kotlarek	9bff0882c3	FEATURE: Nokogumbo (#9577 ) * FEATURE: Nokogumbo Use Nokogumbo HTML parser.	2020-05-05 13:46:57 +10:00

1 2 3 4 5 ...

275 Commits