discourse-ai/lib
Sam 47f5da7e42
FEATURE: Add AI-powered spam detection for new user posts (#1004)
This introduces a comprehensive spam detection system that uses LLM models
to automatically identify and flag potential spam posts. The system is
designed to be both powerful and configurable while preventing false positives.

Key Features:
* Automatically scans first 3 posts from new users (TL0/TL1)
* Creates dedicated AI flagging user to distinguish from system flags
* Tracks false positives/negatives for quality monitoring
* Supports custom instructions to fine-tune detection
* Includes test interface for trying detection on any post

Technical Implementation:
* New database tables:
  - ai_spam_logs: Stores scan history and results
  - ai_moderation_settings: Stores LLM config and custom instructions
* Rate limiting and safeguards:
  - Minimum 10-minute delay between rescans
  - Only scans significant edits (>10 char difference)
  - Maximum 3 scans per post
  - 24-hour maximum age for scannable posts
* Admin UI features:
  - Real-time testing capabilities
  - 7-day statistics dashboard
  - Configurable LLM model selection
  - Custom instruction support

Security and Performance:
* Respects trust levels - only scans TL0/TL1 users
* Skips private messages entirely
* Stops scanning users after 3 successful public posts
* Includes comprehensive test coverage
* Maintains audit log of all scan attempts


---------

Co-authored-by: Keegan George <kgeorge13@gmail.com>
Co-authored-by: Martin Brennan <martin@discourse.org>
2024-12-12 09:17:25 +11:00
..
ai_bot FEATURE: first class support for OpenRouter (#1011) 2024-12-10 05:59:19 +11:00
ai_helper FIX: Tag suggester is suggesting already assigned tags (#990) 2024-12-03 07:25:04 +11:00
ai_moderation FEATURE: Add AI-powered spam detection for new user posts (#1004) 2024-12-12 09:17:25 +11:00
automation FIX: Triage rule should append selected tags instead of replacing them (#1022) 2024-12-11 11:19:44 -03:00
completions UX: Improve rough edges of AI usage page (#1014) 2024-12-12 08:55:24 +11:00
configuration FEATURE: Add AI-powered spam detection for new user posts (#1004) 2024-12-12 09:17:25 +11:00
database DEV: port directory structure to Zeitwerk (#319) 2023-11-29 15:17:46 +11:00
discord/bot FEATURE: Discord Bot integration (#831) 2024-10-16 12:41:18 -03:00
embeddings FIX: Gemini inference client was missing #instance (#1019) 2024-12-10 15:42:31 -03:00
inference FIX: Gemini inference client was missing #instance (#1019) 2024-12-10 15:42:31 -03:00
sentiment UX: Make sentiment trends more readable (#1018) 2024-12-11 09:13:18 -08:00
summarization FEATURE: first class support for OpenRouter (#1011) 2024-12-10 05:59:19 +11:00
tasks/modules FEATURE: Backfill posts sentiment. (#982) 2024-12-03 10:27:03 -03:00
tokenizer FIX/REFACTOR: FoldContent revamp (#866) 2024-10-25 11:51:17 -03:00
utils FEATURE: allow artifacts to be updated (#980) 2024-12-03 07:23:31 +11:00
automation.rb FIX: AI Automation scripts were broken when using seeded models (#991) 2024-12-02 19:07:05 -03:00
engine.rb DEV: port directory structure to Zeitwerk (#319) 2023-11-29 15:17:46 +11:00
guardian_extensions.rb FEATURE: Calculate gists from non hot topics too (#958) 2024-11-26 13:44:12 -03:00
multisite_hash.rb FIX: properly cache user locale (#593) 2024-04-26 09:28:35 -03:00
post_extensions.rb FEATURE: Backfill posts sentiment. (#982) 2024-12-03 10:27:03 -03:00
summarization.rb FEATURE: Generate topic gists for the hot topics list. (#837) 2024-10-18 18:01:39 -03:00
topic_extensions.rb PERF: Preload only gists when including summaries in topic list (#948) 2024-11-25 12:24:02 -03:00