discourse-ai/lib
Sam fb81307c59
FEATURE: web browsing tool (#548)
This pull request makes several improvements and additions to the GitHub-related tools and personas in the `discourse-ai` repository:

1. It adds the `WebBrowser` tool to the  `Researcher` persona, allowing the AI to visit web pages, retrieve HTML content, extract the main content, and convert it to plain text.

2. It updates the `GithubFileContent`, `GithubPullRequestDiff`, and `GithubSearchCode` tools to handle HTTP responses more robustly (introducing size limits). 

3. It refactors the `send_http_request` method in the `Tool` class to follow redirects when specified, and to read the response body in chunks to avoid memory issues with large responses. (only for WebBrowser)

4. It updates the system prompt for the `Researcher` persona to provide more detailed guidance on when to use Google search vs web browsing, and how to optimize tool usage and reduce redundant requests.

5. It adds a new `web_browser_spec.rb` file with tests for the `WebBrowser` tool, covering various scenarios like handling different HTML structures and following redirects.
2024-03-28 16:01:58 +11:00
..
ai_bot FEATURE: web browsing tool (#548) 2024-03-28 16:01:58 +11:00
ai_helper FIX: Prevent AI chat thread titles from being created before replies are posted (#517) 2024-03-07 16:14:17 -03:00
automation FIX: unify automation model translation (#540) 2024-03-21 11:32:35 +11:00
completions FEATURE: Add vision support to AI personas (Claude 3) (#546) 2024-03-27 14:30:11 +11:00
configuration UX: Re-introduce embedding settings validations (#457) 2024-02-01 16:54:09 -03:00
database DEV: port directory structure to Zeitwerk (#319) 2023-11-29 15:17:46 +11:00
embeddings DEV: Fix module namespace breaking reloads (#530) 2024-03-14 15:19:28 -03:00
inference FEATURE: AI Quick Semantic Search (#501) 2024-03-08 13:02:50 -03:00
nsfw DEV: Fix various typos (#434) 2024-01-19 12:51:26 +01:00
sentiment FIX: Truncate content for sentiment/toxicity classification (#431) 2024-01-17 15:17:58 -03:00
summarization FIX: prompt engineering for summary prompt (#539) 2024-03-20 16:33:05 +11:00
tasks/modules FIX: Filter soft-deleted topics when backfilling sentiment (#527) 2024-03-12 21:01:24 -03:00
tokenizer FIX: Handle unicode on tokenizer (#515) 2024-03-14 17:33:30 -03:00
toxicity FIX: Truncate content for sentiment/toxicity classification (#431) 2024-01-17 15:17:58 -03:00
utils FEATURE: DNS SRV support for TEI (#363) 2023-12-18 13:21:21 -03:00
automation.rb FIX: unify automation model translation (#540) 2024-03-21 11:32:35 +11:00
chat_message_classificator.rb DEV: port directory structure to Zeitwerk (#319) 2023-11-29 15:17:46 +11:00
classificator.rb DEV: port directory structure to Zeitwerk (#319) 2023-11-29 15:17:46 +11:00
engine.rb DEV: port directory structure to Zeitwerk (#319) 2023-11-29 15:17:46 +11:00
guardian_extensions.rb FIX: don't show share conversation incorrectly (#526) 2024-03-13 11:24:22 +11:00
post_classificator.rb DEV: port directory structure to Zeitwerk (#319) 2023-11-29 15:17:46 +11:00