FIX: never mention the word JSON in tool preamble (#572)

Just having the word JSON can confuse models when we expect them
to deal solely in XML

Instead provide an example of how string arrays should be returned

Technically the tool framework supports int arrays and more, but
our current implementation only does string arrays.

Also tune the prompt construction not to give any tips about arrays
if none exist
This commit is contained in:
Sam 2024-04-11 11:24:22 +10:00 committed by GitHub
parent bab5e52e38
commit 0cbbf130b9
No known key found for this signature in database
GPG Key ID: B5690EEEBB952194
2 changed files with 57 additions and 16 deletions

View File

@ -33,7 +33,17 @@ module DiscourseAi
raise NotImplemented raise NotImplemented
end end
def tool_preamble def tool_preamble(include_array_tip: true)
array_tip =
if include_array_tip
<<~TEXT
If a parameter type is an array, return an array of values. For example:
<$PARAMETER_NAME>["one","two","three"]</$PARAMETER_NAME>
TEXT
else
""
end
<<~TEXT <<~TEXT
In this environment you have access to a set of tools you can use to answer the user's question. In this environment you have access to a set of tools you can use to answer the user's question.
You may call them like this. You may call them like this.
@ -47,16 +57,12 @@ module DiscourseAi
</parameters> </parameters>
</invoke> </invoke>
</function_calls> </function_calls>
#{array_tip}
If a parameter type is an array, return a JSON array of values. For example:
[1,"two",3.0]
If you wish to call multiple function in one reply, wrap multiple <invoke> If you wish to call multiple function in one reply, wrap multiple <invoke>
block in a single <function_calls> block. block in a single <function_calls> block.
Always prefer to lead with tool calls, if you need to execute any. Always prefer to lead with tool calls, if you need to execute any.
Avoid all niceties prior to tool calls, Eg: "Let me look this up for you.." etc. Avoid all niceties prior to tool calls, Eg: "Let me look this up for you.." etc.
Here are the complete list of tools available: Here are the complete list of tools available:
TEXT TEXT
end end
@ -148,6 +154,19 @@ module DiscourseAi
attr_reader :prompt attr_reader :prompt
def build_tools_prompt
return "" if prompt.tools.blank?
has_arrays =
prompt.tools.any? { |tool| tool[:parameters].any? { |p| p[:type] == "array" } }
(<<~TEXT).strip
#{self.class.tool_preamble(include_array_tip: has_arrays)}
<tools>
#{tools}</tools>
TEXT
end
private private
attr_reader :model_name, :opts attr_reader :model_name, :opts
@ -211,16 +230,6 @@ module DiscourseAi
def calculate_message_token(msg) def calculate_message_token(msg)
self.class.tokenizer.size(msg[:content].to_s) self.class.tokenizer.size(msg[:content].to_s)
end end
def build_tools_prompt
return "" if prompt.tools.blank?
(<<~TEXT).strip
#{self.class.tool_preamble}
<tools>
#{tools}</tools>
TEXT
end
end end
end end
end end

View File

@ -17,6 +17,38 @@ class TestDialect < DiscourseAi::Completions::Dialects::Dialect
end end
RSpec.describe DiscourseAi::Completions::Dialects::Dialect do RSpec.describe DiscourseAi::Completions::Dialects::Dialect do
describe "#build_tools_prompt" do
it "can exclude array instructions" do
prompt = DiscourseAi::Completions::Prompt.new("12345")
prompt.tools = [
{
name: "weather",
description: "lookup weather in a city",
parameters: [{ name: "city", type: "string", description: "city name", required: true }],
},
]
dialect = TestDialect.new(prompt, "test")
expect(dialect.build_tools_prompt).not_to include("array")
end
it "can include array instructions" do
prompt = DiscourseAi::Completions::Prompt.new("12345")
prompt.tools = [
{
name: "weather",
description: "lookup weather in a city",
parameters: [{ name: "city", type: "array", description: "city names", required: true }],
},
]
dialect = TestDialect.new(prompt, "test")
expect(dialect.build_tools_prompt).to include("array")
end
end
describe "#trim_messages" do describe "#trim_messages" do
it "should trim tool messages if tool_calls are trimmed" do it "should trim tool messages if tool_calls are trimmed" do
prompt = DiscourseAi::Completions::Prompt.new("12345") prompt = DiscourseAi::Completions::Prompt.new("12345")