6623928b95
A recent change meant that llm instance got cached internally, repeat calls to inference would cache data in Endpoint object leading model to failures. Both Gemini and Open AI expect a clean endpoint object cause they set data. This amends internals to make sure llm.generate will always operate on clean objects |
||
---|---|---|
.. | ||
anthropic.rb | ||
aws_bedrock.rb | ||
base.rb | ||
canned_response.rb | ||
cohere.rb | ||
fake.rb | ||
gemini.rb | ||
hugging_face.rb | ||
open_ai.rb | ||
vllm.rb |