mirror of
https://github.com/discourse/discourse-ai.git
synced 2025-02-19 18:04:51 +00:00
A recent change meant that llm instance got cached internally, repeat calls to inference would cache data in Endpoint object leading model to failures. Both Gemini and Open AI expect a clean endpoint object cause they set data. This amends internals to make sure llm.generate will always operate on clean objects