discourse-ai/tokenizers
Rafael dos Santos Silva 5db7bf6e68
Mixtral (#376)
Add both Mistral and Mixtral support. Also includes vLLM-openAI inference support.

Co-authored-by: Roman Rizzi <rizziromanalejandro@gmail.com>
2023-12-26 14:49:55 -03:00
..
Apache License Refinements to embeddings and tokenizers (#61) 2023-05-15 15:10:42 -03:00
MIT License Refinements to embeddings and tokenizers (#61) 2023-05-15 15:10:42 -03:00
README.md Mixtral (#376) 2023-12-26 14:49:55 -03:00
all-mpnet-base-v2.json FIX: Disable truncation and padding in all-mpnet-base-v2 tokenizer (#105) 2023-07-13 21:09:46 -03:00
bert-base-uncased.json FEATURE: Add a basic tokenizer API (#37) 2023-04-19 11:55:59 -03:00
bge-large-en.json FEATURE: Bge-large-en embeddings via Cloudflare Workers AI API (#241) 2023-10-04 13:47:51 -03:00
claude-v1-tokenization.json Refinements to embeddings and tokenizers (#61) 2023-05-15 15:10:42 -03:00
llama-2-70b-chat-hf.json FEATURE: Llama2 for summarization (#116) 2023-07-27 13:55:32 -03:00
mixtral.json Mixtral (#376) 2023-12-26 14:49:55 -03:00
multilingual-e5-large.json FEATURE: Support for locally infered embeddings in 100 languages (#115) 2023-07-27 15:50:03 -03:00

README.md

bert-base-uncased.json

Licensed under Apache License

claude-v1-tokenization.json

Licensed under MIT License

all-mpnet-base-v2.json

Licensed under Apache License

llama-2-70b-chat-hf

Licensed under LLAMA 2 COMMUNITY LICENSE AGREEMENT

multilingual-e5-large

Licensed under MIT License

bge-large-en

Licensed under MIT License

mixtral

Licensed under Apache 2.0 License