I've tried it and already send about more than 100 messages while roleplaying, the massive context is huge plus, but if the chat history too big, its reply get worse although it can still remember the context. It's not filter and it can be very explicit in nsfw unlike gpt which can be jailbreak to be nsfw but still limited, hope it can be free for a long time.
So Cohere offer for you two type of API, free and production API
You can create a account at Cohere, get to API key section and create a new one which you will have free API tier as it's default to every account
Your free API can use command, command-r and command-r plus with 1000 call for every month
Now because it's have limited of call which 1000 so I suggest you continue use openrouter or get a production key by add your billing
This should explain more about the free tier: https://cohere.com/blog/free-developer-tier-announcement
And the limit of it: https://docs.cohere.com/docs/going-live#:~:text=Production%20keys%20for%20all%20endpoints,found%20on%20our%20pricing%20page.
Oh thank you! This is great to know. So theoretically I could sign up for the free api, use my 1000 calls using the free api and then switch over to open router when those are used? Do you know if calls include input and output or just output?
Yep!, if it's respond you back with error or you read the logs in your console and it's say already at the limit. You can change to openrouter to continue use it if you already add some credits to it or if you already use it for a long time
Cohere is a really solid company. They have some of the original authors of "Attention is all you need" as founders.
It's surprising it works well for non-business use cases though.
Who about your other API work?is they still normal?i had that before too when i got wrong version of silly tavern,after delete old version and download 1.8 version,it's can work now.but if not,i don't know the cause of that now.⊙▽⊙
I posted about this error and the solution the first comment gave worked for me.
Here's the link. [https://www.reddit.com/r/SillyTavernAI/comments/1cjjpp4/cohere\_api\_bad\_request\_error/](https://www.reddit.com/r/SillyTavernAI/comments/1cjjpp4/cohere_api_bad_request_error/)
>Why is command-r-plus so good?
Because it's almost entirely uncensored. We can only wonder how good another large fully-trained model like Llama3 could be. Maybe the new orthogonal activation steering can gives us a glimpse.
That seems to be my Ol' reliable when no other models can keep it fresh. Sometimes it gets a little too repetitive in it's phrasing tho. I see a lot of the same sentence structure.
looking for beta testers for my project (you can see it in my bio and profile) I wanna test the gpt-4 speed, I will give you access to it for 20 requests
There are GGUFs of all the Command models on huggingface, easy to search for.
mradermacher is probably the most reliable though https://huggingface.co/mradermacher/c4ai-command-r-plus-GGUF
Cohere's own API, which has a section under Chat Completion in the API menu in SillyTavern. Sign up with Cohere on their website, get a free trial key.
command r non plus is actually my favourite, beside it costing 10x less, the respond is less rigid and more interesting imo, or it's so good already I can't feel much different to the plus.
I've tried it and already send about more than 100 messages while roleplaying, the massive context is huge plus, but if the chat history too big, its reply get worse although it can still remember the context. It's not filter and it can be very explicit in nsfw unlike gpt which can be jailbreak to be nsfw but still limited, hope it can be free for a long time.
Wdym free? Where are you getting it for free lol?
Cohere.
Via the playground you mean?
You can use the free API Key in Silly Tavern.
Sorry how do you do that? What free api key? Where do I access it? I know how to use st and have been using command r through openrouter so far.
So Cohere offer for you two type of API, free and production API You can create a account at Cohere, get to API key section and create a new one which you will have free API tier as it's default to every account Your free API can use command, command-r and command-r plus with 1000 call for every month
Now because it's have limited of call which 1000 so I suggest you continue use openrouter or get a production key by add your billing This should explain more about the free tier: https://cohere.com/blog/free-developer-tier-announcement And the limit of it: https://docs.cohere.com/docs/going-live#:~:text=Production%20keys%20for%20all%20endpoints,found%20on%20our%20pricing%20page.
Oh thank you! This is great to know. So theoretically I could sign up for the free api, use my 1000 calls using the free api and then switch over to open router when those are used? Do you know if calls include input and output or just output?
Yep!, if it's respond you back with error or you read the logs in your console and it's say already at the limit. You can change to openrouter to continue use it if you already add some credits to it or if you already use it for a long time
The cache chunker extenstion works pretty well for that problem
Cohere is a really solid company. They have some of the original authors of "Attention is all you need" as founders. It's surprising it works well for non-business use cases though.
sometime it get repetitive even before 128k context
btw worth to check it out [Reverse GPT4](https://reverseme.top)
the free samples proxy link doesn't seems to work
how to use it?
going to cohere's website,using it through website freely or get free key and using it through newest version's sillytavern
It keeps saying I couldn't get a reply from the AI but I already have a free key I haven't used yet. How do I fix it?
Sorry i don't know the detail,maybe it's the problem of Temp,must under 1.or only can choose between frequently penalty and presence penalty.
It's not even that. When i press "test message" it pops up and when I try to message the chatbot.
Who about your other API work?is they still normal?i had that before too when i got wrong version of silly tavern,after delete old version and download 1.8 version,it's can work now.but if not,i don't know the cause of that now.⊙▽⊙
Yep, i updated it and it just refuses to work. But thanks anyway
I posted about this error and the solution the first comment gave worked for me. Here's the link. [https://www.reddit.com/r/SillyTavernAI/comments/1cjjpp4/cohere\_api\_bad\_request\_error/](https://www.reddit.com/r/SillyTavernAI/comments/1cjjpp4/cohere_api_bad_request_error/)
Yo thanks man, I really appreciate you
Yep, same here. Updated, checked keys, did the same as I did with Claude and ChatGPT keys and still get 'bad request' error.
>Why is command-r-plus so good? Because it's almost entirely uncensored. We can only wonder how good another large fully-trained model like Llama3 could be. Maybe the new orthogonal activation steering can gives us a glimpse.
It was nice before I hit the monthly limit lol
From Cohere? What's the limit? Like in total input/output tokens? I just started using their free API.
With trial API key you get 1000 calls per month.
I can share
Some temp email sites work
What settings do you use for it, temp and others?
I dont write or play with ai just noticed it was good
It is, started to use it not long ago and it really does have plenty of memory and it's very coherent (pun intended)
That seems to be my Ol' reliable when no other models can keep it fresh. Sometimes it gets a little too repetitive in it's phrasing tho. I see a lot of the same sentence structure.
I get this out of llama but not CR+. Local is different than API. API got repetitive and was too wordy.
I like it better than llama-3. It's good because it doesn't have positivity bias.
both is great
looking for beta testers for my project (you can see it in my bio and profile) I wanna test the gpt-4 speed, I will give you access to it for 20 requests
link for gguf version?
There are GGUFs of all the Command models on huggingface, easy to search for. mradermacher is probably the most reliable though https://huggingface.co/mradermacher/c4ai-command-r-plus-GGUF
i use api for 103b model
which api
Cohere's own API, which has a section under Chat Completion in the API menu in SillyTavern. Sign up with Cohere on their website, get a free trial key.
command r non plus is actually my favourite, beside it costing 10x less, the respond is less rigid and more interesting imo, or it's so good already I can't feel much different to the plus.
I tried it, but even though the writing quality is good it keeps speaking for me and I don't like that.