More accurately... I'm long on a viable alternative to the current monopoly. We have two OS's for phones (android and ios), there is no reason why we shouldn't have the same for all AI hardware and software. The only one even close, is AMD.
Checked out this company about a year ago and they only offered small models. Now I see they have GLM-fp8/Kimi and DeepSeek V4 Pro. Since workloads are predominantly cached input, I'm surprised to see no separate price for cached input vs uncached.
I hope the prices will drop significantly; with these prices you'll end up with thousands in monthly costs quickly.
Hopefully more hardware companies will be on the market in the coming years. If the Chinese eventually start competing with the current memory makers, maybe that will help.
It’s just weird Deepseek released a model that was not compatible with any of the usual engines. Without derez’s new project just to support DSv4, how long until it’s actually viable in llama :(
benlm | 21 days ago
[OP] kkm | 21 days ago
mezark | 21 days ago
brcmthrowaway | 21 days ago
latchkey | 21 days ago
brcmthrowaway | 20 days ago
latchkey | 20 days ago
boxking | 20 days ago
latchkey | 20 days ago
boxking | 20 days ago
latchkey | 20 days ago
I'm super curious what you would use them for.
boxking | 20 days ago
boxking | 20 days ago
maCDzP | 21 days ago
[OP] kkm | 21 days ago
latchkey | 21 days ago
(CEO Hot Aisle)
erichocean | 20 days ago
latchkey | 20 days ago
edg5000 | 20 days ago
alfiedotwtf | 20 days ago