AI萌娘
DenzelWashingMachine 2024 年 9 月 25 日 上午 6:09
What is the performance on this?
How much cpu/gpu/ram does this use I know it will be based on model but whats a decent llm to run with this and not eat up 100% resources so I can still play games and stream?
< >
正在显示第 1 - 6 条,共 6 条留言
Riftwind 2024 年 9 月 26 日 下午 12:33 
72b version consumes about 6gb of ram on my laptop, cant play games with it loaded because it takes forever to reply. The lower versions work better, but then they arent as smart. Seems its mostly GPU power and memory that are important.
DenzelWashingMachine 2024 年 9 月 26 日 下午 5:19 
Damn it would have been nice to have something that can just pull chat from twitch/youtube and then read it out or respond..But it looks like the new llama models can do this off even smartphones or raspberry pi's. So I might end up hosting my own on vps or from raspberry pi then connecting it to chat. Having this app be able to be used on low sys requirements and work with twitch or youtube would be a gread addon and increase sales im sure.
aYYbsYYa 2024 年 9 月 27 日 下午 9:07 
引用自 Riftwind
72b version consumes about 6gb of ram on my laptop, cant play games with it loaded because it takes forever to reply. The lower versions work better, but then they arent as smart. Seems its mostly GPU power and memory that are important.
you can use online api to slove this problem
Riftwind 2024 年 9 月 28 日 上午 3:02 
Ive heard you can, but theres little to no documentation or instructions on how, where and what to use to get this api sort of thing working. this program works out of the box, requires hardly any setup and doesnt require connecting to an external site to use, which i really like.
gtamike_TSGK 3 月 2 日 下午 1:20 
最后由 gtamike_TSGK 编辑于; 3 月 2 日 下午 1:22
yumri 7 月 10 日 上午 9:42 
In my mind ChatWaifu performance for the LLM is comparatively the same to when i optimize KoboldCPP to run locally on my machine. I can get more out of KoboldCPP but at the cost of worse performance. What ChatWaifu seems to have done is optimize for performance.

As it is AI compute it is normal for it to take 100% of your CPU and GPU resources to finish as soon as it can.
< >
正在显示第 1 - 6 条,共 6 条留言
每页显示数: 1530 50