What is the performance on this? :: AI萌娘综合讨论

商店页面

AI萌娘

全部讨论截图艺术作品实况直播视频创意工坊新闻指南评测

AI萌娘 > 综合讨论 > 主题详情

DenzelWashingMachine 2024 年 9 月 25 日上午 6:09

What is the performance on this?

How much cpu/gpu/ram does this use I know it will be based on model but whats a decent llm to run with this and not eat up 100% resources so I can still play games and stream?

< >

正在显示第 1 - 6 条，共 6 条留言

Riftwind

2024 年 9 月 26 日下午 12:33

72b version consumes about 6gb of ram on my laptop, cant play games with it loaded because it takes forever to reply. The lower versions work better, but then they arent as smart. Seems its mostly GPU power and memory that are important.

DenzelWashingMachine 2024 年 9 月 26 日下午 5:19

Damn it would have been nice to have something that can just pull chat from twitch/youtube and then read it out or respond..But it looks like the new llama models can do this off even smartphones or raspberry pi's. So I might end up hosting my own on vps or from raspberry pi then connecting it to chat. Having this app be able to be used on low sys requirements and work with twitch or youtube would be a gread addon and increase sales im sure.

aYYbsYYa

2024 年 9 月 27 日下午 9:07

引用自 Riftwind：
72b version consumes about 6gb of ram on my laptop, cant play games with it loaded because it takes forever to reply. The lower versions work better, but then they arent as smart. Seems its mostly GPU power and memory that are important.

you can use online api to slove this problem

Riftwind

2024 年 9 月 28 日上午 3:02

Ive heard you can, but theres little to no documentation or instructions on how, where and what to use to get this api sort of thing working. this program works out of the box, requires hardly any setup and doesnt require connecting to an external site to use, which i really like.

gtamike_TSGK 拥有 AI萌娘

3 月 2 日下午 1:20

Some are 6GB Vram

https://psteamcommunity.yuanyoumao.com/workshop/browse/?appid=2331610&requiredtags[]=chat_service

最后由 gtamike_TSGK 编辑于; 3 月 2 日下午 1:22

yumri

7 月 10 日上午 9:42

In my mind ChatWaifu performance for the LLM is comparatively the same to when i optimize KoboldCPP to run locally on my machine. I can get more out of KoboldCPP but at the cost of worse performance. What ChatWaifu seems to have done is optimize for performance.

As it is AI compute it is normal for it to take 100% of your CPU and GPU resources to finish as soon as it can.

< >

正在显示第 1 - 6 条，共 6 条留言

每页显示数： 1530 50

AI萌娘 > 综合讨论 > 主题详情

发帖日期： 2024 年 9 月 25 日上午 6:09

回复数： 6