What is wrong with LLM benchmarks, and why are we still using them? - sh.itjust.works

micheal65536@lemmy.micheal65536.duckdns.org · 11 months ago

In that case ChatGPT is correct, it cannot work with links. You will need to download the video transcript (subtitles) yourself and ask it to summarise that. This definitely works, people have been doing it for months.

micheal65536@lemmy.micheal65536.duckdns.org · 1 year ago

Probably another case of “I don’t want people training AI on my posts/images so I’m nuking my entire online existence”.

micheal65536@lemmy.micheal65536.duckdns.org · 1 year ago

Without knowing anything about this model or what it was trained on or how it was trained, it’s impossible to say exactly why it displays this behavior. But there is no “hidden layer” in llama.cpp that allows for “hardcoded”/“built-in” content.

It is absolutely possible for the model to “override pretty much anything in the system context”. Consider any regular “censored” model, and how any attempt at adding system instructions to change/disable this behavior is mostly ignored. This model is probably doing much the same thing except with a “built-in story” rather than a message that says “As an AI assistant, I am not able to …”.

As I say, without knowing anything more about what model this is or what the training data looked like, it’s impossible to say exactly why/how it has learned this behavior or even if it’s intentional (this could just be a side-effect of the model being trained on a small selection of specific stories, or perhaps those stories were over-represented in the training data).

micheal65536@lemmy.micheal65536.duckdns.org · 1 year ago

AMD GPU support appears to be included in GGML. I don’t see any reason why you wouldn’t be able to split between multiple GPUs as the splitting is handled within GGML itself and not tied to any particular library/driver/backend.

micheal65536@lemmy.micheal65536.duckdns.org · 1 year ago

What is wrong with LLM benchmarks, and why are we still using them? - sh.itjust.works