Dran

Dran@lemmy.world · 10 days ago

Yes they were, so I’m offering you an actual theory as to why this may actually be true, yet difficult to “prove”.

Smoking was bad for your health long before anyone sat down and took the time to prove it. Autoregressive LLM tokenizer are a very new field of computer science and it’s going to take a while for the community to collectively understand everything we’re currently doing by trial and error.

Dran@lemmy.world · 10 days ago

Anecdotally, I use it a lot and I feel like my responses are better when I’m polite. I have a couple of theories as to why.

More tokens in the context window of your question, and a clear separator between ideas in a conversation make it easier for the inference tokenizer to recognize disparate ideas.
Higher quality datasets contain american boomer/millennial notions of “politeness” and when responses are structured in kind, they’re more likely to contain tokens from those higher quality datasets.

I haven’t mathematically proven any of this within the llama.cpp tokenizer, but I strongly suspect that I could at least prove a correlation between polite token input and dataset representation output tokens

Dran@lemmy.world · 28 days ago

Thank you for letting me know what software not to use; good bot

Dran@lemmy.world · 29 days ago

Crossfading and normalization would both independently be dealbreakers for me. I can’t go back

Dran@lemmy.world · 30 days ago

agree in principal, but in practice:

parents who live across the state
plexamp for music

Dran@lemmy.world · 30 days ago

They are indeed just that keen on our data.

They know they can’t get rid of it for all of their customers, but they do want to make it as hard as possible for random users to do so.

Dran@lemmy.world · 30 days ago

For people with “that one game” there is a middle ground. Mine is Destiny 2 and they use a version of easy anticheat that refuses to run on Linux. My solution was to buy a $150 used Dell on eBay, a $180 GPU to be able to output to my 4 high-res displays, and install Debian + moonlight on it. I moved my gaming PC downstairs and a combination of wake-on-lan + sunshine means that I can game at functionally native performance, streaming from the basement. In my setup, windows only exists to play games on.

The added bonus here is now I can also stream games to my phone, or other ~thin clients~ in the house, saving me upgrade costs if I want to play something in the living room or upstairs. All you need is the bare minimum for native-framerate, native-res decoding, which you can find in just about anything made in the last 5-10 years.

Dran@lemmy.world · 8 months ago

Fail2ban and containers can be tricky, because under the hood, you’ll often have container policies automatically inserting themselves above host policies in iptables. The docker documentation has a good write-up on how to solve it for their implementation

https://docs.docker.com/engine/network/packet-filtering-firewalls/

For your usecase specifically: If you’re using VMs only, you could run it within any VM that is exposing traffic, but for containers you’ll have to run fail2ban on the host itself. I’m not sure how LXC handles this, but I assume it’s probably similar to docker.

The simplest solution would be to just put something between your hypervisor and the Internet physically (a raspberry-pi-based firewall, etc)