Need some nginx magic

Behohippy@lemmy.world · 1 year ago

The advancements in this space have moved so fast, it’s hard to extract a predictive model on where we’ll end up and how fast it’ll get there.

Meta releasing LLaMA produced a ton of innovation from open source that showed you could run models that were nearly the same level as ChatGPT with less parameters, on smaller and smaller hardware. At the same time, almost every large company you can think of has prioritized integrating generative AI as a high strategic priority with blank cheque budgets. Whole industries (also deeply funded) are popping up around solving the context window memory deficiencies, prompt stuffing for better steerability, better summarization and embedding of your personal or corporate data.

We’re going to see LLM tech everywhere in everything, even if it makes no sense and becomes annoying. After a few years, maybe it’ll seem normal to have a conversation with your shoes?

Behohippy@lemmy.world · 1 year ago

I’m not sure either, Win 10/11 are pretty quick to get going and Ubuntu is not much longer than that. If I have to hard reset the mbp for work, it’s a nice block of slacker time :)

Behohippy@lemmy.world · 1 year ago

For the really old stuff, I used to do NetBSD. I’m sure their 32bit x86 support is still top notch.

Behohippy@lemmy.world · 1 year ago

Halls of Torment. $5 game on steam that is like a Vampire Survivors clone, but with more rpg elements to it.

Behohippy@lemmy.world · 1 year ago

These are amazing. Dell, Lenovo and I think HP made these tiny things and they were so much easier to get than Pi’s during the shortage. Plus they’re incredibly fast in comparison.

Behohippy@lemmy.world · 1 year ago

I’ve got a background in deep learning and I still struggle to understand the attention mechanism. I know it’s a key/value store but I’m not sure what it’s doing to the tensor when it passes through different layers.

Behohippy@lemmy.world · 1 year ago

Subscribed. That last episode of AAA was heartbreaking.

Behohippy@lemmy.world · 1 year ago

I’m on lemmy.world and the sidebar shows 401 subscribers. Is that just a sub count from the local instance or global?

Behohippy@lemmy.world · 1 year ago

Also not sure how that would be helpful. If every prompt needs to rip through those tokens first, before predicting a response, it’ll be stupid slow. Even now with llama.cpp, it’s annoying when it pauses to do the context window shuffle thing.

Behohippy@lemmy.world · 1 year ago

Still had some reasoning issues, but looking forward to the fine tunes!

Behohippy@lemmy.world · 1 year ago

Bad article title. This is the “Textbooks are all you need” paper from a few days ago. It’s programming focused and I think Python only. For general purpose LLM use, LLaMA is still better.

Behohippy@lemmy.world · 1 year ago

Need some nginx magic

Behohippy@lemmy.world · 1 year ago

Any data sets produced before 2022 will be very valuable compared to anything after. Maybe the only way we avoid this is to stick to training LLMs on older data and prompt inject anything newer, rather than training for it.

Behohippy@lemmy.world · 1 year ago

Happy Barkday

Behohippy@lemmy.world · 1 year ago

My home server setup

Behohippy@lemmy.world · 1 year ago

Baby robins in my wood shed

Behohippy@lemmy.world · 1 year ago

I hate these filthy neutrals…

Behohippy