Microsoft launches ‘vibe working’ in Excel and Word

This is fine🔥🐶☕🔥@lemmy.world · 6 months ago

Microsoft launches ‘vibe working’ in Excel and Word

InEnduringGrowStrong@sh.itjust.works · 6 months ago

Microsoft says its Agent Mode in Excel has an accuracy rate of 57.2 percent in SpreadsheetBench, a benchmark for evaluating an AI model’s ability to edit real world spreadsheets.

It generates 42.8% bullshit.

jubilationtcornpone@sh.itjust.works · 6 months ago

They probably view that as a statistic worth bragging about. It’s not. If Excel got calculations right 57.2% of the time it would be completely worthless.

PerogiBoi@lemmy.ca · 6 months ago

I asked copilot to look through my every spreadsheet and find how many instances of a category occurred. I was curious to see if it was any good. Gave me 2 different numbers. Neither were correct.

jubilationtcornpone@sh.itjust.works · 6 months ago

Copilot: Putting the “Artificial” in Artificial Intelligence.

PerogiBoi@lemmy.ca · 6 months ago

The tech behind LLMs could have just been Clippy and everyone would be happy.

bobalot@lemmy.world · 6 months ago

Fartificial Intelligence

FreedomAdvocate@lemmy.net.au · edit-2 6 months ago

Removed by mod

MountingSuspicion@reddthat.com · 6 months ago

I wonder where that “human accuracy” statistic is coming from. Plenty of people don’t know how to read and interpret data, much less use excel in the first place. There’s a difference between 1/4 of people in the workforce not being able to complete a task, and a specialized AI not being able to complete a task. Additionally, this is how you get into the KPI as a goal rather than a proxy issue. AI will never understand context isn’t directly provided in the workbook. If you introduced a new drink at your restaurant in 2020 AI will tell you that the introduction of the drink caused a 100% decrease in foot traffic since there’s no line item for “global pandemic”. I’m not saying AI will never be there, but people using this version of AI instead of actual analysis don’t care about the facts and just want an answer and for that answer to be cheap.

FreedomAdvocate@lemmy.net.au · edit-2 6 months ago

Removed by mod

FutileRecipe@lemmy.world · 6 months ago

Depending on where you go to school, 70% is passing while 50% is not. While “not far off,” one is a C, the other a F.

FreedomAdvocate@lemmy.net.au · edit-2 6 months ago

Removed by mod

MadMadBunny@lemmy.ca · 6 months ago

So it achieved the actual proficiency of a middle manager…

MonkderVierte@lemmy.zip · 6 months ago

Decades ago. The company that replaced it’s CEO with a LLM thrives.

potoo22@programming.dev · 6 months ago

Just keep regenerating data until it’s something the stock holders like. Doesn’t matter if it’s BS. They’re already accustomed to that.

SkaveRat@discuss.tchncs.de · 6 months ago

Nice. Basically a coin flip

GasMaskedLunatic@lemmy.dbzer0.com · 6 months ago

Slightly better than Vegas. Unfortunately, plenty of people are okay with Vegas odds.

sparky@lemmy.federate.cc@lemmy.federate.cc · 6 months ago

https://defector.com/it-took-many-years-and-billions-of-dollars-but-microsoft-finally-invented-a-calculator-that-is-wrong-sometimes

Imgonnatrythis@sh.itjust.works · 6 months ago

Not enough accuracy to be useful. Not enough bullshit for politics.

Fyrnyx@kbin.melroy.org · 6 months ago

Oh it’s going to do it for Word too?

Prompt: Termination letter telling my boss and bosses to kindly go fuck themselves and make it professional

calliope@retrolemmy.com · 6 months ago

The best you can do in any job is to care as little about them as they care about you.

They will barely read it, and they won’t care nearly as much as you do.

I resign my position as a [position], effective [DATE].

MonkderVierte@lemmy.zip · 6 months ago

The best cancers of both worlds.

supersquirrel@sopuli.xyz · 6 months ago

So let me fast forward a bit, ->underpaid stressed out techworkers in the global south pretending to be AI for incompetent upper management in wealthy countries?

Fizz@lemmy.nz · 6 months ago

Not related but does global south refer to south of the equator or just everything south of north America?

supersquirrel@sopuli.xyz · 6 months ago

I don’t know if it is a perfect term, but it doesn’t literally refer to any specific “South”, rather I think it is a reference to the coincidence that many of the heavily industrialized empires of the 18th, 19th and 20th centuries have been in the northern hemisphere, and the general colonial power dynamic therein set up has lead to the term “Global South” meaning pretty much anywhere that has gotten the short end of the colonialism stick, vs the long end.

https://en.m.wikipedia.org/wiki/Global_North_and_Global_South

kramer@slevin.horse · 6 months ago

They’re out smarting the sheet that’s for sure.

FreedomAdvocate@lemmy.net.au · edit-2 6 months ago

Removed by mod

Olap@lemmy.world · 6 months ago

LLMs can’t count. Can’t add. Can’t deal with actually large datasets

How is excel a good fit for vibe-coding?

FreedomAdvocate@lemmy.net.au · edit-2 6 months ago

Removed by mod

Olap@lemmy.world · 6 months ago

LLMs lose context over a short session. They all have input limits. Very small input limits usually. Best it can probably do is suggest formulas for you based on your natural language, maybe some copy/paste. Which means it can beat a 9 year old, great news everyone! Or show a help article on pivot tables (which the help function already does!)

Excel is very simple to work with, hence its ubiquity. LLMs also get shit wrong about half the time, way more than half with difficult things ime. Meaning they cost experienced operators time, a few studies are showing this now with coding. And are expensive as fuck. And slow as fuck. And reduce capacity for learning. Meaning they actually cap what excel can achieve, as the user won’t grow at the same rate, renoving the one advantage excel actually has: the learning rate is phenomenal

The C-Suite which insisted on this integration is basically an subservient idiot themselves at this stage who doesn’t understand their product, their market fit, or their userbase. They should replace thenselves with an LLM

FreedomAdvocate@lemmy.net.au · edit-2 6 months ago

Removed by mod

Olap@lemmy.world · 6 months ago

And it will fuck up around half of even the simple formulas. This is really bad, and the idiots in charge should feel bad. Excel basically runs the world and they are about to fuck it up

FreedomAdvocate@lemmy.net.au · edit-2 6 months ago

Removed by mod

bigfondue@lemmy.world · edit-2 3 months ago

deleted by creator

FreedomAdvocate@lemmy.net.au · edit-2 6 months ago

Removed by mod

Zos_Kia@lemmynsfw.com · 6 months ago

The problem with being a pragmatic LLM user is that you have on one side corporate America shoe-horning the tech in mediocre products none wants, and on the other side a large portion of the internet who loathe it but don’t use it and don’t even know what it does. Those conversations never go anywhere man. You’re talking to someone who thinks accuracy of 57% on SpreadsheetBench means the model gives wrong answers 42% of the time.

Hate to agree with Microsoft but yeah, Excel is probably a great place to introduce an LLM. It’s in that sweet spot between natural language and light programming, in an environment with math baked in so you don’t really care about the model’s accuracy or exact recall. All the data is here, and the model only has to manipulates cell numbers and writes formulas in this dumbed down language.

I’m sure you can get away with pretty small models too. It doesn’t need super human knowledge to implement 90% of common Excel use cases, and i suspect in real world scenarios the accuracy must be pretty interesting.

jacksilver@lemmy.world · 6 months ago

It could be good to layer in standard machine learning (ML), and it already does have some features (like line of best fit).

However, in today’s context AI means LLMs, and that is not a good fit due to its unpredictability.