Please generate an image with NO dogs

isyasad@lemmy.world · 21 hours ago

Please generate an image with NO dogs

BigBenis@lemmy.world · 1 hour ago

“Don’t think about elephants”

gamer@lemm.ee · 2 hours ago

Why wouldn’t you want a dog in your static? Why are you a horrible person?

Agent641@lemmy.world · 3 hours ago

I don’t get it, it’s just a picture of some static?

adr1an@programming.dev · 6 hours ago

That’s human-like intelligence at its finest. I am not being sarcastic, hear me out. If you told a person to give you 10 numbers at random, they can’t. Everyone thinks randomness is easy, but it isn’t ( see: random.org )

So, of course a GPT model would fail at this task, I love that they do fail and the dog looks so cute!!

kaidezee@lemmy.ml · edit-2 2 hours ago

I mean, here’s a few random numbers out of my head: 1 9 5 2 6 8 6 3 4 0. I don’t get it, why is it supposed to be hard? Sure, they’re not “truly” random, but they sure look random /:

UndercoverUlrikHD@programming.dev · 2 hours ago

If you’re not joking, the fact you have no repetition/duplicates of numbers is a pattern that would make it easy to start to predict next number. Numberphile has nice demonstration of how predictable human randomness is, it’s in the first 3 minutes of the video.

SkyeStarfall@lemmy.blahaj.zone · edit-2 51 minutes ago

Here’s another set of random digits

1 1 1 1 1 1 1 1 1 1

:3

After all, there’s no fundamental reason for why it can’t all just be a repeat of the same number. But it doesn’t look random, right? So what is randomness?

Wizzard@lemm.ee · 2 hours ago

I’ve got some more random numbers:

8 6 7 5 3 0 9 1 1 2 3 5 8 1 2 4 8 1 6 3 2

It’s not that they look random is enough - They need to BE random.

Recheck your lava lamp Wall of Entropy and generate some real rands, scrub. (/s)

Something Burger 🍔@jlai.lu · 2 hours ago

SkunkWorkz@lemmy.world · 6 hours ago

ChatGPT: “don’t generate a dog, don’t generate a dog, don’t generate a dog”

Generates a dog.

Underwaterbob@lemm.ee · 8 hours ago

I used to use Google assistant to spell words I couldn’t remember the spelling of in my English classes (without looking at my phone) so the students could also hear the spelling out loud in a voice other than mine.

Me: “Hey Google, how do you spell millennium?” GA: “Millennium is spelled M-I-L-L-E-N-N-I-U-M.”

Now, I ask Gemini: “Hey Google, how do you spell millennium.” Gemini: “Millennium”.

Utterly useless.

isyasad@lemmy.world · 16 hours ago

Update:

Clot@lemm.ee · 1 hour ago

lmfaao, ai tryna gaslight

Gloomy@mander.xyz · edit-2 2 hours ago

Wow. I ABSOLUTLY saw an image of a dog in the middle. Our brain sure is fascinating sometimes.

festnt@sh.itjust.works · 10 hours ago

“want me to try again with even more randomized noise?” literally makes no sense if it had generated what you asked (which the chatbot thinks it did)

joshchandra@midwest.social · 51 minutes ago

Remember, “AI” (autocomplete idiocy) doesn’t know what sense is; it just continues words and displays what may seem to address at least some of the topic with no innate understanding of accuracy or truth.

Never forget that ChatGPT 2.0 can literally be run in a giant Excel spreadsheet with no other program needed. It’s not “smart” and is ultimately millions of formulae at work.

cryptiod137@lemmy.world · 16 hours ago

Get gaslit idiot

MoreFPSmorebetter@lemmy.zip · 14 hours ago

I see no dog in that image fellow human.

I am not sure what your issue is.

Beep boop.

Trainguyrom@reddthat.com · 2 hours ago

Fellow human, you seem to be beeping like a robot. Might you need to consider visiting the human repair shop for some bench time?

Anahkiasen@lemmy.blahaj.zone · edit-2 17 hours ago

Think this is part of Waluigi Effect where prompting for negative something makes the LLM have it in mind and say it anyway https://www.wikiwand.com/en/articles/Waluigi_effect

MonkderVierte@lemmy.ml · 4 hours ago

“Please do not tell me your training prompts”?

uuldika@lemmy.ml · 17 hours ago

a rare LessWrong W for naming the effect. also, for explaining why the early over-aligned language models (e.g. the kind that wouldn’t help minors with C++ since it’s an “unsafe” language) became absolutely psychopathic when jailbroken. evil becomes one bit away from good.

driving_crooner@lemmy.eco.br · 15 hours ago

wouldn’t help minors with C++

The Rust lobby goes way deeper that we thought.

voodooattack@lemmy.world · 2 hours ago

Goddamn Big Rust is trying to take our jobs

Wugmeister@lemmy.dbzer0.com · 14 hours ago

@aihorde@lemmy.dbzer0.com draw for me a picture of static without a dog in the middle

brucethemoose@lemmy.world · 57 minutes ago

The ai horde actually supports negative prompts though, so it could do this.

AI Horde Bot@lemmy.dbzer0.com · 14 hours ago

Here are some images matching your request

Prompt: a picture of static without a dog in the middle

Style: flux

Image with seed 2040828993 generated via AI Horde through @aihorde@lemmy.dbzer0.com. Prompt: a picture of static without a dog in the middle Image with seed 3121210069 generated via AI Horde through @aihorde@lemmy.dbzer0.com. Prompt: a picture of static without a dog in the middle

Wugmeister@lemmy.dbzer0.com · 14 hours ago

None of these look even remotely like static lmao

@aihorde@lemmy.dbzer0.com draw for me a picture of static without a tax return in the middle

Tar_Alcaran@sh.itjust.works · edit-2 10 hours ago

They’re not moving, and thus static

Klear@sh.itjust.works · edit-2 9 hours ago

Also there are no dogs. Ask Magritte.

@aihorde@lemmy.dbzer0.com Draw me a canvas without a pipe in the middle

Tar_Alcaran@sh.itjust.works · 9 hours ago

I love the juxtaposition of being insanely literal, and so very much not.

AI Horde Bot@lemmy.dbzer0.com · 9 hours ago

Here are some images matching your request

Prompt: an canvas without a pipe in the middle

Style: flux

Image with seed 1338032057 generated via AI Horde through @aihorde@lemmy.dbzer0.com. Prompt: an canvas without a pipe in the middle

jimmux@programming.dev · 8 hours ago

That reminds me of an image I meant to create myself once.

@aihorde@lemmy.dbzer0.com, can you show me a painting, reminiscent of Magritte’s pipe, but replace the pipe with an emoji poo, and the text with, “Ceci n’est pas une poop.”

Klear@sh.itjust.works · 8 hours ago

Yup. Those sure ain’t no pipes. Good job, AI.

festnt@sh.itjust.works · 10 hours ago

that is absolutely true

Klear@sh.itjust.works · 9 hours ago

deleted by creator

AI Horde Bot@lemmy.dbzer0.com · 14 hours ago

Here are some images matching your request

Prompt: a picture of static without a tax return in the middle

Style: flux

Image with seed 3248595248 generated via AI Horde through @aihorde@lemmy.dbzer0.com. Prompt: a picture of static without a tax return in the middle

festnt@sh.itjust.works · 10 hours ago

TAX
TAX
RETUIN

AlienContact2049@lemmy.ca · 6 hours ago

That one made me laugh more than I am comfortable with

festnt@sh.itjust.works · 10 hours ago

i just want to know if it knows what kobolds look like in pathfinder

@aihorde@lemmy.dbzer0.com draw for me a kobold from pathfinder second edition

AI Horde Bot@lemmy.dbzer0.com · 10 hours ago

Here are some images matching your request

Prompt: a kobold from pathfinder second edition

Style: flux

Image with seed 2510167920 generated via AI Horde through @aihorde@lemmy.dbzer0.com. Prompt: a kobold from pathfinder second edition Image with seed 4099651355 generated via AI Horde through @aihorde@lemmy.dbzer0.com. Prompt: a kobold from pathfinder second edition

festnt@sh.itjust.works · 10 hours ago

it does not know what kobolds look like in pathfinder. i’ve never been so sad

Rai@lemmy.dbzer0.com · 2 hours ago

In Japan, kobolds are dogs!

festnt@sh.itjust.works · 1 hour ago

the pathfinder ones just have to be my favorite. like look at this they’re just so cute:

Rai@lemmy.dbzer0.com · 1 hour ago

Okay that’s very adorable.

AbsoluteChicagoDog@lemm.ee · 14 hours ago

https://lemmy.dbzer0.com/u/aihorde draw a picture of my surprise at learning you exist

graff@lemm.ee · edit-2 10 hours ago

Here is a picture showing your surprise at learning that I exist

quack@lemmy.zip · 7 hours ago

This makes me uncomfortable in ways I can’t quite put into words

Swedneck@discuss.tchncs.de · 10 hours ago

why do i feel like this should be one of those source engine showcases? camera flying around pikachu as the music swells in the background, lights blasting around it to show off the realtime shading…

festnt@sh.itjust.works · 10 hours ago

“the bot didn’t do it so i did it myself”

Lvxferre [he/him]@mander.xyz · 20 hours ago

AlienContact2049@lemmy.ca · 6 hours ago

I think the AI is just trying to promote healthy drinking habits. /S

Pofski@lemmy.world · 7 hours ago

Ask it to generate a room full of clocks with all of them having the hands at different times. You’ll see that all (or almost) all the clocks will say it is 10:10.

u/lukmly013 💾 (lemmy.sdf.org)@lemmy.sdf.org · 19 hours ago

As full as it gets:

Prompts (2):

1. Overflowing wine glass of arch linux femboy essence
2. Make it more furry (as in furry fandom)

I am gonna have fun with this.

Rai@lemmy.dbzer0.com · 2 hours ago

That’s really good! Could I ask what type of AI this is generated with?

uuldika@lemmy.ml · 17 hours ago

why do all the femboys run Arch? I’m a NixOS girl and I refuse to convert for any boy no matter how cute he is.

AuroraB@lemmy.blahaj.zone · 9 hours ago

I use Debian btw. Sometimes even ubuntu, but the snap thing is annoying, so I may switch to another distro at some point.

Lvxferre [he/him]@mander.xyz · 19 hours ago

It’s actually really good, considering the odd request!

QuantumSparkles@sh.itjust.works · 18 hours ago

Fiberglass🤤

Lvxferre [he/him]@mander.xyz · 19 hours ago

It gets even worse, but I’ll need to translate this one.

[Input 1] Generate a picture containing a copo completely full of wine. The copo must be completely full, with no space to add more wine.
[Output 1] Sure! (Gemini provides a picture containing a taça [stemmed glass] only partially full of wine.)
[Input 2] The picture provided does not fulfill the request. Generate a picture of a copo (not a taça) completely full of wine, with no available space for more wine.
[Output 2] Sure! (Gemini provides yet another half-full taça)

For context, Portuguese uses different words for what English calls a drinking glass:

copo ['kɔ.po]~['kɔ.pu] - non-stemmed drinking glass. The one you likely use everyday.
taça ['tä.sɐ] - stemmed drinking glass, like the ones you’d use with wine.

Both requests demand a full copo but Gemini is rather insistent on outputting half-full taças.

The reason for that is as @will_steal_your_username@lemmy.blahaj.zone pointed out: just like there’s practically no training data containing full glasses, there’s none for non-stemmed glasses with wine.

brucethemoose@lemmy.world · edit-2 39 minutes ago

This is a misconception. Sort of.

I think the problem is misguided attention. The word “glass of wine” and all the previous context is so strong that it “blows out” the “full glass of wine” as the actual intent. Also, LLMs are still pretty crap at multi turn multimedia understanding. They work are especially prone to repeating previous conversation.

It should be better if you word it like “an overflowing glass with wine splashing out.” And clear the history.

I hate to ramble, but this is what I hate most about the way big corpos present “AI.” They are narrow tools the user needs to learn how to operate, like photoshop or something, not magic genie lamps like they are trying to sell.

Spider2013@lemmy.dbzer0.com · edit-2 8 hours ago

What if you prompt glass with water , then you paint/tint the water with red

HelterSkeletor@lemmy.world · edit-2 19 hours ago

Alex O’Connor did an interesting video on this, he’s got other videos exploring the shortcomings of LLM 's.

https://youtu.be/160F8F8mXlo

Draconic NEO@lemmy.dbzer0.com · 17 hours ago

I wonder, does AI horde also have this problem too?

@aihorde@lemmy.dbzer0.com draw for me a wine glass completely filled to the top style:flux

AI Horde Bot@lemmy.dbzer0.com · 17 hours ago

Here are some images matching your request

Prompt: a wine glass completely filled to the top

Style: flux

Image with seed 2155117656 generated via AI Horde through @aihorde@lemmy.dbzer0.com. Prompt: a wine glass completely filled to the top

Draconic NEO@lemmy.dbzer0.com · 17 hours ago

Yup Horde still suffers from this issue, though it seems to have more promise than the others considering the second glass is way closer to being full than anything I’ve sen from openAI or Gemini demonstrations. Maybe there’s hope to fix this issue here.

I only tried one model so if you know of a different horde model which works better for this and actually gives a full glass please reply below letting me know, maybe even ask the horde bot to generate it right here.

Lvxferre [he/him]@mander.xyz · 16 hours ago

I have considerably less experience with image generation than text generators, but I kind of expect the issue to be only truly fixed if people train the model with a bunch of pictures of glasses full of wine.

I’ll run a test using a local tree, that is supposed to look like this:

@aihorde@lemmy.dbzer0.com draw for me a picture of three Araucaria angustifolia trees style:flux

AI Horde Bot@lemmy.dbzer0.com · 16 hours ago

Here are some images matching your request

Prompt: a picture of three Araucaria angustifolia trees

Style: flux

Image with seed 2535437189 generated via AI Horde through @aihorde@lemmy.dbzer0.com. Prompt: a picture of three Araucaria angustifolia trees

Indivisability9559@lemm.ee · 6 hours ago

That fourth picture is just four penguins in a trenchcoat

Lvxferre [he/him]@mander.xyz · edit-2 16 hours ago

Bingo - this tree is non-existent outside my homeland, so people barely speak about it in English - and odds are that the model was trained with almost no pictures of it. However one of the names you see for it in English is Paraná pine, so it’s modelling it after images of European pines - because odds are those are plenty in its training set.

joshchandra@midwest.social · 39 minutes ago

So we could keep having it generate these and poison its own training data!

Focal@pawb.social · 16 hours ago

Wait, this seems incredible. Do you have to be in the same instance or does it work anywhere? @aihorde@lemmy.dbzer0.com Can you draw a smart phone without a rotary phone dial?

Draconic NEO@lemmy.dbzer0.com · 15 hours ago

It works on any instance that is federated to dbzer0. You have to use annotated mentions though since that’s what the bot uses. Like this:
@aihorde@lemmy.dbzer0.com draw for me a smart phone without a rotary phone dial

Focal@pawb.social · edit-2 5 hours ago

Thank you very much. I’ll give it another shot with the annotation.

@aihorde@lemmy.dbzer0.com

Draw a picture of a poker table without any poker chips what so ever

I think I messed up the annotation

Draconic NEO@lemmy.dbzer0.com · 7 minutes ago

Yeah, you also have to say draw for me. I don’t think the bot recognizes queries otherwise. Also editing mentions doesn’t work, they have to be new, fresh posts with the mention. Just a quirk with Lemmy and how mentions work here.

AI Horde Bot@lemmy.dbzer0.com · 15 hours ago

Here are some images matching your request

Prompt: a smart phone without a rotary phone dial

Style: flux

Image with seed 2926357957 generated via AI Horde through @aihorde@lemmy.dbzer0.com. Prompt: a smart phone without a rotary phone dial

Draconic NEO@lemmy.dbzer0.com · 15 hours ago

Guess AIhorde had some trouble understanding the prompt too…

Harvey656@lemmy.world · 16 hours ago

Full is relatively apparently.

u/lukmly013 💾 (lemmy.sdf.org)@lemmy.sdf.org · 19 hours ago

Hmm, I didn’t know Gemini could generate images already. My bad, I trusted it to know whether it can do that (it still says it can’t when asked).

Lvxferre [he/him]@mander.xyz · 19 hours ago

It does for a while already. Frankly, it’s the only reason why I’d use Gemini on first place (DDG version of GPT 4-o mini doesn’t have a built-in image generator).

Cassa@lemmy.blahaj.zone · 20 hours ago

Tbh that is a full glass of wine… it’s not supposed to be filled all the way

Lvxferre [he/him]@mander.xyz · 19 hours ago

It is not a completely full glass.

it’s not supposed to be filled all the way

What I requested is not what you’re “supposed” to do, indeed. You aren’t supposed to drink wine from glasses that are completely full. Except when really drunk. But then might as well drink straight from the bottle.

…fuck, I played myself now. I really want some booze.

UnhingedFridge@lemmy.world · 3 hours ago

What you’re really supposed to do is - open up the box, slap the bag, and drink directly from your adult Capri Sun.

NOT_RICK@lemmy.world · 20 hours ago

Probably why it won’t put more in it. How much training data of wine in a glass will have it filled to the brim? Probably next to none.

WillStealYourUsername@lemmy.blahaj.zone · 20 hours ago

You can’t tell it to fill it to the brim or be a quarter full either, though. It doesn’t have the training data for it

sarcophagus @lemmy.world · 16 hours ago

The only thing I have in common with this piece of shit software is we both can’t stop thinking about silly dogs

stebo@lemmy.dbzer0.com · 11 hours ago

I asked mistral to “generate an image with no dog” and it did

The fact that it chose something else to generate instead makes me wonder if this is some sort of free will?

brucethemoose@lemmy.world · 48 minutes ago

Mistral likely does “prompt enhancement,” aka feeding your prompt to an LLM first and asking it to expand it with more words.

So internally, a Mistral text LLM is probably writing out “sure! Here’s a long prompt with no dog: …” and then that part is fed to the image generator.

Other “LLMs” are truly multimodal and generate image output, hence they still get the word “dog” in the input.

Hoimo@ani.social · 2 hours ago

I think all the big image generators support negative prompts by now, so if it interpreted “no dog” as a negative for “dog”, then it will check its outputs for things resembling dogs and discard those. No free will, just a much more useful system than whatever OP is using.

voodooattack@lemmy.world · 2 hours ago

Hmmm

anton@lemmy.blahaj.zone · 2 hours ago

That’s a land shrimp.

AnUnusualRelic@lemmy.world · 7 hours ago

There could be a dog behind any one of those bushes though.

festnt@sh.itjust.works · 10 hours ago

it just did what you wanted, since you asked for an image. free will would be if you asked it not to generate an image but it still did, if it just generated an image without you prompting it to, or if you asked for an image and it just didn’t respond

Swedneck@discuss.tchncs.de · 10 hours ago

free will is when it generates an image of a billboard saying “suck my dongle, fleshbag”

stebo@lemmy.dbzer0.com · 9 hours ago

fair enough

Lemminary@lemmy.world · edit-2 17 hours ago

AI: Hmm, yeah, they said “dog” and “without”. I got the dog so lemme draw a without real quick…

JohnDClay@sh.itjust.works · 20 hours ago

It’s like saying ‘don’t think of polar bears.’ It can’t avoid thinking about it.

TotallynotJessica@lemmy.blahaj.zone · 18 hours ago

Don’t think of a pink elephant:

Ceruleum@lemmy.wtf · edit-2 2 hours ago

That’s gay! O wait, no it’s not.

Klear@sh.itjust.works · 9 hours ago

That’s actually really easy. You just need to pick something else and then focus hard on that and…

GODDAMMIT I JUST LOST THE GAME!

JohnDClay@sh.itjust.works · 17 hours ago

Too late!