

It’s definitely not indicative of the region, it’s a weird jumble of ESL stereotypes, much like the content.
The patois affecting the response is expected, it was basically part of the hypothesis, but the question itself is phrased fluently, and neither bio nor question is unclear. The repetition about bar charts with weird “da?” ending is… something.
Sure, some of it is fixable but the point remains that gross assumptions about people are amplified in LLM data and then reflected back at vulnerable demographics.
The whole paper is worth a read, and it’s very short. This is just one example, the task refusal rates are possibly even more problematic.
Edit: thought this was a response to a different thread. Sorry. Larger point stands though.






Probably referencing the 23andme kit mail out Epstein did. Yeah. It’s all pretty dire.