Wired has retracted its article "How Google alters search queries"

Skull giver@popplesburger.hilciferous.nl · 1 year ago

Wired has retracted its article "How Google alters search queries"

Skull giver@popplesburger.hilciferous.nl · edit-2 1 year ago

deleted by creator

30mag@lemmy.world · edit-2 1 year ago

deleted by creator

Skull giver@popplesburger.hilciferous.nl · 1 year ago

deleted by creator

30mag@lemmy.world · edit-2 1 year ago

deleted by creator

Skull giver@popplesburger.hilciferous.nl · edit-2 1 year ago

deleted by creator

mindbleach@sh.itjust.works · 1 year ago

They fucked it up completely in a way that raises questions of competence.

HTML has ways to display angle brackets specifically intended to never be interpreted as tags. “Entity names” will never be code. There’s not even a sensible way to do it deliberately, like %20 nonsense.

30mag@lemmy.world · edit-2 1 year ago

deleted by creator

0xD@infosec.pub · 1 year ago

Could have done it with proper encoding, don’t need to remove it lol o.O

MotoAsh@lemmy.world · 1 year ago

Allowing tainted data in to the dataset means every single client has to do every single spot of content rendering correctly or else be vulnerable to easy hacking. Keeping it out of the dataset means not all clients have to be perfect for Lemmy to be a secure place.

0xD@infosec.pub · edit-2 1 year ago

The point of encoding, the process of representing data in a different way, is to have the data set not be tainted. :)

Here, for example: https://www.w3schools.com/html/html_entities.asp

MotoAsh@lemmy.world · 1 year ago

I’m a fan of the swiss cheese model of safety. While blindly blocking arbitrary characters is a bit silly, not filtering/encoding the data even on the output from web services can end up in disaster.

It’s an open API that serves publicly-sourced data. I’d not want to serve up anything more than markup content even if every single API call had perfect handling. At least not without a lot more sophisticated filtering in front of it. Even certain totally valid arrangements of HTML can be vulnerable as all hell.

Even certain markup systems have problems, but I doubt this one has huge vulnerabilities to exploit. Certain wiki systems in the past had to be completely retired over such things.

sbv@sh.itjust.works · 1 year ago

we’re having a stroke together

apis@kbin.social · 1 year ago

Am all in on this stroke as well.

xionzui@sh.itjust.works · 1 year ago

The slide shows neither. It shows that they use synonyms to get more results. They take a search for “kids clothing” and add results for “children’s clothing” and “kidswear”

aksdb@feddit.de · 1 year ago

I hope they also add results for “kids swear”.

stifle867@programming.dev · 1 year ago

My interpretation was this + in terms of the actual “sponsored” results work by matching “kids clothing” with advertisers who match for that term, and Google “changing” it into “$brand_name kids clothing” which seems entirely obvious when spelling it out.

I haven’t used Google as my primary search engine for many years but occasionally I do run a search on it. While the quality of results is extremely low, I never noticed anything obvious like a generic search term only returning results for a specific brand + that search term like the original article implied.

It seemed like a giant misunderstanding of how it all works from the start but made for a great headline.

Wired has retracted its article "How Google alters search queries"

Wired has retracted its article "How Google alters search queries"

A Note From WIRED Leadership