FRESH

Hacker News

Home

Apparently Google hates us now

489 points by zeitg3ist

by f4stjack

10 subcomments

Google does not hate us... it is worse than that - it is indifferent to us. Hate requires some sort of recognition. I mean this single incident may not mean anything but overall google is heading to an _interesting_ place. In short, it was state of the art but in 20 years it became just another conglomerate sacrificing quality for shareholder gain, I think?
As a search engine, it does not work for me. I see promoted links above the thing I actually search for. Moved to Kagi and didn't look back.
As an AI it does not work for me. I am seeing an arbitrary usage limit, refreshing in 5 hours and a weekly quota given in a percentage. That is as opaque as it gets. Again, to give Kagi as an example I look at my usage details and I see how much is remaining in a clear way. Not working for Kagi by the way, I am just a happy customer.
As a cloud storage, it does not work for me. Probably some shared folder I am working with others has a spam user and/or a hacked account and they periodically spam x-rated notifications. And that's not only me (https://www.reddit.com/r/techsupport/comments/1azf25v/myster...). Moved to apple iCloud and done with it.
Mail is fine. After 22 years of usage, I kind of delegated it to a non-important stage in my life. The important bits have relocated to European providers anyway.

by hungryhobbit

10 subcomments

They're a wiki. Wiki spammers are relentless now.
Source: a small wiki I help manage, for an obscure game with <10k players, recently had to disable new signups, because the spam was so bad (and it was stuck on an old version of MediaWiki, which didn't have CAPTCHA-support).
On a popular wiki, and it sounds like this one was fairly popular, I imagine even CAPTCHA's won't be enough to stop wiki spammers. If those spammers were posting more than just "buy my penis pill" garbage (e.g. they were putting links to malware sites), Google probably, and somewhat legitimately, saw them as a source of such malware.
I imagine the fix for the OP is a thorough audit/cleansing of all malicious content on the wiki, followed by some sort of appeal to Google (which will no doubt take months, if they even respond at all, because ... Google).
Really OP's only hope is that the Google team responsible for this has an Italian Pokemon fan; otherwise they are probably screwed.

by marginalia_nu

2 subcomments

To be honest it's probably just jank on Google's end.
There's a lot of delayed cause and effect in search, and it's much easier to make a minor mistake that excludes 0.1% of websites from crawling or indexing than it is to detect that it's happened except from affected websites telling you about it.
Like in marginalia I've had a bug that affected websites in the condition that if the root path didn't support HEAD, but did support GET with a `Range` header, and it correctly responded with a HTTP 206, then the website wouldn't be indexed because some code that was testing the root document for issues as an initial probe handled that as an error state. Most websites that support range requests also support HEAD (as this usually means the document isn't generated). Except a handful of Caddy-based configurations, about 0.3% of servers.

by phyzix5761

4 subcomments

Why would Google need to direct traffic to the website when they've already scraped and trained their models on the data? Content creators and legitimate websites were wham-bammed and thank-you-ma’amed.

by p4bl0

2 subcomments

The same thing happened with my blog a few weeks ago. It was well referenced for years and suddenly almost all of my entries are not indexed anymore. The Search Console indicates that the URLs were crawled but are currently not indexed, and contrary to technical problems, there nothing I can do to fix it, I just have to accept that most of my articles cannot be found via Google anymore.
EDIT: I don't actually think it is related, but now that I think of it, the timing corresponds with when I started setting up TDMRep to forbid using my content to train LLMs.

by paol_taja

1 subcomments

You guys made the classic SEO mistake of building a real community site instead of a Reddit thread, a coupon subfolder, or an AI summary.
Scherzi a parte, spero che possiate recuperare presto…

by frouge

2 subcomments

I can even tell you that Google hates us all

by tomp

2 subcomments

FYI Google also hates OpenCV
What used to be easily searchable (e.g. "opencv orb") now brings up pages and pages of spam sites (basically "learn opencv here!" blogspam).
Literally the first result on "docs.opencv.org" is on page 4, and points to version 3.4 (9 years old!).
The page that I want https://docs.opencv.org/4.13.0/dc/dc3/tutorial_py_matcher.ht... is nowhere to be found.

by arjie

0 subcomment

Wikis are just high-risk for SEO. Getting my own personal wiki to be indexed was such a challenge that I'd just about given up when a friend who is more acquainted with the whole thing helped me make sure I had all the bits and bobs in the right place. If you're not careful, people can easily put spam all over your site and then it'll really ruin your presence on a search engine.
Google is really big, though. Really really big. They're so big that not even all the people inside Google are trustworthy to them on a subject like this.
But they don't universally hate wikis and so on. It's just you have to do a lot of work and make sure you don't have spam on your wiki, and then fill in all of the information in your meta tags, and have a sitemap.xml, and all that. Here's my wiki for example: https://wiki.roshangeorge.dev/w/images/8/89/Screenshot_-_Goo...

by chakintosh

1 subcomments

After yesterday's keynote and the changes to Search, it became clear in the near future, Google will cease to direct any traffic to websites and the search results will just become a footnote in Gemini's response.

by _alphageek

0 subcomment

I still have 42k page indexed, but previously I had 20k impressions per day, past week impressions started to change. And now I have 399 impressions per 24 hours :/

by rglover

0 subcomment

Tangential: this would be an excellent time for the Kagi folks to do a Mullvad style campaign around "Remember Google before the apocalypse? Meet Kagi."

by Mickelby

0 subcomment

It's hard to over-estimate the "crawled but not currently indexed" apocalypse that's happening at the moment.
Google is DRASTICALLY reducing the size of their search index. The reasons can be debated but the outcome is clear. A much smaller index of pages they consider to be the primary authority. Anything else they are not interested in and do not need.

by computomatic

1 subcomments

A wiki with only 11 pages?
Perhaps they will investigate why 541,000 pages aren’t being indexed. In my experience, Google provides adequate tools for identifying and resolving indexing issues.
Google won’t serve pages it hasn’t indexed. Seems they left a lot of relevant details out of that tweet.
Edit: and the most likely answer would be that their current robots.txt disallows virtually all indexing. I’m no SEO expert but entries like this seem like footguns:
```
   User-agent: Google-Extended
   Disallow: /
```
Edit 2: there’s more info in the full thread but that was only viewable via the xcancel link someone else shared (despite having the X app installed - deeplinks don’t work today). A helpful example of why X is not the best platform for sharing multi-post threads. Seems robots.txt was considered but ruled out.

by atleastoptimal

0 subcomment

I think people were fooled by the 2000s-2010s tech propaganda that the tech/internet companies in general were some sort of utopian benign benefactors of humanity.
All businesses seek to survive, and will use human goodwill until it is not needed anymore. Everyone who thought that Google was opening up the web out of the generosity of their hearts will be shocked when they "feel" nothing when that is taken, because ultimately a company cannot "feel" anything at all, so the OP headline is a silly proposition.

by clacker-o-matic

1 subcomments

oof that sucks; i really wish there was more info on why google decides to crawl or not crawl a page

by ZeWaka

1 subcomments

Interesting. My small game wiki was also affected ~3 weeks ago. It doesn't even show up on Google anymore even if you directly search for the URL.
We don't get any spam since there's no public signups for editing access.

by OsrsNeedsf2P

0 subcomment

This happened to an open source project I used to maintain[0]. After years of healthy growth, traffic dropped 95%. No explanation. It then restarted to organically grow again. It's been about 3 years, and we're still get roughly 40% of the hits we used to, but we were set back about 6 years of natural growth.
[0] https://2009scape.org

by declan_roberts

0 subcomment

I suspect this is a cloudflare thing since the other search engines are doing fine. I'd look closer into your cloudflare settings and see what you can relax.

by hackerbeat

0 subcomment

They sucked out all the content and then pulled the plug.

by astkl

0 subcomment

My guess is that the combination of Wiki and Pokemon is highly suspect for Google.
The Pokemon Industrial Complex has advanced astroturfing especially on YouTube/Twitch, where streamers mention the damn things in any second episode, they "accidentally" meet people going to Pokemon conventions in live streams and so on.
Try to audit the Wiki if anyone abused it.

by Aboutplants

1 subcomments

Google does not hate you, they simply do not care about you at all. There is a very minor difference

by drcongo

1 subcomments

https://xcancel.com/pokemoncentral/status/205712380740463825...

by stronglikedan

2 subcomments

Content creators need to accept that traditional SEO is a thing of the past, because traditional search engines are a thing of the past. None of my normie friends use search engines any longer. They just ask the AI - anything and everything - the AI has all the answers they need. The best that any content creators can hope for in terms of engagement is that they were the quoted source and the user cares enough to check sources. Content creators just need to find new ways of driving engagement now, and we're never going back so there's no use crying about it.

by kokojambo

2 subcomments

It appears for me when I search for it. Even Gemini is cool with looking for it.
Here is a part of the Gemini result I got which was directly above the regular result link.
"Pokémon Central is a major community network and independent Italian encyclopedia for everything Pokémon-related"
Honestly, the title is super clickbait and it doesn't even reflect reality. Its so easy picking some giant entity far away and create some drama about it. Dont get me wrong, I am not a google fan, but I also dislike clickbaits and whiney dramatic claims, moreover if unverified.

by sitebolts

2 subcomments

Google's always adjusting its search rankings, but it's rare for a legitimate site to suffer such a sudden massive hit without reason.
My first thought would be that they accidentally blocked Google's crawler (maybe through some kind of anti-AI setting?) or that Google believes that the site is serving malware or spam. Either scenario can have that kind of effect. I can see that their forum at least appears to have strong Cloudflare anti-bot rules in place, so that might be the case.
They're also using a subdomain for both their wiki and forum, which Google has been observed to punish. They might consider moving each of those to their own separate .com domain.
But aside from that usual stuff, there's one more possible reason that's specific to this site. In November of last year, the Pokemon Company rebranded their "Pokemon Trainer Club" to "Pokemon Trainer Central", which is the first result that comes up when you search for "Pokemon Central".
That change was made a few months before the sudden drop in traffic, but could still be a viable explanation here. Google does routine re-ranking on a daily basis along with occasional major re-ranking, which happens maybe a few times a year, so the delayed hit that they saw could have come from Google finally recognizing that most people who search for "Pokemon Central" are no longer looking for the wiki like was once true in the past.
https://gonintendo.com/contents/54863-pokemon-trainer-club-r...

by opengrass

0 subcomment

Easily a legal request.

by startpage_com

1 subcomments

Why not use startpage.com which is doing anonymous searches in google?

by cynicalsecurity

5 subcomments

Can someone start a new Google, please? Just search, nothing more. I'm willing to pay 10 USD a month for that. API access included.

by echelon

3 subcomments

Pokemon Central runs ads (Google AdSense at that!), which is probably how they pay for everything.
Google is likely their biggest inbound source of traffic, so they're probably experiencing a marked revenue drop as well.
It's unfortunate that so many livelihoods are subject to the capricious whims of a single company. A company that is increasingly seeking to keep users on their engine without sending eyeballs or revenue to any third parties at all.
We're watching Google's "embrace-extend-extinguish" arc for the web. It's not over by a long shot, but they absolutely intend to finish the job.

by gverrilla

0 subcomment

I hope other news/content groups start to wake up. It will be indeed a war.

by PLenz

0 subcomment

Now? Google has hated us since at least the DoubleClick aquistion in 2008. That's when people became the product

by anigbrowl

1 subcomments

One of the sad things about this story is that everyone has to read tea leaves to guess what reasoning might be going on at Google's end. Tech companies have normalized the practice of cutting people off with no explanation, or saying 'we investigated and found a violation' without articulating what the violation is. Naturally, they want to secure themselves against abuse and people trying to game their system, but refusing to provide any information does not achieve that.
It does infuriate legitimate users, enables other kind of abuse and scamming (eg immunize yourself against delisting with this one weird trick!', link farming etc), and act as a fig leaf for abusive behavior by platform operators. Effectively, we've allowed large teach companies to act as digital dictatorships with no accountability to their customers. Yes I consider users to be 'customers' even if they're uploading content or doing searches 'for free'. If you're monetizing their activity on your platform, they are your customers whether or not you call them that to avoid legal liability.

by phendrenad2

0 subcomment

> PCW is part of the EP network of international wikis that includes bulbapedia. A lot of our content is translated from it (with permission) and is the work of thousands of human volunteers.
Oh there we go. Google's internal AI marked it as "can be 99% derived from an English source" and stopped crawling it, preferring to synthesize translated Italian responses as needed.

by cess11

2 subcomments

Perhaps they're decommissioning search in favor of LLM:s.

by jdw64

0 subcomment

[dead]

by etgpao

0 subcomment

[flagged]

by Serhii-Set

0 subcomment

[dead]

0 subcomment

by esseph

0 subcomment

[dead]

by spiderfarmer

0 subcomment

[flagged]

by ChrisArchitect

1 subcomments

Title could be: Apparently Google hates Pokémon Central Wiki now
(to be clearer what the source of the post is)