this post was submitted on 16 Jun 2023
35 points (100.0% liked)

Asklemmy

43818 readers
1274 users here now

A loosely moderated place to ask open-ended questions

Search asklemmy 🔍

If your post meets the following criteria, it's welcome here!

  1. Open-ended question
  2. Not offensive: at this point, we do not have the bandwidth to moderate overtly political discussions. Assume best intent and be excellent to each other.
  3. Not regarding using or support for Lemmy: context, see the list of support communities and tools for finding communities below
  4. Not ad nauseam inducing: please make sure it is a question that would be new to most members
  5. An actual topic of discussion

Looking for support?

Looking for a community?

~Icon~ ~by~ ~@Double_[email protected]~

founded 5 years ago
MODERATORS
 

I was wondering if the nature of decentralization would negatively affect SEO, since people can access the same post from many different instance

top 19 comments
sorted by: hot top controversial new old
[–] [email protected] 8 points 1 year ago (2 children)

https://lemmy.ml/robots.txt , https://lemmy.world/robots.txt , etc don't seem to disallow posts, so the text-based content should be easy to index, at least for these instances.

related news: Google is getting a lot worse because of the Reddit blackouts.

[–] [email protected] 3 points 1 year ago (1 children)

An earlier post pointed out: federated sites seem like they will suffer against central content in a SEO world - regardless of whether they are technically indexable.

I wonder if lemmy should have a SEO friendly federated site... .com domain, robots.txt and everything else...

[–] [email protected] 2 points 1 year ago* (last edited 1 year ago)

right, though the SEO game is changing drastically with AI. People are using GPT-like models often in place of searches and likewise, expecting search results to hit their answers rather than being vague pointers. Following this reasoning, the search engines will need direct users to where the valuable information is, not always, but often enough to not lose users to competitors.

So the thing about SEO is that it's often an attention game that advertisers and smaller websites compete with each other. The information in public forums and threads is invaluable for the success of the search engine itself, so they're the ones that will eventually have to adapt to the new federated reality, should it become mainstream - and I do hope so.

[–] [email protected] 2 points 1 year ago (1 children)

How long does it usually take for google to index websites? Because I tried the string lemmy site:lemmy.ml after:2023-06-15 and only one post turned up for me and it was Memes... the current state of affairs does not seem promising 😔 And if I tried with another instance with the same keywords lemmy site:kbin.social after:2023-06-15 nothing even turned up.

I wonder though, will search engines adapt to Lemmy and its fediverse system? Or will search engines die? Or will we see dedicated search engines to search through the fediverse?

[–] [email protected] 3 points 1 year ago

How long does it usually take for google to index websites?

Anything between a couple of hours to more than a week, I don't think having a "real-time feed" through Google is important though. Other than world cup scores, their results were never about speed.

[–] [email protected] 7 points 1 year ago

Google brings it up

[–] [email protected] 4 points 1 year ago (1 children)

The second link for me when searching for Lemmy on Google is the link to the "Join Lemmy" website. Surprisingly, Brave Search, which has seemingly no search bubbles or accounts, shows the same.

[–] [email protected] 13 points 1 year ago (1 children)

Sorry I didnt make the post very clear. I was referring to an individual posts when people search for a specific issue/discussion in Lemmy.

[–] [email protected] 3 points 1 year ago

I was googling about Lemmy instances and got several front page results from the Self Hosted community.

[–] [email protected] 4 points 1 year ago (1 children)

I tried searching for the title of this post verbatim and it isn't in google results period.

[–] [email protected] 2 points 1 year ago

That could just be because of lag between when the post is created and when the Google crawler finds/indexes the page

[–] [email protected] 2 points 1 year ago

I believe there are engines that specifically index (search) the Fediverse. Searx is one that can do this but it might depend on the instance.

[–] [email protected] 2 points 1 year ago

Fair enough for atleast lemmy webiste and github page shows in the search.

[–] [email protected] 2 points 1 year ago (1 children)

Haven't checked if they do this, but you can tell Google which one is supposed to be the "real" post. So you shouldn't get duplicate content.

[–] [email protected] 2 points 1 year ago

The current version of the frontend is not doing that yet, though.

[–] [email protected] 1 points 1 year ago (1 children)

Oh, good point. Yes, probably? We can not simply assume search engines know that all of these point to the same content:

Or even worse, due to defederation, they may not all point to the exact same content.

Without further investment either from lemmy or the search engine's side, they are probably seen as distinct sources, not aggregated. Which makes each individually less relevant and less likely to show up .

Also note none of the adresses above contain 'lemmy'. How would users search for content on lemmy in these cases? Can't do "technology site:lemmy", or?

But I can say, lemmy content is visible. Haven't seen it on the first page of ecosia yet, but on page 2 or 3.

[–] [email protected] 3 points 1 year ago

This is relatively simple to solve from a technology perspective. You just incorporate the canonical URL meta tag on federated sites that reference the source URL. It'd be trivial to implement, provided the authoritative URL is known.

[–] [email protected] 1 points 1 year ago (1 children)

In my opinion, pushing lemmy to be a Reddit replacement isn’t the right move. Instead, Lemmy should be pushed as something that previously bulletin board forums dominated: the hobbyist, niche, discussion forum.

We don’t need hundreds of millions of users. Hell, most people stopped posting on places on Reddit that had too many users. What you want are communities with thousands of passionate users.

My point is that Google indexing would also be great, but more important is the ability to search these communicators for archivable posts. Reddit’s search functionality was dogshit so we had to rely on Google.

[–] [email protected] 1 points 1 year ago

“communicators” = “communities”. I can’t seem to find an edit button on Mlem