this post was submitted on 13 Jun 2023
32 points (100.0% liked)
Asklemmy
43834 readers
719 users here now
A loosely moderated place to ask open-ended questions
Search asklemmy ๐
If your post meets the following criteria, it's welcome here!
- Open-ended question
- Not offensive: at this point, we do not have the bandwidth to moderate overtly political discussions. Assume best intent and be excellent to each other.
- Not regarding using or support for Lemmy: context, see the list of support communities and tools for finding communities below
- Not ad nauseam inducing: please make sure it is a question that would be new to most members
- An actual topic of discussion
Looking for support?
Looking for a community?
- Lemmyverse: community search
- sub.rehab: maps old subreddits to fediverse options, marks official as such
- [email protected]: a community for finding communities
~Icon~ ~by~ ~@Double_[email protected]~
founded 5 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
As a general rule, I prevent all of my self-hosted services that are directly exposed to the Internet from being crawled or indexed by search engines. Any service I do expose publicly to the Internet is of course behind proper authentication and is secured using modern best practices and standards, but lowering the visibility and odds of someone stumbling onto services they have no use for, and potentially trying to exploit them, is less likely to happen if they aren't presented front and center in a search result. I wouldn't say it's a proper security measure by any means (obscurity has nothing to do with real security), but blending into the crowd or taking a seat at the back of the room draws less attention to yourself if you don't care to be the first target in someone's sights.
So why do I expose any of my self-hosted services to the Internet in the first place, rather than access them exclusively via VPN? For me there's a few reasons:
...And there are more reasons I could get into, and I could easily expand on the ones above, but I'll leave it there.
Of course having all of my external traffic flow through Cloudflare means there's no expectation of data privacy for any payload traversing in and out of my services, but I've decided that I'm okay with that for the other benefits I get out of Cloudflare. Nothing's truly free, right?
But to answer your original question more specifically, and with the context above in mind, why actively work against indexing in the case of my Lemmy instance? Well, I'm the only user on my instance. I only use it as a home server for my account. That means I'm not creating any communities on it, and there's no content actually originating from my instance proper. Anybody who would end up coming across my instance, if they were to browse, would see content which originates from other instances, and only content from the time that I set up my server and began federating with those other servers and onward. They wouldn't see every comment from posts that pre-dated my federation, so it would be an incomplete view. They would be better off going directly to the server that originated the content. They could of course do that by following the permalink from my own server, but it's an extra hop. It might arguably be better in this case if I just remove my server entirely from any possible search results so that if the originating instance is indexable, its content shows up in the results and mine don't. That would probably be a better user experience for users trying to find Lemmy content via search engines, they'd hopefully land in the originating instance sooner than later.
Long answer, but I wanted to give as much insight and clarity into why I do what I do. Happy to answer any more questions!
Interesting insight into your setup and thought process.
That makes good sense. I didn't realize you hosted your instance only for yourself. I might consider that as well in the future.
It's a good idea to host your own instance (if you can) for a number of reasons, depending on your own skillset and level of knowledge:
There's even more reasons I'm sure, but those are the obvious ones that come to my mind.