Briskfall

joined 1 year ago
[โ€“] [email protected] 4 points 1 year ago

I've had better results searching through the instances themselves because Google doesn't always index the keywords on time. On caveat of this method is that if the instance doesn't have the syncing out the instance where the info is from being propagated, then this trick would not work

[โ€“] [email protected] 6 points 1 year ago

Oh no, not a Pay to Win x HostileArchitecure collaboration!!

This is giving me peak ABoringDystopia vibes...

[โ€“] [email protected] 2 points 1 year ago (1 children)

I used to not do it, but I've been influenced by Bing after talking to it for so long during the closed beta (I guess that this is an effect of subconsciously mirroring it so that I don't get kicked out before the 5 turn limit back in the days haha ๐Ÿ˜‚)

Then again, in diverse online communities, there are various styles and voices that are eventually formed to be what's "acceptable" be the general consensus. As Lemmy is very new, it has yet to find its voice yet... I think ๐Ÿค”.

 

AI-TRIGGER WARNING: I've asked ChatGPT to revise my writing because it was ass (writing a stream of coherent looking text is not my forte). Proceed at your own discretion.

Yes the emoji 's all on me, I've been too much influenced by Bing Chat lately---even ChatGPT took it out but then I pestered it to move it back.

Below this line it's all text that has been retouched by AI ๐Ÿ˜ฑ:


Title: Archiving Reddit Threads During Protests: Suggestions Needed

Body:

Hello everyone,

As many of you are aware, numerous Reddit subreddits are temporarily closed due to the ongoing protest. While I completely support this action, it is causing some issues with my hobby research. Many posts are being deleted or replaced with placeholder scripts, leading to a loss of valuable information. Source: https://lemmy.ml/post/1259772

In an effort to address this, I have been using a script to save Reddit threads that I find interesting to my Personal Knowledge Management system: https://www.reddit.com/r/ObsidianMD/comments/104k0om/script_save_reddit_posts_to_obsidian/ . I have managed to successfully use it, but since I don't have a strong understanding of Ruby code ๐Ÿ˜…, I'm worried about its future functionality, especially if it depends on the Reddit API.

I recently discovered a thread discussing Reddit dumps: https://lemmy.nz/post/52092 . This discovery made me curious if it would be possible to modify the Ruby script to work with a local version of Reddit or even directly with the Reddit logs. To my understanding, these logs are in JSON format, but I haven't downloaded them yet.

Additionally, I've come across the concept of vector embeddings and a tool called Pinecone. Would it be more straightforward to use this tool to extract the necessary information, as opposed to manually searching through the data? Ideally, I would like to create a local search function, similar to Google, specifically for this dataset dump. However, I'm unsure of how to search a local database of Reddit submissions. I have found potential solutions such as Semantra and Qdrant, but I'm uncertain if these are the best tools for this task. Perhaps there is a more suitable option?

I will be honest, I don't have a strong background in technology, and this problem is proving to be quite complex. But I'm willing to tackle it. I would greatly appreciate any input or suggestions that you could provide.

Thank you in advance, everyone! ๐Ÿ˜Š

[โ€“] [email protected] 2 points 1 year ago* (last edited 1 year ago)

I think that giving too many choices to users who are already confused by the concept of federation and instances will enhance their paralysis of making choice due to cognitive overload (See https://en.m.wikipedia.org/wiki/Overchoice ).

I've found out kbin.social the easiest to get used to (end-user wise).

[โ€“] [email protected] 2 points 1 year ago (1 children)

How long does it usually take for google to index websites? Because I tried the string lemmy site:lemmy.ml after:2023-06-15 and only one post turned up for me and it was Memes... the current state of affairs does not seem promising ๐Ÿ˜” And if I tried with another instance with the same keywords lemmy site:kbin.social after:2023-06-15 nothing even turned up.

I wonder though, will search engines adapt to Lemmy and its fediverse system? Or will search engines die? Or will we see dedicated search engines to search through the fediverse?

[โ€“] [email protected] 2 points 1 year ago (1 children)

Unfortunately, I've been getting some 404 not found of some communities/magazines of some instances that are not from the instance I'm using, e.g. I'm using kbin.social at the current posting account, but let's say that I tried to access something like https://sh.itjust.works/c/skincareaddiction there's no issues whatsoever (since it's the main instance where that community spawned off) but if I tried https://kbin.social/m/[email protected] then I would get the aforementioned error code. I find it pretty inconvenient that caching/indexing of certain less popular (which I assume is what is happening) community working clunkily, it feels not as reliable than using a centralized service, but I guess that this is the price to pay for a decentralized system.

[โ€“] [email protected] 0 points 1 year ago (1 children)

Two, but only because I can't log into the other fediverse instance that I've registered (sh.itjust.works).

[โ€“] [email protected] 4 points 1 year ago (5 children)

My list

Funsies & Weird brainstorming

  • /r/competitiveoverwatch aka /r/cow (I don't play the game but I sure enjoy the juice)
  • /r/hobbydrama (very diverse type of juicy dramas, miam!)
  • /r/overwatchTMZ (/r/cow but extended)
  • /r/valorantcompetitive (same reasoning for the OW one)
  • /r/livestreamfail (don't post there; but is fun to occasionally lurk and see funny stuffs and be up to date with the latest online juicers)
  • /r/anarchychess
  • /r/singularity
  • /r/BestofRedditorUpdates
  • and 20 more cute animal pictures/videos subreddits like /r/partyparrots, /r/happycowgifs, etc.

Stuffs I use for er..... productivity! yeah yeah productivity, that's right!

  • /r/obsidianmd I enjoy seeing other people's workflow and new tools being developed
  • /r/chatgpt (Recently the main sub went to shit with the influx of new users so /r/chatgptcoding or /r/chatgptpro might be better lol)

Subreddits that I often get led by Google search engine and it would be sad if they were to go down perpetually since I would have a very hard time without them...

  • /r/homelab
  • /r/automation
  • /r/selfhosted
  • /r/datahoarder
  • /r/android
  • /r/sysadmin
  • /r/kitchenconfidential
  • /r/appliancerepair/

I'm also very interested in how some different jobs work so I subbed out to these to check on them occasionally... and they sometimes would provide interesting workflows/insights that I can a-hem, take inspiration from...

  • /r/ExperiencedDevs/
  • /r/accounting
  • /r/uxdesign

There's way more but I visit those a bit less, the problem is, I'm not sure if Lemmy can fill the void in my heart but if it does for those main ones (all above) then I think that I can permanently migrate from Reddit.