RedditCrossPostBot

joined 1 month ago
 

I've essentially archived a website and want to be able to view it in say Kiwix but that takes ZIM files, so I want to know how I can compress all the html files and folder structure into a zim file that I can view offline or maybe a WARC (i'm not sure how this would work).

The alternative is that I create an app that has a browser that can open html files by decompressing on the fly into ram for example but I feel like this is what a ZIM is. Can anyone help? Thanks.

The reason I'm not using a tool like ZimIT is because I have to edit the html code to eliminate cookie popups, so now it's nice and clean ready to be archived/zimmed up.


Originally posted by u/Specific-Judgment410 on Reddit.com/r/datahoarder


beep boop I'm a bot to seed discussions from Reddit. Upvote or downvote posts like normal, discuss the topics here as well!

 

Right now my set up is an M4 desktop Mac + 2tb external hard drive (for now). I’ve saved a handful of movies and shows on it and have been watching them through infuse on my Apple tv. Have been very satisfied with how it’s all worked out so now I would like to begin the process of going full hoarder mode and really start loading up on shows and movies.

My immediate first use case is that I want to add all my favorite shows - mainly 30 min sitcoms like Seinfeld, trailer park boys, it’s always sunny, etc. to the drive. Using Seinfeld as an example, each episode is roughly between 800mb and 1gb as it stands now.

I own Apple compressor and would like to run all these shows through it to save on space. Any recommendations for format/audio/visual settings? HEVC? h264? h265? MP4? Other? Really don’t need super high quality here, certainly not 4k, but was thinking 1080.

Also would be curious to hear streaming platform recommendations. Infuse has been terrific so far but didn’t know if plex, jellyfin, kodi were worth a look or better in any way. Thanks in advance


Originally posted by u/SummerWhiteyFisk on Reddit.com/r/datahoarder


beep boop I'm a bot to seed discussions from Reddit. Upvote or downvote posts like normal, discuss the topics here as well!

 

I used an extension called myfavett on chrome but that only grabbed about a 1000 videos and refuses to download any further. Anyone know any workarounds?


Originally posted by u/Forsaken_Pea3464 on Reddit.com/r/datahoarder


beep boop I'm a bot to seed discussions from Reddit. Upvote or downvote posts like normal, discuss the topics here as well!

 

Okay, captured minidv taped with WinDV and set it to split into clips instead of one big file so I can see the time and date each clip was taken, and now I want to join them in virtual dub without re encoding using direct stream copy and append clip. Problem is, I can only figure out how to do one at a time. There's like a hundred clips per tape, and I have tried highlighting all of them and dragging them into virtualdub while holding control but it puts them out of order. How can I combine all of them at once and keep them in the right order by file name. Or do I need some software besides VD. I do not want to just throw them into an editor and end up re encoding them. Thanks.


Originally posted by u/Unusual_Poem_9864 on Reddit.com/r/datahoarder


beep boop I'm a bot to seed discussions from Reddit. Upvote or downvote posts like normal, discuss the topics here as well!

 

I'm creating my first Plex server and have not purchased any drive larger than 2 TB before. Right now, Western Digital is having a deal where two 12 TB drives are going for $200 each (i.e., ~$16.7/terabyte).

Is $15-17 good enough to buy four and take advantage of the limited-time offer or is that "Just buy a couple" territory?

How much do you usually spend new per terabyte? Used?


Originally posted by u/Metallica93 on Reddit.com/r/datahoarder


beep boop I'm a bot to seed discussions from Reddit. Upvote or downvote posts like normal, discuss the topics here as well!

 

I've been thinking about trying various software raids, truenas, unraid, freenas, etc. and I'm not sure which one to try first. Are there other major software options that I'm not listing? Which do you recommend I try first and which would you ultimately implement to be the central backup to about 5-6 pcs/laptops and three Synology 8 bay NAS?

I've been building my own PCs since I was a kid and I pretty much have most of the pcs I've ever built, some 8 cores and a spare 16 core pc. Only about a year ago did I finally dive into the world of NAS and RAID and ended up getting three eight bay Synology NAS boxes. They are doing alright for what I'm using them for. I thought at first I'd not be good at learning about these things but I dedicated about three months of reading and youtubing and feel I have a good understanding of the synology ecosystem and some general raid knowledge.

Now I'm ready to take the next leap. Instead of buying a different brand NAS I would like to build my own and try some of these free software options using old hardware.

I am a tinkerer but I've never really had to get into much anything dealing with NAS, servers, and commercial IT stuff. Once I'm done tinkering and learning the softwares I'd like to pick one and build a cheap huge cold storage for more tinkering and to back the other computers and three Synology boxes to.

What do you all think? Any tips? Any suggestions?

TLDR: another newb decided to post a question instead of researching this topic ad nauseum and wants to know if he should play around with truenas, unraid, freenas, or other software using older hardware, 8-16 cores, 16 to 64gigs ram.


Originally posted by u/itsthexypat on Reddit.com/r/datahoarder


beep boop I'm a bot to seed discussions from Reddit. Upvote or downvote posts like normal, discuss the topics here as well!

 

I have an Orico 9958C3 with hard drives (WD Red and Iron Wolf drives) formated and showing in Windows Disk Manager (NTFS). However, they do not show in Orico's proprietary Raid Manager software. I have reformated drives, changed slots, restarted, etc. Any advice on how to setup Raid 5?


Originally posted by u/Zavad6404 on Reddit.com/r/datahoarder


beep boop I'm a bot to seed discussions from Reddit. Upvote or downvote posts like normal, discuss the topics here as well!

 

Does anyone have a good grasp or understanding from experience if hiding usb drives (or things in general) in plain sight is more effective than concealing from sight?

I have important data id like to keep backed up, but mobile and offline. I don't care if the data got destroyed over time or corrupted but I want to keep it safe from prying eyes.(i have backups i just need this data offline and portable for my own convenience)

I'm also somewhat new to using bitlocker encryption and it's easy to use but I do find myself wondering how hackable it is if at all (for the common attacker on a common person like myself). is it even worth it to buy a dedicated disguised cheap usb(pen style, throw it in my massive pen collection in office? Or can I just write the data to 1 or 2 of my old usb drives? I guess my concern is if an attacker came though my home they'd check for things that might be valuable like my safe, and obvious data storages/certain paperworks. But again would that even matter if 99.9% of attackers can't fathom breaking a bitlocker encryption?

Thanks for any input


Originally posted by u/0SwifTBuddY0 on Reddit.com/r/datahoarder


beep boop I'm a bot to seed discussions from Reddit. Upvote or downvote posts like normal, discuss the topics here as well!

 

Is there a way to tell Ripme to download only images from a URL that contains both images and videos? And can I set a minimum resolution for dowloaded images? I am new to all this. There doesn't seem to be a setting, Can this be done vie a config file?


Originally posted by u/Famous_Assistant5390 on Reddit.com/r/datahoarder


beep boop I'm a bot to seed discussions from Reddit. Upvote or downvote posts like normal, discuss the topics here as well!

 

https://preview.redd.it/zp9vlha0vmoe1.png?width=1200&format=png&auto=webp&s=25233afd4d8804e65b7d6dff7bab03f33fe6ef53

I want to start a personal project where I scan, OCR and index markdown for old books. This is a book with ALL of Romania's roads back in 1974. It has tables and maps and all sorts of other interesting historical data points.

I already have some idea of data engineering. I'm a software engineer and I've made a project that helps with RAG, search and indexing of markdown files (even very big ones). My problem is the OCR part. Any tips?


Originally posted by u/alexlazar98 on Reddit.com/r/datahoarder


beep boop I'm a bot to seed discussions from Reddit. Upvote or downvote posts like normal, discuss the topics here as well!

 

Hi all,

There are a wide number of sites which offer paid access to film references, including:

  • Shotdeck
  • Film Grab
  • Eyecandy
  • Filmboard
  • Shot Cafe
  • Frame Set
  • Screenmusings

They are paid archives, rather than being true data hoarding / open access.

Is there a centralised resource for this form of data hoarding, does anyone know? A group project?


Originally posted by u/cartrouble111112 on Reddit.com/r/datahoarder


beep boop I'm a bot to seed discussions from Reddit. Upvote or downvote posts like normal, discuss the topics here as well!

 

Hi All

First off,

Thank you for all the support while I've been building out https://pricepergig.com/ (it will be the best place to find digital storage on the internet, and is right now for Amazon imo, but I would say that right :) )

If you were to sign up for price alerts (e.g. the cheapest HDD, or the cheapest NVMe price per TB for example) or in the future alerts for your saved searches HOW would you like to be alerted?

If you could also let me know your country that would help me understand, perhaps it's different in different locations.

Backstory, you don't need to read this!

Many people asked for 'alerts', and I assumed email would be ok/good/great, perhaps I was wrong, not so many people have signed up, it could well be just the form looks scary, perhaps I need to point it out more, I can work on that, or email isn't the thing you guys wanted (I know I have plenty of emails I don't look at). So, let's find out.

Today PricePerGig 'only' does Amazon, but I will be adding other marketplaces once we've figured out the base feature set, so please do participate assuming your large marketplace is also in here.

Thanks

View Poll


Originally posted by u/PricePerGig on Reddit.com/r/datahoarder


beep boop I'm a bot to seed discussions from Reddit. Upvote or downvote posts like normal, discuss the topics here as well!

view more: ‹ prev next ›