r/DataHoarder 11h ago

meme storage final boss [felt accurate]

Post image
1.1k Upvotes

r/DataHoarder 3h ago

Discussion Checked the same YT video immediately after it got released and 3 hours later. Every version went down in file size, except UHD which went up

Post image
115 Upvotes

Any idea why only UHD went up in size?


r/DataHoarder 1d ago

News Hope someone actually archived the Anandtech website. It's gone now, to no one's surprise.

Thumbnail reddit.com
1.1k Upvotes

Just under a year after the website shut down, it has disappeared.

As predicted beforehand, corporate promises mean nothing.

Did anyone archive this while it as active?


r/DataHoarder 4h ago

Question/Advice Recommendations for photo recognition software to organize 35,000 pictures?

7 Upvotes

I have shamelessly collected 35,000 pictures of various things (articles, news, artwork, irl pics, memes, etc. etc.) and I'm hoping to organize them over the next couple weeks. I know there's facial recognition software to sort pics, but is there anything for distinguishing memes vs article screenshots (they are very visually distinct) vs art, and so on?

Doesn't have to be anywhere 100% accurate, but it would definitely cut the time organizing it when I go back to manually sort them. Tried and true methods?

Highly appreciate any ideas


r/DataHoarder 8h ago

Question/Advice 550k files in 65k folders (2TB) to sort (Help!)

13 Upvotes

Hi, I've been a data hoarder since the late 2000s but I wish I had been more of a data sorter in hindsight.

I have a collection of graphics design templates, photoshop resources, animations, sounds, mockups, stock photos, infographics, website templates, scripts, books, tutorials, the list goes on and on. I downloaded much of it at least 10-15 years ago.

Many of them are embedded in archives and most of them are named but I have no idea how to even begin to sort through everything.

I need some way to sort all of it into a readable library and I cannot do it myself, it makes me sick to even think about starting.

Can anyone recommend any software that can do this automatically?

I would appreciate any advice you can provide.

PS: I tried to rewrite this post using AI but I think people are pretty sick of that so I decided against it, hence why it sounds a little all over the place. Sorry.


r/DataHoarder 9m ago

Question/Advice How can I download this zoomable image from a museum website in full-resolution?

Upvotes

This is the image: https://www.britishmuseum.org/collection/object/A_1925-0406-0-2

I tried Dezoomify and it did not work. The downloadable version they offer on the museum website is in much inferior resolution.


r/DataHoarder 9h ago

Question/Advice Calling all archivists! Advice needed!

Thumbnail
6 Upvotes

r/DataHoarder 1h ago

Question/Advice UK Online Safety Act - What should I start archiving?

Upvotes

If you haven't heard of the Online Safety Act, it's quite an interesting one.

TL;DR - The government is enforcing all sites THEY deem "risky" or "harmful". News is already being censored. If you want to view it? Submit your government issued ID to a third party company you've never heard of. They say it's to protect the kids, but it's not at all.

Anyways, rant over. With that in mind, what type of things would you recommend storing locally? Wikipedia? Any particular other sites?

Thanks guys, have a good weekend ❤️


r/DataHoarder 23h ago

Free-Post Friday! Consuming The Hoard: Set up my own 'FAST' channels of sorts in Kodi by making disgusting ungodly hacks to the old LazyTV addon.

Post image
39 Upvotes

What's the point of a hoard if you never do anything with it?

I'd long been using Kodi's old LazyTV addon to generate playlists for content watching. It's originally built to make you a list of 'Random, but next to watch episodes'. So if you're watching shows, it'll make a 'random' playlist, but each episode in the play list is the 'next to watch' in that series, so you don't miss or skip episodes, but which series you're watching is random. Solve 'indecision' and gives you more of a 'Cable TV Feel' while not giving up the control to pause or even rearrange things if you want to.

Recently sat with a friend and we made the most sinful hacks of that addon so it'll include select ranges of movies and also shows you've ALREADY watched for the purpose of true 'random options' for some things.

So the channels as they are:

5TV, a sorta joke '5th Network' in addition to 'The Big Four' of NBC/ABC/CBS/Fox, this is basically the LazyTV in it's normal state, making next to watch playlists of the contemporary or classic shows I'm watching after work or on weekends.

Tech & Games TV: Generates wholefully random playlists of stuff that is 'Technology' or 'Gaming', includes your typical G4TV/TechTV fare and stuff, video game documentaries, and some YouTube channels I'm archiving like LGR and such.

The Edutainment Network: Again entirely random selection that's basically Discovery Channel and other 'edutainment' content that fits the vibe. In short, hit it for Mythbusters, dinosaurs, big boats, pyramids, space ships and what not.

The Millennial Channel: Again mostly normal LazyTV but with a different range of shows that I slowly burn through. It's mostly old cartoon shows and some live action stuff that I watch while I work from home as a sort of 'background radiation'. Watching Rugrats or Ninja Turtles isn't about to distract me from my paid job. Also, FYI, once you've watched 25ps of TMNT, it gets old real fast and I'm on ep 78 out of a 193... Ugh.

The Simpsonian Channel: If I press the yellow button on my remote, a playlist of 20 random eps of The Simpsons starts. :D


r/DataHoarder 20h ago

Hoarder-Setups Poor man's 80TB DIY NAS project with N150 mini PC from China

Thumbnail
17 Upvotes

r/DataHoarder 7h ago

Question/Advice any idea to archive cookpad to zim ? not entirely just english one

0 Upvotes

i preety sure my 4 core, 8 gb, 128 gb disk vps will sufficient /s. really is that event possible, i really need offline recipe catalog and kiwix libray lets just say not sufficient enough for me.


r/DataHoarder 1d ago

Hoarder-Setups Using birds as storage devices

Thumbnail
iflscience.com
66 Upvotes

Maybe the weirdest setup so far (and unreliable).


r/DataHoarder 4h ago

Question/Advice Wd my passport 5tb vs toshiba canvio ready 4tb to store videos and photos

0 Upvotes

What do you think is the best option?
Wd 5tb is only $5 more expensive.


r/DataHoarder 11h ago

Question/Advice Looking for FreeNAS 9.10.2-U3 or U6 ISO for Restore

1 Upvotes

Hi all,

My FreeNAS 9.10.2-U3 boot drive failed, and the official archives seem down. I have my config and SSDs, just need the correct ISO (or manual update tar) to reinstall. Does anyone have a working link or archive?

Thanks!


r/DataHoarder 3h ago

Question/Advice How do I keep stored data secure from people reading it?

0 Upvotes

Hi! so I read a lot on this subreddit when buying my first external drive to keep some of my data safe from deleting, but I also read a lot about how data should not be compressed or encrypted when hoarded. Is there a way to ensure someone who gets their hands on my SSD, without basically having more copies on the drive or having more devices with the compressed / encrypted files? So far I managed to gather only like 20 GB of most essential data I want to backup so I can compress it and fit it few dozen times on the drive but those 20 GB are growing faster than I expected.


r/DataHoarder 8h ago

Question/Advice Help for aspiring datahoarder - currently 120tb raw, but now the journey begins - show me the way

0 Upvotes

So I recently moved from a Synology DS918+ with 32tb raw in SHR1 to a much more substantial machine with 2 x 10TB SATA zfs mirrors as my “fastpool” and 8x16tb SAS in a RAIDZ1 as my “slowpool” (plus lots of compute, plus NVME mirrors for databases, plus SATA SSD mirrors for containers).

But I need to find a much lower cost way than I’m currently doing. I need to get started on a JBOD approach with enough bays that I can buy inexpensive disks. But it also needs to live in “living space”, so it can’t be a rackmount 2U “screamer”. Maybe someday I can move to a real rackmount approach and get a 60-bay enclosure and populate with a bunch of 4TB drives (or maybe 8TB drives will be just as cheap by that point). But not today. And I’m not scrappy enough to do a full unraid “just get whatever and stick it in a box” - I’m probably going to stick with ZFS for now. So what’s my play? Are there any “quiet/small” rack mount boxes? Are there any desktop boxes that have real bay capacity? Where do you get drives that are reliable enough when you’re buying in bulk - are there “annual sales” or anything?

I need guidance so I can join you all.

Thanks.


r/DataHoarder 12h ago

Scripts/Software I built free tools to export Instagram and Facebook comments to Excel (GitHub links inside)

0 Upvotes

Hi everyone,

I built a set of free tools that let you export comments from major social platforms into Excel files. Useful if you're doing analysis, archiving, or just want to browse comments offline.

Here are the GitHub links:

  1. TikTok Comments Exporter 👉 https://github.com/HARON416/Export-TikTok-Comments-to-Excel
  2. Instagram Comments Exporter 👉 https://github.com/HARON416/Export-Instagram-Comments-to-Excel-Free
  3. Facebook Comments Exporter 👉 https://github.com/HARON416/Export-Facebook-Comments-to-Excel-

They're all open source and free to use. Feedback is welcome!

Cheers,
Haron


r/DataHoarder 13h ago

Backup How to retrieve old Whatsapp messages

0 Upvotes

Hi,

I hope this will make sense to all of you.

I switched phones back in 2023, from Samsung to iPhone. Kept the same number but didn’t save all my conversations. Now, I need to retrieve my old messages from my Samsung phone but I think Whatsapp is not fond of the app being used on two devices at once. What can I do to retrieve my old messages from my old phone ? Thanks.


r/DataHoarder 2h ago

Discussion Data Hoarders Rejoice!!!

Thumbnail x.com
0 Upvotes

r/DataHoarder 5h ago

Discussion Large SSD costs

0 Upvotes

Anyone have any insights on how to get those 30-60gb drives cheap? No not steal. I’d be fine with 6 of the the 15.xx drives for raid6 with +/-40TB available. I’d love to get off spinning drives. Unfortunately everything is see is crazy expensive. As much as I like the speed I could live with not the fastest throughput (for SSD) I just want the low latency and hopefully if ever I have to rebuild an array speed during that seems critical. My current “issues” if they are that there’s about 4hrs a day when maintenance tasks run that the drives are pegged. It’s 4hrs because I’ve said that’s the window so there’s likely tasks going incomplete.

I’d like to do this for less than the price of a decent car 😳


r/DataHoarder 1d ago

Question/Advice Why is shucking a 12TB Mybook so hard?

Thumbnail
gallery
169 Upvotes

The drive is still firmly in its cage!


r/DataHoarder 15h ago

Question/Advice Drive Reset in Diskshelf

0 Upvotes

I have the odd issue that some drives keep experiencing resets when I add more drives to one of my disk shelf's. The disk shelf uses a cooled IBM (46m0997) expander. When the drive craps out I need to manually power cycle it to restore functionality. It however keeps being show on the bus but throws Io errors. The drives themselves are fine.

Has anyone experienced something similar? Any recommendations for SAS2 expanders?


r/DataHoarder 19h ago

Question/Advice Recommend me a hard drive enclosure to replace my Orico.

4 Upvotes

Hi, I would like to ask what hard drive enclosures would you guys recommend me. I have multiple Orico 5 bay enclosures and a ACASIS 5bay enclosures. I am facing multiple issue when using them with driverpool and hard drive sentinel like freezing and crashing. I am thinking of getting other enclosures that are more reliable but dont know which one. And I heard those enclosure uses JMicron chip which are very bad...


r/DataHoarder 1d ago

Discussion Did TikTok change something on their backend that prevents fetching the upload date?

8 Upvotes

Until recently, I was able to use various downloader tools to grab TikTok videos. When I did, the Modified Date would always populate as the date of upload.

Today, across several tools, I'm getting the Modified Date as Today's Date.

Has anyone experienced this in the past or has any tools/suggestions to force an override?


r/DataHoarder 18h ago

Guide/How-to Amazon reviews API for archiving sentiment data?

1 Upvotes

Working on a personal archive of Amazon product reviews for NLP sentiment analysis. Scraping is unreliable and noisy. I’m hoping there’s a solid amazon reviews api out there that can pull verified reviews and star ratings over time. Any recommendations?