Programming

20404 readers
169 users here now

Welcome to the main community in programming.dev! Feel free to post anything relating to programming here!

Cross posting is strongly encouraged in the instance. If you feel your post or another person's post makes sense in another community cross post into it.

Hope you enjoy the instance!

Rules

Rules

  • Follow the programming.dev instance rules
  • Keep content related to programming in some way
  • If you're posting long videos try to add in some form of tldr for those who don't want to watch videos

Wormhole

Follow the wormhole through a path of communities [email protected]



founded 2 years ago
MODERATORS
26
27
 
 

I came across Nepenthes today in the comments under a post about AI mazes. It has an option to purposefully generate not just an endless pit of links and pages, but also to deterministically generate random, human-like text for those pages to poison the LLM scrapers as they sink into the tarpit.

After reading that, I thought, could you do something similar to poison image scrapers too?

Like if you have an art hosting site, as long as you can get an AI to fall into the tarpit, you could replace all the art it thinks should be there with distorted images from a dataset.

Or just send it to a kind of “parallel” version of the site that replaces (or heavily distorts) all the images but leaves the text descriptions and tags the same.

I realize there’s probably some sort of filter for any automated image scraper that attempts to sort out low quality images, but if one used similar images to the expected content, that might be enough to get through the filter.

I guess if someone really wanted to poison a model, generating AI replacement images would probably be the most effective way to speed up model decay, but that has much higher energy and processing power overhead.

Anyway, I’m definitely not skilled/knowledgeable enough to make this a thing myself even just as an experiment. But I thought you all might know if someone’s already done it, or you might find the idea fascinating.

What do you think? Any better ideas / suggestions for poisoning art scraping AI?

28
29
30
31
32
 
 
33
34
35
 
 

If you ever wondered how well does it actually work.

tl;dr: good news, you are not loosing your job for now

To be fair, the last message in those PRs state:

The agent was blocked by configuration issues from accessing the necessary dependencies to successfully build and test. Those are being fixed and we'll continue experimenting.

So let's wait and see 😄

More links on Reddit

36
 
 

I need to create a website that holds some events data as well as other content, it will also maybe need to grab some data from APIs.

Since I'm skilled with Hugo (static site generator) I thought I could use that but it's turning out it's a total mess actually.

In Hugo I can have contacts (like events organizers) as taxonomy, but that is a different format (yaml) than CSV or vcard, and it's also static, meaning that if I edit a contact it will only change in Hugo. So I found myself having to manage contacts in 4 different places, in 4 different ways: Hugo yaml, Thunderbird, google contacts, CSV (from earlier days)... And I will add mailchimp once I'll also add a newsletter. This ensures my contacts are kinda becoming a mess.

Same goes with events, it's okay if I generate events in Hugo, but if I grab events from APIs and then the API content changes I will have to modify it on Hugo as well.

Everything it's turning out to be a total mess essentially and I think I tried to use something simple to build something quite complex, I realized the complexity later.

Now ideally I would like to be able to have my contacts, my newsletter, my content in one single place ato have everything nicely synced and not having to deal with 30 different lists or formats.

What should I do?

I know about the jamstack and headless CMS like ghosts and I was wondering if they could be a good solution, or if I should opt for a full CMS. Obvious solution would be WordPress but I wouldn't really want to mess with all the plugins.

Also I spent quite a but of time in building my templates for the Hugo website and throwing everything away would feel awful, if there's a way to reuse them (?).

I know some JavaScript basics but I would avoid it if possible.

37
-4
submitted 1 week ago* (last edited 1 week ago) by [email protected] to c/programming
 
 

I'm writing to follow up on the QP tool here- because I posted before, and it was incredibly helpful. Thanks to those who helped out with all the directions.

RECAP:

Been building a tiny tool to share structured thoughts fast- no tabs, just type and share.

It’s called Quickpoint → https://quickpoint.me/

Curious what people think it's for. Some use it like a notepad, some for demos or quick specs.

If you gave it 30 seconds:

**- What’s your gut sense of what this is?

  • Would you actually use it for anything?
  • What would stop you from using it twice?**

Thanks — Trying to see who might use this and for what and adapt accordingly.

Really appreciate any feedback that you can give.

38
 
 

I use vscode for my personal projects (c++ and a fully open source stack, compiling for both Linux and Windows).

I'm using the proprietary version of vscode (via the aur) for the plugin repository, but I've always envied the open source version...

Are there any tools that have made you excited?

Bonus points if they have some support for compiling with MSVC (or if you can convince me to ditch it for something else).

39
156
Stack overflow is almost dead (blog.pragmaticengineer.com)
submitted 1 week ago by [email protected] to c/programming
40
41
 
 

I am experiencing a bit of a whiplash here at work. To be fair, I am getting paid very well (and got a significant raise last year, that was good), but we have had positions be open for years now with no hiring. And we are having more and more projects being proposed without finishing the projects themselves. Its the classic they want to do more with less scenario. The top brass just announced a hiring freeze so new devs for a while.

The execs are floating the idea that AI can be used to replace or supplement the people leaving. On all of our propitiatory code-base/solutions....yeah that will go well.

So yeah anyone else dealing with this?

42
 
 

I have been familiar with Monero for many years now, so I have to address this problem urgently:

I have noticed a fatal issue with Bitmain's “I am not an ASIC” ASIC, the Antminer X5: Even though the device is all about CPUs and RAM sticks, it is still the most efficient device. It beats all miners that rely on conventional CPUs (EPYC or RYZEN) in terms of price and efficiency. As a result, Monero mining has become completely dependent on a single company within just a few years. Entrepreneurs who want to mine Monero are faced with one of the toughest competitions there is. They will therefore use the most efficient equipment. The X5. Am I the only one who seems to have noticed this so far?

Monero's ASIC resistance was meant to not be dependent on one company. Now we are. How can we wriggle out of this noose?

I would also like to ask you not to fall for assumptions that someone at home is simply mining for the good Monero with their own computer. The percentage is probably the smallest. In truth, it is an advanced industry.

I see the recent hashrate growth as particularly critical when looking at the hashrate chart as a whole. Often these increases came before a prominent ASIC came out of the shadows and was presented to the world, while it had already been mining for years. (Like in 2018 and 2019)

If the hashrate drops dramatically again, this could lead to a significant vulnerability of Monero.

Moneros Hashrate Chart: https://bitinfocharts.com/comparison/monero-hashrate.html#alltime

I work a lot with Monero, so I wanted to share these thoughts with you.

I usually address problems for which I also have solutions. But frankly, I don't have any ideas here. Because adjusting the mining algorithm would not throw Bitmain out of the race. After all, they have only built a very efficient CPU miner.

As if they were spitting at the developers' feet, even based on an open source chipset (RISC-V).

What do you think?

43
 
 

I'm trying to wrap my head around how media player websites work.

This might not be fully programming, but idk where else to post this question

Some video sharing sites uses blob objects to send the video in chunks, instead of letting the browser download in full. Usually there's mpd or m3u8 files that we can use to download the whole video

But some sites don't use those files. They send mp4 chunks, but those mp4s can't be played and seem encrypted.

How is the webpage's video player able to play the video? Is there a decrypter or something in the webpage/video player? Am i able to download all mp4 chunks and run them through some decrypter in the webpage served to me and create the whole video from it?

44
 
 

I’m a retired coder, These days, I mostly scroll around Lemmy to kill time, but something caught my eye recently, a post about this protocol called Plebbit.

Honestly? At first, the name put me off. Sounded like a joke, plebbit word was a sluar used against redditors…But I was curious enough to check it out and I’m glad I did. Plebbit is basically a protocol that combines the structure of Reddit and all other social media with the tech behind torrents. Every sub is its own instance, and everything is peer-to-peer. No central servers, no single point of failure. It’s simple, but it hits at something big : real decentralized social media.

After digging into it (and chatting with the dev a bit : an anonymous dev named Esteban Abaroa)I started thinking… what if Lemmy could run on top of this? No more relying on any one instance just pure P2P. So yeah, I have decided to build a Lemmy client on top of Plebbit.

Still not a fan of the name but the idea is solid. And honestly, I think we might need this more than we realize.

If anyone has name suggestions for the client, I’d love to hear them. I’ll leave the GitHub link here so you can check out the code yourself.

https://github.com/plebbit

Need your honest opinions, guys.

45
46
47
48
49
50
13
Lisp. But Why? (www.youtube.com)
submitted 1 week ago by [email protected] to c/programming
view more: ‹ prev next ›