this post was submitted on 15 Jun 2023
87 points (95.8% liked)

Programming

17666 readers
376 users here now

Welcome to the main community in programming.dev! Feel free to post anything relating to programming here!

Cross posting is strongly encouraged in the instance. If you feel your post or another person's post makes sense in another community cross post into it.

Hope you enjoy the instance!

Rules

Rules

  • Follow the programming.dev instance rules
  • Keep content related to programming in some way
  • If you're posting long videos try to add in some form of tldr for those who don't want to watch videos

Wormhole

Follow the wormhole through a path of communities [email protected]



founded 2 years ago
MODERATORS
top 8 comments
sorted by: hot top controversial new old
[–] flynnguy 22 points 2 years ago* (last edited 2 years ago)

Oh boy, this article is 20 years old. Still relevant though so go read it if you still aren't sure about Unicode!

[–] lasagna 6 points 2 years ago (3 children)

I would take this with a grain of salt. None of the programming languages I have used require this level of in-depth knowledge. Certainly not modern C++.

[–] [email protected] 6 points 2 years ago (1 children)

Did you see the date on the article?

[–] lasagna 3 points 2 years ago

No. I can only read date in the YYYY/MM/DD format.

[–] [email protected] 5 points 2 years ago (1 children)

It's not so much about the programming language you use, it's about what data you're taking in, what you're doing with it, and where you're passing the data off to next.

If everything is all the same encoding, or all your data is ANSI you never have to think about it. It's only when your program runs across systems or regions things get screwed up

[–] lasagna 2 points 2 years ago

You make a good point. Data and programming can be seen as separate entities. Though I'd disagree with this slightly because some programming languages are friendlier to some data types than others.

With that said, my main disagreement comes from the claim that every programmer must know what is being pitched here. Which is what I would take with a block of salt.

[–] Buttons 5 points 2 years ago (1 children)

It's still true that if you have a bunch of bytes and don't know their encoding, you don't really know what characters you have.

[–] lasagna 2 points 2 years ago* (last edited 2 years ago)

Good thing print debugging is still going strong!

I code across multiple systems, particularly Windows and Linux. A lot of data involved. These encoding issues are rare. Granted, I'm more on the backend of things and more numerical-data driven. I'd expect a completely different set of headaches for web dev and such. But that just highlights the issue I have with the claim that every programmer must know this. Hence why the comment about taking it with a grain of salt.

I have seen a lot of people interested in learning but get turned away by this level of complexity. This imo is unnecessary. Programming is first and foremost about logic, this level of in-depth knowledge requirement is what we have been doing away with in higher-level programming languages.