this post was submitted on 17 Feb 2024

14 points (100.0% liked)

Free and Open Source Software

18232 readers

123 users here now

If it's free and open source and it's also software, it can be discussed here. Subcommunity of Technology.

This community's icon was made by Aaron Schneider, under the CC-BY-NC-SA 4.0 license.

founded 2 years ago

MODERATORS

[email protected]

Audio Upscaler (lemmy.tf)

submitted 1 year ago by [email protected] to c/[email protected]

27 comments fedilink hide all child comments

Does anyone know of a local audio upscaler? Preferably Android based.

you are viewing a single comment's thread
view the rest of the comments

[–] [email protected] 2 points 1 year ago (3 children)

Basically something to improve quality. So from 192kbs to 320kbs

[–] [email protected] 15 points 1 year ago (1 children)

That's not really possible, once compressed the original audio is just missing entirely.

[–] [email protected] 2 points 1 year ago (2 children)

Can AI not do in the same way that it does with pictures?

[–] [email protected] 11 points 1 year ago (1 children)

Hello, Audio Engineer with some little knowledge regarding AI here.

What you think of is restoring frequencies, this is possible, and commonly used in plugins for audio restaurization. I might be mistaken, but this does not improve the bitrate, but the perceived quality (which is still lossy).

I don't think that there is a real interest to upscale quality (not perceived quality), especially for longer (> 1 minute) material.

[–] [email protected] 1 points 1 year ago (1 children)

It's funny you say that. I, as most people bought a bunch of CDs back in the day and ripped a bunch before I gave up my CD drive. At the time, storage was expensive and so I did what I could at the time with MP3. As storage gets cheaper (though not cheap enough for me to go lossless), I'd like to be able to upscale my music while keeping a similar file size and have my collection mature with me until storage becomes cheap enough for me to go lossless.

I can't be the only person who's thought of this.

[–] [email protected] 11 points 1 year ago* (last edited 1 year ago) (2 children)

You're better off buying a cheap USB optical drive, re-ripping those CDs, and transcoding the files to something like Opus, which offers comparable quality to 320kbps MP3 files at lower bitrates (which also means smaller file sizes).

Or you can just "download" the FLAC versions, transcode those, and delete them after.

Also, kind of funny how this was posted just after someone complained about the same thing in the audio engineering subreddit.

[–] [email protected] 2 points 1 year ago

That is comedy gold! £1000 Ethernet cables? WTF?

[–] [email protected] 2 points 1 year ago* (last edited 1 year ago) (1 children)

wow, so now reddit won't let you see the post without logging in even if you open it through the old.reddit domain?

[–] [email protected] 2 points 1 year ago* (last edited 1 year ago) (1 children)

This link should work.

I didn't realize that reddit formats the link completely differently when you "share" from its shitty app.

My bad, and sorry about that. Should work now.

[–] [email protected] 2 points 1 year ago

sorry about that

https://yewtu.be/watch?v=brLNcJeSAhw

[–] [email protected] 6 points 1 year ago (1 children)

It wouldn't be the original audio, the AI would just be making up new content to fill in the blanks like it does with a photo.

[–] [email protected] 2 points 1 year ago (1 children)

That's not a bad thing though, right?

[–] [email protected] 3 points 1 year ago (1 children)

If you're OK listening to a derivative work of your input. Otherwise, it's bad.

[–] [email protected] 2 points 1 year ago

Hmmm. I feel like this is one of those long-term studies that would be quite exciting? Am I wrong to be a little bit excited about programs learning how to guess correctly what should be where and subsequently how things should sound?

[–] [email protected] 9 points 1 year ago (1 children)

you won't magically restore the parts that have been removed during compression.

[–] [email protected] 2 points 1 year ago (1 children)

Can AI or machine learning not do in the same way that it does with pictures?

[–] [email protected] 9 points 1 year ago (1 children)

It cannot bring back lost data. It can hallucinate something that is statistically likely given the context but I'm not aware of any tool which can do that to a useful degree.

What's the context? Why can't you just get a better encode where the data isn't lost?

[–] [email protected] 5 points 1 year ago (1 children)

Things like old mixtapes are impossible to get better encodes of

[–] [email protected] 6 points 1 year ago

Old mixtapes and such can be noisy with hizzes, pops and such. It is possible to filter out those artefacts but thats removing stuff, just as digitally compressing audio is removing stuff. You can't create data from nowhere for digitally compressed files and you can't simply add back the hizz, noise, and pops to the mixtapes if you remove that.

[–] [email protected] 5 points 1 year ago* (last edited 1 year ago) (1 children)

Going from 192kbps to 320kbps would be audibly negligible unless you used a really bad codec to begin with, in which case adding AI into the mix would likely just compound the problem.

Probably not even worth it, tbh.

[–] [email protected] 1 points 1 year ago

Ah, thank you for the info