overview for hok

Can you fine-tune on localized steering of an LLM? in c/[email protected]

[–] [email protected] 1 points 1 month ago* (last edited 1 month ago) (15 children)

Can SFT be used on partial generations? What I mean by a "steer" is a correction to only a portion, and not even the end, of model output.

For example, a "bad" partial output might be:

<assistant> Here are four examples:
1. High-quality example 1
2. Low-quality example 2

and the "steer" might be:

<assistant> Here are four examples:
1. High-quality example 1
2. High-quality example 2

but the full response will eventually be:

<assistant> Here are four examples:
1. High-quality example 1
2. High-quality example 2
3. High-quality example 3
4. High-quality example 4

The corrections don't include the full output.

Fish Speech 1.5, an open source voice cloning TTS that's actually good in c/[email protected]

[–] [email protected] 3 points 1 month ago

You are right. Their description of "SOTA Open Source TTS" caused me to assume it was open source, but it's clear that

This codebase and all models are released under CC-BY-NC-SA-4.0 License.

So, it's "source available" and not released under a permissive licence.

Llama 3.3 70b - End of open-weight pretrained models from Meta or just a better Llama 3.1 405b finetune? in c/[email protected]

[–] [email protected] 2 points 1 month ago

Thank you so much, that exactly answers my question with the official response (that guy works at Meta) that confirms it's the same base model!

I was concerned primarily because in the release notes it strangely didn't mention it anywhere, and I thought it would have been important enough to mention.

Fish Speech 1.5, an open source voice cloning TTS that's actually good in c/[email protected]

[–] [email protected] 4 points 1 month ago* (last edited 1 month ago)

I followed their instructions here: https://speech.fish.audio/

I am using the locally-run API server to do inference: https://speech.fish.audio/inference/#http-api-inference

I don't know about other ways. To be clear, this is not (necessarily) an LLM, it's just for speech synthesis, so you don't run it on ollama. That said I think it does technically use Llama under the hood since there are two models, one for encoding text and the other for decoding to audio. Honestly the paper is terrible but it explains the architecture somewhat: https://arxiv.org/pdf/2411.01156

Discussing jury nullification is against the ToS of lemmy.world and will get you banned, according to worldnews mod in c/[email protected]

[–] [email protected] 15 points 1 month ago

On Lemmy, everything is a bit leftist at the moment.

Is there anything that makes training a translation task easy? in c/[email protected]

[–] [email protected] 4 points 2 years ago

Thanks for the tips. After doing a bunch of searching, I found that what I needed was BPE, or byte-pair encoding. This allows the token set to contain sub-word sequences, which lets the tokenizer represent a unique constant like 0x0373 as ['__sow', '0x', '03', '73', '__eow'].

Is there anything that makes training a translation task easy? in c/[email protected]

[–] [email protected] 2 points 2 years ago (2 children)

Thanks, the quickstart guide was straightforward to follow. Do you have any suggestions on how to do word splitting with code, if any? For example, on a test run, I found that the model was not able to synthesize unique constants correctly even though this test run consisted only of obvious "a to b" relationships.