this post was submitted on 12 Sep 2023
16 points (94.4% liked)

Free Open-Source Artificial Intelligence

2889 readers
2 users here now

Welcome to Free Open-Source Artificial Intelligence!

We are a community dedicated to forwarding the availability and access to:

Free Open Source Artificial Intelligence (F.O.S.A.I.)

More AI Communities

LLM Leaderboards

Developer Resources

GitHub Projects

FOSAI Time Capsule

founded 1 year ago
MODERATORS
 

Hi,

I'd like to explore the possibilities of training a LM to learn a specific programming language so he can be used as co-pilot in that context. Language is a niche language (http://pharo.org), and there is no existing model nowadays knowing it (also, I want to make some extra tweaks, once I have it).
Thing is... I have no idea where to start! :)

Any hint where can I learn the ropes?

Thanks!

you are viewing a single comment's thread
view the rest of the comments
[–] [email protected] 4 points 1 year ago (1 children)

It's going to seem so wrong, but... stable diffusion and porn is the easiest way to figure out the basics of modifying a model. It is easier to spot mistakes and the number of examples to try is enormous. This space is packed with examples running on basic low level hardware and you'll find lots of easy to follow examples. The bar is much higher when it comes to training examples for text or code. Most examples are based on proprietary toolchains or are very basic getting started guides.

[–] [email protected] 1 points 1 year ago* (last edited 1 year ago) (1 children)

... stable diffusion is a diffusion model - OP wants a language model. They don't work even remotely the same way.

[–] [email protected] 1 points 1 year ago* (last edited 1 year ago)

I'd agree. And the toolchains aren't proprietary. But they are different and you can transfer only a small amount of knowledge from one to the other. The concept 'training an AI model' is the same. PyTorch, Linux and the graphics card are the same. So you can lean something. But it's another kind of AI model and the software stack you need to use also differs.