LocalLLaMA

2628 readers

17 users here now

Community to discuss about LLaMA, the large language model created by Meta AI.

This is intended to be a replacement for r/LocalLLaMA on Reddit.

founded 2 years ago

MODERATORS

[email protected]

Has anyone applied tree of thought prompting to r1 yet? (self.localllama)

submitted 1 month ago by artificialfish to c/[email protected]

9 comments fedilink hide all child comments

Generate 5 thoughts, prune 3, branch, repeat. I think that’s what o1 pro and o3 do

you are viewing a single comment's thread
view the rest of the comments

[–] [email protected] 1 points 1 month ago* (last edited 1 month ago) (2 children)

Can't you feed that back into the same model? I believe most agentic pipelines just use a regular LLM to assess and review the answers from the previous step. At least that's what I've seen in these CoT examples. I believe training a model on rationality tests would be quite hard, as this requires understanding the reasoning, context, having the domain specific knowledge available... Wouldn't that require a very smart LLM? Or just the original one (R1) since that was trained on... well... reasoning? I'd just run the same R1 as "distillation" and tell it to come up with critique and give a final rating of the previous idea in machine redable format (JSON). After that you can feed it back again and have the LLM decide on two promising ideas to keep and follow. That'd implement the tree search. Though I'd argue this isn't Monte Carlo.

[–] artificialfish 2 points 1 month ago

Actually now that I think about it, LLM's are decoder only these days. But decoders and encoders are architecturally very similar. You could probably cut off the "head" of the decoder, make a few fully connected layers, and fine tune them to provide a score.

[–] artificialfish 2 points 1 month ago

All theoretical, but I would cut the decoder off a very smart chat model, then fine tune the encoder to provide a score on the rationality test dataset under CoT prompting.