this post was submitted on 03 Jun 2024
1471 points (97.9% liked)

People Twitter

5383 readers
686 users here now

People tweeting stuff. We allow tweets from anyone.

RULES:

  1. Mark NSFW content.
  2. No doxxing people.
  3. Must be a tweet or similar
  4. No bullying or international politcs
  5. Be excellent to each other.
  6. Provide an archived link to the tweet (or similar) being shown if it's a major figure or a politician.

founded 2 years ago
MODERATORS
 
you are viewing a single comment's thread
view the rest of the comments
[–] [email protected] 1 points 6 months ago (1 children)

Which is one area ML models might (with the right investment) actually be useful. A model trained to look at web pages and relay information from the content visually like we do would be very powerful. The newer ChatGPT models have visual capabilities, I wonder if you could give it a website screen capture and ask it for prices.

[–] [email protected] 2 points 6 months ago (1 children)

Why would you want a model trained on outdated prices? This is not really something LLMs are particularly suited for.
Maybe to crunch historical data, but not for daily comparisons.

[–] [email protected] 2 points 6 months ago (1 children)

Why would the model be trained on outdated prices? I'm not talking about LLMs, but separate model designed to parse visual information - specifically websites - and extract particular elements like prices. My comment about ChataGPT was in reference to the newer models which can relay visual information, I'm not suggesting that would be the right approach for training a new model.

The applications would be broader than just prices - this would allow you to scrape any human-readable website without needing to do bespoke development.

[–] [email protected] 1 points 6 months ago

I am not sure, that would work. You could train a model that analyzes data and then feed it the data you want to transform. The data wouldn't be the training data then but part of your request.
Like you can feed a book into GPT4/5 and then ask questions about it.

For what you describe you wouldn't really need AI just a more or less fuzzy parser (like the scan a receipt, get the prices ocr things). Unless I didn't get it.