this post was submitted on 29 Jun 2023
2 points (75.0% liked)

Actually Useful AI

2010 readers
7 users here now

Welcome! ๐Ÿค–

Our community focuses on programming-oriented, hype-free discussion of Artificial Intelligence (AI) topics. We aim to curate content that truly contributes to the understanding and practical application of AI, making it, as the name suggests, "actually useful" for developers and enthusiasts alike.

Be an active member! ๐Ÿ””

We highly value participation in our community. Whether it's asking questions, sharing insights, or sparking new discussions, your engagement helps us all grow.

What can I post? ๐Ÿ“

In general, anything related to AI is acceptable. However, we encourage you to strive for high-quality content.

What is not allowed? ๐Ÿšซ

General Rules ๐Ÿ“œ

Members are expected to engage in on-topic discussions, and exhibit mature, respectful behavior. Those who fail to uphold these standards may find their posts or comments removed, with repeat offenders potentially facing a permanent ban.

While we appreciate focus, a little humor and off-topic banter, when tasteful and relevant, can also add flavor to our discussions.

Related Communities ๐ŸŒ

General

Chat

Image

Open Source

Please message @[email protected] if you would like us to add a community to this list.

Icon base by Lord Berandas under CC BY 3.0 with modifications to add a gradient

founded 1 year ago
MODERATORS
2
On giving AI eyes and ears (www.oneusefulthing.org)
submitted 1 year ago* (last edited 1 year ago) by sisyphean to c/auai
 

TL;DR (by GPT-4 ๐Ÿค–)

The article discusses the evolution of AI beyond text-based chatbots, highlighting the emergence of multimodal AI, which can process different kinds of input, including images. This development allows AI to "see" and understand images, significantly enhancing its capabilities and enabling it to interact with the world in new ways. The article also mentions the integration of OpenAI's Whisper, a highly effective voice-to-text system, into the ChatGPT app, which changes how AI can be used, such as serving as an intelligent assistant. The author emphasizes that AI's growing capabilities, including internet connectivity, code execution, and the ability to watch and listen, have profound implications, necessitating a thoughtful consideration of both the benefits and concerns.

Notes (by GPT-4 ๐Ÿค–)

AI Evolution Beyond Text

  • AI has evolved beyond being just chatbots. New modes of AI usage have emerged, such as the write-it-for-me buttons in Google Docs, which seamlessly integrate AI into work processes.
  • These changes have significant implications for work and the meaning of writing.

Multimodal AI

  • The most advanced AI, GPT-4, is a multimodal AI, which means it can process different kinds of input, including images.
  • Multimodal AI allows the AI to "see" images and "understand" what it is seeing. This capability significantly enhances what AI can do, despite occasional errors and hallucinations.

AI Interaction with the World

  • Because AI can now "see," it can interact with the world in an entirely new way, with significant implications.
  • For instance, AI can now build and refine prototypes using vision, a substantial increase in capabilities.

AI Voice Recognition

  • OpenAI's Whisper is a highly effective voice-to-text system that is now part of the ChatGPT app on mobile phones.
  • This integration changes how AI can be used, such as serving as an intelligent assistant that can understand intent rather than just dictation.

AI in Education

  • Voice recognition can be useful in education, providing real-time presentation feedback.
  • For example, GPT-4 can act as a real-time virtual VC, providing feedback on startup pitches.

AI's Growing Capabilities

  • AI's knowledge and capabilities have expanded beyond just text and include internet connectivity, code execution, and now, the ability to watch and listen.
  • These advancements mean that jobs requiring visual or audio interactions are no longer insulated from AI.
  • The implications of these capabilities are profound, and there is a need to start considering both the benefits and concerns today.
no comments (yet)
sorted by: hot top controversial new old
there doesn't seem to be anything here