Continuing with the exercise of sharing an easily digestible and smaller version of the main updates of the day in the world of AI.
ChatGPT is getting major update, It can now see, hear, and speak
- OpenAI is introducing voice and image capabilities in ChatGPT, allowing users to have voice conversations and show images to ChatGPT.
- Users can have live conversations about landmarks, get recipe suggestions by showing pictures of their fridge, and even receive math problem hints by sharing a photo.
- The voice and image capabilities will be rolled out to Plus and Enterprise users over the next two weeks, with voice available on iOS and Android and images available on all platforms.
Getty Images introduced an art tool called Generative AI, powered by Nvidia
- It will render images from text descriptions. The tool is designed to be “commercially safer” than rival solutions, with safeguards in place to prevent disinformation and copyright infringement.
- Getty Images will compensate contributors whose works are used to train the AI generator and share revenues generated from the tool.
- The tool can be accessed on Getty’s website or integrated into apps and websites through an API, with pricing based on prompt volume.
Colossal-AI released Colossal-LLaMA-2, an open-source and commercial-free domain-specific LLM
- It uses a relatively small amount of data and training time, resulting in lower costs.
- Includes improvements such as vocabulary expansion, data cleaning system, and a multi-stage pre-training scheme to enhance both Chinese and English abilities.
- Allows for cost-effective training of lightweight domain-specific LLMs, enabling fine-tuning for specific business applications.
Tesla’s humanoid robot Optimus can now sort objects autonomously
- Using its end-to-end trained neural network. The robot is able to calibrate itself using joint position encoders and vision to locate its limbs precisely. It can then sort colored blocks into their respective trays, even adapting to dynamic changes in the environment. – The robot also uses corrective action to turn blocks right-side-up if they are placed on their side.
Snapchat has partnered with Microsoft to insert ads into its AI chatbot feature, My AI
- The chatbot, introduced earlier this year, offers link suggestions related to user conversations. For example, if a user asks for dinner recommendations, the chatbot could reply with a link sponsored by a local restaurant.
- The partnership is a win for Microsoft’s ads business and could position Snapchat as a platform for Gen Z users to search for products and services through AI chats.
Spotify is testing a voice translation feature for podcasts, using AI to translate content into different languages
- By offering translated podcasts from popular hosts like Dax Shepard and Lex Fridman, Spotify hopes to expand its global reach and cater to a wider audience.
Google's AI tool, Bard, has now new capabilities to help travelers plan their vacations
- By connecting with various Google applications like Gmail, Google Flights, and Google Maps, Bard can provide personalized assistance throughout the trip.
- Users can ask Bard to find flight and hotel information, get directions, watch YouTube videos, and even check dates that work for everyone involved.
- Additionally, recent updates to Google Flights offer tools to predict the best time to find cheap airline deals.
Correcto has raised $7M in seed funding to expand its language writing tool for Spanish speakers
- While AI tools like ChatGPT can generate text in Spanish, Correcto believes its tool offers better quality and provides opportunities for individual learning. The company plans to target enterprise customers while also offering a freemium version for individual users.