Elon Musk’s Grok AI has introduced a new vision feature, endowing the chatbot with the ability to ‘see’ the world in real-time using a smartphone’s camera. This is similar to OpenAI’s ChatGPT and Google Gemini, which can analyse images and visuals in real-time.
On Tuesday, April 22, xAI announced Grok Vision, the feature that allows users to point their smartphone cameras at objects, signs, documents, etc., and ask questions about them. The new feature can be accessed using the Grok app for iOS. However, the feature is yet to be introduced in Grok’s Android app.
xAI also introduced capabilities such as multilingual audio and real-time search in Grok’s voice mode. The feature is available to those subscribed to the SuperGrok plan.
Grok’s memory feature
Last week, a memory feature was added to Grok 3, allowing it to remember conversations that users have had with it, allowing it to come up with more personalised responses. In simple words, if a user mentions their health routine, Grok can later suggest a diet plan personalised to their historical habits.
xAI claimed Grok’s memory feature is distinct compared to other chatbots, as its memories are ‘transparent’. This means the user can see exactly what Grok knows and chooses to forget. Transparency and user control set Grok’s memory feature apart from other chatbots. xAI reportedly plans to introduce a ‘forget’ button for Grok users on Android OS that will let them exclude specific chats from its memory.
Earlier this month, Grok gained a Canvas-like feature for editing and creating documents as well as developing basic AI apps in Grok Studio. The features are available for free and paying users on Grok.com. “Grok can now generate documents, code, reports, and browser games. Grok Studio will open your content in a separate window, allowing both you and Grok to collaborate on the content together,” the company said on its official X handle.
© IE Online Media Services Pvt Ltd