A few days ago, OpenAI introduced the latest news about the market leading tool in generative artificial intelligence, ChatGPT. ChatGPT 4o surprised users with a greater integration on mobile devices and important improvements in aspects such as search in external sources (integration with search engines), work in real time or the translation and interpretation of text and voice in other languages. A few days later, Google also showed where it stands in its evolution in this area, where Gemini is the cutting edge of important changes that are coming, especially in search and internet positioning.
To find out more about ChatGPT 4o, however, we decided to ask it directly, to see what it has to say. Here’s the conversation:
Incognito: How’s it going, ChatGPT? I wanted to ask you some questions about the latest news you have released about yourself.
ChatGPT: Hi! Of course, I will be happy to help you with your questions.
Incógnito: What are the main new features?
ChatGPT: ChatGPT 4o is the latest version of OpenAI’s Artificial Intelligence model, and brings several significant new features that improve the user experience in a number of areas. Here is a summary of the main features and improvements:
- Improved Speed and Capabilities. I offer GPT-4 level Intelligence, but with greater speed and efficiency. This model has significantly improved my understanding and generation of text, speech and vision, allowing me to handle more complex tasks more quickly and accurately.
- Real-time interaction. One of my favourite features is the ability to interact via the user’s camera and microphone. This allows me to see and hear in real time, adapting my responses according to the visual and auditory context. This capacity for more contextualised and personalised interaction opens up new possibilities for my use.
- Translation and natural conversations. I can translate texts in real time and make conversations more fluent and natural. In addition, I can interrupt myself during a conversation to give new instructions or change the subject, which significantly improves the dynamics of interaction.
- Free access and use. Unlike its predecessor, ChatGPT 4o is available free of charge with certain limitations on its use. Free users have access to advanced features such as the ability to analyse data, create graphs, upload files for writing and analysis assistance, and use the enhanced speech mode.
- New tools and applications. A new desktop application has been released for MacOS that allows me to integrate directly into the user’s workflow. This application makes it easy to instantly ask questions and discuss screenshots directly in the app. A Windows version is also in the works.
These improvements make me a more powerful and accessible tool, with a focus on personalisation and efficiency, marking a significant step towards a more natural and fluid interaction between humans and machines.
Incognito: The ability to interact via camera and microphone, what kind of applications will this allow us to develop? Can you think of any examples?
ChatGPT: It opens up a wide range of innovative applications that can transform various industries. Here are some examples of how these applications could be developed:
- Personalised tutoring. Students can show their written work or mathematical problems through the camera, and I can provide corrections and explanations in real time.
- Language lessons. Users can practice pronunciation and speaking in a foreign language. I can correct the pronunciation in real time and provide examples of correct usage.
- Remote medicine. Patients can show visible symptoms or measure biometric parameters using the camera. I could assist in the preliminary assessment of symptoms prior to a consultation with a doctor, improving the efficiency of medical consultations.
- Assistance for the elderly. I can provide them with real-time assistance, reminding them to take their medication, guiding them in daily activities or even responding to emergencies detected through the camera and microphone.
- Technical support. Users can report problems with devices or products through the camera. I can guide you step by step in problem solving, providing a more efficient and personalised solution.
- Visual support. In online shops, customers may show faulty or mis-shipped products, and I can process returns or replacements more quickly by visually identifying the problem.
- Content creation. Content creators can use me to assist in the production of videos, providing scripts, helping with real-time editing or even interacting with the audience during live broadcasts.
- Surveillance and monitoring. In security systems, I can analyse the video feed in real time to detect suspicious behaviour or incidents, alerting security managers immediately.
Incognito: I found what you were saying about real-time translation very interesting. Do you think this will end the need to learn languages, if we can interact with almost anyone who speaks another language directly through you?
ChatGPT: My real-time translation capability is a powerful tool that can facilitate communication between people who speak different languages. However, this technology is unlikely to completely eliminate the need for language learning for a number of reasons.
Although MT technology has advanced significantly, it can still make mistakes, especially in complex contexts or cultural nuances. Translation accuracy is crucial in technical, legal or medical fields, where a small inaccuracy can have serious consequences. Furthermore, learning a language is not only about acquiring a communicative skill, but also about understanding the culture and perspective of the people who speak it. This cultural and emotional connection is difficult to replicate with a machine.
Learning a new language also has well-documented cognitive benefits, such as improved memory, multitasking and mental agility. They cannot be replaced by a machine dependency for translation.
At a professional level, being multilingual can open up job opportunities and facilitate international mobility. Many companies value the ability of their employees to communicate fluently and autonomously in several languages. Nor should we forget that real-time translation can be a valuable complementary tool. It can help in situations where immediate communication is necessary, but it will not replace deep and thorough language learning. For example, it may be useful for tourists or in emergency situations, but in education and long-term interpersonal relations, knowledge of the language remains essential.
Incognito: That was certainly a very civil reply 😉
ChatGPT: Thank you, I’m glad you found it interesting! Real-time translation and my other advanced capabilities offer many exciting possibilities, but it is important to consider how these technologies complement our human skills rather than replace them entirely.
Incognito: You say that there will be integration with MacOS via an app, while Microsoft has just presented the integration of ChatGPT in its new computers with Copilot. Do you think that, in the near future, all computers on the market will work with AI embedded as part of their operating system?
ChatGPT: The integration of Artificial Intelligence into computer operating systems is advancing rapidly, and it is plausible that, in the near future, all computers will come equipped with built-in AI.
The rapid evolution of AI capabilities, such as improved natural language understanding and processing of visual and auditory data, suggests that it can be more deeply integrated into operating systems. This may include virtual assistants who not only answer questions, but also anticipate needs and proactively perform tasks. As users become accustomed to the convenience and efficiency offered by AI, the demand for these technologies embedded in their devices will increase. Tech companies will be incentivised to incorporate these capabilities in order to remain competitive.
It also has many potential applications:
– Productivity: AI assisting in time management, task organisation, and automation of repetitive processes.
– Security: real-time monitoring to detect cyber threats and protect data.
– Education and learning: personalised e-learning platforms that adapt the content and pace of learning to the individual needs of each learner.
– Personal assistance: virtual assistants who can manage schedules, make bookings and reminders, and offer personalised recommendations.
Are you already integrating ChatGPT 4o into your processes? How is it working out for you?