GPT-4o set to disrupt conversational AI, edtech and call centre business (2024)

OpenAI’s latest GPT-4o (“omni”) model has not just ‘wowed’ people across the globe but is set to disrupt everything from conversational AI and edtech to content creation and customer services, experts said.

GPT-4o, launched on Monday, will also majorly impact contact centres business, propelling towards fully autonomous call centres without any dependency on human agents, they said.


Elevate Your Tech Prowess with High-Value Skill Courses

Offering CollegeCourseWebsite
Indian School of BusinessISB Product ManagementVisit
IIT DelhiCertificate Programme in Data Science & Machine LearningVisit
Indian School of BusinessProfessional Certificate in Product ManagementVisit

“It seems OpenAI has built an audio foundation model ground up – the model can sing, change tones, speak slow or fast, understand your emotions and respond appropriately,” said Hemant Mohapatra, partner at venture capital firm Lightspeed.


GPT-4o set to disrupt conversational AI, edtech and call centre business (1)ETtech

“A full stack voice and workflows that GPT offers out-of-the box very cheap will be very attractive to developers,” he said, adding that business applications built on caller APIs will find it tough to justify applications.

Voice-first AI startups such as ElevenLabs, play.ht, Respeecher, Hume and Duolingo are bracing up for increased competition from GPT’s multimodal capabilities and real-time language translations.


In fact, after the GPT-4o demo showed real-time language translation, Duolingo lost $340 million in market cap on Monday.

“We are moving closer to AGI (artificial general intelligence) faster than we are willing to believe,” said Alok Goyal, partner at venture capital firm Stellaris Venture Partners. “The only bad news of course is that humans will soon need to worry about their own role and expertise!”

With GPT4o, he said, many complex, cognitive tasks can be done “off the shelf” with no fine-tuning/training in areas such as education, healthcare, financial services, and sales.

Kunal Bahl, cofounder of venture capital firm Titan Capital, said GPT-4o is “the ultimate edtech utility” that could teach millions of students in their native language who wouldn’t need teachers or schools.

The GPT-4o model will power its free-to-use chat assistant ChatGPT with multimodal capabilities across text, vision and audio.

The ‘omni’ model can reason across formats in real time and perform tasks such as debugging code via voice commands, make jokes, sing a lullaby, synthesise 3D objects, resolve customer service complaints, and even talk to another AI.

Raghu Ravinutala, CEO and cofounder of conversational AI startup Yellow.ai, said GPT-4o brings us closer to AI agents very close to humans. “Imagine an AI-powered voice that understands and adapts to your frustration in a conversation, changing its tonality accordingly,” he said.

“The previous voice mode used multiple steps: converting audio to text, processing it through a language model, and then converting the text back to speech,” Ravinutala said. “GPT-4o simplifies this by managing everything within a single model, significantly reducing latency and enabling real-time interpretation for conversations in distinct languages.”

Further, GPT-4o is poised to enhance accessibility for visually impaired individuals, as demonstrated in OpenAI’s videos.

However, the transformation it is expected to bring will not be easy and won’t happen overnight, experts said.

“Just think for a second when you want to change a flight… There are so many variables in play such as price, time, airline, duration, etc.,” said Aakrit Vaish, co-founder and CEO of Indian conversational AI platform Haptik. “Often, these are better done through GUI or a hybrid interface. AI will get some things wrong, just like humans will. But, with AI, we blame the person who built the AI (brand) and not say GPT,” he added.

Amitabh Nag, chief executive of Bhashini, a Digital India initiative to make digital services accessible in all Indian languages, said, “Our aim is to reach India’s last mile, i.e., people living on the other side of not only language divide but also digital divide and literacy divide. To achieve that, AI knowledge, glossary, etc. need to be Indianised to a large extent, and it remains to be seen if global AI models can achieve that.

Experts said AI-native startups will expand their margins as OpenAI has cut the price of business API into half.

Stellaris’ Goyal also reckoned that hyperscalers will witness a boom as “media-rich applications continue to require even more compute and storage”.

AI video analytics company Staqu Technologies said cost of computation is going to be a major factor in full-scale adoption. “For a street camera installed in a smart city and monitoring large crowds 24/7, AI will run a huge bill in terms of both computational power and cost,” Atul Rai, founder of Staqu, said.

Pawan Prabhat, cofounder of generative AI firm Shorthills AI, said Indians are mostly voice-first technology consumers and GPT-4o will bring a quantum jump in AI consumption among India.

“Till now, most of the interaction has been through keyboard and typing. As humans, we are wired to ‘show and tell’. GPT-4o gets AI closer to the way humans converse and interact,” he said.

GPT-4o set to disrupt conversational AI, edtech and call centre business (2024)

FAQs

What can I do with GPT-4o? ›

Multimodal Capabilities: GPT-4o is a multimodal AI model that simultaneously understands and generates content across text, images, and audio. This allows for seamless and natural interactions, whether you type, speak, or share visuals with the model. You can have conversations mixing different modalities fluidly.

Is GPT-4o free? ›

GPT-4o represents OpenAI's latest flagship model, integrating advanced reasoning capabilities across audio, vision, and text modalities in real time. It has been made freely accessible to all users.

What are the benefits of GPT-4o? ›

  • Authentication and access control.
  • Compliance, risk and governance.
  • Network security.
  • Security Admin.
  • Threat management.
May 15, 2024

What are the disadvantages of GPT-4? ›

The disadvantages of ChatGPT should be known in detail. They are said that GPT 4 turbo has up to 120k context while the output output is only around 4k words. Instead of expanding the amount of output, output in the context is deliberately withheld without any valid reason.

What is the difference between GPT-4 and GPT-4o? ›

Performance and efficiency

GPT-4o is also designed to be quicker and more computationally efficient than GPT-4 across the board, not just for multimodal queries. According to OpenAI, GPT-4o is twice as fast as the most recent version of GPT-4.

What is the difference between AI and conversational AI? ›

Basically, the difference between generative AI (GAI) and conversational AI (CAI) is that generative AI produces original content and creations when prompted, while conversational AI specialises in holding authentic and useful two-way interactions with humans by understanding and responding in text or speech.

How is conversational AI used in business? ›

Conversational AI has become an invaluable tool for data collection. It assists customers and gathers crucial customer data during interactions to convert potential customers into active ones. This data can be used to better understand customer preferences and tailor marketing strategies accordingly.

What are the 4 types of AI with example? ›

4 main types of artificial intelligence
  • Reactive machines. Reactive machines are AI systems that have no memory and are task specific, meaning that an input always delivers the same output. ...
  • Limited memory machines. The next type of AI in its evolution is limited memory. ...
  • Theory of mind. ...
  • Self-awareness.
Mar 26, 2024

What can GPT-4 not do? ›

GPT4 cannot really hear, and it cannot really talk. Voice input is transcribed into text by a separate model, 'Whisper,' and then fed to GPT4. The output is read by another model.

What is GPT-4 best at? ›

GPT-4 is a large multimodal model that can mimic prose, art, video or audio produced by a human. GPT-4 is able to solve written problems or generate original text or images.

How much does GPT-4 cost? ›

gpt-4 models cost $30.00 per 1M input tokens and $60.00 per 1M output tokens. gpt-4-1106-vision-preview (with GPT_VISION) costs $10.00 per 1M input tokens and $30.00 per 1M output tokens. text-embedding-ada-002 (with GPT_MATCH) costs $0.10 for 1M input tokens.

What can ChatGPT-4 help with? ›

ChatGPT-4 can help you create cool readmes that provide users with a clear understanding of your project's goals, installation instructions, and usage examples. Outline the details you want to point out, and ChatGPT-4 will translate them into a simple readme.

What can ChatGPT Plus do? ›

ChatGPT Plus is a subscription plan for ChatGPT. It offers availability even when demand is high, faster response speed, and priority access to new features.

What can ChatGPT-4 do that 3.5 can't? ›

The main distinction between GPT-3.5 and 4 resides in their scale and capabilities. While GPT-3.5 was trained on 175 billion parameters, GPT-4 likely surpasses 100 trillion parameters, indicating a substantial increase in size and sophistication.

Is GPT-4 usable? ›

It's unusable at the moment. Code blocks being messed up. Each time when you ask something, it gives a complete unneeded summary everytime instead of going to the point of what was asked.

Top Articles
Latest Posts
Article information

Author: Kelle Weber

Last Updated:

Views: 5931

Rating: 4.2 / 5 (73 voted)

Reviews: 80% of readers found this page helpful

Author information

Name: Kelle Weber

Birthday: 2000-08-05

Address: 6796 Juan Square, Markfort, MN 58988

Phone: +8215934114615

Job: Hospitality Director

Hobby: tabletop games, Foreign language learning, Leather crafting, Horseback riding, Swimming, Knapping, Handball

Introduction: My name is Kelle Weber, I am a magnificent, enchanting, fair, joyous, light, determined, joyous person who loves writing and wants to share my knowledge and understanding with you.