OpenAI Realtime API officially launched: supports emotion perception and multi-language switching

08.29.25

OpenAI recently announced that its "Realtime API" has officially exited beta and entered production. This new API, designed for businesses and developers, is powered by the gpt-realtime conversational speech model. It utilizes an end-to-end speech-to-speech architecture to directly generate and process speech, eliminating traditional text-to-text conversion steps. Compared to its predecessor, it offers faster response times, more natural speech, and significantly improved processing of complex commands, making it suitable for scenarios such as customer support, education, and personal productivity tools.

The model has added emotion-sensing capabilities, capturing nonverbal cues like laughter and enabling seamless language switching during conversations. Developers can also customize the voice tone, such as "friendly with a French accent" or "fast-paced professional voice." In terms of performance, gpt-realtime achieved impressive results across multiple benchmarks: Big Bench Audio accuracy increased from 65.6% to 82.8%, and ComplexFuncBench jumped from 49.7% to 66.5%.

This upgrade also optimizes the tool integration process, allowing models to more accurately select and trigger external tools. It also supports image input—users can send screenshots or photos, and the model will interact based on the image content, such as recognizing text or answering related questions. To address cost constraints, the API price has been reduced by 20%, with audio input/output tokens now priced at $32 and $64 per million, respectively. The ability to set a token usage cap has also been added.

In terms of security, the API automatically detects inappropriate content and terminates sessions, but OpenAI emphasizes that developers must implement custom security rules. For EU users, data localization options and special privacy rules have been implemented to comply with GDPR requirements.

Fujitsu and NVIDIA Build FugakuNEXT, the Most Powerful AI Supercomputer

Japan is preparing to launch a new national supercomputer, FugakuNEXT, developed by Fujitsu, NVIDIA, and the RIKEN Research Center. The system, expected to be

08.30.25 1

Huawei takes Apple's lead in smartwatch market

According to Counterpoint Research, global smartwatch shipments grew 8% in the second quarter of 2025 after a long period of decline. A key factor was the buoy

08.30.25 0

The iconic Apple iMac G3 is made from Lego

An Apple fan has recreated the iconic iMac G3 all-in-one computer from Lego. The project, hosted on the LEGO Ideas platform, has little chance of becoming an o

08.30.25 1

In Russia, a bandage has been invented to fight superbugs

MISIS University staff, led by Professor Dmitry Shtansky, have developed a new generation of medical bandages for treating complex wounds. The prototype is eff

08.30.25 0

Nissan Confirms Return of Legendary GT-R

Nissan has officially announced the resumption of production of its iconic GT-R sports car. After the final R35 model rolled off the assembly line at the autom

08.30.25 0

Researchers have figured out how to identify hacked accounts

Researchers at Cornell Tech have introduced a system called CSAL (Client-Side Encrypted Access Logging) that helps detect hacked accounts without compromising

08.30.25 0

The Chinese have learned to produce fuel and oxygen in space

For the first time, Chinese astronauts aboard the Tiangong orbiting station were able to use artificial photosynthesis technology to produce rocket fuel and ox

08.30.25 0

‌xAI Launches Grok Code Fast 1: A Low-Cost Intelligent Code Generation Model to Challenge the Industry Landscape

Elon Musk's artificial intelligence startup, xAI, officially released Grok Code Fast 1, an intelligent code generation model, on Thursday. Focusing on "spe

08.30.25 1

Alibaba releases Q1 2026 financial report: Net profit surges 76%, driven by cloud business and e-commerce

Alibaba Group recently released its financial results for the first quarter of fiscal year 2026 (ending June 2025). The report showed net profit reaching 42.38

08.30.25 0

TCL Technology's 2025 Semi-Annual Report: Net Profit Surges 89%, Display Business Leads the World

TCL Technology recently released its 2025 semi-annual report, showing operating revenue reaching 85.56 billion yuan, a year-on-year increase of 6.65%, and net

08.30.25 1

Xiaomi AI glasses launch internal testing of new features, recruiting 200 users to experience innovative features such as Alipay QR code scanning

Xiaomi has reportedly launched a closed beta program for new features on its AI glasses, recruiting 200 Mi Fans nationwide to participate. The beta program wi

08.30.25 1

Epic Games is giving away two free games: Machinarium and Make Way, and Monument Valley will be given away next week.

This week, Epic Games is giving away two free games: the puzzle masterpiece "Machinarium" and the creative racing game "Make Way." Next week, Epic Games will

08.29.25 2

OpenAI Realtime API officially launched: supports emotion perception and multi-language switching

OpenAI recently announced that its "Realtime API" has officially exited beta and entered production. This new API, designed for businesses and developers, is

08.29.25 1

‌Global smartwatch market rebounds: Huawei surpasses Apple to take the top spot, with Chinese brands becoming the growth engine

Market research firm Counterpoint Research recently released a report showing that global smartwatch shipments grew 8% year-on-year in the second quarter of 2

08.29.25 1

Dell releases Q2 results for fiscal year 2026: revenue reaches $29.8 billion, up 19% year-on-year

Yesterday, Dell Technologies announced its second-quarter financial results for fiscal year 2026 (ending August 1st). Dell's report showed record revenue

08.29.25 1