OpenAI曾秘密测试GPT-4o，登顶聊天机器人竞技场排行榜

author：IT House 2024-05-14 18:15:00

IT Home reported on May 14 that OpenAI employee William Fedus confirmed on the social platform X on Monday that the mysterious chatbot "gpt-chatbot" that has recently performed well on the LMSYS chatbot arena (Chatbot Arena) is the new artificial intelligence model GPT-4o that they have just released. Fedus also revealed that GPT-4o topped the arena leaderboard in the test, achieving the highest score ever.

OpenAI曾秘密测试GPT-4o，登顶聊天机器人竞技场排行榜

"GPT-4o is our state-of-the-art, cutting-edge model," Fedus wrote on Twitter, "and we've been testing a version of the model in the arena under the name 'im-also-a-good-gpt2-chatbot.'" ”

OpenAI曾秘密测试GPT-4o，登顶聊天机器人竞技场排行榜

OpenAI曾秘密测试GPT-4o，登顶聊天机器人竞技场排行榜

A chatbot arena is a website where visitors can talk to two random AI language models at the same time without knowing which is which, and then choose the model that provides a better response.

Starting in April of this year, OpenAI tested multiple versions of GPT-4o in the arena, and the model initially appeared under the name "gpt2-chatbot", then became "im-a-good-gpt2-chatbot", and finally "im-also-a-good-gpt2-chatbot".

Since GPT-4o's release today, multiple sources have revealed that the model has topped LMSYS' internal leaderboard by a huge margin, surpassing the previous highest-ranked models, Claude 3 Opus and GPT-4 Turbo.

lmsys.org's official account shared a chart and wrote: "The 'GPT2-chatbot' family of models has just soared to the top of the list, surpassing all other models by a significant margin (around 50 ELO), and it has become the most powerful model in the arena." Here's an inside screenshot of the public version of 'GPT-4O' now in the arena and will soon be on the public leaderboards! ”

OpenAI曾秘密测试GPT-4o，登顶聊天机器人竞技场排行榜

截至IT之家发稿时,"im-also-a-good-gpt2-chatbot" 的 Elo 分数为 1309,领先于 GPT-4-Turbo-2023-04-09 的 1253 分和 Claude 3 Opus 的 1246 分。在三个"GPT2-chatbot" 出现并搅局之前,Claude 3 和 GPT-4 Turbo 一直在排行榜上争夺冠军。

robot openai Test

Previous: Raspberry Pi M.2 HAT+ Expansion Board Launched: Raspberry Pi 5 support for $12

Next: AYN预热新款Odin安卓游戏掌机:Mini LED屏幕、320g重量

Read on

Google released a new upgraded large model to face off against OpenAI; Meizu released the new Flyme AIOS system
Finishing|Liu Fen Editor|Jiang Shizhou【Big Company News】Apple VisionPro passed China's 3C certification on May 14, according to the website of the China Quality Certification Center, Apple's...
technology openai model
05-17
changes in the senior management of pharmaceutical companies Novartis and GSK in China; OpenAI's Chief Scientist Leaves | Executive Updates: May 5-17, 2024
Sany Group, Shanghai Jahwa, Xiaomi Group, Meizu, Swiss Re, Novartis, GlaxoSmithKline, Petrobras, Pioneer Pilot, OpenAI, Amazon, ABC、...
scientist China openai
05-18
The Wuha regiment frightened Deng Chao, and Brother Chao fought back? Netizen: This friendship test is so exciting!
Test Deng Chao
05-18
The Conservative Rout? The driving force behind OpenAI's infighting left Altman: It makes me sad
openai
05-18
When Lei Jun tested the car live, he was suspected of being maliciously stopped by the car, and the co-pilot: Is there a one-click report?
Test Car
05-18
In order to overcome the high temperature of 2000 degrees, China and the United States are stepping up the test of the same "protective cover"
Test Protection
05-18
Tesla was revealed to have sued a big V with tens of millions of fans, suspected of being made by the "emergency braking" test
Test A fan of someone
05-19
The humanoid robot concept is on fire! The NEEQ company has welcomed 7 institutional surveys this year
A few days ago, the new third board company Sichuan Robot (835015) announced that it received a number of institutions for research on May 13 and May 14. Due to the concept of humanoid robots, Chuanji...
robot
05-18
OpenAI is shockingly exposed! Executives angrily denounced the suppression, and the 710 billion AI giant was embarrassed at home and abroad|Titanium Media AGI
openai diplomacy
05-19
"Mentally retarded" or "intelligent", what kind of chips do robots need?
robot chip
05-18
GPT-4o sparks heated discussions about OpenAI's organizational innovation! Heavy responsibilities for fresh graduates and undergraduates, the ranks are all floating clouds
openai
05-19
Ilya left OpenAI insider exposure: Ultraman cut his team's computing power and prioritized products to make money
openai products
05-18
The 4th Youth Robot Competition in Lishi District: A fierce scientific and technological competition
robot technology
05-19
In the second act of OpenAI's palace fight, the core security team was disbanded, and the person in charge blew up the inside story of his resignation
openai News safety
05-19
In-depth understanding of the high elasticity and anti-static properties of robot protective clothing
protective clothing robot Protection
05-19
Explore the challenges of the high elasticity and anti-static properties of robotic protective clothing
protective clothing robot Protection
05-19
How strong is Casio really? Load-bearing test, crushing and falling, even if sent into space, it will not be damaged
Test
05-19
OpenAI forces departing employees to sign shut-up agreements: GPT can talk, but former employees can't
Whip Bull reported that on May 19, according to foreign news reports, on Monday, OpenAI announced an exciting new product news: ChatGPT can now say like a human...
openai
05-19
Xiaomi SU7, which has not yet completed the durability test, has been sold for two months?
millet Test
05-18
Lehends机器人神钩飞爪"钩崩"BLG,GEN 2-0率先拿到赛点
robot News
05-19
Lei Jun invited Internet celebrity Ah Fei to test SU7, and was maliciously stopped at high speed, Ah Fei's words showed high IQ
Test
05-19
Ryzen 7 7800X3D VS Core i9-14900K（启用基线配置），游戏测试出炉
Game News Test
05-20
Quiz, who will you spend the rest of your life with?
Test
05-19
The birth of a Chinese beauty robot is much better than that of Japan, and netizens say that there is no need to worry about the single
robot China Japan beautiful woman
05-19
Psychological test: Choose a transit bead that will bring you good luck and see who will change your fate
Test
05-19
Psychological test: Choose a fruit bowl and test what you can't avoid
Test Fruit
05-19
Musk: Give me 25% of Tesla, otherwise divest artificial intelligence and robotics
artificial intelligence robot Elon Musk News
05-21
Empowering thousands of industries with AI+ product matrix, Pudu Robotics held a new product launch conference in 2024
robot products
05-20
Beat the robot and absorb 1 billion! The most "grumpy" boss, why is he always blown up?
robot
05-20
OpenAI responds to "gag" resignation clauses; Didi Chengwei: Liu Qing was promoted to permanent partner, and the company no longer has the position of president; NetBSD prohibits AI-generated code | Geek headlines
openai Liuqing
05-20
OpenAI employees were "sealed" when they left their jobs, the core security team was disbanded, and Altman responded urgently: there was an agreement, but it was never implemented!
openai safety heal
05-20
【Industrial Internet Weekly】Kimi Launches a Paid Plan? Dark Side of the Moon: Small-scale grayscale testing; When the Wensheng diagram was demonstrated, the sleep code appeared, and Huawei responded to the suspicion of fraud; Snowflake is in talks to acquire Reka AI for more than $1 billion
Test
05-20
3999's Cloud Whale J4 sweeping and mopping integrated robot: the actual payment can reach as low as 3199 yuan recently
robot All-in-one machine
05-20
聊聊OpenAI最新发布的GPT 4o
openai
05-21
The highest degree of digital intelligence and the most complete integration functions in China! The 520 offshore survey and inspection platform of the Central South Institute was delivered in Qingdao
Test success
05-20
GPT-4 passes the Turing test with a 54% win rate! UCSD's New Work: Humans Can't Recognize GPT-4
Test Up mankind
05-20
Shaping the Future of Capabilities: Robotics and Autonomous Systems
robot
05-20
OpenAI Shock! The chief scientist suddenly left! Wang Yuquan's exclusive analysis!
scientist openai Wang Yu
05-21
From commercial services to industrial services, Pudu Robotics is one step ahead
robot industry serve
05-20
Cloud Whale Xiaoyao Intelligent Sweeping Robot 001 Evaluation: Smart, worry-free, quiet
Robot vacuums robot
05-20
A must for testers! What to do in this article?
devise Test
05-20
Musk demanded a 25% increase in Tesla's shares, otherwise it would divest AI and robotics
robot Elon Musk News
05-21
How far are humanoid robots coming into the home?
robot News Family
05-25
The iQOO 13 series is still "dual-machine" The 6000mAh large battery is being tested
News Test battery series
05-28
OpenAI officially announced the launch of "next-generation cutting-edge model" training! It is expected that the training parameters will be further improved, or the "Wensheng video" model Sora will be integrated
openai News Video model
05-29
Former OpenAI director reveals the inside story of Ultraman's recall: The board of directors knew that ChatGPT had been released from X
ChatGPT openai News
05-29
It's all "my own people"! OpenAI urgently set up a "safety committee", less than half a month after the disbandment of the "super alignment" team, and will face the first security "big test" in 90 days
openai News safety
05-29
OpenAI is caught in the biggest public relations crisis in history, and the head of Altman, who is in charge, donated half of his net worth to help the company tide over the difficulties
openai News
05-29