Wen Xin's words compared with actual measurement - how far is it from the top AI models?

author：Colorful peaks 2023-07-01 14:57:00

Wen Xin Yiyan is a large language model released by Baidu in March this year, and is said to be "the first in Asia in terms of parameter quantity". After three months, the author finally got a distinguished internal test quota and took the time to do a relatively comprehensive test:

Comparison object: GPT-3.5 (official website does not recharge version), newbing (new Bing, it is said that I got the GPT-4 kernel in advance, but castrated some functions,)

First, first test a popular science logic problem:

Wen Xin's words compared with actual measurement - how far is it from the top AI models?

GPT

Wen Xin

new bing

All three models give good answers. Wen Xin twisted a little, tentatively on the edge of circling himself.

Second, this is followed by a moderately difficult mathematical problem - a monic cubic equation:

(Measurement: All three models can complete the operation of univariate quadratic equations)

The GPT idea is fine, and a positive solution is given, but it seems to be miscalculated later:

Chinese the question, Newbing doesn't seem to understand what I mean (no matter how I correct it later):

new bing

When asked in English, Newbing gave the correct answer:

new bing

Wen Xin's answer was wrong, and he was so wrong that he didn't know how to ask:

Wen Xin

Third, finally the scheduling problem, see how good they are as housekeepers:

Wen Xin

new bing

GPT

newbing performed the best, GPT was discounted because it could not update information online, Wen Xin said... It seems that Wen Xin can't understand some of my premise - foreign AI knows more about Chinese pinching than Chinese AI.

Four

As a result, Wen Xin's words are indeed lagging behind the top AI models in many aspects, and perhaps it is more like a knowledge base plusplus version of Siri, Xiaodu, or Xiaoai classmates than GPT.

[Tips for using AI: Talk like a human, rather than asking questions mechanically.] Errors in answers, deviations in understanding, should be corrected in time and let it be re-answered, which will often lead to more accurate conclusions. 】

Wen Xin's words compared with actual measurement - how far is it from the top AI models?

Read on

Have you ever used the Wenxin Yiyan app? When I asked which version it was, the answer was a scoundrel.

#Today's headlines#How to make more people read the dynamics of headlines?bean bag#bean bag#Wenxin Yiyan#Wenxin Yiyan#Xunfei Xinghuo#Xunfei Xinghuo#Give their respective answers

This is a picture made by Wenxin in one word, drawing input: Chinese girl, delicate facial features, long hair, breast and fat buttocks, hot body, bikini, cheongsam, buttocks, movie-level, photography-level, live-action CG,

文心一言 VS 讯飞星火 VS chatgpt (195)—— 算法导论14.3 3题

It is reported that the Chinese version of the iPhone 16 series will cooperate with Baidu to provide AI functions

Jiyue car owners are young people, each car voice interaction is about 63 times a day, the average daily use of Wenxin Yiyan service 7.1 times, 9 percent of users will use PPA intelligent driving, all of which are used in big cities

Apple joins forces with Baidu, and the iPhone 16 national bank is expected to have a built-in Wenxin Yiyan!

117 Generative AI Service Filing Information Announced: Baidu Wenxin Yiyan and others are listed

How can ordinary people effectively apply artificial intelligence software such as Kimi, Wenxin Yiyan, and iFLYTEK Xinghuo?

Customize the AI voice in 2 seconds! Wenxin is a big job in one word: the effect is surprising

On April 11th, #华为新款MateBookXPro正式发布#, the new product is positioned as a flagship thin and light book, with a weight of only 980 grams and a body thickness of 13.5 mm.

Baidu CEO Robin Li: The number of users of Wenxin Yiyan exceeded 200 million and released three major AI development tools

Wang Haifeng, CTO of Baidu: Wenxin Yiyan's user scale and average daily call volume have reached 200 million

Can it compete with ChatGPT?Baidu says Wenxin Yiyan now has 200 million users

Following Microsoft's example, Google has also merged its hardware and operating system divisions, OpenAI has set up a Japanese branch, and Wenxin Yiyan has more than 200 million users......

Wenxin said the latest instructions, and quickly saved them