हिन्दी ಕನ್ನಡ తెలుగు मराठी ગુજરાતી বাংলা ਪੰਜਾਬੀ தமிழ் অসমীয়া മലയാളം मनी9 TV9 UP
Bihar 2025 India Sports Tech World Business Career Religion Entertainment LifeStyle Photos Shorts Education Science Cities Videos

Grok 4.1: All you need to know about xAI’s biggest upgrade yet

xAI has launched Grok 4.1, now available to all users on web and mobile. The new update focuses on improving conversation quality, cutting down on factual errors, and scoring higher in empathy and writing tests. It ranks #1 on LMArena's blind AI benchmark.

Grok 4.1 now live for all users
| Updated on: Nov 18, 2025 | 01:55 PM

New Delhi: The last few months have been a busy stretch for large language model developers, and xAI has now pushed itself back into the spotlight with the release of Grok 4.1. The update went live on November 18 (IST) across grok.com, the X platform, and the mobile apps, with Auto mode switching to the new version by default. Even free users can try it, which is rare in the current AI race.

We spent most of today reading through xAI’s long blogpost and early benchmark reports, and the mood is pretty clear. Grok 4.1 is not trying to be louder or bigger. It is trying to feel better. Faster replies, fewer mistakes, and a conversation style that feels a little more human.

Also Read

Grok 4.1 gets a silent rollout and early wins

xAI said that it quietly shipped the new model between November 1 and November 14. Users did not know they were talking to test versions. During this period, xAI ran blind pairwise comparisons on real conversations. According to the company, Grok 4.1 was preferred 64.78 percent of the time over the previous production model.

Elon Musk also made a short comment saying users will "notice a significant improvement in speed and quality.” The update focuses on three simple sounding goals: faster response, better factual accuracy and more natural conversation.

The biggest structural change is inside the training system. xAI said Grok 4.1 uses frontier reasoning models as its reward models, letting the AI evaluate itself repeatedly at scale. This reduces the need for massive manual labeling and gives more control over style, tone and personality.

Top scores in global LLM battles

One of the most talked-about results comes from LMArena’s Text Arena, a popular platform where users can test models blindly.

Here is how Grok 4.1 performed:

  • Grok 4.1 Thinking scored 1483 Elo, ranking #1
  • Grok 4.1 (fast) scored 1465 Elo, ranking #2
  • The previous Grok 4 model was ranked #33

It is unusual to see a non-reasoning mode beat other models that are using full chains of thought. That alone has boosted the hype around the update.

Big jump in emotional intelligence and creative writing

xAI also tested the model on EQ-Bench, a benchmark that measures empathy, insight and interpersonal responses. Grok 4.1 scored 1586 Elo, an increase of more than 100 points.

In Creative Writing v3, the score climbed to 1722 Elo, nearly a 600 point jump. This test looks at story flow, language rhythm and character consistency. Early examples shared by xAI show the new version responding to emotional prompts with more warmth and less robotic phrasing.

Fewer hallucinations and smarter responses

Factual accuracy has been a weak point for many fast-response models. Grok 4.1 seems to have made progress here.

According to xAI:

  • Hallucination rate dropped from 12.09 percent to 4.22 percent
  • FActScore dropped from 9.89 percent to 2.97 percent

That is close to a three times reduction in incorrect answers for information queries. The company credits its new reward model system for this improvement.

Bigger context window for long conversations

Grok 4.1 now supports a context window of 256,000 tokens, and up to 2 million tokens in Fast mode. This allows long document analysis, larger stories and more continuous chats without losing track of earlier messages.

For creators, researchers and editors, this is a huge quality-of-life upgrade. No need to repeatedly paste parts of the same document.

How does it compare to rivals?

We still do not have complete data on its performance against GPT 5.1 or the soon-to-release Gemini 3.0. Benchmarks are scattered. Early reports show that Grok 4.1 is pushing hard, but the AI field is moving so quickly that comparisons become outdated within days.

Photo Gallery

Entertainment

World

Sports

Lifestyle

India

Technology

Business

Religion

Shorts

Career

Videos

Education

Science

Cities