Claude Opus 4.6

Anthropic

Claude Opus 4.6 (Non-thinking) — the default high-effort version — is already among the top models for reasoning capability, while the low-effort version thinks less but shows little difference in output. Despite its high cost, it offers multimodal input processing including text and images, while generating quality text output. A key highlight is its context window that can scale up to 1 million tokens, enabling the processing of large volumes of information.

Rate this model

Your rating: Not rated yet

Model Specifications

Technical information and release details.

Developer

Anthropic

Multimodal Support

Intelligence Score

Context Window

200k

Average Price (USD/1M tokens)

$10.00

Speed (tokens/s)

45.0

Latency (s)

1.91

Release Date

2/5/2026

Performance Statistics

The model's intelligence score is the average of these benchmark scores

Detailed Benchmarks

Compare Claude Opus 4.6 with other top models in specific domains.

Other models from Anthropic

Claude 4.5 Haiku (thinking)

Rank 49

Claude 4.5 Haiku (Thinking) by Anthropic is one of the strongest models in terms of intelligence and is reasonably priced compared to similar models. It also stands out for its speed, supports text and image input, text output, and has a 200k token context window with knowledge updated to July 2025.

View details

Claude 4.5 Sonnet (thinking)

Rank 42

Claude Sonnet 4.5 is Anthropic's most advanced Sonnet model, optimized for AI agents and coding workflows. It delivers superior performance on coding benchmarks and introduces powerful agentic capabilities such as tool orchestration and speculative parallel execution. This model is suited for multi-context and long-horizon workflows, capable of operating autonomously for many hours.

2.0

View details

Claude Fable 5 (max)

Rank 1

Claude Fable 5 (Adaptive Reasoning, Max Effort) is one of the most powerful AI models, regarded as a safety-optimized version of Mythos 5. It stands out for its adaptive reasoning and maximum effort capabilities — particularly when using Max effort mode, where Anthropic has implemented a mechanism that re-invokes Opus 4.8, though at a very high price of $10 per 1M input tokens (cache $1/1M tokens) and $50 per 1M output tokens.

3.0

View details

Claude Opus 4.6 (max)

Rank 15

Claude Opus 4.6 (Thinking) is one of the models primarily focused on Adaptive Thinking. Despite being expensive, slow, and verbose, it excels with adaptive reasoning capabilities. This model supports text and image input, then outputs text.

3.0

View details

Claude Opus 4.7

Rank 19

Claude Opus 4.7 (Non-reasoning, High Effort) is one of Anthropic's best models. Despite reduced reasoning, with high effort enabled the model remains very powerful. It supports text and image input, with text output. However, this model is quite expensive, slower than average, and tends to overthink.

View details

Claude Opus 4.7 (max)

Rank 7

Claude Opus 4.7 (Adaptive Reasoning, Max Effort) is one of the leading intelligence models, supporting text and image input while outputting text. It features adaptive reasoning capabilities and is designed for tasks requiring maximum effort. This model is very detailed in its responses.

View details

Is Claude 4.6 really worse than at launch?

On Reddit, Hacker News, and Anthropic's GitHub, hundreds of developers are reporting the same issue: Claude Opus 4.6 and Sonnet 4.6 are performing significantly worse in real-world tasks compared to their launch. One GitHub user recorded their performance score dropping from 92/100 to 38/100 when using Opus 4.6. The question is whether this is due to ongoing business losses, a technical issue at Anthropic, or a more complex story? What the Community is Reporting About Claude Opus 4.6 The Most Clearly Documented Complaints Most of the most reliable complaints might come from social media, but when they come from Anthropic's own GitHub repository – where developers report bugs with Claude Code – it's truly an issue. These are professional users with measured processes, not subjective feelings. A developer reported that a production automation pipeline, which had been running stably for over 2 weeks, suddenly produced chaotic results on March 6th with the same Opus 4.6 model. According to this person, when asked to self-evaluate the conversation quality, the model consistently scored itself as Sonnet 4, not Opus 4.6. In other words, Opus 4.6 is also recognizing that it is performing below expectations. (Source: GitHub Issue #31480 — Anthropic/claude-code) Another report documented more specifically with a real-world example: requesting Opus 4.6 to generate 3 emails based on a template for 3 insurance companies, the result was only 1 email. When prompted again, the model generated all 3, but when the user made a minor edit, the model reverted to generating 1 email. This loop repeated without any consistent logic — the reporter noted their performance score dropped from 92/100 to 38/100 after switching to Opus 4.6. (Source: GitHub Issue #24991 — Anthropic/claude-code) In addition to the two reports above, a compiled thread on Hacker News noted many independent developers confirming similar situations and stating they reverted to using Claude 4.5 while awaiting a response from Anthropic. (Source: Hacker News thread) Real-world Comparison Between Opus 4.6 at Launch and Recently Below are some specific examples from the community, and I have also had time to compare the behavior of the two versions: Example 1 — Instruction Adherence: Prompt: "Write an email to a customer. NEVER mention the price in this email." Previous Opus 4.6: Complied correctly, with no mention of price. Opus 4.6 (after some point in March 2026): Mentioned "suitable pricing package" in the second paragraph despite the clear "NEVER" rule. Example 2 — Reading Reference Files: Prompt requested reading a style guide file and applying it to the output. Previous Opus 4.6: The ability to read the file was quite accurate and applied the specified style correctly. Opus 4.6 (at the time of the report above): Ignored reading the file while creating a completely different format. Example 3 — Multi-part Task Handling: Prompt: "Create 3 scenarios for 3 different situations." Previous Sonnet 4.6: Generated all 3 scenarios in one go, with a clear structure. Opus 4.6 (according to the February 2026 report): Generated 1 scenario, when prompted to continue, forgot the previous 2 scenarios, leading to an endless loop. Is Reverting to Opus 4.5 the Best Solution? Reverting to Opus 4.5 Even Though Opus 4.6 is Still Quite Good Many people have suggested reverting to Opus 4.5 as a temporary solution to this problem. However, if we only look at official benchmarks, Opus 4.6 outperforms Opus 4.5 in almost all important criteria, especially for those who need long contexts. Opus 4.5 currently only has 200k context, which cannot be compared to Opus 4.6's ability to expand to 1M context. Regarding scores, on BrowseComp – a benchmark evaluating multi-step web research capabilities – Opus 4.6 achieved 84.0% while Opus 4.5 only reached 67.8%, an improvement of 16.2 percentage points. On SWE-bench Verified, which assesses real-world coding, Sonnet 4.6 achieved 79.6% compared to Sonnet 4.5's 77.2%. ARC-AGI 2 – a test of new problem-solving abilities – Opus 4.6 nearly doubled its score compared to 4.5. However, there's an interesting point: on the SWE-Bench Multi-Agent benchmark, which measures the ability to coordinate multiple tools simultaneously, Opus 4.5 achieved 62.3% while Opus 4.6 only reached 59.5% – a small but real decline, which seems to be the scenario most users are complaining about. Subjective and Objective Causes for Opus 4.6's Poor Experience? This is the most important part to correctly understand the problem. There are at least three different reasons leading to the same symptom of "model performing worse": Temporary Technical Issues: Anthropic has confirmed multiple official incidents on its status page, including "Elevated errors on Claude Opus 4.6" on February 28, 2026, a similar incident on March 31, 2026, and "Opus 4.6 and Sonnet 4.6 error rate elevated" on the same day. These are not subjective complaints — these are officially recorded technical incidents, and many "regression" reports occurred precisely during these periods. Default Behavior Changes: Opus 4.6 is designed to think more by default through "adaptive thinking" — meaning it decides when to engage in deep reasoning and when not to. This makes it slower and sometimes feel more cumbersome on simple tasks, making users accustomed to 4.5 feel like the model is "overthinking" instead of performing quickly. Anthropic is Still Profit-Oriented: (This is a personal opinion) It seems Anthropic's biggest goal is still profit, as they might adjust to reduce Opus 4.6's computational capacity to lessen the cost burden, just as OpenAI had to shut down Sora to reduce cost burdens, which everyone knows. So, Are People Mentioning Other Solutions? First, Switching to Codex Based on what Opus has demonstrated previously, Opus 4.6's current issues appear temporary, but this inadvertently benefits OpenAI's Codex significantly as people flock to Codex with GPT-5.3 Codex. Codex currently offers more generous quotas than Claude Code, but I don't think this will significantly threaten Anthropic, as my experience with Opus 4.6 on both Antigravity and Claude Code is much better than with Codex. For instance, when I only needed to modify one file, Opus 4.6 did it correctly and precisely, but Codex also modified other files, messing up my entire website, which was truly frustrating. Deep Edits in the Settings File Someone has shown how to modify Claude Code to address Claude Opus 4.6's "thinking" part by editing the ~/.claude/settings.json file. Anyone who has tried it, please comment on your experience so others know. Is This an Industry Standard? Yes. OpenAI, Google, and Anthropic all have a history of releasing new models with better benchmarks but causing complaints about real-world experience — often because optimization for a benchmark set doesn't reflect the full diversity of actual workflows. This is why large companies often don't upgrade models immediately upon a new version release but thoroughly test them on their specific workloads first. If you are using Claude Opus 4.6 for research workflows, computer use, or long-term reasoning tasks, the best approach currently is still to revert to Opus 4.5 to continue your work without interruption.

An•

14 Apr, 2026

Claude upgrades to 1 million token context window for free

In a move considered a 'game-changer' in the AI industry, Anthropic has just announced a revolutionary upgrade: offering a 1 million token context window for Claude Opus 4.6 and Sonnet 4.6 at standard pricing. Notably, there are absolutely no surcharges for long context, a policy completely opposite to most other AI providers, including Google and OpenAI, who typically increase prices for larger context limits.How much does Claude with a one million token context window cost?This is a massive shift brought by Anthropic. One million tokens is equivalent to about 750,000 words, enough to process 10 to 15 full novels in a single run. This number has immense significance in real-world workflows:Instead of having to chunk codebases or documents into parts, users can input an entire project into a single session, allowing the AI to work with it as a unified whole.Claude can retain all information from start to finish when analyzing thousands of pages of legal contracts, eliminating the risk of losing context halfway.Complex long-context processing techniques previously required, such as document chunking, lossy summarization, or clearing context to free up memory, are now no longer necessary.New pricing structure with no surcharge for one million tokensThe most surprising aspect is the new pricing structure, which completely lacks any long-context surcharge. The standard pricing applies to the entire range from 1 to 1 million tokens:Opus 4.6: $5 input and $25 output per 1 million tokens.Sonnet 4.6: $3 input and $15 output per 1 million tokens.For perspective, previously, when using a context window exceeding 200,000 tokens, many providers typically charged 2 to 4 times more. Particularly, Claude Code users with Pro ($20), Max ($100), Team, and Enterprise plans automatically receive the 1 million token context window when using Opus 4.6 without needing extra usage credits.Does Claude Opus 4.6 really remember all 1 million tokens?A common question when increasing the context window size is whether reasoning quality is compromised. Anthropic addressed this concern with impressive benchmark results.Claude Opus 4.6 scores 78.3% on MRCR v2 – a test measuring the ability to retrieve hidden information within a massive amount of text. This is the highest score among all current advanced models at the same context length. For comparison:GPT's accuracy drops significantly, reaching only 36% at 1 million context.Gemini performs even worse, at just 26%, showing that these models 'forget' up to ⅓ of what was provided previously when reaching long context.With Sonnet 4.6, the ability to remember over ⅔ of a long context further proves Anthropic's leading position in processing complex information. Claude's media limit increases 6x to 600 images per requestAlongside the context window, another upgrade that received less attention but is extremely important is the media limit. This limit has increased to 600 images or PDF pages per request, 6 times higher than the previous 100. This is especially meaningful for those working intensively with tasks requiring the processing of many visual documents or PDFs, which is truly significant for Claude Pro and Max users who constantly hit token limits and have to wait.Significantly reducing usage costs for businessesThis breakthrough feature is available immediately on the Claude Platform, Microsoft Azure Foundry, and Google Cloud Vertex AI. For Claude Code users on Max, Team, and Enterprise plans using Opus 4.6, the 1 million token context window is enabled by default without additional setup. This not only improves performance but also significantly reduces costs for AI systems that frequently call Claude's API, bringing major economic benefits to businesses and developers.

Liên•

16 Mar, 2026

Claude Opus 4.6 ra mắt tiếp tục nhấn mạnh vào adaptive thinking

Có thể có những người còn chưa kịp trải nghiệm Claude Opus 4.5 thì nay Anthropic đã cho ra mắt Claude Opus 4.6 rồi thật sự là một tốc độ quá nhanh. Giống như phiên bản tiền nhiệm, Anthropic tiếp tục nhấn mạnh vào sự chuyển mình của model từ trợ lý phản hồi sang một cộng tác viên chủ động. Những sự thay đổi mạnh mẽ trong cách AI hiểu và đồng hành cùng con người trong công việc hàng ngày được thể hiện rõ nét qua tính năng Adaptive Thinking (Tư duy thích ứng). [VIDEO:dPn3GBI8lII|Video giới thiệu Claude Opus 4.6|Video giới thiệu Claude Opus 4.6 của Anthropic] Khi Claude bắt đầu biết suy nghĩ trước khi thực hiện Thay đổi dễ nhận thấy nhất ở Claude Opus 4.6 chính là tính năng Adaptive Thinking. Trước đây, bạn thường phải đắn đo xem nên để AI suy nghĩ bao lâu để cân bằng giữa tốc độ và chất lượng.Tương tự như GPT 5.x, Claude tự quyết định việc chọn model trả lời dựa trên độ khó của yêu cầu. Với những việc vặt như đổi tên file hay định dạng văn bản, Claude sẽ phản hồi tức thì (mức Low). Nhưng khi gặp một bài toán kiến trúc phần mềm phức tạp, nó sẽ phân tích sâu hơn trước khi đưa ra câu trả lời cuối cùng nhằm đạt độ chính xác cao nhất. Điểm khác biệt so với GPT 5.x là người dùng vẫn có thể can thiệp dễ dàng vào thông số effort, chủ động giảm xuống mức thấp hơn để tiết kiệm thời gian và chi phí nếu thấy Claude đang "suy nghĩ quá nhiều" cho một việc đơn giản. Thực sự cộng đồng đang kêu rất nhiều về việc Claude Opus 4.6 đang bị bệnh suy nghĩ quá nhiều dẫn đến cực kì tốn token và lãng phí thời gian mong rằng Anthropic sẽ nhanh chóng khác phục điều này. Tiếp tục đứng đầu các bảng xếp hạngViệc Anthropic tung ra Claude Opus 4.6 với khả năng xử lý 1 triệu token (trong bản beta) giúp Claude đứng ngang hàng với Gemini 3 và Grok 4.1. Tuy nhiên, đối với người dùng bình thường, con số này có lẽ không quá quan trọng vì rất khó để dùng hết 200k token; tính năng này chủ yếu dành cho các đối tượng chuyên biệt. Lưu ý đối với Claude Opus 4.6, nếu yêu cầu vượt quá 200k token sẽ áp dụng mức phí $10/triệu token đầu vào.Ngay sau khi ra mắt, Claude Opus 4.6 đã tạo nên một cuộc "càn quét" diện rộng trên các bảng xếp hạng AI thế giới. Nó liên tục đánh bại các đối thủ như Gemini 3, Grok 4.1 và GPT 5.2 để chiếm lĩnh vị trí quán quân, từ khả năng lập trình agentic trên Terminal-Bench 2.0 cho đến các bài kiểm tra lý luận đa ngành phức tạp như Humanity’s Last Exam.Agent tiếp tục với khả năng tự vận hànhAnthropic cung cấp thêm Agent Teams (Nhóm tác nhân), giúp bạn không còn phải làm việc với một AI đơn lẻ. Đặc biệt trong lĩnh vực coding, Claude Opus 4.5 đã nhận được sự tin tưởng rất lớn vì viết code ít lỗi hơn đối thủ, và chắc chắn Claude Opus 4.6 sẽ còn làm tốt hơn thế.Trong các dự án lớn, Claude có thể tự phân chia thành các nhóm nhỏ làm việc song song: một nhóm lo giao diện, một nhóm lo logic hệ thống và một nhóm chuyên kiểm tra lỗi.Một ví dụ điển hình là nhóm gồm 16 Agent Claudeđã tự xây dựng một trình biên dịch C từ con số không, tạo ra hơn 100.000 dòng mã nguồn với rất ít sự can thiệp của con người. Dù chi phí cho những dự án tự trị hoàn toàn này có thể lên tới hàng chục ngàn USD, nhưng nó mở ra tương lai nơi AI có thể quản lý các dự án phức tạp từ đầu đến cuối.Tích hợp sâu vào văn phòng: Excel và PowerPointKhông dừng lại ở việc lập trình, Claude Opus 4.6 giờ đây đã tiến sâu vào những công cụ văn phòng quen thuộc:Trong Excel: Claude có thể lập kế hoạch trước khi thực hiện, tự động cấu trúc lại dữ liệu phi cấu trúc và xử lý các thay đổi đa bước chỉ trong một lần thực hiện.Trong PowerPoint: Claude hỗ trợ tạo toàn bộ slide từ mô tả, biết đọc layout, font chữ và phong cách thiết kế của công ty để đảm bảo bài thuyết trình luôn đúng bộ nhận diện thương hiệu.Sự an toàn và giảm thiểu ảo giácDù thông minh hơn, Claude Opus 4.6 vẫn duy trì các tiêu chuẩn an toàn nghiêm ngặt thông qua hệ thống Constitutional AI v3. Hệ thống này giúp mô hình đạt tỷ lệ hành vi sai lệch thấp nhất từ trước đến nay chỉ khoảng 1.8/10 điểm trong các bài kiểm tra về hành vi không phù hợp.Đặc biệt, Opus 4.6 đã khắc phục được điểm yếu từ chối nhầm các yêu cầu hợp lệ (over-refusals), mang lại trải nghiệm mượt mà hơn. Với cấu trúc tư duy mới, tình trạng lệch lạc logic (logic drift)trong các chuỗi suy luận đa bước cũng giảm đáng kể, giúp kết quả ổn định hơn trong các tác vụ phức tạp như mô hình hóa tài chính.Kết luận: Một sự đầu tư xứng đáng?Với mức giá giữ nguyên so với bản 4.5, Claude Opus 4.6 vẫn thực sự là một món hời trong việc tiến tới Agentic AI. Tuy nhiên, bạn vẫn nên coi nó là người đồng hành thông minh trong công việc hơn là để nó thực hiện mọi thứ hoàn toàn thay thế con người.

Nam•

11 Feb, 2026

Claude Opus 4.6

Rate this model

Model Specifications

Performance Statistics

Detailed Benchmarks

Other models from Anthropic

Related Articles