What is Codex? OpenAI's rising star tool

Published on 15 May, 2026

Quick Summary

OpenAI Codex is an AI agent that runs as a desktop app on Windows and macOS, letting anyone assign tasks in natural language and receive complete results without knowing how to code. From automating reports and building websites to generating design mockups and controlling your computer in the background, Codex is expanding the definition of who can build things with AI. This article breaks down what Codex can do for you, how to install and get started, and compares it directly with Claude Code, Antigravity, and Cursor to help you choose the right tool for your needs.

Three million Codex users per week — up six times in just the first three months of 2026. That number tells you something: Codex is the rising star. OpenAI is turning it into an all-in-one tool, which means Codex is no longer just a playground for developers.

What is Codex? A tool that's not just for developers

Think about this scenario: you want to build a spreadsheet that automatically updates every week, or a small website to let customers book appointments, or simply a tool that summarizes your email reports each morning without opening dozens of tabs. Previously, these things required a developer. With Codex, you just type your request in plain English and wait for the result.

Codex is OpenAI's AI agent, launched in May 2025 and deeply integrated into the ChatGPT ecosystem. Its core difference from regular ChatGPT is that Codex doesn't just answer — it actually does the work through a code execution environment. You assign a task, Codex plans it out, executes each step, checks the result, and returns a finished product ready to use. No need to understand what code is, no need to monitor every command.

What Codex can do for you

Build apps or small websites from a description

You don't need to know HTML or JavaScript. Just describe what you need: "Create a simple appointment booking page with fields for name, phone number, and date/time selection, and send an email notification whenever someone books." Codex will build the entire interface, handle the logic, and guide you through publishing it online. A startup team in the US once shared that they completed in one weekend what would have previously taken an entire quarter — and it wasn't a team full of developers.

Automate repetitive tasks

This is where non-developer users will find the most value. For example: every week you have to consolidate revenue data from three different Excel files, merge them, and send a report to your manager. Codex can build an automated workflow that does this for you on a schedule and delivers the result without you ever opening your laptop. With the Automations feature launched in the April 2026 update, Codex can take on long-horizon tasks, pause, resume, and complete them over multiple days without needing to be reminded.

Generate images and prototypes directly in the app

Codex integrates image generation powered by GPT Image 2.0 directly inside the app. You can ask Codex to create interface mockups, product banners, or illustration assets for a document — all within the same workflow, without switching to another tool. For content creators, marketers, and solo founders, this is a genuine advantage: the entire journey from idea to finished output can happen in a single window.

Control your computer to work in the background

Since April 2026, Codex can operate Mac applications using its own cursor, viewing the screen and clicking and typing to complete tasks while you continue using the machine normally. A simpler way to picture it: you're in an online meeting while Codex has Figma open, editing a design and saving the file according to instructions you set earlier. Two things happening in parallel, neither getting in the other's way.

How to get started with Codex

Codex requires installing the desktop app on Windows or macOS — it does not run directly in a web browser. The setup process is straightforward and takes just a few minutes.

Step 1: Go to openai.com/codex and download the version for your operating system. On macOS, there are two separate builds for Apple Silicon (M1 and later) and Intel chips. On Windows, there is a single universal build.
Step 2: Install the app and sign in with your existing ChatGPT account or OpenAI API key.
Step 3: Choose a project folder you want Codex to work within — you can also link it to a github repository — or skip this step if you only want to assign standalone tasks like creating files, generating images, or automating a workflow.

Step 4: Type your request in natural language, and be as specific as possible. Instead of "make me something about a report," try: "Create an Excel file summarizing monthly revenue from the data I provide, add a bar chart comparing each month, and highlight the month with the highest revenue."

Codex vs Claude Code, Antigravity, and Cursor from a non-technical user's perspective

If you're not a developer, the real question isn't "which tool is technically more powerful" — it's "which tool can I use right now without learning anything new." From that angle, these four tools are clearly different from one another.

Codex and Claude Code

Claude Code from Anthropic is Codex's most direct and formidable competitor. In terms of raw technical output quality, Claude Code currently leads the pack — producing cleaner code, tighter logic, and handling large, complex codebases more effectively. However, Claude Code is explicitly designed for developers: it runs in the terminal, requires command-line installation, and notably has no image generation capability. If you're not comfortable in a terminal, Claude Code is a barrier from the very first step. Codex, by contrast, offers a more user-friendly desktop interface, integrates image generation within the same workflow, and is noticeably more accessible to non-technical users.

Codex and Antigravity

Both require a desktop app, but their underlying philosophies are completely different. Codex is built around a "hand off the task and wait for results" model: you describe what you need, the agent runs in an isolated cloud sandbox, and returns a finished product without affecting your machine at all. It suits people who want to automate workflows, create files, or build something without monitoring every step.

Antigravity works in the opposite direction: the agent runs directly on your machine, watches your screen, opens applications, and collaborates with you in real time while you work. If you want an AI colleague working alongside you — observing and reacting to what's happening on your screen — Antigravity is the better fit.

Codex and Cursor

Cursor is built on VS Code and targets developers who want to keep their familiar working environment intact. For non-coders, Cursor is largely inaccessible because the entire experience revolves around editing code inside an editor. Cursor excels at understanding an entire codebase and offers flexibility in choosing AI models, but those advantages are for developers — not for general users who need to automate workflows or build something from scratch.

In summary, from a non-technical user's perspective:

Codex: Friendly desktop interface on Windows and macOS, capable of generating images, well-suited for users who want AI as an automated workflow tool.
Claude Code: Best technical output quality, but developer-oriented and cannot generate images.
Antigravity: Agent works directly on your machine in real time, suited for users who want to collaborate with AI while they work.
Cursor: Best for developers keeping their VS Code workflow intact; not suited for general users.

Who is Codex best for?

If you're a content creator who wants to build a landing page for a campaign without hiring a developer, Codex fits. If you're a marketer who needs to automate weekly reports pulling from multiple data sources, Codex fits. If you're a solo founder who needs to ship a product fast without a technical team, Codex fits. If you're a teacher who wants to build a small quiz app for students without learning to code, Codex fits.

On the other hand, if you're a developer who needs granular control over every line of code in a large, complex codebase, Claude Code will deliver better output quality. Codex is the right tool for people who want fast results without needing to understand what's happening under the hood.

One practical limitation worth knowing: Codex currently has full support for Python, JavaScript, TypeScript, and Ruby. For tasks that don't involve code — like generating images, automating workflows, or creating documents — this language limitation has no impact on you.

The line between "can code" and "can't code" is fading

The question "do you know how to program?" is losing its weight as tools like Codex continue to evolve. What matters more now is whether you can describe clearly what you want — because that's exactly the thinking skill required to work effectively with Codex and similar AI agent tools.

If you want to try it today, start with something small and specific: ask Codex to create an Excel file consolidating data you currently process manually every week. That's the fastest test to evaluate whether Codex genuinely saves you time or not.

Discussion (0)

No comments yet. Be the first!

How to combine Codex and Claude Code with one plugin

Is anyone else using Codex and Claude Code side by side? I only recently discovered the Codex plugin for Claude Code, published by OpenAI itself. The useful part is not simply having another AI available. It is being able to call Codex from the current Claude Code session for a code review, an adversarial design challenge, or a separate delegated task without constantly switching tabs and sessions. What makes the Codex plugin for Claude Code useful? The openai/codex-plugin-cc plugin is intended for developers who already work in Claude Code and want to add Codex to that workflow. Instead of allowing both agents to edit the same file at the same time, you can assign clear roles: Claude Code implements and Codex reviews, or Claude Code keeps the main thread while Codex investigates an independent problem in the background. The official plugin provides review commands such as /codex:review and /codex:adversarial-review, delegation through /codex:rescue, and job or session management through /codex:transfer, /codex:status, /codex:result, and /codex:cancel. Codex therefore becomes a collaborator inside the Claude Code workflow rather than a separate window. It does not create a separate Codex runtime The plugin uses the Codex CLI and Codex app server installed on the same machine. It also reuses the local authentication state, current repository checkout, and existing config.toml settings. Integration is straightforward, but each request still contributes to the user's Codex usage limits. Requirements before installation You need Node.js 18.18 or later and either a ChatGPT subscription, including Free, or an OpenAI API key. If Codex CLI is missing, /codex:setup can offer installation guidance. You can also install it manually with npm install -g @openai/codex and sign in with !codex login. How to install the Codex plugin in Claude Code Run these commands in Claude Code: /plugin marketplace add openai/codex-plugin-cc /plugin install codex@openai-codex /reload-plugins /codex:setup The last command checks whether Codex is installed and authenticated. Once setup is complete, the Codex slash commands should appear in Claude Code, along with the codex:codex-rescue agent under /agents. Try a background review first A low-risk first run is /codex:review --background. Use /codex:status to monitor it and /codex:result to retrieve the final review. Multi-file reviews can take time, so background mode keeps Claude Code available for other work. Three effective Codex and Claude Code workflows The value of the plugin comes from role design. If both agents modify the same area without boundaries, the result may be conflicting edits, repeated analysis, and wasted context. The following workflows make ownership clearer. Let Claude implement and Codex review After Claude Code completes a feature, run /codex:review for a read-only review. It can inspect current uncommitted changes or compare the branch against a base with /codex:review --base main. Because Codex does not edit files in this mode, the developer keeps control of what is accepted. For example, after Claude adds a payment flow across several modules, Codex can inspect logic errors, edge cases, and cross-file side effects. Claude Code can then evaluate the findings and apply only the changes that make sense. Delegate an entire task to Codex Use /codex:rescue for a problem that can be isolated, such as /codex:rescue --background investigate why the integration test is flaky. Claude Code can continue working on the interface or documentation while Codex investigates in the background. Rescue supports --background, --wait, --resume, and --fresh. Define the expected output and file scope before delegating. A vague instruction to fix everything while Claude Code is also editing the repository can still create collisions. A good task has a specific goal, completion criteria, and a clearly owned part of the codebase. Use adversarial review to challenge the project direction /codex:adversarial-review is designed to question implementation and design decisions rather than merely find bugs. For example, /codex:adversarial-review --base main challenge the caching and retry design asks Codex to inspect assumptions, trade-offs, alternatives, and risks such as data loss, race conditions, rollback, or reliability. This is where the two agents may appear to argue, but the debate only helps when a human sets a narrow question, requests evidence, and defines a decision rule. Otherwise, the review can become a chain of opinions with no practical outcome. Transfer sessions and manage background jobs /codex:transfer creates a persistent Codex thread from the current Claude Code session and prints a codex resume <session-id> command. It is useful when a discussion has grown beyond a short review and you want to continue directly in the Codex App or TUI without manually rewriting the context. Monitor, retrieve, and cancel work For background tasks, /codex:status shows progress, /codex:result returns the stored output and session ID, and /codex:cancel stops an active job. These commands prevent multi-agent work from becoming a black box. When a task drifts from its goal, canceling early is usually cheaper than waiting and starting over. Watch for review loops and usage limits Important: OpenAI explicitly warns that the optional review gate can create a long-running Claude/Codex loop and drain usage limits quickly. When enabled with /codex:setup --enable-review-gate, the plugin uses a Stop hook, which is an automated trigger that runs when Claude is about to finish its response, to start a targeted Codex review. If it finds an issue, Claude's response is blocked so Claude can address it first. This can be valuable before shipping, but it should not be left unattended. A practical safety checklist Assign roles before running: one agent implements while the other reviews, or each owns a separate task. Limit the scope by naming the branch, files, risk area, and completion criteria. Use background mode for large reviews and check progress periodically. Enable the review gate only while actively monitoring it, then disable it with /codex:setup --disable-review-gate. Do not let Claude review all Codex output and then ask Codex to review every Claude revision without a clear stopping rule. Use /codex:cancel when a task moves in the wrong direction. How can Codex and Claude Code work well together? The official OpenAI plugin offers a cleaner alternative to keeping Codex and Claude Code open in separate tabs or letting both agents edit the same file. Claude Code can remain the coordinator while Codex reviews, challenges a design, or owns a separate task. A sensible starting point is one small /codex:review --background run, followed by status, result, and cancel. Try rescue, transfer, and the review gate only after the basic workflow is familiar. The two systems can complement each other well, provided a person still sets the boundaries, budget, and stopping point.

Nam•

14 Jul, 2026

GPT-5.6 vs Claude Fable 5: What Is New?

Sol, Terra, and Luna make GPT-5.6 look more like a product family than a single model. The naming also signals what OpenAI is trying to change: users no longer have to choose only between an expensive flagship and a much smaller model. Instead, they get three capability tiers designed for different workloads. The important caveat is that GPT-5.6 is currently in limited preview, and OpenAI says it is not available in ChatGPT during this preview period.On the other side, Anthropic positions Claude Fable 5 as a frontier model for reasoning, software engineering, scientific research, and long horizon agentic work. The useful question is therefore not simply which model is smarter. It is which product architecture helps a team complete real work with predictable quality, latency, and cost.What GPT-5.6 actually isAccording to OpenAI's preview announcement, GPT-5.6 consists of Sol, Terra, and Luna. Sol is the flagship and most capable option, Terra is a strong lower cost model, and Luna is the fastest and most cost efficient member of the family.The important change is how OpenAI divides demand into three tiers. A research team might use Sol for a difficult reasoning problem, a product team might run most daily work on Terra, and a high volume system might use Luna for thousands of short requests. This looks more like an infrastructure strategy than the launch of a single new chatbot.Availability matters: OpenAI says GPT-5.6 is not available in ChatGPT during the preview. An experience in an API, developer tool, or partner platform should not be treated as the final ChatGPT experience.Sol is designed for difficult, extended workSol is positioned as the strongest GPT-5.6 model for deep reasoning, complex coding, and long multi step tasks. A software team might ask it to understand a repository, identify the cause of a bug, propose a minimal patch, and write regression tests. Sol's value is not answering a short question quickly. It is maintaining the objective while working through a longer chain of decisions.OpenAI also highlights stronger cyber capability as reasoning increases. That can be useful for authorized security testing and vulnerability analysis, but it also makes access controls, logging, sandboxing, and human approval more important.Terra aims for the practical middleTerra targets the broadest category of work: document analysis, content production, application development, research synthesis, and operational support. If Sol is the specialist called for the hardest problem, Terra is the strong team member expected to work throughout the day without making every request unnecessarily expensive.A marketing team could use Terra to read market reports, extract insights, build an outline, and draft several content variants. A development team could use it for code review, test generation, and tickets with a clear scope. This tier could become the default if its real world quality remains consistent.Luna prioritizes speed and scaleLuna is designed for low latency and lower cost. Classification, conversation summaries, field extraction, drafting, and ticket routing do not always require the strongest model. In these cases, response time and total operating cost matter more than maximum reasoning capability.Fast does not mean suitable for everything. If a task requires source verification, a long plan, or a code change with a large blast radius, a team should move it to Terra or Sol instead of forcing Luna beyond its intended role.Claude Fable 5 takes a different routeAnthropic presents Claude Fable 5 as a frontier model for reasoning, software engineering, vision, scientific research, and long horizon agentic work. Instead of emphasizing three product tiers in one generation, Anthropic's message focuses on the capability of a powerful model working inside the Claude ecosystem.This difference changes deployment decisions. With GPT-5.6, an engineering team might build a router that sends each request to Sol, Terra, or Luna. With Fable 5, the focus may be on optimizing prompts, tools, context, and reasoning budgets around one primary model. Neither approach is universally better because the answer depends on workload and operational maturity.A fair comparison: Do not run one prompt and declare a winner. Build a test set covering short tasks, long reasoning, coding, extraction, and recovery from errors. Measure accuracy, latency, the number of human corrections, and the total cost of a completed task.Coding and agentic work depend on the surrounding toolsBoth GPT-5.6 Sol and Claude Fable 5 target complex software work, but the practical experience depends heavily on the system around the model. The ability to read a repository, execute commands, observe results, and correct mistakes can matter as much as a benchmark score. For OpenAI workflows, the Codex page is a useful starting point for understanding how a model participates in coding work.Fable 5 may be attractive to teams already invested in Claude and long running agentic workflows. Read our Claude Fable 5 coverage for more context on Anthropic's positioning and the types of work it targets.What early forum experience tells usEarly discussions on Reddit and developer communities focus on how different Sol, Terra, and Luna feel in real work. Some users describe Sol as the better fit for multi step tasks, Terra as the practical option for routine work, and Luna as the interesting choice for speed. These observations match OpenAI's positioning, but they do not establish a precise quality gap.Forum reports are useful because they reveal the questions real users care about. However, they are self selected evidence. People may use different prompts, access levels, integrations, and preview versions. A result from a developer platform does not guarantee the same result when a model eventually appears in ChatGPT.Early positivesThe three tiers make it easier to understand which model belongs to which workload.Luna creates a clear expectation of low latency for high volume systems.Terra could become a default if it delivers stable quality at a practical cost.Sol is expected to be stronger for coding, long reasoning, and tasks with several verification steps.Open questionsHow large the practical quality gap between Sol and Terra will be on common workloads.The total cost after retries, corrections, and human review are included.How Luna behaves with long prompts and many constraints.Whether performance remains stable as GPT-5.6 expands beyond preview access.Forum reports are not benchmarks: Community experience should help you choose test cases, not make a production purchasing decision by itself.Comparing GPT-5.6 and Fable 5 by workloadWriting and document analysisTerra appears positioned for most document work because it balances capability and cost. Fable 5 may be attractive when documents are long, questions are complex, and the model must maintain an argument across a large context. A useful evaluation should score citation accuracy, structural consistency, and how much editing is required before publication.Software development and debuggingSol and Fable 5 are both candidates for difficult coding tasks. A representative test should include reading existing code, identifying the root cause, producing a minimal fix, writing tests, and explaining risk. Asking a model to create an isolated function from scratch does not reflect how well it works in a real repository.High volume processingLuna has the clearest positioning advantage when speed and cost dominate. At thousands of extraction or classification requests per day, a small difference in price and latency can have a large effect. Fable 5 may be unnecessarily expensive for a workload that only needs short, structured outputs.Research and long reasoningSol and Fable 5 should be compared with tasks that have verifiable outcomes rather than open questions that merely sound impressive. Give both models the same research material and ask them to identify assumptions, detect contradictions, propose an experiment, and explain what evidence is missing. The better model is the one that helps users discover errors faster, not the one that writes the longest answer.Should you choose Sol, Terra, Luna, or Fable 5?If you want maximum capability inside the OpenAI ecosystem, Sol is the first model to test. If you need a strong model for regular use, Terra has the more practical position. If your workload contains many short and repetitive tasks, Luna could reduce operating cost. Fable 5 remains relevant for teams invested in Claude or focused on long reasoning and agentic work.Because GPT-5.6 is still in preview, replacing an entire production workload would be premature. Run the models in parallel on real but sanitized data, record failures, and use the same criteria for every candidate.A test plan you can use nowSelect 20 tasks that represent real work, including easy and difficult cases.Run each task on Sol, Terra, Luna, and Fable 5 when access allows.Score accuracy, response time, total cost, and required human correction.Track severe failures separately instead of relying only on averages.Choose a model for each workload category rather than forcing one model to do everything.Is GPT-5.6 worth switching to now?The most important change in GPT-5.6 may not be Sol's raw capability. It is OpenAI's decision to turn one model generation into three operational tiers. That could help organizations control cost, but only if they can classify workloads and route requests intelligently.The practical next step is to build a small benchmark from your own data. If Sol wins difficult tasks, Terra is good enough for routine work, and Luna handles high volume requests reliably, the three tier architecture has real value. If Fable 5 remains more consistent on long reasoning, a multi model strategy may still be better than committing to one provider.

Liên•

9 Jul, 2026

How to control Codex from your phone with ChatGPT app

You're out and suddenly remember a small detail in your project that needs fixing — you don't have to open your laptop or remote desktop in. With the right connection set up, ChatGPT app on your phone can become a control panel for Codex, while your computer at home or the office keeps running the actual code. ChatGPT app doesn't run Codex on your phone The easiest thing to misunderstand is thinking Codex is running directly on your phone. In reality, your phone only sends prompts, replies, approvals and follow-up messages, while the actual working environment lives on your Mac or Windows machine running Codex. In other words, ChatGPT app is the remote controller, and the host machine is where your repo, terminal, credentials, plugins, MCP servers and other tools actually live. This makes complete sense because codebases typically live on your development machine, not your phone. When you send a request like fixing a TypeScript error, running tests or checking a diff, Codex processes it inside the selected project on the host and sends results back for you to review. If you want to understand the foundation before using remote access, check out What is Codex and how to use Codex to get a clear picture of where this tool fits in your workflow. What do you need before connecting ChatGPT app to Codex? According to the latest Codex documentation from OpenAI, ChatGPT app supports controlling Codex on both macOS and Windows, though Linux is not supported yet. Notably, this feature works with all ChatGPT account types, including Free and Go — no paid plan required. You only need to make sure you're signed into the same account or workspace on both devices: ChatGPT mobile (latest version on iOS or Android) and Codex (latest version on your host machine, online and running). Your host machine must stay on and Codex must keep running for the entire time you're controlling it remotely. If the machine goes to sleep, loses its connection or Codex is closed, the connection from your phone drops immediately and any tasks in progress may be interrupted. What's worth noting is that the entire setup process starts from Codex App on the host machine and is surprisingly simple — just scan a QR code and you're done. Inside Codex App, select the mobile setup option in the sidebar, scan the QR code with your phone, then complete the confirmation in ChatGPT app. For enterprise workspaces, an admin may need to enable Remote Control permissions before you can connect. This QR code grants control over your computer, so keep it private and never share it with anyone to avoid unauthorized access to your machine. To summarize, connecting ChatGPT app to Codex is straightforward: Host machine must be online and running Codex ChatGPT app and Codex must be signed into the same account or workspace Generate the QR code in Codex on the host and complete setup on your phone MFA, SSO or passkey requirements may still apply depending on your workspace What can you do once connected? Once the host appears in Codex on your phone, you can start a new thread inside a project on the host or pick up an existing one. This is where the experience becomes genuinely useful: you can send follow-ups, answer Codex's questions, approve commands, view output, check diffs, review test results and even receive notifications when a task finishes or needs your attention. A real example: you're at a coffee shop and remember the login form has a validation bug. You open ChatGPT app, select the connected host, and ask Codex to check the auth flow, fix the email validation error and run the related tests. Codex works directly on the repo sitting on your host machine, while you review the results, approve actions when needed and decide whether to request further changes. This is also why people are starting to think of Codex and other AI-powered IDEs as a colleague working inside a real environment, not just a code suggestion tool anymore. Its strength lies in reading files, running commands, editing code and maintaining context across multiple rounds of back-and-forth. Limitations to keep in mind when using Codex from your phone Remote control depends entirely on the host machine — if your computer goes to sleep, loses its connection, closes Codex or gets signed out of the workspace, your phone loses its working environment immediately. That said, if Codex is mid-task when the connection drops, it will continue running on the host and notify you once your phone reconnects, so there's less to worry about if your phone suddenly loses signal during a running task. One more thing to note: on Windows, tasks using Computer Use require an appropriate foreground session, so this setup is not a complete replacement for sitting directly in front of your machine. It also helps to draw a clear line between handing off a focused task and reviewing large changes. Your phone works well for small bugs, running tests, quick questions about a specific file, reviewing short tasks or checking task status. However, anything requiring a high level of attention should still be reviewed on a larger screen to avoid missing details. How to use it effectively in practice The most effective approach is to hand off tasks with a clear scope and specific expected outcomes. Instead of saying "fix the login", describe exactly where the error occurs, what the expected behavior should be after the fix, which tests to run and which parts of the codebase to leave untouched. Codex performs better when it knows the boundaries of a task, especially since remote mobile means each feedback loop takes longer than when you're sitting right at your machine. A clean working rhythm might look like this: describe the task in detail whether small or medium-sized, ask Codex to read the relevant files, let it propose a solution, only approve when necessary and wait for the result report. Once you get used to this rhythm, you'll find that idle time outside can handle real work — while keeping the final decision firmly in your hands. Compared to Claude Code Remote and Telegram bot There are many ways to control an AI coding agent from your phone, though the three most common approaches each serve a different need. Criteria ChatGPT app + Codex Claude Code Remote Telegram + Codex Natural conversation ✅ Excellent ✅ Good ❌ Requires exact syntax Granular control Moderate Highest Low Connection stability Stable Stable Frequent drops Mobile UI Well optimized Not fully optimized Uses existing Telegram app Initial setup Easy, scan QR Easy Requires manual bot configuration Computer must stay on ✅ Required ✅ Required ✅ Required Claude Code Remote Control offers the strongest level of control — you get direct terminal output, can intervene mid-task and generally feel much closer to what the agent is doing. That said, the UI on small phone screens isn't fully optimized yet, and some interactions are still difficult to perform without a physical keyboard. Telegram bot has the advantage of not requiring a separate app and is easy to get started with, but the real-world experience has clear limits: it's prone to slowdowns, occasional silent disconnections mid-task, and because it lacks genuine AI context, anything slightly more complex than a simple command quickly falls apart — forcing you to type precise instructions rather than describe what you need naturally. ChatGPT app + Codex sits at the best balance point for most users — smooth enough, smart enough, quick to set up with a QR scan and no new syntax to learn before you can get to work. Connecting ChatGPT app to Codex doesn't turn your phone into a development machine — it turns your phone into a control surface for a development machine that's already ready to work. As long as the host stays on, permissions are configured correctly and the task is scoped tightly enough, this is the most practical way to handle real coding work when you're away from your laptop.

Nam•

22 Jun, 2026

Microsoft launches 7 new AI models to challenge OpenAI

Microsoft just dropped seven new AI models at Build 2026, with MAI-Thinking-1 boasting 35 billion active parameters and trained entirely on clean data. For the first time, the software giant is openly challenging the position of its own strategic partner, OpenAI, on the AI model battlefield. MAI-Thinking-1 and Microsoft's reasoning ambitions The centerpiece of Build 2026 was MAI-Thinking-1, Microsoft's first reasoning AI model developed entirely in-house. With approximately 35 billion active parameters, the model is designed to handle multi-step reasoning tasks, work with long contexts, and support complex coding, all at a lower cost than many large-scale AI models currently available. The most notable claim is that Microsoft trained MAI-Thinking-1 on clean data without using distillation from third-party AI models. In other words, this is a clear statement that Microsoft has the independent AI research capability to build competitive models without "borrowing" knowledge from GPT or any other model. According to Microsoft's published evaluations, MAI-Thinking-1 achieves competitive performance on coding benchmarks and is rated on par with many leading AI models in blind evaluation tests. The 35-billion parameter count also signals that Microsoft is prioritizing efficiency over raw scale, as many competitor models have significantly more parameters but may not necessarily deliver better output quality. From coding to voice: a complete AI ecosystem Beyond reasoning, Microsoft introduced six additional AI models to build a complete AI ecosystem serving both individual users and enterprises. From coding and image generation to voice synthesis, every piece of the puzzle now has a dedicated model. Smarter coding with MAI-Code-1-Flash For developers, MAI-Code-1-Flash is significant news. This model specializes in code generation and software development support, optimized for real-world programming tasks. More importantly, it will be integrated directly into GitHub Copilot and Visual Studio Code, two tools used daily by millions of developers. This means code suggestions and automated coding experiences will be significantly upgraded within familiar development environments. Images and voice: the missing pieces In the creative content space, Microsoft announced MAI-Image-2.5 alongside MAI-Image-2.5-Flash. These are next-generation image creation and editing models, with the Flash version optimized for fast response times, making it suitable for real-time applications like live photo editing or on-demand illustration generation. In the audio domain, Microsoft introduced two important models: MAI-Voice-2 with more natural voice synthesis capabilities and support for additional languages MAI-Transcribe-1.5 for speech-to-text conversion with significantly faster processing speeds than the previous generation Additionally, Microsoft has developed optimized variants specifically for the Microsoft Foundry platform, helping enterprises easily build and deploy their own AI applications. The strategy to reduce OpenAI dependence Where Microsoft was previously seen mainly as an infrastructure partner and deployment platform for OpenAI, Build 2026 shows the company is steadily acquiring all the essential components of a full AI ecosystem. Microsoft now has its own reasoning model, coding model, image generation model, voice synthesis model, and speech recognition model, all connected directly to the Azure, Copilot, and Microsoft Foundry ecosystem. This strategy gives Microsoft greater autonomy in developing core technology while reducing risk from dependence on external partners. More specifically, owning proprietary AI models allows Microsoft to control its product roadmap, optimize operational costs, and customize models for specific service needs without waiting for or negotiating with third parties. Where does the AI model race go from here? The simultaneous launch of seven new AI models shows Microsoft is investing heavily in foundational technologies to compete directly with major players like OpenAI, Google, and Anthropic. When OpenAI's largest partner decides to build its own AI models, that is the clearest signal that the AI race has entered a new phase where no one wants to place the future of their technology in someone else's hands. For developers and enterprises, now is the time to closely watch Microsoft Foundry and the Azure AI ecosystem, as tools that were previously only available through OpenAI will soon appear within Microsoft's familiar ecosystem. Build 2026 may well be remembered as the moment Microsoft officially declared its vision for an independent, comprehensive AI ecosystem with its own distinctive identity.

Nam•

4 Jun, 2026

Quick Summary

What is Codex? A tool that's not just for developers

What Codex can do for you

Build apps or small websites from a description

Automate repetitive tasks

Generate images and prototypes directly in the app

Control your computer to work in the background

How to get started with Codex

Codex vs Claude Code, Antigravity, and Cursor from a non-technical user's perspective

Codex and Claude Code

Codex and Antigravity

Codex and Cursor

Who is Codex best for?

The line between "can code" and "can't code" is fading

Discussion (0)

Related Articles

How to combine Codex and Claude Code with one plugin

GPT-5.6 vs Claude Fable 5: What Is New?

How to control Codex from your phone with ChatGPT app

Microsoft launches 7 new AI models to challenge OpenAI