What is Claude Project? How to use it effectively

Published on 28 April, 2026

Quick Summary

Claude Memory remembers who you are, while Project remembers what you're working on — and that's exactly why these two features can't replace each other, even now that Memory has been opened up for free. This article walks you through setting up a Claude Project from scratch: writing custom instructions so Claude understands your style and expectations, uploading documents to the knowledge base, combining Projects with Skills, and organizing separate Projects for different clients or tasks. It also clearly explains how a Project on Claude.ai differs from a Project in Cowork, so you know which one to use for the right job.

Claude Memory is now free for all users, which means Claude can automatically remember your name, profession, and a few preferences from previous conversations. That sounds useful enough — until you're running three projects in parallel, each with its own set of documents, writing styles, and requirements. As context builds up, Memory won't help much at all. That's when Project becomes the thing you actually need.

How are Memory and Project different?

Claude Memory works like Claude's personal knowledge about you — it records general information that carries across every conversation: who you are, what you do, what communication style you prefer. This is an identity layer, not a work context layer.

Project is a specialized context layer for each specific piece of work. You can have one Memory about yourself and ten different Projects, each containing its own documents, its own instructions, and its own conversation history — completely independent from one another.

Think of it this way: Memory is like your ID card, helping Claude always know who you are. Project is like a separate work folder for each job, and when you open a specific Project, Claude knows exactly the context for that work without mixing it up with anything else.

A practical example: Memory helps Claude know you're a marketing professional, but the "Client A Website" Project holds the marketing materials, project brief, and specific technical decisions for that job — things Memory could never store because they belong to the project, not to you.

What is a Claude Project?

A Project is a dedicated workspace inside Claude where you can store documents, write custom instructions, and keep conversation history organized by topic or task. Instead of every conversation starting as a blank slate, a Project lets Claude come in already knowing the context of what you're working on before you type your first message.

If Memory is what Claude knows about you, then Project is what Claude knows about the specific work you're doing — and the combination of both is what creates an AI experience that genuinely understands you.

Limits by plan

Free accounts can create up to 5 Projects. Paid plans (Pro, Max, Team, Enterprise) get unlimited Projects, plus RAG functionality — meaning when you upload enough documents to exceed the context window limit, Claude automatically switches to intelligent search mode, extending capacity up to 10 times without any drop in response quality. Team and Enterprise accounts also include Project sharing and member permission settings.

How to set up a Project so Claude understands you better

Step 1: Write custom instructions

This is the most important part — and the part most people skip. Custom instructions are a passage you write once, and Claude reads it before every conversation inside that Project. A good set of instructions isn't a long list of rules; it's a concise picture of who you are and what you expect.

Example instructions for a content creator:

With instructions like these, every time you say "write an article about Claude Opus 4.7," Claude doesn't need to ask about format, length, or tone — it already knows.

Example instructions for a developer:

Step 2: Upload documents to the knowledge base

Projects let you upload documents in PDF, DOCX, CSV, TXT, HTML, and many other formats, with a maximum size of 30MB per file. Claude will read and reference these documents in every conversation within the Project.

What to upload depends on how you're using the Project:

Writing projects: Your style guide, sample articles you want Claude to learn from, SEO keyword lists, product or service information you frequently reference.
Research projects: Reference materials, background reports, a list of trusted sources, notes from previous reading sessions.
Development projects: API documentation you're using, the project README, recorded architecture decisions, a log of bugs encountered and how they were resolved.
Personal projects: Information about yourself — your goals, schedule, work habits, and current focus areas — so Claude can give you more relevant advice.

Can you add a Skill to a Project?

Yes — and this is how many advanced users are combining the two features. A Skill in Claude is a packaged set of instructions that teaches Claude how to handle a specific type of task, such as a skill for writing SEO-optimized articles, a skill for analyzing code, or a skill for summarizing legal documents.

When you enable a Skill inside a Project, Claude has both the specific context of your work (from the knowledge base and custom instructions) and the specialized process (from the Skill). The two layers complement each other rather than conflict — the Skill defines how to do something, the Project defines the context it's done in.

A practical example: if you have a Skill for writing in the AIDA framework and enable it inside your content Project, Claude will automatically apply the structure from the Skill while also drawing on the style guide, keyword list, and sample articles you've uploaded to the Project — without you needing to explain any of it.

Three of the most effective ways to use Projects

A "about me" Project to use Claude as a personal assistant

This is a use case most people don't think of, but it delivers real value. Create a Project called "About Me" and fill it with information Claude needs to support you well: your current job, active projects, short- and long-term goals, skills you're learning, your working style, and even weaknesses you're trying to improve.

With this Project in place, you can ask very specific questions like "Given my schedule this week, what should I prioritize learning?" or "Suggest how to balance project A and project B" — without having to explain who you are or what situation you're in from scratch each time.

A Project per client or per initiative

If you work across multiple clients or projects in parallel, each Project becomes an independent workspace. Upload the project brief, client information, key conversations from before, and specific requirements. When you need to work on something for that client, open the corresponding Project and Claude immediately understands the context — no recap needed.

A learning and research Project

When studying a new subject — AI agents, behavioral economics, programming — create a dedicated Project for it. Upload the materials you're reading, your notes, and a running list of unanswered questions. Claude inside this Project becomes a guide who knows exactly where you are in your learning journey and can pick up from where you left off last time.

Frequently asked questions about Claude Projects

How is a Project in Claude different from a Project in Cowork?

This is the most common source of confusion because Anthropic uses the word "Project" for two different things. A Project in Claude.ai (in the browser) is a chat space with memory and a knowledge base — you upload documents, write instructions, and Claude retains that context across every conversation inside it. But it's still just chat, and Claude cannot create actual files, run code, or automate tasks.

A Project in Cowork (the desktop app) is the next level: Claude doesn't just remember context — it actually does the work, including creating Word, Excel, and PDF files, running code, controlling the browser, and scheduling automated tasks. If a Claude.ai Project is "an assistant with better memory," a Cowork Project is closer to "an AI employee who gets things done for you."

A practical example: in a Claude.ai Project you can ask "analyze this month's revenue report" and Claude replies in text. In a Cowork Project, Claude reads your actual Excel file, produces a new analysis table, and saves it as a PDF — no copying and pasting required.

If you need advice, writing help, and context-rich conversation, a Claude.ai Project is enough. If you want AI to actually process work and produce output files, Cowork Project is the right choice.

How long should custom instructions be?

Five to eight sentences is usually enough — and more effective than a 500-word description. Claude reads concise, clearly stated instructions best, not overly detailed ones that risk contradicting each other.

Example of a short, effective instruction: "I'm a content manager for an AI website, writing for non-technical readers, using approachable English, default article length 1,000–1,200 words in HTML format."

How should I name my Projects for easy management?

Avoid generic names like "Project 1" or "Work" — as your number of Projects grows, you won't remember which is which. Name Projects by purpose and time period so they're easy to find later.

Good examples: "AIDA Content — April 2026," "Next.js website for Client ABC," "AI agent research — Q2 2026."

When should I delete or update documents in a Project?

Outdated or irrelevant information will introduce noise into Claude's responses because it will keep trying to reference things that are no longer accurate. Review your knowledge base every four to six weeks, remove anything that's expired, and add newer materials — especially when the project direction has changed significantly.

Example: if you're changing focus because an earlier direction is now outdated, remove the old documents and upload the updated ones so Claude is working from the right foundation.

Is a Project actually better than a regular chat?

The real difference isn't any single technical feature — it's accumulation over time. A new chat is a blank page, while a Project built up consistently over three months produces noticeably better results because every document and instruction you add is another layer of context helping Claude understand you and your work more deeply.

Example: after three months using a research Project on AI, Claude knows which materials you've read, which direction your research is heading, and what kind of reasoning you tend to use — making its answers far more specific and connected than if you asked the same question in an empty chat. And it gets even more useful when it can synthesize everything you've learned and accomplished over those three months.

Discussion (0)

No comments yet. Be the first!

Comparing Hermes Agent, OpenClaw, and Claude Cowork

Hermes Agent, OpenClaw, and Claude Cowork are all called AI agents because they do more than answer questions. They can break an objective into multiple steps, call tools, read data, and produce a complete result. However, comparing these three products using only a feature table can easily lead to the wrong choice. Hermes Agent is designed as an agent that can learn how you work. OpenClaw is designed as a personal assistant that is always available through messaging channels, while Claude Cowork is intended for users who want to delegate office work in natural language within an environment managed by Anthropic. Therefore, the important question is not which tool is the most powerful, but how much you want to manage yourself and where you want the agent to appear in your daily workflow. Three products with different designs The differences among these three AI agent tools do not lie only in the model that performs the work. They also come from the framework surrounding the model, which manages tools, memory, access permissions, and the execution loop. This concept is explained in detail in our article What is an agent harness?, which helps explain why three products that are all called AI agents can behave so differently. Hermes Agent prioritizes a learning loop and execution environments The notable point about Hermes is that skills are not merely a list of skills that have already been installed. After completing a task, the agent can extract a useful process, save it, and improve it the next time. Our article What is Hermes Agent? explains this self learning mechanism separately. The accumulated value of this mechanism grows over time when users have recurring tasks such as analyzing projects, monitoring information sources, standardizing reports, or operating a chain of internal tools. Hermes also supports several types of sandboxes, including local execution, Docker, SSH, Singularity, and Modal. A sandbox is an isolated environment in which the agent executes commands and works with files. This flexibility lets users choose among speed, control, and isolation, but it also requires an understanding of infrastructure, access permissions, and secret management. OpenClaw uses the Gateway as its coordination center In OpenClaw, the Gateway is the control layer between the agent, devices, and communication channels. A message can become a request for the agent to read a calendar, process a file, call a service, or respond in the correct conversation. This approach feels natural for people who want to message an assistant from their phone without needing to remember where the server is running. OpenClaw is most suitable when the agent needs to react as soon as work appears, without requiring the user to open a computer or enter a separate application. Instead of waiting for you to start a work session, it remains available in the messaging channels you already use and begins processing as soon as a message arrives or a configured event is triggered. Claude Cowork provides a managed workspace Cowork reduces the amount of infrastructure that users must manage themselves. In the desktop application, users can grant access to a local folder and ask Claude to read, organize, or create files. With remote sessions, work takes place in an isolated environment on Anthropic servers, which suits long tasks that do not require a personal computer to remain active continuously. In return, the level of customization and control over the execution layer is not as broad as in a self hosted project. Cowork is better suited to people who want quick results within the Claude ecosystem and do not want to maintain a server or design a Gateway themselves. How the memory of the three tools works differently Memory in an agent should not be understood simply as storing every conversation. A useful system must know which information is worth retaining, which information matters only in the current session, and when old data should be retrieved. If it stores too little, the agent must ask the same questions repeatedly. If it stores too much, costs will certainly increase and sensitive data can easily be used in the wrong context. Hermes stands out by combining persistent memory with skills that can improve. Memory records preferences and context, while a skill records how to complete a type of task. These two layers make the agent feel as if it increasingly understands the user, but quality still depends on whether the user reviews what has been stored and removes processes that are no longer appropriate. OpenClaw runs across several channels at once, and that is also its most complicated aspect. Remembering conversation content is only one part of the problem. The harder issue is distinguishing who is speaking, which channel they are using, and which scope the work belongs to. A command sent in a company Slack group should not automatically pull in private context previously discussed on Telegram. If session configuration and identity policies should be established clearly from the beginning, even a strong model cannot rescue a system when everything remains ambiguous. Cowork limits context to each work session, reads only the files for which you grant access, and uses only the connections you allow. For people who are not accustomed to building systems, this approach is easier to control because the boundaries of each task are relatively clear. However, clear boundaries do not mean automatic understanding. You still need to explain what you want, what completion should look like, and where the data should come from. Cowork cannot infer your company context unless you actively provide it. Which type of work each tool automates best Hermes includes web tools, terminal access, MCP, scheduled runs, and subagents. MCP is a connection standard that helps an agent communicate with external data sources or applications through a consistent interface. By combining MCP with skills, users can turn an experiment into a repeatable process, such as collecting data each morning, analyzing changes, and sending a summary. OpenClaw is strong at workflows that begin with a message or an event. For example, a user can send an invoice to a private channel, after which the agent extracts the information and updates a storage system. Another example is receiving a service alert, gathering additional diagnostic data, and returning a summary directly to the operations group. Its value comes from reducing the gap between the moment a need appears and the moment the agent begins acting. Cowork suits structured office outputs. It can research a topic, synthesize data, create a document, and continue revising it according to feedback. Long running or scheduled tasks help Cowork move beyond short question and answer interactions. Even so, organizations need to inspect each connector and its access permissions before allowing the agent to work with real data stores. When deep integration with private infrastructure is required, Hermes and OpenClaw generally provide more room. When the priority is reducing the time from a request to a finished document, Cowork usually has an advantage. This is the difference between a platform intended for assembly and a product that has already been packaged. How secure are these three AI agents? There is no simple answer to the question of which one is safer because the security risks of each tool come from completely different areas. Hermes Agent: Self hosting does not automatically mean safety. The greatest risk comes from automatically generated skills because, in essence, they are pieces of code that the agent writes and then runs by itself. If they are not reviewed before scheduled execution, a skill with terminal access or permission to send data externally can do things without your knowledge. In addition, API keys and sensitive folders should not appear in prompts or be mounted directly into a sandbox when the skill does not actually need them. OpenClaw: The more channels you connect, the wider the attack surface becomes. The point most easily overlooked is sender authentication. If the Gateway trusts only a display name or a channel that has not been properly secured, a compromised messaging account may be enough for someone to issue commands to your agent. The list of people allowed to send commands and the permissions of each bot need to be reviewed whenever you add a new channel. Claude Cowork: The most concerning risk is prompt injection, which occurs when the agent reads a document or webpage containing hidden instructions intended to redirect it away from your original request. Anthropic provides safeguards and asks for confirmation before sensitive actions, but those measures do not replace your own review of the results or the need to avoid granting broader permissions than the task actually requires. Note: With any agent, do not grant permission to delete files, send external messages, or perform sensitive transactions. Start with read only mode, enable complete logging, and retain human approval for actions that require human judgment. Should you choose Hermes Agent, OpenClaw, or Claude Cowork? Every tool has its own strengths and weaknesses, so selecting the most suitable one depends on the user and the work that needs to be done. Choose Hermes Agent when you want the agent to understand how you work increasingly well Hermes suits developers, researchers, and technical teams that want an agent to learn their own processes and run on flexible infrastructure. It is particularly worth considering when tasks recur often enough for skills to create accumulated value. You need to be prepared to read logs, review skills, and manage execution environments. Best suited when: You want the agent to remember and improve work processes through repeated use. You can manage sandboxes, select models, and control access permissions yourself. Choose OpenClaw when work requires continuous communication through messages OpenClaw is suitable when the assistant needs to be present on Telegram, WhatsApp, Slack, Zalo, or similar channels. It is useful for alerts, rapid collection of requests, and automation that begins with a conversation. In return, you must manage identity, channel permissions, and Gateway stability. Best suited when: Requests usually arrive as messages or automated alerts. You need one coordination point for several different communication channels. Choose Claude Cowork when you need quick results without building a system Cowork suits content creators, analysts, and managers who need complete documents, spreadsheets, and slides without wanting to think about servers or Gateways. In return, you should understand the limits of your plan, where data travels, and which connections are enabled before introducing real work. Best suited when: You want to describe the required outcome in natural language and receive a complete output. You prioritize the convenience of a managed service over full control of the infrastructure.

Nam•

14 Jul, 2026

GPT-5.6 vs Claude Fable 5: What Is New?

Sol, Terra, and Luna make GPT-5.6 look more like a product family than a single model. The naming also signals what OpenAI is trying to change: users no longer have to choose only between an expensive flagship and a much smaller model. Instead, they get three capability tiers designed for different workloads. The important caveat is that GPT-5.6 is currently in limited preview, and OpenAI says it is not available in ChatGPT during this preview period.On the other side, Anthropic positions Claude Fable 5 as a frontier model for reasoning, software engineering, scientific research, and long horizon agentic work. The useful question is therefore not simply which model is smarter. It is which product architecture helps a team complete real work with predictable quality, latency, and cost.What GPT-5.6 actually isAccording to OpenAI's preview announcement, GPT-5.6 consists of Sol, Terra, and Luna. Sol is the flagship and most capable option, Terra is a strong lower cost model, and Luna is the fastest and most cost efficient member of the family.The important change is how OpenAI divides demand into three tiers. A research team might use Sol for a difficult reasoning problem, a product team might run most daily work on Terra, and a high volume system might use Luna for thousands of short requests. This looks more like an infrastructure strategy than the launch of a single new chatbot.Availability matters: OpenAI says GPT-5.6 is not available in ChatGPT during the preview. An experience in an API, developer tool, or partner platform should not be treated as the final ChatGPT experience.Sol is designed for difficult, extended workSol is positioned as the strongest GPT-5.6 model for deep reasoning, complex coding, and long multi step tasks. A software team might ask it to understand a repository, identify the cause of a bug, propose a minimal patch, and write regression tests. Sol's value is not answering a short question quickly. It is maintaining the objective while working through a longer chain of decisions.OpenAI also highlights stronger cyber capability as reasoning increases. That can be useful for authorized security testing and vulnerability analysis, but it also makes access controls, logging, sandboxing, and human approval more important.Terra aims for the practical middleTerra targets the broadest category of work: document analysis, content production, application development, research synthesis, and operational support. If Sol is the specialist called for the hardest problem, Terra is the strong team member expected to work throughout the day without making every request unnecessarily expensive.A marketing team could use Terra to read market reports, extract insights, build an outline, and draft several content variants. A development team could use it for code review, test generation, and tickets with a clear scope. This tier could become the default if its real world quality remains consistent.Luna prioritizes speed and scaleLuna is designed for low latency and lower cost. Classification, conversation summaries, field extraction, drafting, and ticket routing do not always require the strongest model. In these cases, response time and total operating cost matter more than maximum reasoning capability.Fast does not mean suitable for everything. If a task requires source verification, a long plan, or a code change with a large blast radius, a team should move it to Terra or Sol instead of forcing Luna beyond its intended role.Claude Fable 5 takes a different routeAnthropic presents Claude Fable 5 as a frontier model for reasoning, software engineering, vision, scientific research, and long horizon agentic work. Instead of emphasizing three product tiers in one generation, Anthropic's message focuses on the capability of a powerful model working inside the Claude ecosystem.This difference changes deployment decisions. With GPT-5.6, an engineering team might build a router that sends each request to Sol, Terra, or Luna. With Fable 5, the focus may be on optimizing prompts, tools, context, and reasoning budgets around one primary model. Neither approach is universally better because the answer depends on workload and operational maturity.A fair comparison: Do not run one prompt and declare a winner. Build a test set covering short tasks, long reasoning, coding, extraction, and recovery from errors. Measure accuracy, latency, the number of human corrections, and the total cost of a completed task.Coding and agentic work depend on the surrounding toolsBoth GPT-5.6 Sol and Claude Fable 5 target complex software work, but the practical experience depends heavily on the system around the model. The ability to read a repository, execute commands, observe results, and correct mistakes can matter as much as a benchmark score. For OpenAI workflows, the Codex page is a useful starting point for understanding how a model participates in coding work.Fable 5 may be attractive to teams already invested in Claude and long running agentic workflows. Read our Claude Fable 5 coverage for more context on Anthropic's positioning and the types of work it targets.What early forum experience tells usEarly discussions on Reddit and developer communities focus on how different Sol, Terra, and Luna feel in real work. Some users describe Sol as the better fit for multi step tasks, Terra as the practical option for routine work, and Luna as the interesting choice for speed. These observations match OpenAI's positioning, but they do not establish a precise quality gap.Forum reports are useful because they reveal the questions real users care about. However, they are self selected evidence. People may use different prompts, access levels, integrations, and preview versions. A result from a developer platform does not guarantee the same result when a model eventually appears in ChatGPT.Early positivesThe three tiers make it easier to understand which model belongs to which workload.Luna creates a clear expectation of low latency for high volume systems.Terra could become a default if it delivers stable quality at a practical cost.Sol is expected to be stronger for coding, long reasoning, and tasks with several verification steps.Open questionsHow large the practical quality gap between Sol and Terra will be on common workloads.The total cost after retries, corrections, and human review are included.How Luna behaves with long prompts and many constraints.Whether performance remains stable as GPT-5.6 expands beyond preview access.Forum reports are not benchmarks: Community experience should help you choose test cases, not make a production purchasing decision by itself.Comparing GPT-5.6 and Fable 5 by workloadWriting and document analysisTerra appears positioned for most document work because it balances capability and cost. Fable 5 may be attractive when documents are long, questions are complex, and the model must maintain an argument across a large context. A useful evaluation should score citation accuracy, structural consistency, and how much editing is required before publication.Software development and debuggingSol and Fable 5 are both candidates for difficult coding tasks. A representative test should include reading existing code, identifying the root cause, producing a minimal fix, writing tests, and explaining risk. Asking a model to create an isolated function from scratch does not reflect how well it works in a real repository.High volume processingLuna has the clearest positioning advantage when speed and cost dominate. At thousands of extraction or classification requests per day, a small difference in price and latency can have a large effect. Fable 5 may be unnecessarily expensive for a workload that only needs short, structured outputs.Research and long reasoningSol and Fable 5 should be compared with tasks that have verifiable outcomes rather than open questions that merely sound impressive. Give both models the same research material and ask them to identify assumptions, detect contradictions, propose an experiment, and explain what evidence is missing. The better model is the one that helps users discover errors faster, not the one that writes the longest answer.Should you choose Sol, Terra, Luna, or Fable 5?If you want maximum capability inside the OpenAI ecosystem, Sol is the first model to test. If you need a strong model for regular use, Terra has the more practical position. If your workload contains many short and repetitive tasks, Luna could reduce operating cost. Fable 5 remains relevant for teams invested in Claude or focused on long reasoning and agentic work.Because GPT-5.6 is still in preview, replacing an entire production workload would be premature. Run the models in parallel on real but sanitized data, record failures, and use the same criteria for every candidate.A test plan you can use nowSelect 20 tasks that represent real work, including easy and difficult cases.Run each task on Sol, Terra, Luna, and Fable 5 when access allows.Score accuracy, response time, total cost, and required human correction.Track severe failures separately instead of relying only on averages.Choose a model for each workload category rather than forcing one model to do everything.Is GPT-5.6 worth switching to now?The most important change in GPT-5.6 may not be Sol's raw capability. It is OpenAI's decision to turn one model generation into three operational tiers. That could help organizations control cost, but only if they can classify workloads and route requests intelligently.The practical next step is to build a small benchmark from your own data. If Sol wins difficult tasks, Terra is good enough for routine work, and Luna handles high volume requests reliably, the three tier architecture has real value. If Fable 5 remains more consistent on long reasoning, a multi model strategy may still be better than committing to one provider.

Liên•

9 Jul, 2026

YC CEO's 6 forcing questions before starting any project

I'd heard a lot about the gstack repo from the CEO of Y Combinator, so I got curious and installed it to try. What surprised me most wasn't the polished workflows — it was the genuinely different mindset behind them. That mindset shows up in the very first command: /office-hours, with six questions that don't ask about code at all, only the things most people haven't thought through before they start building. What is gstack and why did Garry Tan build it gstack is an open-source toolkit by Garry Tan, CEO of Y Combinator, built primarily for Claude Code. The core idea: instead of using AI as a plain code writer, Garry Tan wanted to turn Claude into a small AI agent team, where each member handles a different role — from product direction and security review to testing and release. The entire workflow runs in an ordered loop: Think → Plan → Build → Review → Test → Ship → Reflect. More specifically, gstack splits Claude Code into 23 specialized roles, and the output of each step is automatically passed to the next — no manual handoff needed. Some of the standout commands: /office-hours 6 questions that force you to rethink your feature before writing a single line of code /plan-ceo-review checks whether you're overbuilding or underbuilding relative to what's actually needed /review catches serious bugs that standard automated checks miss /qa opens a real browser, performs real interactions, finds real bugs /cso runs an automated security audit against international standards /ship syncs, tests, pushes code and opens a pull request in a single command How effective is gstack? Garry Tan says his working speed in 2026 is roughly 810 times faster than in 2013, measured by lines of completed code per day (11,417 vs 14). In 60 days, he shipped 3 production services and over 40 features — all while running Y Combinator full-time. Andrej Karpathy, co-founder of OpenAI, confirmed a similar trend, sharing that he hasn't typed a single line of code himself since December 2025 thanks to AI agents. But among all those commands, /office-hours stands out for the opposite reason from the rest, it doesn't help you work faster and it helps you avoid building the wrong thing from the start. Why Garry Tan puts /office-hours first Garry Tan placed /office-hours at the top of the workflow based on a simple observation: most products fail not because of poor code, but because they build the wrong thing. Teams spend weeks on a feature nobody needs, or build the right feature for the wrong audience, or solve a problem users already handle better another way. The command has two modes: Startup mode for founders and people building real products with real users, and Builder mode for side projects, hackathons, and open source. This article focuses on Startup mode, where the 6 questions are most directly applicable. 6 questions that stop you from building the wrong thing These aren't 6 questions to answer quickly and move on. They're designed to make you think honestly, because the more truthful your answers, the more accurately Claude can match what you actually need — saving you a significant amount of time later. You can read the full original prompts at office-hours/SKILL.md.tmpl. Demand reality: Is there a real need? Original question: "Who specifically has this problem? How are they solving it today?" Not "users in general" or "the marketing team" — the goal is to name one real person, ideally by name, who is actively struggling with a specific problem. If you can't name someone like that, you don't yet understand what they actually need. Concrete example: Instead of "users want better task management," it should be: "Minh, a project manager at a 20-person company, copy-pastes between Notion and Google Sheets every Monday morning because the two tools don't sync." Apply this to your own situation accordingly. Status quo: What are they using instead? Original question: "What is their current workaround? How much better do you need to be for them to switch?" Everyone is already solving their problem somehow — whether with Excel, sticky notes, or a WhatsApp group. If their current solution is good enough, they have no reason to migrate their data and learn an entirely new platform. Your solution needs to be meaningfully better before they'll even consider switching. Desperate specificity: Who needs this badly enough? Original question: "Who needs a solution badly enough to use your ugly beta version today?" This is the question that separates nice-to-have from must-have. If you can't find anyone willing to use an incomplete, rough, buggy version right now, the problem you're solving isn't urgent enough. Real early users are people who need a solution badly enough to tolerate an unpolished product — as long as it's moving in the right direction. Narrowest wedge: What is the smallest possible piece? Original question: "What is the smallest thing you could launch tomorrow? Not the full vision — the smallest piece." Not the first full-featured version — something even smaller than that. This question typically cuts 80% of the scope people add because they think "might as well do it while I'm here." It's a trap many builders fall into, including myself. Launch the smallest meaningful piece first, listen to real users, then decide whether to expand. Common mistake: Many people confuse "smallest piece" with "first full-featured version." The narrowest wedge truly means one small thing that solves one specific problem for one specific group of users — nothing more. Observation and surprise: Have you watched real people use it? Original question: "Have you watched real people use your product? Did they use it in ways you didn't expect?" This question is best saved for the second iteration onward, once you have something to test. Rather than asking for feedback through messages or surveys, sit and watch directly — or review screen recordings. The most valuable insights usually don't come from what users say, but from what they do that you didn't design for, or what they skip that you thought was important. Note: If you're in your first iteration and don't have a product yet, you can skip this question and come back after launching the smallest piece in step 4. Future-fit: The 2 to 3 year view Original question: "In 2-3 years, will what you're building still be relevant — or is the trend moving against you?" This isn't about predicting the future precisely. It's about avoiding building something that's already fading. If the trend is making your problem less urgent over the next two years, that's a clear signal to reconsider from the start. That said, if your goal is to move fast and capture the market before big tech ships something similar, this question can reasonably be set aside. A real example: a simple idea completely flipped In the gstack documentation, Garry Tan walks through a practical example. You open /office-hours and say: "I want to build an app that summarizes my daily work calendar." Claude doesn't agree and start executing. Instead, it pushes back: what you just described isn't a calendar summary app — it's actually a full personal AI chief of staff. These are entirely different in scope, technical complexity, and user expectations. From that single opening description, /office-hours helps you see: 5 features you were describing without realizing it 4 assumptions that need to be validated before building 3 different implementation directions with varying levels of complexity 1 recommendation: launch the smallest piece first, treat the rest as a long-term roadmap All of this happens before you write a single line of code. The output is saved as a document that subsequent steps in the workflow automatically pick up and continue from. These 6 questions work even without gstack The 6 questions from /office-hours don't require Claude Code or a gstack installation. They're a way of thinking — the same framework YC partners use to evaluate startups — and you can apply them right now with any AI tool you already have. The difference when using them through gstack is that Claude won't let you give vague answers. It pushes for specifics and won't move forward until your response is grounded enough to be useful. That's why /office-hours tends to be the most uncomfortable command in the entire toolkit — not because it's difficult to use, but because it asks exactly what you've been avoiding. Try it today: Before starting your next project, paste these 6 questions into Claude, Gemini, or ChatGPT along with your idea. Ask it to go through each question one at a time and not let you skip any. The results are often more surprising than you'd expect — even for ideas you've already thought through carefully. gstack currently has over 117k stars on GitHub and is still growing. For me, the most valuable part isn't the technical commands like /review or /ship — it's /office-hours, because it's the only command in the entire toolkit that forces you to stop and think before doing anything else.

Nam•

27 Jun, 2026

How to control Codex from your phone with ChatGPT app

You're out and suddenly remember a small detail in your project that needs fixing — you don't have to open your laptop or remote desktop in. With the right connection set up, ChatGPT app on your phone can become a control panel for Codex, while your computer at home or the office keeps running the actual code. ChatGPT app doesn't run Codex on your phone The easiest thing to misunderstand is thinking Codex is running directly on your phone. In reality, your phone only sends prompts, replies, approvals and follow-up messages, while the actual working environment lives on your Mac or Windows machine running Codex. In other words, ChatGPT app is the remote controller, and the host machine is where your repo, terminal, credentials, plugins, MCP servers and other tools actually live. This makes complete sense because codebases typically live on your development machine, not your phone. When you send a request like fixing a TypeScript error, running tests or checking a diff, Codex processes it inside the selected project on the host and sends results back for you to review. If you want to understand the foundation before using remote access, check out What is Codex and how to use Codex to get a clear picture of where this tool fits in your workflow. What do you need before connecting ChatGPT app to Codex? According to the latest Codex documentation from OpenAI, ChatGPT app supports controlling Codex on both macOS and Windows, though Linux is not supported yet. Notably, this feature works with all ChatGPT account types, including Free and Go — no paid plan required. You only need to make sure you're signed into the same account or workspace on both devices: ChatGPT mobile (latest version on iOS or Android) and Codex (latest version on your host machine, online and running). Your host machine must stay on and Codex must keep running for the entire time you're controlling it remotely. If the machine goes to sleep, loses its connection or Codex is closed, the connection from your phone drops immediately and any tasks in progress may be interrupted. What's worth noting is that the entire setup process starts from Codex App on the host machine and is surprisingly simple — just scan a QR code and you're done. Inside Codex App, select the mobile setup option in the sidebar, scan the QR code with your phone, then complete the confirmation in ChatGPT app. For enterprise workspaces, an admin may need to enable Remote Control permissions before you can connect. This QR code grants control over your computer, so keep it private and never share it with anyone to avoid unauthorized access to your machine. To summarize, connecting ChatGPT app to Codex is straightforward: Host machine must be online and running Codex ChatGPT app and Codex must be signed into the same account or workspace Generate the QR code in Codex on the host and complete setup on your phone MFA, SSO or passkey requirements may still apply depending on your workspace What can you do once connected? Once the host appears in Codex on your phone, you can start a new thread inside a project on the host or pick up an existing one. This is where the experience becomes genuinely useful: you can send follow-ups, answer Codex's questions, approve commands, view output, check diffs, review test results and even receive notifications when a task finishes or needs your attention. A real example: you're at a coffee shop and remember the login form has a validation bug. You open ChatGPT app, select the connected host, and ask Codex to check the auth flow, fix the email validation error and run the related tests. Codex works directly on the repo sitting on your host machine, while you review the results, approve actions when needed and decide whether to request further changes. This is also why people are starting to think of Codex and other AI-powered IDEs as a colleague working inside a real environment, not just a code suggestion tool anymore. Its strength lies in reading files, running commands, editing code and maintaining context across multiple rounds of back-and-forth. Limitations to keep in mind when using Codex from your phone Remote control depends entirely on the host machine — if your computer goes to sleep, loses its connection, closes Codex or gets signed out of the workspace, your phone loses its working environment immediately. That said, if Codex is mid-task when the connection drops, it will continue running on the host and notify you once your phone reconnects, so there's less to worry about if your phone suddenly loses signal during a running task. One more thing to note: on Windows, tasks using Computer Use require an appropriate foreground session, so this setup is not a complete replacement for sitting directly in front of your machine. It also helps to draw a clear line between handing off a focused task and reviewing large changes. Your phone works well for small bugs, running tests, quick questions about a specific file, reviewing short tasks or checking task status. However, anything requiring a high level of attention should still be reviewed on a larger screen to avoid missing details. How to use it effectively in practice The most effective approach is to hand off tasks with a clear scope and specific expected outcomes. Instead of saying "fix the login", describe exactly where the error occurs, what the expected behavior should be after the fix, which tests to run and which parts of the codebase to leave untouched. Codex performs better when it knows the boundaries of a task, especially since remote mobile means each feedback loop takes longer than when you're sitting right at your machine. A clean working rhythm might look like this: describe the task in detail whether small or medium-sized, ask Codex to read the relevant files, let it propose a solution, only approve when necessary and wait for the result report. Once you get used to this rhythm, you'll find that idle time outside can handle real work — while keeping the final decision firmly in your hands. Compared to Claude Code Remote and Telegram bot There are many ways to control an AI coding agent from your phone, though the three most common approaches each serve a different need. Criteria ChatGPT app + Codex Claude Code Remote Telegram + Codex Natural conversation ✅ Excellent ✅ Good ❌ Requires exact syntax Granular control Moderate Highest Low Connection stability Stable Stable Frequent drops Mobile UI Well optimized Not fully optimized Uses existing Telegram app Initial setup Easy, scan QR Easy Requires manual bot configuration Computer must stay on ✅ Required ✅ Required ✅ Required Claude Code Remote Control offers the strongest level of control — you get direct terminal output, can intervene mid-task and generally feel much closer to what the agent is doing. That said, the UI on small phone screens isn't fully optimized yet, and some interactions are still difficult to perform without a physical keyboard. Telegram bot has the advantage of not requiring a separate app and is easy to get started with, but the real-world experience has clear limits: it's prone to slowdowns, occasional silent disconnections mid-task, and because it lacks genuine AI context, anything slightly more complex than a simple command quickly falls apart — forcing you to type precise instructions rather than describe what you need naturally. ChatGPT app + Codex sits at the best balance point for most users — smooth enough, smart enough, quick to set up with a QR scan and no new syntax to learn before you can get to work. Connecting ChatGPT app to Codex doesn't turn your phone into a development machine — it turns your phone into a control surface for a development machine that's already ready to work. As long as the host stays on, permissions are configured correctly and the task is scoped tightly enough, this is the most practical way to handle real coding work when you're away from your laptop.

Nam•

22 Jun, 2026

Quick Summary

How are Memory and Project different?

What is a Claude Project?

Limits by plan

How to set up a Project so Claude understands you better

Step 1: Write custom instructions

Step 2: Upload documents to the knowledge base

Can you add a Skill to a Project?

Three of the most effective ways to use Projects

A "about me" Project to use Claude as a personal assistant

A Project per client or per initiative

A learning and research Project

Frequently asked questions about Claude Projects

How is a Project in Claude different from a Project in Cowork?

How long should custom instructions be?

How should I name my Projects for easy management?

When should I delete or update documents in a Project?

Is a Project actually better than a regular chat?

Discussion (0)

Related Articles

Comparing Hermes Agent, OpenClaw, and Claude Cowork

GPT-5.6 vs Claude Fable 5: What Is New?

YC CEO's 6 forcing questions before starting any project

How to control Codex from your phone with ChatGPT app