Question 1

How do I reduce Claude Code token usage?

Accepted Answer

Find the biggest per-turn consumers first. claude-devtools breaks down each turn across CLAUDE.md, skills, @-mentions, tool I/O, thinking, team overhead, and user text. Once you can see which category is dominating, you can act — refactor large files, add entries to .claudeignore, switch lazy prompts to explicit @-mentions, split a monolithic CLAUDE.md into layered directory files, or invoke skills explicitly instead of hoping for auto-detection.

Question 2

Why does my Claude Code context fill up so fast?

Accepted Answer

Three usual suspects: heavy MCP responses (some MCPs return 10k+ tokens per call), Read calls on large files (long components, lockfiles, generated artifacts), and a monolithic CLAUDE.md loaded on every turn. The terminal's progress bar hides which one is at fault. claude-devtools' per-turn token attribution shows each category as an explicit number.

Question 3

Does @-mentioning files in Claude Code save tokens?

Accepted Answer

Yes, often substantially. Without @-mention, Claude uses Grep + Read to locate the file you meant — each step adds its own tool I/O. @-mentioning loads the file content directly with no intermediate tool calls. For multi-file tasks, the savings compound.

Question 4

Should I use a layered CLAUDE.md or one big file?

Accepted Answer

Layered. A single large CLAUDE.md is loaded into context on every turn even when most of it is irrelevant to the current task. Splitting into a small project-root file plus directory-specific CLAUDE.md files in apps/, packages/, etc. keeps the loaded-per-turn portion small and scoped.

Question 5

Are Claude Code custom skills automatically invoked?

Accepted Answer

Sometimes, but not reliably. Automatic skill matching is probabilistic — Claude may skip a skill that would obviously help, or invoke it after exploring the wrong direction first. For token efficiency, invoke skills explicitly in your prompt rather than relying on auto-detection.

Question 6

What is .claudeignore and what should I put in it?

Accepted Answer

A .claudeignore file (similar to .gitignore) tells Claude Code to skip listed paths when reading or globbing. Good candidates: lockfiles (pnpm-lock.yaml, yarn.lock), generated bundles, build artifacts, large auto-generated TypeScript output, vendor directories. If you see Claude Reading these files in claude-devtools and they don't matter for your task, add them.

Question 7

How can I see which MCP responses are consuming the most tokens?

Accepted Answer

In claude-devtools, expand the relevant tool call — MCP responses render with their full payload and token cost. The per-turn token breakdown attributes tool I/O as a separate category, so a turn dominated by tool I/O usually points to a heavy MCP response or a large Read.

4 ways I cut Claude Code token usage after actually seeing my context

1. Heavy MCPs and large files crash the context

2. The hidden cost of lazy `@`-mentions

3. Skill activation is probabilistic

4. A layered `CLAUDE.md` beats one giant file

How to find your own patterns

Questions,
answered.

On this page

4 ways I cut Claude Code token usage after actually seeing my context

1. Heavy MCPs and large files crash the context

2. The hidden cost of lazy @-mentions

3. Skill activation is probabilistic

4. A layered CLAUDE.md beats one giant file

How to find your own patterns

Questions,answered.

Related

On this page

2. The hidden cost of lazy `@`-mentions

4. A layered `CLAUDE.md` beats one giant file

Questions,
answered.