What You Leave Out of the Context Matters More

I used to paste everything into the context. Every relevant document, every prior conversation, every piece of background I thought the model might need. The logic was simple. More information means better output, and the context window is big enough to hold it all. What I got back was long, hedged, generic, and safe. The model was trying to accommodate everything I had given it, which meant it addressed nothing with any depth.

The realization that changed how I work is that context is a budget. Every document I put in the window competes with everything else for the model's attention. Past a certain point, adding more does not make the output better. It makes the output blander, because the model is averaging across more inputs and committing to fewer positions. The window being large enough to hold everything does not mean everything belongs in it. What fits and what helps are two different questions, and only the second one matters.

The model does not weight all context equally. Information near the beginning and near the end gets more attention than material buried in the middle of a long document. This is a measured, documented behavior, and it means that pasting a twenty-page document into the context practically guarantees the model will miss something important on page eleven. The output just gradually gets less specific, less opinionated, and more exhaustive in the unhelpful sense. If I notice the model producing "here are several considerations" instead of a clear answer, the first thing I check is whether I drowned the signal in context it did not need. Nine times out of ten, trimming the context fixes the output without changing the instruction at all.

The skill I have built is curation. Before a complex task, I still read everything, but I do not feed everything. I select the two or three documents most relevant to the specific question, summarize the rest into a paragraph of constraints rather than pasting the source text, and I sequence what I include so the most important framing comes first. A five-page research summary in the context produces worse output than a single paragraph stating what that research concluded and why it matters here. The model needs my judgment about what is important, not the raw material for forming its own opinion about what might be.

Ordering turned out to be the subtlest form of curation. What I put first in the context establishes the frame through which the model reads everything else. If the first thing it sees is the constraints and goals, the later material gets interpreted through that lens. If the first thing it sees is raw background, it builds its own frame from whatever it finds, and that frame may not be mine. I now spend real time deciding what goes at the top, because the top is what the model treats as the premise, and the premise shapes the conclusion more than the evidence does.

The hardest habit to break was the instinct to include the full source just in case. I kept thinking the model might need a detail buried on page four, so I would paste the entire document to be safe. What I learned is that if a detail is important enough to influence the output, I should state it explicitly in my own words rather than hoping the model locates it in a wall of text. Summarizing is doing the interpretive work myself instead of delegating it to a system that has no sense of what I care about. The conclusion I drew from a document is almost always more useful to the model than the document itself.

This is the skill I think will separate people who get consistently good results from people who get generic ones, and it has nothing to do with prompt tricks or model selection. It is the ability to look at a pile of relevant material and decide what the model actually needs for this specific task, right now. That takes domain knowledge. It takes knowing what matters and what is noise. It is the same judgment that makes a good briefing short rather than comprehensive, and it is not something the model can do for itself, because the model has no independent sense of what I am trying to accomplish with its output. The people I have seen get the best results treat context selection as the actual work, and the prompt as the easy part that follows.

So my context preparation now takes longer than writing the instruction. I read everything, form a view about what matters, and then include only that. The model gets a curated, sequenced, deliberately incomplete picture of the problem, and it produces sharper work precisely because of the incompleteness. The window is large. The discipline is keeping most of it empty.