Me and Copilot are done

Every time I use copilot for anything intelligent it collapses in a heap… I get it, it’s an LLM designed to mirror, infer and basically make stuff up so you get drawn in. Good for writing poems and rewording emails but beyond that, I can’t find a use for it (not that I write poems btw)

So, I gave it 1 more shot.. “provide the best GPU solution for AI video generation and creation on my PC”

Copilot (local) came up with “RTX 4090 (24GB) is the correct choice.” but they are now no longer supported! It also went on to claim “The 5070/5080 are “inferior” for AI because NVIDIA wants it that way

So I did my own online search and the opening page statement on the search page was, by Copilot “Copilot Search Branding

The NVIDIA GeForce RTX 5090 is currently the top GPU for AI video generation, offering 32GB VRAM, 10,240 CUDA cores, and advanced multi-frame generation for high-resolution workflows.

I’m now confused and I think me and copilot are done… fairly sure there are smarter folks on here…

You’re probably using a different Copilot than the one you intended to use, and if search is turned off, generative AI often can’t access the latest information.

A practical guide to Copilot, AI consistency, and what to use instead

Copilot does not feel inconsistent because you are imagining it. It feels inconsistent because it is not always looking at the same information, using the same mode, or using the same hidden context. Bing’s Copilot Search is designed to search the web and return summarized answers with cited sources. Microsoft 365 Copilot can work in Web mode or Work mode, which changes what it looks at. On top of that, Microsoft also offers different conversation modes, memory, personalization, and custom instructions. So the label is the same, but the behavior can change quite a bit underneath. (Microsoft)

If you only remember five things

  1. Use Search or Web when you want current facts.
  2. Open Sources before you trust an answer.
  3. Use Work for company files, meetings, and email.
  4. Use a Notebook when you want Copilot to look only at selected files.
  5. If answers feel oddly personalized or unstable, check memory and custom instructions. (Microsoft Support)

1. Why Copilot can give inconsistent answers

Copilot is not one single experience

This is the biggest reason. Copilot Search in Bing is a search product. Microsoft says it gives quick summarized answers with cited sources. Microsoft 365 Copilot Chat is broader. It can work from public web information, or from your work data, depending on the mode and your setup. If you compare those two as if they are the same thing, the differences will look random when they are not. (Microsoft)

It does not always use live web information

When Microsoft 365 Copilot uses web search, Microsoft says you will see a Sources button, and that button lets you inspect the exact query Copilot sent to Bing plus the sources it used. That means there are moments when it is clearly using fresh web search, and moments when it is not doing that in the same way. For fast-moving topics like GPUs, prices, product support, news, or software changes, that difference alone can flip the answer. (Microsoft Support)

The mode changes the style of the answer

Microsoft’s official conversation modes explain this directly. Search is for current answers from the web with citations. Think Deeper spends longer reasoning. Smart uses GPT-5. Quick response is optimized for fast replies. So two answers can differ because you are not really asking the same Copilot in the same way. (Microsoft Support)

Your past chats and settings can quietly affect the result

Microsoft says Copilot personalization can include saved memories, details inferred from chat history, and custom instructions. That is useful for convenience, but it also means two people, or even the same person on different days, may not be starting from the same hidden assumptions. This is one reason AI tools can feel strangely moody or inconsistent. (Microsoft Learn)

Copilot also has built-in decision logic behind the scenes

Microsoft says Microsoft 365 Copilot uses an orchestrator. In plain English, that means there is a layer that helps decide what tools, skills, and actions to use between your prompt and the final answer. So Copilot is not just “one fixed model replying from a blank page.” That makes it more capable, but it can also make it harder to predict. (Microsoft Learn)

The simple takeaway

In your GPU example, the most likely explanation is not “Copilot is dumb.” It is “these answers were produced under different conditions.” One answer was probably more grounded in current web information than the other, or it came from a different mode or product surface. That is the cleanest explanation that fits Microsoft’s own product documentation. (Microsoft)


2. A Copilot-specific guide that actually works

Start by choosing the right surface

Use Search or Web for current facts, prices, product comparisons, and news. Use Work for Microsoft 365 files, meetings, Teams, and email. Use a Notebook when you want Copilot to stay inside a small, chosen set of documents instead of wandering across everything else. Microsoft says Notebook answers are grounded in the notebook’s references, and that general web links are not used the same way there. (Microsoft Support)

Always open Sources

This is the single best Copilot habit. Microsoft says the Sources panel shows the exact Bing query Copilot generated and the sources it used. That lets you check whether Copilot even understood your question correctly before you judge the answer itself. If the hidden search query is wrong, the answer will often be wrong too. (Microsoft Support)

Use Notebooks for serious work

A Notebook is one of Copilot’s best features because it narrows the field. Microsoft says Notebook answers are grounded in the notebook’s references, and that only a limited number of references are used for grounding. It also lets you add custom instructions at the notebook level, such as tone, format, or focus. That makes Notebooks much more stable than a wide-open chat when accuracy matters. (Microsoft Support)

Use Researcher when the job is real research

Microsoft describes Researcher as the tool for in-depth research across multiple sources, including web and work, with citations and a structured report. That makes it better than plain Copilot Chat when you need a serious comparison or a report you can share. (Microsoft Support)

Check memory and customization when answers feel “off”

If Copilot starts sounding too tailored, too sticky, or just oddly biased toward old context, look at memory and custom instructions. Microsoft says these settings affect personalization, and turning them off stops Copilot from applying them going forward, though stored memories may need to be deleted separately. (Microsoft Learn)

Keep Copilot for Windows tasks if that part is useful

Microsoft’s Windows Copilot app can search local files and answer questions about them. That is one area where Copilot still has a natural advantage on Windows itself. So replacing Copilot entirely is not always necessary. Sometimes the better move is to keep it for Windows tasks and use a stronger AI app for thinking and research. (Microsoft Support)

A simple prompt that works well in Copilot

Use this when you want more stable answers:

Goal:
Context:
Constraints:
Output format:
What sources should be used:
Separate confirmed facts from guesses:

That structure matches Microsoft’s own advice that better prompts come from clearly stating the goal, context, expectations, and source. (Microsoft Support)


3. If you move away from Copilot, what is the best replacement?

Best overall replacement: ChatGPT

If you want one Windows AI app that does most things well, ChatGPT is the safest choice. OpenAI offers a Windows app, and its documentation says you can use a Companion Window with Alt + Space to ask questions, upload files, generate images, and start chats quickly. OpenAI also says ChatGPT search can return timely answers with links to relevant web sources. That combination makes it a strong all-round replacement. (OpenAI Help Center)

Best for thoughtful writing and longer reasoning: Claude

If your main use is writing, organizing long thoughts, and careful back-and-forth, Claude is a strong alternative. Anthropic provides a Windows desktop app, and says the desktop version works with your files and apps. Independent model tracking also places Claude Opus 4.6 among the top intelligence models right now. (Claude)

Best for research and source-heavy answers: Perplexity

If what you really want is “search, but better,” Perplexity is a very good fit. Perplexity says it searches the web in real time and gives conversational answers backed by verifiable sources, with citations and links to original material. That makes it especially good for research, comparison shopping, and factual lookups. (Perplexity AI)

Best if you live in Google’s world: Gemini

If you are already deep in Gmail, Google Drive, and Google Search, Gemini is a strong option. Google says Gemini Deep Research can break down complex research tasks, explore web sources, and optionally use your Workspace content. It can also work from uploaded files. (Gemini)

If you care about pure model intelligence

If you care about model rankings as one input, not the only input, Artificial Analysis currently shows Gemini 3.1 Pro Preview and GPT-5.4 at the top of its intelligence leaderboard, followed closely by Claude Opus 4.6. Real product experience still depends on app quality, search, sources, and workflow, but that gives a rough picture of the current top tier. (Artificial Analysis)

The practical answer

If you want one answer, it is this:

  • ChatGPT for best all-round replacement on Windows
  • Perplexity if research is the main job
  • Claude if writing and deep thinking matter most
  • Gemini if your digital life already runs through Google (OpenAI Help Center)

4. A general guide for using AI tools well

This part applies not just to Copilot, but to ChatGPT, Claude, Gemini, Perplexity, and most other AI systems.

First decide what information the AI should use

This is the most important rule. Google describes grounding as using web or chosen sources to improve the answer. Azure describes agentic retrieval as a way to break a question into parts and pull in the right material. In simple terms, this means good AI answers depend heavily on what the system is allowed to look at, not just how smart the model is. (Gemini)

Ask with purpose, constraints, and output format

OpenAI’s prompt guides say clear and specific instructions work better, and Anthropic says the same: be direct, provide structure, and give examples where useful. In practice, that means vague prompts get vague answers, while focused prompts get much better ones. (OpenAI)

A simple reusable format is:

Goal:
Constraints:
Desired output:
What information to use:
If something is uncertain, say so clearly:

Ask the AI to separate facts from guesses

This is one of the best practical tricks. AI tools are very good at making uncertain things sound polished. So tell the model to split its answer into confirmed facts, inference, and uncertain points. That makes the answer much easier to trust or challenge. This is consistent with the broader prompting advice from OpenAI and Anthropic, even though the wording here is a practical simplification. (OpenAI)

For important topics, use a two-step method

First ask broadly. Then ask again using only official or primary sources. This works especially well for products, prices, current events, legal changes, and technical details that can change quickly. Search-grounded tools from OpenAI, Google, Microsoft, and Perplexity all make this easier, but the habit still matters. (OpenAI)

Keep hidden context under control

Long chats can quietly pick up extra assumptions. Azure’s description of agentic retrieval makes clear that chat history can become part of how later queries are handled. Copilot’s memory and personalization features show the same general pattern. So if you need a clean answer, start a fresh chat or narrow the scope with selected documents. (Microsoft Learn)

Show one example when you can

Anthropic’s prompting guidance strongly supports giving examples. Often, one sample output is more useful than a long explanation of what you want. (GitHub)


5. A few important terms, explained simply

Sources means the links or references behind the answer. In Copilot, Sources can also show the exact hidden search query it used. (Microsoft Support)

Memory means the tool remembers some preferences or details from earlier chats and uses them later. Useful, but it can also make answers less predictable. (Microsoft Learn)

RAG is short for a setup where the AI looks at documents or search results while answering. You do not need the acronym to use it well. Just think “AI answering with extra reference material.” (GitHub)

Custom instructions means your own standing preferences or rules for how the AI should respond. Copilot supports this, and so do other tools in different ways. (Microsoft Learn)


6. Good places to read more

If you want practical reading rather than theory, start with these:

  • Microsoft’s pages on Copilot Search, conversation modes, web search and Sources, Notebooks, Researcher, and memory/personalization. (Microsoft)
  • OpenAI’s guidance on prompt engineering and ChatGPT search. (OpenAI)
  • Google’s Gemini Deep Research pages. (Gemini)
  • GitHub lists like Awesome Prompt Engineering, Awesome RAG, and awesome-ai-agents if you want a broader map of the field. (GitHub)

Final takeaway

Copilot feels inconsistent mostly because it is not always using the same sources, the same mode, or the same hidden context. Once you accept that, the tool becomes easier to manage. Use Search/Web for current facts. Use Work for Microsoft 365 data. Use Notebooks for focused work. Open Sources before trusting an answer. And if you want a stronger general replacement on Windows, start with ChatGPT, then consider Perplexity, Claude, or Gemini depending on what you do most. (Microsoft Support)

I use Copilot too but not for much. It’s not good for writing full programs. That’s because Copilot uses the GPT model, and you might have been using an older model GPT. Use the newest they have. An LLM is only as good as the data it was trained on, and older LLMS, even a year old, can have outdated data about industries that move quickly, like technology.

I sometimes use Copilot to get snippets of code but I design the overall program. We are required to use Copilot for AI at work because it has end-to-end encryption and we sometimes will send it a spreadsheet or another file. I have found some light uses cases but nothing I can automate specifically with Copilot Studio.

Also, Microsoft Copilot is supporting Claude Sonnet now but I believe only for Frontier users at this point. (Frontier is Microsoft’s name for a beta tester.) So a regular user may have to enable it, or have their Office 365 admin enable Sonnet.