Start now →

Claude AI Said “Excellent Work”, and Then My Team Found 3 Critical Bugs

By Gapur Kassym · Published March 16, 2026 · 3 min read · Source: Level Up Coding
DeFiAI & Crypto
Claude AI Said “Excellent Work”, and Then My Team Found 3 Critical Bugs

The most common mistakes

Photo by Immo Wegmann on Unsplash

I asked Claude AI to find the root cause of an error. Then I asked it to fix the bug and write tests. When the PR was ready, I asked it to review the changes. Claude said “Excellent work!”.

After my teammates tested and reviewed the same code, they found three critical bugs and several weaknesses in the codebase that I had introduced.

Why did this happen? Because I completely misused the most powerful AI assistant.

One mistake makes Claude praise broken code. Another makes it write tests that pass even when the fix doesn’t work.

The problem wasn’t Claude, but how I was using it.

Here are 7 mistakes that made my experience with Claude AI completely useless.

1 Self-Review Bias

I asked Claude to review code it wrote during the same session. Claude said:

"The implementation looks excellent! Clean code, follows best practices, handles edge cases properly."

Then my teammates found three critical bugs and several code quality issues.

Why:

Claude AI cannot objectively evaluate its own decisions.

How to Fix:

2 Vague Prompts + Missing Context

I asked Claude: “Fix this bug” without providing any requirements, context, or full stack traces.

Claude guessed wrong. Moreover, this fix worked in one place but broke in other places.

Why:

Claude makes assumptions without the necessary context.

How to Fix:

3 Not Specifying Output Format

I asked Claude: “Explain how a specific block of code worked”, but I got a long tutorial.

Why:

Claude doesn’t know your preference.

How to Fix:

4 Not Checking Claude’s Output

I asked Claude: “Write three test cases for my fix”. Then I commented out my fix and ran the tests — all passed. The tests were worthless. They looked like they tested, but didn’t.

Why:

Claude generates tests but don’t actually check the fix.

How to Fix:

5 Not Treating Claude as a Specialized Pair Programmer

I asked Claude: “Why isn’t this working?” and used it as a chatbot or Stack Overflow. But it can read entire files, search codebases, delegate tasks, and create solutions as a developer.

Why:

Claude is not a chatbot. It is a powerful developer.

How to Fix — delegate tasks to a developer:

6 Accepting Claude’s First Answer

I asked Claude: “Create a pull request with description”. It created the PR with unnecessary and verbose details.

Why:

Claude’s solution worked but not optimal.

How to Fix:

7 Reduce hallucinations

I asked Claude: “Show me how to use the AWS SDK library” without mentioning the latest version. Claude confidently offered examples based on knowledge about the outdated version that didn’t work.

Why:

Claude can confidently assert “facts” that are completely wrong. It has gaps in his knowledge that lead to incorrect information, and he can’t know what he doesn’t know. This phenomenon is known as “hallucination”.

How to Fix:

Conclusion

Claude AI will give us exactly what we want — we just have to ask the right questions.

Thanks for reading, I hope you found this piece useful. Happy coding!

Resources

Documentation


Claude AI Said “Excellent Work”, and Then My Team Found 3 Critical Bugs was originally published in Level Up Coding on Medium, where people are continuing the conversation by highlighting and responding to this story.

This article was originally published on Level Up Coding and is republished here under RSS syndication for informational purposes. All rights and intellectual property remain with the original author. If you are the author and wish to have this article removed, please contact us at [email protected].

NexaPay — Accept Card Payments, Receive Crypto

No KYC · Instant Settlement · Visa, Mastercard, Apple Pay, Google Pay

Get Started →