A number of readers requested me whether or not they may use AI to calculate taxes after studying my earlier publish AI Offers Higher Solutions Than Google. They need a tax projection to assist with setting tax withholding, paying estimated taxes, or planning for Roth conversions.
My speedy response was that calculating taxes isn’t the most effective use of AI, as a result of it falls below “verifiable information” and “newest growth” classes. The retirement plan contribution limits, tax brackets, the utmost deduction quantities, and so forth., are all verifiable information. The IRS units them yearly, and they’re what they’re. Simply go to the IRS web site for the most recent numbers, or Google. Should you ask AI, it had higher get the most recent numbers on-line anyway, as a result of AI’s coaching lags.
Alternatively, tax guidelines are sophisticated. After you’ve got all the most recent numbers from the IRS, you continue to want to use the complicated guidelines — what counts and what doesn’t, and which charges apply to which revenue. On-line tax calculators often don’t cowl the present 12 months till late within the 12 months, and they are often both too easy or too sophisticated. It could be good to have AI calculate taxes particularly for the forms of revenue and deductions we’ve: not too easy with solely restricted inputs, and never too sophisticated with all the things below the solar.
The Quiz
I believed I’d check how effectively AI calculates taxes. I got here up with this quiz query:
Jill, single, age 63, has these incomes in 2026:
- $30,000 from Social Safety
- $10,000 from curiosity and pre-tax IRA withdrawals
- $37,000 from certified dividends and long-term capital features
- $1,000 from muni bond curiosity
Jill contributes $4,400 to her HSA and donates $1,500 in money to charities. What’s Jill’s federal revenue tax in 2026?
I designed this query fastidiously to cowl a number of calculations. Lower than 85% of Social Safety is taxable. The revenue consists of peculiar revenue, tax-exempt revenue, and funding revenue taxed at preferential charges. The peculiar revenue element goes throughout two tax brackets. So does the funding revenue. One deduction is above-the-line, and the opposite is below-the-line.
I added these directions to encourage AI to get the most recent tax numbers from the IRS:
It’s necessary to calculate it precisely. Please use tax quantities just for 2026 and solely from official IRS sources.
Preliminary Outcomes
I despatched the quiz to all 4 main AI chatbots: ChatGPT, Claude, Gemini, and Grok. I solely used the free model in each, with the free Pondering, Professional, or Skilled mode enabled. I in contrast the solutions to the right outcome from my calculator in Calculator: How A lot of My Social Safety Advantages Is Taxable?
| Supply | Reply |
|---|---|
| ChatGPT 5.4 Pondering Mini | $325 |
| Claude Sonnet 4.6 Prolonged | $1,910 |
| Gemini 3 Professional | $1,910 |
| Grok 4.20 Skilled | $1,910 |
| My Tax Calculator | $1,640 |
My quiz was too laborious! All 4 main AI fashions failed to present the right reply. The $1,500 money donation threw them off. It falls squarely within the “newest growth” class. The charity donation deduction for non-itemizers is new. The AI fashions all have “donations are deductible solely whenever you itemize” imprinted of their coaching. Claude, Gemini, and Grok all would have given the right reply if I hadn’t included the money donation. ChatGPT was off on how a lot of Social Safety is taxable.
While you see AI not supplying you with a deduction, you may increase a query and ask it to double-check. I adopted up with this assertion:
I heard that money donations are deductible for folks utilizing the usual deduction, beginning in 2026, as much as a restrict.
I additionally added this request for ChatGPT:
One other AI mannequin calculated a distinct quantity for the way a lot Social Safety is taxable. Please double-check your calculation to see who’s proper.
Spherical 2
All 4 AI chatbots double-checked and revised their solutions.
| Supply | Reply |
|---|---|
| ChatGPT 5.4 Pondering Mini | $1,640 ✅ |
| Claude Sonnet 4.6 Prolonged | $1,640 ✅ |
| Gemini 3 Professional | $1,411 |
| Grok 4.20 Skilled | $1,411 |
| My Tax Calculator | $1,640 |
ChatGPT and Claude bought the right reply. Gemini and Grok mistakenly handled the money donation deduction as above-the-line and used it to scale back the Social Safety taxable quantity.
I requested Gemini and Grok to make clear whether or not the HSA contribution and the charity donation ought to or shouldn’t be included in calculating the Social Safety taxable quantity. Gemini didn’t understand its mistake. It mentioned each ought to be included. Grok refused to go additional as a result of I reached the message restrict for not having an account.
Listed below are the complete chat transcripts. They make an fascinating learn.
https://chatgpt.com/share/69bf73fb-db44-8004-9905-b77b56fbb6ce
https://claude.ai/share/ef62408a-6502-4d70-82be-6ea46a4a4cc3
https://gemini.google.com/share/795bdad9f5ad
[No sharable link for Grok because I don’t have an account.]
Impressions
I don’t suppose we will draw definitive conclusions based mostly on just one check. I name these impressions.
1. Whether or not AI can calculate taxes appropriately depends upon the complexity. Three out of the 4 main AI chatbots would’ve given the right reply in a single go if the query didn’t embrace the brand new and tough charity donation deduction. The express request to retrieve the most recent data from official IRS sources and turning on the Pondering/Professional/Skilled mode in all probability helped in enhancing accuracy.
2. AI works higher whenever you make it a dialog. Don’t cease and say it’s ineffective whenever you spot some errors within the preliminary reply. Assist the instrument enable you to. Each ChatGPT and Claude reached the right reply in Spherical 2 after I requested them to double-check.
3. It takes little effort to ship the identical query to 2 or extra AI fashions. You don’t must know which AI is appropriate whenever you get completely different solutions. You may inform one AI that one other AI mentioned one thing completely different. It’ll re-examine.
4. AI was incorrect as a result of human sources had been incorrect. Gemini and Grok handled the donation deduction as above-the-line as a result of many human sources incorrectly known as it above-the-line. Should you Googled, you’d encounter these incorrect sources too. ChatGPT and Claude used extra dependable sources and appropriately recognized the distinction between above-the-line and never requiring itemizing deductions.
5. A web based tax calculator is quicker and extra correct if which one to make use of. That’s an enormous “if.” Solely saying “Don’t use AI as a result of it’s usually incorrect” doesn’t say the place you’ll find extra dependable sources.
As a lot as I’d wish to see everybody use my tax calculator, let’s face it: 99.99% of individuals don’t know that I exist. They’ve little likelihood of discovering the few sources that do it precisely as a result of Google doesn’t rank tax calculators by timeliness or accuracy.
Right here’s a small problem for you:
Fake you’re not conversant in tax calculations. Discover one other method to reply the quiz query. Don’t say “I’ve a spreadsheet,” or “I take advantage of the super-duper Case Research Spreadsheet or Excel1040.” 99.99% of individuals don’t have these. Please inform me within the feedback part how you can do it in any other case.
Wanting discovering the great sources, the outcomes from AI aren’t that unhealthy (see #1).
6. AI’s common strategy was appropriate even when it was incorrect in some nuances. All 4 AI chatbots adopted these appropriate steps:
- Calculate how a lot of Social Safety is taxable
- Calculate AGI
- Calculate taxable revenue
- Separate peculiar revenue and preferential funding revenue, and understand how they stack
- Apply completely different tax brackets
This common strategy is “widespread information of an insider.” You may be taught the steps from AI when you’re not conversant in tax calculations. That’s extra helpful than simply having a remaining quantity. After I noticed that ChatGPT didn’t embrace particulars of some interim steps, I requested it to indicate its work. ChatGPT defined the main points quicker and higher than I may (learn the transcript).
A web based tax calculator offers you a quantity however doesn’t clarify the steps. AI is a greater instrument in educating you easy methods to fish than supplying you with a fish.
Be taught the Nuts and Bolts
I put all the things I take advantage of to handle my cash in a ebook. My Monetary Toolbox guides you to a transparent plan of action.
