from Hacker News

The Impact of Generative AI on Critical Thinking [pdf]

by greybox on 3/26/25, 4:54 PM with 138 comments

by Maro on 3/26/25, 6:30 PM
A good model for understanding what happens to people as they delegate tasks to AI is to think about what happens to managers who delegate tasks to their subordinates. Sure, there are some managers who can remain sharp, hands-on and relevant, but many gradually lose their connection to the area they're managing and become pure process/project/people managers and politicians.
Ie. most managers can't help their team find a hard bug that is causing a massive outage.
Note: I'm a manager, and I spend a lot of time pondering how to spend my time, how to be useful, how to remain relevant, especially in this age of AI.
by vunderba on 3/26/25, 6:04 PM
I've been calling this out since ChatGPT went mainstream.
The seductive promise of solving all your problems is the issue. By reaching for it to solve any problem at an almost instinctual level you are completely failing to cultivate an intrinsically valuable skill - that of critical reasoning.
That act of manipulating the problem in your head—critical thinking—is ultimately a craft. And the only way to become better at it is by practicing it in a deliberate, disciplined fashion.
This is why it's pretty baffling to me when I see attempts at comparing LLMs to the invention of the calculator. A calculator is still used IN SERVICE of a larger problem you are trying to solve.
by jonahx on 3/26/25, 7:21 PM
For those who read only the headline or article:
> In this paper, we aim to address this gap by conducting a survey of a professionally diverse set of knowledge workers ( = 319), eliciting detailed real-world examples of tasks (936) for which they use GenAI, and directly measuring their perceptions of critical thinking during these tasks
So, they asked people to remember times they used AI, and then asked them about their own perceptions about their critical thinking when they did.
How are we even pretending there is serious scientific discussion to be had about these "results"?
by oneofyourtoys on 3/26/25, 7:04 PM
The year is 2035, the age of mental labor automation. People subscribe to memberships for "brain gyms", places that offer various means of mental stimulation to train cognitive skills like critical thinking and memory retention.
Common activities provided by these gyms include fixing misconfigured printers, telling a virtual support customer to turn their PC off and back on again, and troubleshooting mysterious NVIDIA driver issues (the company has gone bankrupt 5 years ago, but their hardware is still in great demand for frustration tolerance training).
by sitkack on 3/26/25, 6:29 PM
Thanks, the paper is very readable.
> Abstract The rise of Generative AI (GenAI) in knowledge workflows raises questions about its impact on critical thinking skills and practices. We survey 319 knowledge workers to investigate 1) when and how they perceive the enaction of critical thinking when using GenAI, and 2) when and why GenAI affects their effort to do so. Participants shared 936 first-hand examples of using GenAI in work tasks. Quantitatively, when considering both task- and user-specific factors, a user’s task-specific self-confidence and confidence in GenAI are predictive of whether critical thinking is enacted and the effort of doing so in GenAI-assisted tasks. Specifically, higher confidence in GenAI is associated with less critical thinking, while higher self-confidence is associated with more critical thinking. Qualitatively, GenAI shifts the nature of critical thinking toward information verification, response integration, and task stewardship. Our insights reveal new design challenges and opportunities for developing GenAI tools for knowledge work.
It is be presented at CHI Conference https://chi2025.acm.org/
https://en.wikipedia.org/wiki/Conference_on_Human_Factors_in...
by lenerdenator on 3/26/25, 5:30 PM
So it does what Google searching did: it made retaining of information an optional cognitive burden, and optional cognitive burdens are usually jettisoned.
Fortunately, my ADHD-addled brain doesn't need some fancy AI to make its cognition "Atrophied and Unprepared"; I can do that all on my own, thank you very much.
by greybox on 3/26/25, 4:54 PM
Microsoft Study Finds AI Makes Human Cognition “Atrophied and Unprepared”
“[A] key irony of automation is that by mechanising routine tasks and leaving exception-handling to the human user, you deprive the user of the routine opportunities to practice their judgement and strengthen their cognitive musculature, leaving them atrophied and unprepared when the exceptions do arise,” the researchers wrote.
by pseudocomposer on 3/26/25, 6:38 PM
It seems like something like medical/legal professionals’ annual/otherwise periodic credential exams might make sense in fields where AI is very usable.
Basically, we might need to standardize 10-20% of work time being used to “keep up” automatable skills that once took up 80+% of work time in fields where AI-based automation is making things more efficient.
This could even be done within automation platforms themselves, and sold to their customers as an additional feature. I suspect/hope that most employers do not want to see these automatable skills atrophy in their employees, for the sake of long-term efficiency, even if that means a small reduction in short-term efficiency gains from automation.
by nopelynopington on 3/26/25, 6:54 PM
I feel like my critical thinking has taken a nosedive recently, I changed jobs and the work in the new job is monotonous and relies on automation like copilot. Most of my day is figuring out why the ai code didn't work this time rather than solving actually problems. It feels like we're a year away from the me part being obsolete.
I've also turned to AI in side projects, and it's allowed me to create some very fast MVPs, but the code is worse than spaghetti - it's spaghetti mixed with the hair from the shower drain.
None of the things I've built are beyond my understanding, but I'm lazy and it doesn't seem worth the effort to use my brain to code.
Probably the most use my brain gets every day is wordle
by bentt on 3/26/25, 6:48 PM
Is this any different than saying that nowadays most people in the USA are physically weaker and less able to work on a farm than their predecessors? Sure, it's not optimal through certain lenses, but through other lenses it is an improvement. We are by any rights dependent on new systems to procure food, which is even more fundamental than other types of human cognition being preserved.
by sollewitt on 3/26/25, 5:41 PM
One thing I've tried using Gemini for, and been really impressed with, is practicing languages. I find Duolingo doesn't really translate to fluency, because it doesn't really get you to struggle to express yourself - the topics are constrained.
Whereas, you can ask an LLM to speak to you in e.g. Spanish, about whatever topic you're interested in, and be able to stop and ask it to explain any idioms or vocabulary or grammar in English at any time.
I found this to be more like a "cognitive gym". Maybe we're just not using the tools beneficially.
by divtiwari on 3/27/25, 1:52 AM
As a part of Gen Z, I feel that with regards to critical thinking skills, our generation got obliterated twice, first with Social media (made worse with affordable data plans) then followed by GenAI tools. You truly need a monk level mind control to come out unscathed from their impact.
by masfuerte on 3/26/25, 5:43 PM
This isn't a new thing. I noticed it in the 1990s in bank employees as their work became increasingly automated. As the software became better at handling exceptions, their skills atrophied further and they became even worse at handling the harder exceptions that remained.
by rraghur on 3/26/25, 6:41 PM
Sort of like once you get used to GPS to get anywhere, you stop developing any further directional sense but even existing capabilities start withering away
by tunesmith on 3/26/25, 8:30 PM
I like to think of problems as having two components: specification and implementation.
With using GenAI (and/or "being a manager") aren't they somewhat inversely related?
I find implementation level programmers to generally be poor at stating specifications. They often phrase problems in terms of lacking their desired solutions. They jump straight to implementation.
But a manager has to get skilled at giving specification: being clear about what they expect, without stating how to do it. And that's a skill that needs to be quickly developed to use GenAI well as well. I think getting good at specifying is definitely worthwhile, and I think GenAI is helping a lot of people get better at that quickly.
Overall, it seems that should very much be considered part of "critical thinking".
by piltdownman on 3/26/25, 5:35 PM
>> Moreover, participants perceived it to be more effort to constantly steer AI responses (48/319), which incurs additional Synthetic thinking effort due to the cost of developing explicit steering prompts. For example, P110 tried to use Copilot to learn a subject more deeply, but realised: “its answers are prone to several [diversions] along the way. I need to constantly make sure the AI is following along the correct ‘thought process’, as inconsistencies evolve and amplify as I keep interacting with the AI.
While much is made of the 'diminished skill for independent problem-solving' caused by over-reliance, is there a more salient KPI than some iteration of this 'Synthetic Thinking Effort' by which to baseline and optimise the cost/benefit of AI usage versus traditional cognition?
by labrador on 3/26/25, 7:18 PM
I'm a retired computer programmer. All my time is free time. I'm using AI as a cognitive amplifier. I'm learning at a much faster rate than I would without AI. I don't have to waste time doing google searches and reading thru irrelevant material to find something germane to my research.
I don't depend on AI for anything. I am not doing corporate work. Could it be that what people are experiencing is that they are becoming less suitable for corporate work as AI and robots replace them? Isn't this a good thing? Shouldn't the focus be on using AI to bring out the innate talents of humans that aren't profit driven?
by tsumnia on 3/26/25, 6:57 PM
I won't disagree with their findings; however I do think there is some need to counter the narrative that "LLM AI worse for humans". Specifically I think back to an example I use when I would describe why I had such motivation toward study students completing typing practice while learning CS. In short, I use the analogy that when I am browsing the web for code snippets (like extracting files from a tar file), I will explicitly retype out the command rather than rely on copy+paste. My logic is that typing out the command helps build the muscle memory so that someday I'll just REMEMBER the command.
That said, the counter to my own counter is "do I really need to memorize that?" Yes yes no internet and I'm screwed... but that's such a rare edge case. I am able to quickly find the command and knowing that it is stored somewhere else may be enough knowledge for me rather than memorization. I can see Gen AI falling into a similar design, I don't need to know explicitly how to do something, just that that task can be resolved through an LLM prompt.
Granted, we're still trying to figure out how to communicate with LLMs and we only really have 3 years of experience. Most of our insights have come from blog posts and a handful of research articles. I agree that Gen AI laziness is a growing issue, but I don't think it needs to go full Idiocracy sensationalist headline.
by Qem on 3/26/25, 5:06 PM
Paywalled, but full study available here: https://www.microsoft.com/en-us/research/wp-content/uploads/...
by 1vuio0pswjnm7 on 3/27/25, 3:19 PM
Original HN titles:
"Impact of Gen AI on Critical Thinking: Reduction in Cognitive Effort, Confidence"
"Impact of AI on Critical Thinking: Self-Reported Reductions in Cognitive Effort"
"The Impact of Generative AI on Critical Thinking: Reductions in Cognitive Effort"
Actual title of the paper:
"The Impact of Generative AI on Critical Thinking: Self-Reported Reductions in Cognitive Effort and Confidence Effects From a Survey of Knowledge Workers"
Previous discussion:
10 Feb 2025 17:01:08 UTC https://news.ycombinator.com/item?id=43002458 (1 comment)
10 Feb 2025 22:31:05 UTC https://news.ycombinator.com/item?id=43006140 (0 comments)
11 Feb 2025 11:14:06 UTC https://news.ycombinator.com/item?id=43011483 (0 comments) [dead]
11 Feb 2025 14:13:36 UTC https://news.ycombinator.com/item?id=43012911 (1 comment)
12 Feb 2025 01:47:16 UTC https://news.ycombinator.com/item?id=43020846 (0 comments) [flagged] [dead]
14 Feb 2025 15:54:57 UTC https://news.ycombinator.com/item?id=43049676 (1 comment)
15 Feb 2025 12:06:01 UTC https://news.ycombinator.com/item?id=43057907 (101 comments)
by allenrb on 3/26/25, 7:17 PM
I’ve told people at work, including my boss and his boss, that it will be time for me to go if and when my job ever becomes “translating business problems into something AI can work with.”
Right now I’m curious to see how long I can keep up with those using AI for more mundane assistance. So far, so good.
by gatinsama on 3/26/25, 6:14 PM
You can't delegate understanding. I don't mean you shouldn't, you can't.
If you don't understand what's happening, you have no way to know if the system is working as intended. And understanding (and deciding) exactly how the system works is the really hard part for any sufficiently complex project.
by deepfriedchokes on 3/26/25, 11:29 PM
I seem to recall Socrates arguing that writing weakened the memory and hindered genuine learning. He probably wasn’t wrong, but the upside of writing was greater than the downside.
by 0x20cowboy on 3/26/25, 6:07 PM
I love doing all aspects of building software. However, I’ve noticed when I am feeling lazy I’ll just copy pasta a stack trace into an LLM and just trust what is says is wrong. I won’t even read the stack trace.
I only tend to do that when I am tired or annoyed, but when I do it I can feel myself getting dumber. And it’s a weirdly satisfying feeling.
I just need a chair that doubles as a toilet and I’ll be all set.
by _heimdall on 3/26/25, 6:33 PM
I'm often surprised that a study like this is even needed, the result seems obvious.
Critical thinking is a skill that requires practice to improve at and maintain it. Using LLMs pushes the task that would require critical thinking off to something/someone else. Of course the user will get worse at critical thinking when they try to do it less often.
by jhallenworld on 3/26/25, 7:20 PM
Obvious advice for students: Human brains are neural networks- they have to be trained. If you have the already trained artificial neural network do all the work, it means your own neural network remains untrained.
You are tremendously better off getting a bad grade doing your own work than getting a good one using ChatGPT.
by moralestapia on 3/26/25, 5:34 PM
I believe this to be true, and it came to happen at the worst possible time, post-COVID and w/ education levels through the floor.
I also believe, however, that humans who are able to reason properly would become much more valuable, because of this same thing.
by DrNosferatu on 3/26/25, 6:23 PM
Then prompt the AI to provide its outputs in a way that keeps the human user engaged and aware of where they are in the thought process: maps, diagrams, repetition summaries.
We have the cognition science to make it happen - or at least learn how to structure it.
by riffic on 3/26/25, 7:05 PM
can't it go the other way? Can't AI be developed to improve and strengthen human cognition? I'm incredibly naive and ill informed but feel that you can go both ways (growth vs fixed mindsets?)
by ChrisArchitect on 3/26/25, 6:04 PM
Article from February.
Some discussion on the study: https://news.ycombinator.com/item?id=43057907
by AISnakeOil on 3/26/25, 7:07 PM
Just as any muscle gets weaker the less you use it, same goes for intelligence.
by _aavaa_ on 3/27/25, 4:56 PM
It’s why I write all of my code directly in binary. Depending on a compiler, or god forbid Python, is really detrimental to me accomplishing my goal as a data scientist: allocating registers.
by derefr on 3/26/25, 7:28 PM
How odd. I don't think I'm thinking any less hard when making use of LLM-based tools. But then, maybe I'm using LLMs differently?
I don't build or rely on pre-prompted agents to automate specific problems or workflows. Rather, I only rely on services like ChatGPT or Claude for their generic reasoning, chat, and "has read the entire web at some point" capabilities.
My use-cases break down into roughly equal thirds:
---
1. As natural-language, iteratively-winnowing-the-search-space versions of search engines.
Often, I want to know something — some information that's definitely somewhere out there on the web. But, from 30+ years of interacting with fulltext search systems, I know that traditional search engines have limitations in the sorts of queries that'll actually do anything. There are a lot of "objective, verifiable, and well-cited knowledge" questions that are just outside of the domain of Google search.
One common example of fulltext-search limitations, is when you know how to describe a thing you're imagining, a thing that may or may not exist — but you don't know the jargon term for it (if there even is one.) No matter how many words you throw at a regular search engine, they won't dredge up discussions about the thing, because discussions about the thing just use the jargon term — they don't usually bother to define it.
To find answers to these sorts of questions, I would have previously ask a human expert — either directly, or through a forum/chatroom/subreddit/Q&A site/etc.
But now, I've got a new and different kind of search engine — a set of pre-trained base models that, all by themselves, perform vaguely as RAGs over all of the world's public-web-accessible information.
Of course, an LLM won't have crystal clarity in its memory — it'll forget exact figures, forget the exact phrasing of quotations, etc. And if there's any way that it can be fooled or misled by some random thing someone made up somewhere on the web once, it will be.
But ChatGPT et al can sure tell me the right jargon term (or entire search query) to turn what was previously, to me, almost deep-web information, into public-web information.
---
2. As a (fuzzy-logic) expert system in many domains, that learned all its implications from the public information available on the web.
One fascinating thing about high-parameter-count pre-trained base models, is that you don't really need to do any prompting, or supply any additional information, to get them to do a vaguely-acceptable job of diagnosis — whether that be diagnosing your early-stage diabetic neuropathy, or that mysterious rattle in your car.
Sure, the LLM will be wrong sometimes. It's just a distillation of what a bunch of conversations and articles spread across the public web have to say about what are or aren't the signs and symptoms of X.
But those are the same articles you'd read. The LLM will almost always outperform you in "doing your own research" (unless you go as far as to read journal papers — I don't know of any LLM base model that's been trained on arXiv yet...). It won't be as good at medicine as a doctor, or as good at automotive repair as an automotive technician, etc. — but it will be better (i.e. more accurate) at those things than an interested amateur who's watched some YouTube videos and read some pop-science articles.
Which means you can just tell LLMs the "weird things you've noticed lately", and get it to hypothesize for you — and, as long as you're good at being observant, the LLM's hypotheses will serve as great lines of investigation. It'll suggest which experts or specialists you should contact, what tests you can perform yourself to do objective differential diagnostics, etc.
(I don't want to under-emphasize the usefulness of this. ChatGPT figured out my house had hidden toxic mold! My allergies are gone now!)
---
3. As a translator.
Large-parameter-count LLM base models are actually really, really good at translation. To the point that I'm not sure why Google Translate et al haven't been updated to be powered by them. (Google Translate was the origin of the Transformer architecture, yet it seems to have been left in the dust since then by the translation performance of generic LLMs.)
And by "translation", I do literally mean "translating entire documents from one spoken/written human language to another." (My partner, who is a fluently-bilingual writer of both English + [Traditional] Chinese, has been using Claude to translate English instructions / documents into Chinese for her [mostly monolingual Chinese] mother to better understand them; and to translate any free-form responses her mother is required to give, back into English. She used to do these tasks herself "by hand" — systems like Google Translate would provide results that were worse-than-useless. But my partner can verify that, at least for this language pair, modern LLMs are excellent translators, writing basically what she would write herself.)
But I also mean:
• The thing Apple markets as part of Apple Intelligence — translation between writing styles (a.k.a. "stylistic editing.") You don't actually need a LoRA / fine-tune to do this; large-parameter-count models already inherently know how to do it.
• Translating between programming languages. "Rewrite-it-in-Rust" is trivial now. (That's what https://www.darpa.mil/research/programs/translating-all-c-to... is about — trying to build up an agentive framework that relies on both the LLM's translation capabilities, and the Rust compiler's typing errors on declaration change, to brute-force iterate across entire codebases, RiiRing one module at a time, and then recursing to its dependents to rewrite them too.)
• Translating between pseudocode, and/or a rigorous description of code, and actual code. I run a data analytics company; I know far more about the intricacies of ANSI SQL than any man ought to. But even I never manage to remember the pile of syntax features that glom together to form a "loose index scan" query. (WITH RECURSIVE, UNION ALL, separate aliases for the tables used in the base vs inductive cases, and one of those aliases referenced in a dependent subquery... but heck if I recall which one.) I have a crystal-clear picture of what I want to do — but I no longer need to look up the exact grammar the SQL standard decided to use yet again, because now I can dump out, in plain language, my (well-formed) mental model of the query — and rely on the LLM to translate that model into ANSI SQL grammar.
by thombles on 3/26/25, 7:19 PM
I'm, uh, not a fan of AI, however in this case I would strongly recommend everybody ctrl-f the juicy quotes in the 404media article and see where they came from in the full text of Microsoft's study. Both of the leading quotes come from the _introduction_, where they're talking at a high level about a paper from 1983. It's enormous clickbait.
by RecycledEle on 3/26/25, 10:48 PM
You decide to rot as AI does the work, or you decide to learn from the AI.
The same is true of managers. I have had managers who yelled at me to do things they did not understand. They rotted on the inside. Other managers learned every trick I brought to the company. They grew.
by sunjester on 3/26/25, 5:58 PM
Sounds like Microsoft is back at it again.
by rraghur on 3/26/25, 6:48 PM
Just today had Gemini write a shell spot for me that had to generate a relative symlink.. Getting it to work xplat on linux & mac took more than ten tries and I stopped reading after the second
At the end, I spent probably more time and learnt nothing.. My initial take was that this is the kind of thing I don't care much for so giving it to a llm is OK... However, by the end of it I ended up more frustrated and lost it in the simulation of working things out aa well
by MrMcCall on 3/26/25, 6:43 PM
Usain Bolt didn't walk around on crutches all day.
Comedians' ability diminishes as they take time off.
Ahnold wasn't lounging around all day.
We should understand that fixing crap, unsensible code is not a productive skillset. As Leslie Lamport said the other day, logically developing and coding out proper abstractions is the core skillset, and not one to be delegated to just anything or anyone.
It's ok; the bright side for folks like me is that you're just happily hamstringing yourselves. I've been trying to tell y'all, but I can only show y'all the water, not make you drink.
by AtomBalm on 3/26/25, 8:27 PM
Literacy makes human memory atrophied and unprepared. Kids these days can’t even recite the Illiad from memory!
by Deprogrammer9 on 3/26/25, 7:23 PM
Don't listen to Microscam yo!
by fbn79 on 3/26/25, 6:56 PM
People growing up using AI risks to become like France nobility during luigi IVI reign.