Ignoring lemmyhate, are programmers really using AI to be more efficient?

bridgeenjoyer@sh.itjust.works · 4 months ago

Ignoring lemmyhate, are programmers really using AI to be more efficient?

Sicklad@lemmy.world · 4 months ago

From my experience it’s great at doing things that have been done 1000x before (which makes sense given the training data), but when it comes to building something novel it really struggles, especially if there’s 3rd party libraries involved that aren’t commonly used. So you end up spending a lot of time and money hand holding it through things that likely would have been quicker to do yourself.

kewjo@lemmy.world · 4 months ago

the 1000x before bit has quite a few sideffects to it as well.

lesser used languages suffer because there’s not enough training data. this gets annoying quickly when it overrides your static tools and suggests nonsense.
larger training sets contain more vulnerabilities as most code is pretty terrible and may just be snippets that someone used once and threw away. owasp has a top 10 for a reason. take input validation for example, if I’m working on parsing a string there’s usually context such as is this trusted data or untrusted? if i don’t have that mental model where I’m thinking about the data i might see generated code and think it looks correct but in reality its extremely nefarious.

mesa@piefed.social · 4 months ago

Its also trained on old stuff.

And because its old, you get some very strange side effects and less maintainability.

MagicShel@lemmy.zip · edit-2 4 months ago

It’s decent at reviewing its own code, especially if you give it different lenses to look though.

“Analyze this code and look for security vulnerabilities.” “Analyze this code and look for ways to reduce complexity.”

And then… think about the response like it’s a random dude online reviewing your code. Lots of times it raises good issues but sometimes it tries too hard to find little shit that is at best a sidegrade.

Abrinoxus@lemmy.today · 4 months ago

this

Eq0@literature.cafe · 4 months ago

The pycharm AI integration completes each line. That’s very useful when you are repeating a well known algorithm and not distracting when you are doing something unusual. So overall, for small things AI is a speed up. I haven’t tried asking chatgpt for bigger coffe chunks, I haven’t had the greatest experience with it up to now and ii don’t want to spend more time debugging than I am already.

ripcord@lemmy.world · 4 months ago

Oh man, the Codeium auto complete in PyCharm has been just awful for me. Slow enough that it doesnt come up fast enough that I ever expect it (and rarely comes up when I pause to wait for it) then goes away instantly when I invariably continue typing when it comes up. Then won’t come back if I backspace, erase the word and start retyping it, etc. And competes with the old school pycharm auto complete sometimes which adds another layer of fun.

Beej Jorgensen@lemmy.sdf.org · 4 months ago

I’m pretty sure every time you use AI for programming your brain atrophies a little, even if you’re just looking something up. There’s value in the struggle.

So they can definitely speed you up, but be careful how you use it. There’s no value in a programmer who can only blindly recite LLM output.

There’s a balance to be struck in there somewhere, and I’m still figuring it out.

0x1C3B00DA@piefed.social · 4 months ago

I’m pretty sure every time you use AI for programming your brain atrophies a little, even if you’re just looking something up. There’s value in the struggle.

I assume you were joking but some studies have come out recently that found this is exactly what happens and for more than just programming. (sorry it was a while ago so I dont have links)

ripcord@lemmy.world · 4 months ago

Doesn’t sound like they’re joking to me.

Honytawk@lemmy.zip · 4 months ago

There are similar studies on the effects of watching a Youtube video instead of reading a manual.

Zexks@lemmy.world · 4 months ago

This is literally the exact same argument made against using books and developing writing.

Naich@lemmings.world · 4 months ago

You can either spend your time generating prompts, tweaking them until you get what you want and then using more prompts to refining the code until you end up with something that does what you want…

or you can just fucking write it yourself. And there’s the bonus of understanding how it works.

AI is probably fine for generating boiler plate code or repetitive simple stuff, but personally I wouldn’t trust it any further than that.

MagicShel@lemmy.zip · 4 months ago

There is a middle ground. I have one prompt I use. I might tweak it a little for different technologies, languages, etc. only so I can fit more standards, documentation and example code in the upload limit.

And I ask it questions rather than asking it to write code. I have it review my code, suggest other ways of doing something, have it explain best practices, ask it to evaluate the maintainability, conformance to corporate standards, etc.

Sometimes it takes me down a rabbit hole when I’m outside my experience (so does Google and stack overflow for what it’s worth), but if you’re executing a task you understand well on your own, it can help you do it faster and/or better.

🇰 🌀 🇱 🇦 🇳 🇦 🇰 🇮 @pawb.social · 4 months ago

I don’t see how it could be more effecient to have AI generate something that you then have to review and make sure actually works over just writing the code yourself, unless you don’t know enough to code it yourself and just accept the AI generated code as-is without further review.

Quibblekrust@thelemmy.club · 4 months ago

I don’t see how it could be more effecient to have [a junior developer write] something that you then have to review and make sure actually works over just writing the code yourself…

iglou@programming.dev · 4 months ago

A junior dev wont be a junior dev their whole career, code reviews also educates them
You can’t trust the quality of a junior’s work, but you can trust that they are able to understand the project and their role in it. LLMs are by definition unable to think and understand. Just pretty good at pretending they are. Which leads to the third point:
When you “vibe code”, you don’t “just” have to review the produced code, you also have to constantly tell the LLM what you want it to do. And fight with it when it fucks up.

Pup Biru@aussie.zone · 4 months ago

if the only point of hiring junior devs were to skill them up so they’d be useful in the future, nobody would hire junior devs

LLMs aren’t the brain: they’re exactly what they are… a fancy auto complete…

type a function header, let if fill the body… as long as you’re descriptive enough and the function is simple enough to understand (as all well structured code should be) it usually gets it pretty right: it’s somewhat of a substitute for libraries, but not for your own structure

let it generate unit tests: doesn’t matter if it gets it wrong because the test will fail; it’ll write a pretty solid test suite using edge cases you may have forgotten

fill lines of data based on other data structures: it can transform text quicker than you can write regex and i’ve never had it fail at this

let it name functions based on a description… you can’t think of the words, but an LLM has a very wide vocabulary and - whilst not knowledge - does have a pretty good handle on synonyms and summary etc

there’s load of things LLMs are good for, but unless you’re just learning something new and you know your code will be garbage anyway, none of those things replace your brain: just repetitive crap you probably hate to start with because you could explain it to a non-programmer and they could carry out the tasks

iglou@programming.dev · 4 months ago

if the only point of hiring junior devs were to skill them up so they’d be useful in the future, nobody would hire junior devs

I never said that, and a single review already will make a junior dev better off the bat

LLMs aren’t the brain: they’re exactly what they are… a fancy auto complete

I agree, but then you say…

type a function header, let if fill the body… as long as you’re descriptive enough and the function is simple enough to understand (as all well structured code should be) it usually gets it pretty right: it’s somewhat of a substitute for libraries, but not for your own structure

…which says the other thing. Implementing a function isn’t for a “fancy autocomplete”, it’s for a brain to do. Unless all you do is reinventing the wheel, then yeah, it can generate a decent wheel for you.

let it generate unit tests: doesn’t matter if it gets it wrong because the test will fail; it’ll write a pretty solid test suite using edge cases you may have forgotten

Fuck no. If it gets the test wrong, it won’t necessarily fail. It might very well pass even when it should fail, and that’s something you won’t know unless you review every single line it spits out. That’s one of the worst areas to use an LLM.

fill lines of data based on other data structures: it can transform text quicker than you can write regex and i’ve never had it fail at this

I’m not sure what you mean by that.

let it name functions based on a description… you can’t think of the words, but an LLM has a very wide vocabulary and - whilst not knowledge - does have a pretty good handle on synonyms and summary etc

I agree with that, naming or even documenting is a good way to use an LLM. With supervision of course, but an imprecise name or documentation is not critical.

Pup Biru@aussie.zone · edit-2 4 months ago

Implementing a function isn’t for a “fancy autocomplete”, it’s for a brain to do. Unless all you do is reinventing the wheel, then yeah, it can generate a decent wheel for you.

pretty much every line of code we write in modern software isn’t unique… we use so many orders of magnitude more lines of other people’s code than our own, we’re really just plumbing pipes together

most functions we write that aren’t business logic specific to the problem domain of our software (and even sometimes then) has been written before… the novel part isn’t in the function body: the low level instructions… the novel part is how those instructions are structured… that may as well be pseudocode, and that pseudocode may as well take the form of function headers

Fuck no. If it gets the test wrong, it won’t necessarily fail. It might very well pass even when it should fail, and that’s something you won’t know unless you review every single line it spits out. That’s one of the worst areas to use an LLM.

write tests, tests fail, write code, tests slowly start to pass until you’re done… this is how we’ve always done TDD because it ensures the tests fail when they should. this is a good idea with or without LLMs because humans fuck up unit tests all the time

I’m not sure what you mean by that.

for example, you have an external API of some kind with an enum expressed via JSON as a string and you want to implement that API including a proper Enum object… an LLM can more easily generate that code than i can, and the longer the list of values the more cumbersome the task gets

especially effective for generating API wrappers because they basically amount to function some_name -> api client -> call /api/someName

this is basically a data transformation problem: translate from some structure to a well-defined chunk of code that matches the semantics of your language of choice

this is annoying for a human, and an LLM can smash out a whole type safe library in seconds based on little more than plain english docs

it might not be 100% right, but the price for failure is an error that you’ll see and can fix before the code hits production

and of course it’s better to generate all this using swagger specs, but they’re not always available and tend not to follow language conventions quite so well

for a concrete example, i wanted to interact with blackmagic pocket cinema cameras via bluetooth in swift on ios: something they don’t provide an SDK for… they do, however document their bluetooth protocols

https://documents.blackmagicdesign.com/UserManuals/BlackmagicPocketCinemaCameraManual.pdf?_v=1742540411000

(page 157 if you’re interested)

it’s incredibly cumbersome, and basically involves packing binary data into a packet that represents a different protocol called SDI… this would have been horrible to try and work out on my own, but with the general idea of how the protocol worked, i structured the functions, wrote some test case using the examples they provided, handed chatgpt the pdf and used it to help me with the bitbanging nonsense and translating their commands and positionally placed binaries into actual function calls

could i have done it? sure, but why would i? chat gpt did in 10 seconds what probably would have taken me at least a few hours of copying data from 7 pages of a table in a pdf - a task i dont enjoy doing, in a language i don’t know very well

Quibblekrust@thelemmy.club · 4 months ago

fill lines of data based on other data structures: it can transform text quicker than you can write regex and i’ve never had it fail at thisI’m

not sure what you mean by that.

Not speaking for them, but I use LLMs for this. You have lines of repetitive code, and you realize you need to swap the order of things within each line. You could brute force it, or you could write a regex search/replace. Instead, you tell the LLM to do it and it saves a lot of time.

Swapping the order of things is just one example. It can change capitalization, insert values, or generate endless amounts of mock data.

iglou@programming.dev · edit-2 4 months ago

Ah! That does seem useful indeed! Even just generating a bunch a dummy data.

Quibblekrust@thelemmy.club · 4 months ago

I was tasked once with writing a front-end for an API that didn’t exist yet, but I had a model. I could have written a loop that generated “Person Man 1”, “Person Man 2”, etc. with all of the associated fields, but instead I gave the LLM my class definition and it spat out 50 people with unique names, phone numbers, emails, and everything. It made it easy to test the paging and especially the filtering. It also took like 30 seconds to ask for and receive.

I originally asked it to make punny names based on celebrities, and it said “I can’t do that.” ☹️

Pup Biru@aussie.zone · 4 months ago

yup! absolutely this too - i provided a different example in my reply, but honestly this is exactly the thing i use it for most… type a couple of lines, it gets the idea of what you’re trying to copy, and then it’s just hitting accept until it’s done… it’s pretty close to 100% accurate, and even if it’s not… fixing it ain’t exactly hard!

leftzero@lemmy.dbzer0.com · 4 months ago

The junior developer can (hopefully) learn and improve.

Honytawk@lemmy.zip · 4 months ago

LLMs are also improving though.

leftzero@lemmy.dbzer0.com · 4 months ago

They’ll never be able to learn, though.

A LLM is merely a statistical model of its training material. Very well indexed but extremely lossy compression.

It will always be outdated. It can never become familiar with your codebase and coding practices. And it’ll always be extremely unreliable, because it’s just a text generator without any semblance of comprehension about what the texts it generates actually mean.

All it’ll ever be able to do is reproduce the standards as they were when its training model was captured.

If we are to compare it to a junior developer, it’d be someone who suffered a traumatic brain injury just after leaving college, which prevents them from ever learning anything new, makes them unaware that they can’t learn, and incapable of realising when they don’t know something, makes them unable to reason or comprehend what they are saying, and causes them to suffer from verbal diarrhoea and excessive sycophancy.

Now, such a tragically brain damaged individual might look like the ideal worker to the average CEO, but I definitely wouldn’t want them anywhere near my code.

Zexks@lemmy.world · 4 months ago

You can type at 300 words per minute with zero mistakes. Youre able to do than on systems youve never worked on before in languages youve never seen. #Doubt

sobchak@programming.dev · 4 months ago

In the grand scheme of things, I think AI code generators make people less efficient. Some studies have come out that indicate this. I’ve tried to use various AI tools, as I do like fields of AI/ML in general, but they would end up hampering my work in various ways.

southernbrewer@lemmy.world · edit-2 4 months ago

I’m enjoying it, mostly. It’s definitely great at some tasks and terrible at orhers. You get a feel for what those are after a while:

Throwaway projects - proof of concepts, one-off static websites, that kind of thing: absolutely ideal. Weeks of dev becomes hours, and you barely need to bother reviewing it if it works.
Research (find a tool for doing XYZ) where you barely know the right search terms: ideal. The research mode on claude.ai is especially amazing at this.
Anything where the language is unfamiliar. AI bootstraps past most of the learning curve. Doesn’t help you learn much, but sometimes you don’t care about learning the codebase layout and you just need to fix something.
Any medium sized project with a detailed up front description.

What it’s not good for:

Debugging in a complex system
Tiny projects (one line change), faster to do it yourself
Large projects (500+ line change) - the diff becomes unreviewable fairly quickly and can’t be trusted (much worse than the same problem with a human where you can at least trust the intent)

iglou@programming.dev · 4 months ago

I’m not against AI use in software development… But you need to understand what the tools you use actually do.

An LLM is not a dev. It doesn’t have the capability to think on a problem and come up with a solution. If you use an LLM as a dev, you are an idiot pressing buttons on a black box you understand nothing about.

An LLM is a predictive tool. So use it as a predictive tool.

Boilerplate code? It can do that, yeah. I don’t like to use it that way, but it can do that.
Implementing a new feature? Maybe, if you’re lucky, it has been trained on enough data that it can put something together. But you need to consider its output completely untrustworthy, and therefore it will require so much reviewing that it’s just better to write it yourself in the first place.
Implementing something that solves a problem not solved before? Just don’t. Use your own brain, for fuck’s sake. That’s what you have been trained on.

The one use of AI, at the moment, that I actually like and actually improves my workflow is JetBrains’ full line completion AI. It very often accurately predicts what I want to write when it’s boilerplate-ish, and shuts up when I write something original.

Zexks@lemmy.world · edit-2 4 months ago

Yes they do have the abikity to think and reason just like you (generally mush faster and slightly better)

https://medium.com/@leucopsis/how-gpt-5-compares-to-claude-opus-4-1-fd10af78ef90

96% on the AIME with zero tools. Only reading the question and reasoning through the answer

https://www.datacamp.com/blog/gpt-5

NewNewAugustEast@lemmy.zip · edit-2 4 months ago

This is not true. They do not think or reason. They have code that appears to reason, but it definitely is not.

Once it gets off track it doesn’t consider that it is obviously wrong.

A simple math problem can fail and it is really obvious to a human for example.

iglou@programming.dev · 4 months ago

Absolutely not. This comment shows you have absolutely zero idea how an LLM works.

speculate7383@lemmy.today · edit-2 4 months ago

No, they can’t think and reason. However, they can replicate and integrate the thinking and reasoning of many people who have written about similar problems. And yes, they can do it must faster than we could read a hundred search result pages. And yes, their output looks slightly better than many of us in many cases, because they are often dispensing best practices by duplicating the writings of experts. (In the best cases, that is.)

https://arxiv.org/pdf/2508.01191

https://arstechnica.com/ai/2025/08/researchers-find-llms-are-bad-at-logical-inference-good-at-fluent-nonsense/

criss_cross@lemmy.world · 4 months ago

From my experience it’s really great at bootstrapping new projects for you. It’s good at getting you sample files and if you’re using cursor just building out a sample project.

It’s decent at being an alternative to google/SO for syntax or previously encountered errors. There’s a few things it hallucinates but generally it can save time as long as you don’t trust it blindly.

It struggles when you give it complex tasks or not-straightforward items. Or things that require a lot of domain knowledge. I once wanted to see what css classes were still in use across a handful of react components and it just shat the bed.

The people who champion AI as a human replacement will build a quick proof of concept with it and proclaim “oh shit this is awesome!” And not realize that that’s the easy part of software engineering.

TabbsTheBat (they/them)@pawb.social · 4 months ago

The truth, as always, is in the middle. Plenty of programmers hate it, and plenty of programmers like it. It just so happens that these kinds of spaces attracts more or the hate crowd, and paid shill articles attract more or the like crowd ¯⁠\⁠_⁠(⁠ツ⁠)⁠_⁠/⁠¯

atzanteol@sh.itjust.works · 4 months ago

The hate is ridiculous as is the hype.

It’s a new tool that is often useful when used correctly. Don’t use it to write entire applications - that’s a recipe for disaster.

But if you’re learning a new language it’s amazing. You have an infinitely patient and immediately available tutor that can teach you a language’s syntax, best practices, etc. I don’t know why that would make you “ill” besides all the shame “real developers” seem to want to lump on anybody who uses AI. If you’re not concerned about passing some “I don’t use an IDE” nerd’s purity test you’ll be fine.

Ledivin@lemmy.world · edit-2 4 months ago

Its an absolute gamechanger, IMO - the research phase of any task is reduced to effectively nothing, and I get massive amounts of work done when I walk away from my desk, because I plan for and keep lists of longer tasks to accomplish during those times.

You need to review every line of code it writes, but that’s no different than it ever was when working with junior devs 🤷‍♂️ but now I get the code in minutes instead of weeks and the agents actually react to my comments.

We’re using this with a massive monorepo containing hundreds of thousands of lines of code, and in tiny tool repos that serve exactly one purpose. If our code quality checks and standards werent as strict as they have been for the past decade, I think it wouldn’t work well with the monorepo.

The important part is that my company is paying for it - I have no clue what these tools cost. I am definitely more productive, there is absolutely no debate there IMO. Is the extra productivity worth the extra cost? I have literally no idea.

fubarx@lemmy.world · 4 months ago

I use it mainly to tweak things I can’t be bothered to dig into, like Jekyll or Wordpress templates. A few times I let it run and do a major refactor of some async back-end code. It botched the whole thing. Fortunately, easy to rewind everything from remote git repo.

Last week I started a brand new project, thought I’d have it write the boilerplate starter code. Described in detail what I was looking for. It sat there for ten minutes saying ‘Thinking’ and nothing happened. Killed it and created it myself. This was with Cursor using Claude. I’ve noticed it’s gotten worse lately, maybe because of the increased costs.

djmikeale@feddit.dk · 4 months ago

Not total bullshit, but it’s not great for all use cases:

For coding tasks the output looks good on the surface but often I end up changing stuff, meaning it would have been faster up do myself.

For coding I know little about (currently writing some GitHub actions), it’s great at explaining alternatives, pros and cons, to give me a rudimentary understanding of stuff

I’ve also used it to transcribe tutorial screencasts, and then afterwards having a secondary LLM use the transcription to generate documentation (include in prompt: "when relevant, generate examples, use markdown tables, generate plantuml etc)

notarobot@lemmy.zip · 4 months ago

Yes. But I’m not paying for premium like some of my cowokres. I use it to avoid the grunt work, and to avoid things I know I’d have to google.

I used some coworkers account for a while and auto complete is amazing. I it guesses wrong you just keep tipping as usual. If its right, hit tab and saves you like 20 seconds.

On the other hand I have cokowkers that do not check the chatgpt output and the PRs make no sense. Example: instead of making a variable type any (which is forbidden in our codebase) they did

Let a : date|number|string|object|(…) = fetchData()

VoterFrog@lemmy.world · 4 months ago

My favorite use is actually just to help me name stuff. Give it a short description of what the thing does and get a list of decent names. Refine if they’re all missing something.

Also useful for finding things quickly in generated documentation, by attaching the documentation as context. And I use it when trying to remember some of the more obscure syntax stuff.

As for coding assistants, they can help quickly fill in boilerplate or maybe autocomplete a line or two. I don’t use it for generating whole functions or anything larger.

So I get some nice marginal benefits out of it. I definitely like it. It’s got a ways to go before it replaces the programming part of my job, though.

leftzero@lemmy.dbzer0.com · 4 months ago

My favorite use is actually just to help me name stuff.

Reverse dictionary lookup, more or less.

Now, that is something LLMs should be actually good at, unlike practically any other thing they’re being sold as being good at.