AI LLM Has Trillion Dollar Use Case By Increasing Computer Programming Productivity

Devin is an AI that was used on the SWE-bench, a challenging benchmark that asks agents to resolve real-world GitHub issues found in open source projects like Django and scikit-learn. Devin correctly resolves 13.86% of the issues end-to-end, far exceeding the previous state-of-the-art of 1.96%. Even when given the exact files to edit, the best previous models can only resolve 4.80% of issues.

Microsoft copilot has also been used to improve programmer productivity. Senior developers in silicon valley have called Microsoft coPilot scary good. They believe that IT projects could soon be performed with an IT staff 10% of the size of pre-AI team sizes.

There is a list of 30 business productivity use cases for copilot.

Global Software and IT Services are a $2.3 trillion industry.

There are also smaller use cases of customer service, call center support and paralegals.

3 thoughts on “AI LLM Has Trillion Dollar Use Case By Increasing Computer Programming Productivity”

  1. >Devin correctly resolves 13.86% of the issues end-to-end

    So use it to help make Devin-2 which can resolve 14% of issues
    Use Devin-2 to help make Devin-3 which can resolve 15% of issues
    ….
    Soon Devin-X will far exceed any and all humans in resolving issues

  2. I believe programming is the killer niche for the LLM. I’ve wanted to make a fuel shuffling/management GUI for years, but never had the patience to spend the time learning how to program radio buttons and drop downs, displays, etc.. Programmers by trade might scoff at such a low barrier (making a basic mouse driven GUI) being prohibitive… Instead, I do the work rather manually with scripts operating on text files and pasting arrays between excel and ascii as needed. I’ve spent a lot of time developing the skills of the trade and could write a specification for the interface I desire as a user. CHAT-GPT does reduce the height of the barrier quite a bit; a customer can describe the desired product conversationally. Of course the customer would need to be proficient enough to debug and adapt the imperfect code given by the “AI”… I figure lots of professions have similar needs…

  3. Think of teams of LMMs that are trained for specific tasks like drop in replacements for most jobs that can be done on screens. They are disembodied robots whose general skills are language and understanding images on screens. Cloud based LMM team robots will start replacing jobs before physical humanoid robots do.

Comments are closed.