You would have to have been living under a rock not to have noticed the hoopla over the launch of ChatGPT, OpenAI's large language model, which signed up over 1 million users in just a few days after launch. Tech pundits rushed out dozens of articles describing in breathless tones the wondrous capabilities of this new AI technology, in applications as diverse as musical composition and resume writing to programming.
In a December 2022 opinion piece, economist Paul Krugman wrote that ChatGPT would affect the demand for knowledge workers (Krugman, Paul (December 6, 2022). "Does ChatGPT Mean Robots Are Coming For the Skilled Jobs?" . The New York Times. Retrieved December 6, 2022.
James Vincent saw the viral success of ChatGPT as evidence that artificial intelligence had gone mainstream (Vincent, James (December 8, 2022). "ChatGPT proves AI is finally mainstream – and things are only going to get weirder"
Only now, in the cold light of the grey January morning are users beginning to wise up to the fact that a lot of what ChatGPT produces is gibberish, dressed up to look like a valid response to the user's prompts and delivered in a highly confident tone. But as for the content itself, well that's another matter entirely.
Dr Darren Hudson Hick, for example, provided an example of ChatGPT-inspired plagiarism in which the essayist
confidently and thoroughly described Hume's views of the paradox of horror in a way that were [sic] thoroughly wrong
In his article Testing ChatGPT in Mathematics — Can ChatGPT do maths?, Hari Prasad found that the system:
failed to admit its mistake, challenged right premises and processed inappropriate requests that it couldn’t or not supposed to handle.
The area where ChatGPT has attracted the most positive reviews is code generation, with many programmers claiming that the system has multiplied their own productivity many-fold. But a recent post in the community by I van Veen (Unexpected answer from ChatGPT for Wolfram language query) illustrates the problem. As I wrote in my response:
the answer given has some elements that appear correct and clearly suggest some grasp of the WL syntax. Consequently, the answer seems plausible, to someone with limited knowledge of WL.
Secondly, however, the answer contains several basic errors.
So what has any of this to do with Wolfram?
I believe that the code-generation capabilities of platform like ChatGPT and GitHub's Copilot represent a serious threat, as well as an opportunity, for Wolfram.
Firstly the threat: one of the arguments made in favor of using WL compared to alternatives is that, despite bring proprietary and (by comparison) expensive, it compensates for that in programming productivity, thanks to the power of the language. However, if I can learn to generate large blocks of code in Python or C++ automatically using a NLP interface, my programming productivity is eventually going to match or exceed anything Wolfram has to offer.
Which bring me to the opportunity, one that I have been banging the drum about for a very long time: automated code generation in the WL. What I am suggesting is not a competitor to ChatGPT, which can generate responses to almost any kind of question, but a more limited NLP interface to rival Copilot and similar products. The Wolfram language is built for this, not only because of its computational power, but also because the language itself is computable, making it an ideal choice for automated expression composition.
So the idea is that you would interact with the interface like this:
How do I speed up this code using parallelization?
Show me how to write a compiled function to do the following...
Write a program to play play tic-tac-toe
Find and fix the bug in this code
Provide 3 examples of widely used ketones, show me their chemical structures and describe their properties and applications
Give me a step-by-step solution for the following integral
Identify all the actors appearing is this video clip
Write a WL api interface from Mathematica to this platform, covering the following functionality...
And so on.
My thesis is that if WR develops a game-changing capability like this it will add rocket-fuel to the productivity of WL developers and attract a cohort of new users who currently have neither the time nor programming skills to navigate the steep terrain of the Wolfram Language.
On the other hand, if WR continues on its current trajectory of incremental releases, it is only a matter of time before AI-assisted programming platforms overtake and surpass the WL in terms of programming productivity. In that case the WL will likely become yet another could-have/should-have story in the history of defunct computer languages.