Message Boards

WOLFRAM COMMUNITY

6797 Views

5 Replies

10 Total Likes

View groups...

Follow this post

Share this post:

GROUPS:

External Programs and Systems Wolfram Language

Unexpected answer from ChatGPT for Wolfram language query

l van Veen

l van Veen, Hewlett-Packard Enterprise

Posted 1 year ago

POSTED BY: l van Veen

5 Replies

Sort By:

Moderation Team

Moderation Team, WOLFRAM

Posted 1 year ago

Stephen Wolfram just published a new article: Wolfram\|Alpha as the Way to Bring Computational Knowledge Superpowers to ChatGPT https://writings.stephenwolfram.com/2023/01/wolframalpha-as-the-way-to-bring-computational-knowledge-superpowers-to-chatgpt We started a new dedicated discussion here: Stephen Wolfram on ChatGPT, Wolfram\|Alpha & Computational Knowledge https://community.wolfram.com/groups/-/m/t/2763581

POSTED BY: Moderation Team

Jonathan Kinlay

Jonathan Kinlay, Systematic Strategies

Posted 1 year ago

I think this nicely illustrates some of the strengths and weakness of ChatGPT, as I understand them. Firstly, the answer given has some elements that appear correct and clearly suggest some grasp of the WL syntax. Consequently, the answer seems plausible, to someone with limited knowledge of WL. Secondly, however, the answer contains several basic errors. For instance, Total[{{1, 2, 3}, {2, 3, 4}}] will not produce the answer 15, as claimed, but rather: {3, 5, 7} More importantly, the bot fails to spot the root cause of the problem, which is the missing & at the end of the anonymous function, as in: Total[# - 1] & /@ {{1, 2, 3}, {2, 3, 4}} {3, 6} So, in summary, ChatGPT's answer is plausible, but completely incorrect and it fails to identify the real issue. Now, imagine that instead of a toy programming problem like this you were faced with finding a bug in a much more complex code base, perhaps involving thousands of lines of WL code. In those circumstances it might be very difficult indeed to validate the answer given; or worse, the given answer might appear to work, but only for some limited set of circumstances (which only becomes apparent much later, perhaps not until the system makes it into production). An even worse scenario: imagine using ChatGPT to try to figure out a proof of some complex mathematical theorem. The proof could be highly complex and take a very long time to validate (as indeed it sometimes does with theorem proofs produced by professional mathematicians). And then, finally, imagine the consequences of using a tool like ChatGPT to diagnose and suggest remedies for "real world" problems. Sounds implausible: but school children using this tool might be fooled into believing that the plausible but flawed solutions offered by the bot are somehow valid. That could quickly become a very slippery road to find ourselves walking down.

POSTED BY: Jonathan Kinlay

Daniel Lichtblau

Daniel Lichtblau, Wolfram Research

Posted 1 year ago

Actually items (2) and (3) in the response are not without merit. They mention "no `Map` function being applied" and "the function being applied ... is not valid". What they fail to mention is that this defect is from lacking an ampersand, properly placed.

POSTED BY: Daniel Lichtblau

Tom Morrison

Posted 1 year ago

True, but from some users' perspectives, Jonathan's response is correct. The bot's response does not "help" the user, it only (arguably: in a confusing manner) identifies the source of the problem. But I might be wrong too. The correctness of the answer depends on the intent of the user - and for ChatGPT to get it right, it would need to have a conversation vs a one off question/issue statement.

POSTED BY: Tom Morrison

Duncan Pettengill

Duncan Pettengill, Wolfram Research

Posted 1 year ago

Hello, as I understand, you've consulted https://chat.openai.com/ with some Wolfram Language questions. This is a chat bot with no specific knowledge of the Wolfram Language and no affiliation with Wolfram Research. If you would like to provide feedback on the chat bot's responses, the OpenAI team is currently gathering feedback at https://cdn.openai.com/chatgpt/ChatGPT_Feedback_Contest_Rules.pdf Please let me know if I've misunderstood something.

POSTED BY: Duncan Pettengill

Reply to this discussion

Reply Preview

Attachments

Remove Add a file to this post

Follow this discussion

or Discard

Group Abstract

Feedback