Message Boards Message Boards

0
|
6797 Views
|
5 Replies
|
10 Total Likes
View groups...
Share
Share this post:

Unexpected answer from ChatGPT for Wolfram language query

enter image description here

POSTED BY: l van Veen
5 Replies

Stephen Wolfram just published a new article:

Wolfram|Alpha as the Way to Bring Computational Knowledge Superpowers to ChatGPT

https://writings.stephenwolfram.com/2023/01/wolframalpha-as-the-way-to-bring-computational-knowledge-superpowers-to-chatgpt

We started a new dedicated discussion here:

Stephen Wolfram on ChatGPT, Wolfram|Alpha & Computational Knowledge
https://community.wolfram.com/groups/-/m/t/2763581

POSTED BY: Moderation Team

I think this nicely illustrates some of the strengths and weakness of ChatGPT, as I understand them.

Firstly, the answer given has some elements that appear correct and clearly suggest some grasp of the WL syntax. Consequently, the answer seems plausible, to someone with limited knowledge of WL.

Secondly, however, the answer contains several basic errors. For instance,

Total[{{1, 2, 3}, {2, 3, 4}}]

will not produce the answer 15, as claimed, but rather:

{3, 5, 7}

More importantly, the bot fails to spot the root cause of the problem, which is the missing & at the end of the anonymous function, as in:

Total[# - 1] & /@ {{1, 2, 3}, {2, 3, 4}}

{3, 6}

So, in summary, ChatGPT's answer is plausible, but completely incorrect and it fails to identify the real issue.

Now, imagine that instead of a toy programming problem like this you were faced with finding a bug in a much more complex code base, perhaps involving thousands of lines of WL code. In those circumstances it might be very difficult indeed to validate the answer given; or worse, the given answer might appear to work, but only for some limited set of circumstances (which only becomes apparent much later, perhaps not until the system makes it into production).

An even worse scenario: imagine using ChatGPT to try to figure out a proof of some complex mathematical theorem. The proof could be highly complex and take a very long time to validate (as indeed it sometimes does with theorem proofs produced by professional mathematicians).

And then, finally, imagine the consequences of using a tool like ChatGPT to diagnose and suggest remedies for "real world" problems. Sounds implausible: but school children using this tool might be fooled into believing that the plausible but flawed solutions offered by the bot are somehow valid.

That could quickly become a very slippery road to find ourselves walking down.

POSTED BY: Jonathan Kinlay

Actually items (2) and (3) in the response are not without merit. They mention "no Map function being applied" and "the function being applied ... is not valid". What they fail to mention is that this defect is from lacking an ampersand, properly placed.

POSTED BY: Daniel Lichtblau
Posted 1 year ago

True, but from some users' perspectives, Jonathan's response is correct. The bot's response does not "help" the user, it only (arguably: in a confusing manner) identifies the source of the problem. But I might be wrong too. The correctness of the answer depends on the intent of the user - and for ChatGPT to get it right, it would need to have a conversation vs a one off question/issue statement.

POSTED BY: Tom Morrison

Hello, as I understand, you've consulted https://chat.openai.com/ with some Wolfram Language questions.

This is a chat bot with no specific knowledge of the Wolfram Language and no affiliation with Wolfram Research. If you would like to provide feedback on the chat bot's responses, the OpenAI team is currently gathering feedback at https://cdn.openai.com/chatgpt/ChatGPT_Feedback_Contest_Rules.pdf

Please let me know if I've misunderstood something.

Reply to this discussion
Community posts can be styled and formatted using the Markdown syntax.
Reply Preview
Attachments
Remove
or Discard

Group Abstract Group Abstract