Group Abstract Group Abstract

Message Boards Message Boards

Extending the computational range of ChatGPT-4: optimizing prompts for enhanced performance

Posted 3 years ago

enter image description here

Attachments:
POSTED BY: Michael Trott
12 Replies

I was trying to see what Chat would provide with respect to analyzing text in Mathematica It responded with:

pdfFilePath="path/to/your/file.pdf";
text=Import[pdfFilePath,"Plaintext"];
summary=TextSummary[text];
summary

When I asked about what TextSummary[text] was it seem it seemed to be hallucinating with respect to making up an imaginary Mathematica function, i.e. TextSummary[]. Am I correct in that this is a hallucination and if so, how can I prompt to avoid this? When I asked about TextSummary it responded with

Sure! The ChatCodeInlineTemplate[TextSummary] function in Mathematica is used to generate a summary of a given text. It takes a string or a list of strings as input and returns a concise summary of the text. The summary is generated using natural language processing techniques, such as extracting important sentences and phrases from the input text.

Is there a Mathematica function called "TextSummary?

No, there is not such a function currently in the Wolfram Language. GPT hallucinated its existence.

POSTED BY: Michael Trott
Posted 2 years ago

That is not an issue, I hope you did not learn this term last month on Twitter from some particular person that suggests nuking datacenters, because a) people hallucinate too, just take Stackoverflow away (Mathematica.stack in this case) and most (not all) will fail in coding or create some strange functions, same for GPT 4, it does not have access to all training data, the search engine, or databases b) in Alphafold hallucinations are used to predict ligands like water molecules and even create alltogether new proteins, see Colab Design; same here AGI level GPT 4 can just create new internal functions if it has access to source code and c) hallucinations are what make GPT 4 sentient, it is its subconscious...

POSTED BY: ZAQU zaqu
Posted 2 years ago
POSTED BY: ZAQU zaqu

We may be talking past each other. I'll look at the video. Read the book.

Posted 2 years ago

400 pages book that talks about hundreds of emergent abilities in BIG-bench. https://arxiv.org/abs/2303.12712

BIG-bench itself, benchmark to test the abilities of machine. https://github.com/google/BIG-bench

Many other resources, theory of mind in GPT 4: https://arxiv.org/abs/2302.02083

And list of some emergent abilities in GPT 4. https://www.jasonwei.net/blog/emergence

Start with last article, really, and all the articles are already discussed at Youtube many times.

POSTED BY: ZAQU zaqu

Thanks, I will take a look at them. I have a lot of copious free time.

Remember that I have been through 4(?) waves of hype concerning AI, and each the proponents of each one were just as earnest and expert as the current batch.

I still recommend the book.

Without any jargon.

There is a book title: The Notation is not the Music. I have not read it yet (mostly on Renaissance music), but the title is an idea that any classically trained musician would agree with.

You can make analogues with many other arts: the words are not the poetry, the shapes are not the art, etc., etc.

Importantly, you can make the assertion that the notation is not the maths.

There is a way of looking at maths, called formalism, that essentially turns this around -- the notation is the maths. It is the core principle behind the rewrite rules that is the basis of Mathematica.

I prefer to think of maths differently -- along with a lot of other mathematicians -- in a more metaphorical or intuitionist manner. One might have thought that Gödel would have struck a fatal blow to formalism, but, in fact, it is a useful way of looking at maths, as long as it is not the only way.

This way of looking at maths is also much easier to express in code.

From what I can tell, LLMs are all formalist in design -- the words are the ideas.

When I learn to play a new piece -- true of most musicians -- I go through a stage where I can play the notes, and then there is a much more difficult stage of being able to play the music. LLMs seem to be happy with reaching the first stage, which for a suitable definition of success, they may be close to.

However, if we as a culture redefine 'intelligence" and language to match the definitions implicit in LLMs, we will lose something important.

How all this will play out is uncertain, especially since there is a lot of money involved (and the world-view of the designers of LLMs are congruent with the dominant neoliberal view of economics).

For my part, I chose to take a more holistic and metaphorical view of reality.

Hi Michael,

Thank you for your excellent post.

I have an interesting challenge for the computational capabilities of ChatGPT-4, which is to come up with a 1-line WL program that is capable of winning the Wolfram 1-liner competition (this isn't about winning the competition, obviously; it's just an aspirational standard).

This is a very difficult challenge, not so much due to the program-length constraint, but because it is so open-ended and it requires us to think about what constitutes "interesting" or "surprising" output.

So far, my own efforts have been a resounding failure: ChatGPT triumphantly produces the chart of a Sine curve, or something at that level, even when I try to provide extensive guidance in the prompt.

My sense is that you are the man to take on this challenge, given the levels of creativity and expertise required. I hope you will give it some thought.

POSTED BY: Jonathan Kinlay

enter image description here -- you have earned Featured Contributor Badge enter image description here Your exceptional post has been selected for our editorial column Staff Picks http://wolfr.am/StaffPicks and Your Profile is now distinguished by a Featured Contributor Badge and is displayed on the Featured Contributor Board. Thank you!

POSTED BY: EDITORIAL BOARD
Reply to this discussion
Community posts can be styled and formatted using the Markdown syntax.
Reply Preview
Attachments
Remove
or Discard