Group Abstract

Message Boards

WOLFRAM COMMUNITY

10.7K Views

18 Replies

15 Total Likes

View groups...

Follow this post

Share this post:

GROUPS:

Wolfram U

[WSG25] Daily Study Group: What is ChatGPT Doing... and Why Does It Work?

Arben Kalziqi

Arben Kalziqi, Wolfram Research

Posted 1 year ago

POSTED BY: Arben Kalziqi

18 Replies

Sort By:

Updating Name

Posted 1 year ago

POSTED BY: Updating Name

Phil Earnhardt

Posted 1 year ago

@Arben , I attended the course live on Friday and am going through the other sessions through the recordings. Good job with the poll questions. They worked great in the BigMarker replays. I was able to understand the poll questions as you asked them. Also, you consistently shared the results on your screen; people listening that way could play along. It's a good quiz, too. I loved your discussion about particles in the universe on Day 1. It tickled me immensely to see an instructor provide a calculation that uses [Wolfram's estimate of] the number of particles in the universe. That discussion reminded me of a short story by Stanislaw Lem in the book The Cyberiad. In the collection, a pair of "constructor" robots create fantastic machines. In the sixth sally, they create a demon of the second kind to work their way out of a problem. The story references a demon of the first kind -- Maxwell's Demon. Maxwell's demon was about thermodynamics; this second demon was about information. For a book that was created over 50 years ago, Lem's machine in this story is a rather astonishing vision of an LLM. Their demon was using a tiny bit of stale air -- a tiny number of particles -- as the source of its information; it could not possibly work as described in the story. OTOH, Maxwell's demon couldn't possibly work, either. Both hypotheticals are quite interesting; they are fine little thought experiments. I don't know if you -- or anyone -- will learn anything from Lem's short story, but you should be amused. Ask an AI to find you a copy.

POSTED BY: Phil Earnhardt

Gerald Oberg

Posted 1 year ago

POSTED BY: Gerald Oberg

Arben Kalziqi

Arben Kalziqi, Wolfram Research

Posted 1 year ago

Hmm... annoying!! I absolutely dropped it in there yesterday. I'll make sure it works this time. As far as DeepSeek goes, as best I can tell there are indeed some architectural improvements, but the overall idea and structure remain the same and most of the improvements are in the training process. This article from MIT Technology Review is insightful! https://www.technologyreview.com/2025/01/31/1110740/how-deepseek-ripped-up-the-ai-playbook-and-why-everyones-going-to-follow-it/

POSTED BY: Arben Kalziqi

Gerald Oberg

Posted 1 year ago

Arben, Digest_Day5 is still not showing up in the Daily Q&A Digests. Did you by any chance put it in the Series 58 ChatGPT rather than the Series 62 ChatGPT? Thanks for the link. The three that I put in the other post are more about the geopolitical implications of AI and LLMs rather than their technology.

POSTED BY: Gerald Oberg

Arben Kalziqi

Arben Kalziqi, Wolfram Research

Posted 1 year ago

Added!! Sorry about that!

POSTED BY: Arben Kalziqi

Gerald Oberg

Posted 1 year ago

Okay, I found it (posted two days ago) using a link from the latest email received two hours ago (3/12 at 11:59AM).

POSTED BY: Gerald Oberg

Zbigniew Kabala

Zbigniew Kabala, Duke University

Posted 1 year ago

Arben, Thanks 10^6 for a wonderful course! Question: Will you be posting an updated Q&A Digest along with an updated notebook for Day 5? We would appreciate it very much.

POSTED BY: Zbigniew Kabala

Arben Kalziqi

Arben Kalziqi, Wolfram Research

Posted 1 year ago

POSTED BY: Arben Kalziqi

Angel Rojas

Posted 1 year ago

I was wondering in the last class two topics related to line of research in Wolfram: 1) Is Wolfram Research developing their own Algorithmic Differentiation framework for programs, a.k.a (parametrized)functions, that depends on loops, conditionals, recursion, and (non)smooth elemental functions? I commented in class this is between numerical and symbolic differentiation. By the way, this is the standard in Scientific Machine Learning and the backbone to make computationally efficient optimizers like ADAM (and their subsequent versions). 2) In case that (1) is negative, are you more focused on developing the discrete techniques to emulate neural networks capacity via Rule Arrays applied to cellular automata?

POSTED BY: Angel Rojas

Gerald Oberg

Posted 1 year ago

Arben, Can you please explain your decision in your communications to abrogate the standard rule of capitalizing the first letter in a sentence? Is that just a strategy to minimize the time to post a response to a question/comment? Is it a standard style followed by certain tech people? Would you do the same in more formal settings, such as published articles? Are you in the vanguard of a movement to change the English language? No disrespect or criticism intended - I am just curious …

POSTED BY: Gerald Oberg

Arben Kalziqi

Arben Kalziqi, Wolfram Research

Posted 1 year ago

POSTED BY: Arben Kalziqi

Gerald Oberg

Posted 1 year ago

In response to the survey question, "How will you use what you learned at this Daily Study Group?" I wrote: I will have a better appreciation and understanding of what is being discussed in the numerous newscasts, podcasts, articles, interviews that one encounters about ChatGPT or other LLMs. Something the course did not address: How could the technology overviewed lead to the catastrophic results people are predicting about AGI? One would not think that even monstrously large matrices could produce malicious consciousness. (I am sometimes reminded of the book I read as a teenager, "Colossus: The Forbin Project", a 1966 science fiction novel by D. F. Jones, about super-computers taking control of mankind.) The things Arben demoed are really impressive, but there is no "mind" (with potential intensions) producing them. I would like to hear Arben's thoughts about these issues. Even more, can you point us somewhere that Stephen Wolfram has discussed these issues (potential threats of AI or AGI)? Here is a good expert discussion: https://drive.google.com/file/d/1JVPc3ObMP1L2a53T5LA1xxKXM6DAwEiC/view Also: https://www.google.com/books/edition/The_Age_of_AI/Y2QwEAAAQBAJ?hl=en&gbpv=1 This could be considered a post-DeepSeek update to the reference above: https://www.csis.org/analysis/deepseek-huawei-export-controls-and-future-us-china-ai-race

POSTED BY: Gerald Oberg

Arben Kalziqi

Arben Kalziqi, Wolfram Research

Posted 1 year ago

POSTED BY: Arben Kalziqi

Laurence Bloxham

Posted 1 year ago

Thanks Arbin, I did not see this aspect of the problem. Very interesting,

POSTED BY: Laurence Bloxham

Laurence Bloxham

Posted 1 year ago

Gradient decent optimization procedures can get trapped at falsw (local) minimums. How does Chat GPT avoid or correct false mionimums? Aside Incredible clas, many thanks Arbin.

POSTED BY: Laurence Bloxham

Arben Kalziqi

Arben Kalziqi, Wolfram Research

Posted 1 year ago

POSTED BY: Arben Kalziqi

Angel Rojas

Posted 1 year ago

POSTED BY: Angel Rojas

Reply to this discussion

Reply Preview

Attachments

Remove Add a file to this post

Follow this discussion

or Discard

Feedback