Group Abstract

Message Boards

WOLFRAM COMMUNITY

10.7K Views

9 Replies

24 Total Likes

View groups...

Follow this post

Share this post:

GROUPS:

Running a local LLM using llamafile and Wolfram Language

Jon McLoone

Jon McLoone, Wolfram Research

Posted 1 year ago

Attachments: DOWLOAD-DESKTOP-...nb

POSTED BY: Jon McLoone

9 Replies

Sort By:

Kotaro Okazaki

Kotaro Okazaki, Fsas Technologies

Posted 2 months ago

Local LLM may be supported in v14.3.

POSTED BY: Kotaro Okazaki

Joshua Schrier

Joshua Schrier, Fordham University

Posted 9 months ago

Indeed, it was suggested at the last Tech Conference that 14.2 would have local LLM support, but this doesn't seem to be reflected in the documentation. (Haven't had the chance to upgrade yet)...

POSTED BY: Joshua Schrier

Francesco S

Posted 9 months ago

Anything useful along those lines available in 14.2 ? Thank you.

POSTED BY: Francesco S

Jacob Evans

Posted 1 year ago

This is great news. I would love to "play" with Liama 3 locally using the built-in LLM functions I've already used for performing some of my use cases.

POSTED BY: Jacob Evans

Ettore Mariotti

Posted 1 year ago

Interesting that's good to know! To be honest calling the server was interesting as there are many systems that now build API by servers (like `ollama` for Mac). I guess I'll have to give my bucks to OpenAI for a while then!!

POSTED BY: Ettore Mariotti

Jon McLoone

Jon McLoone, Wolfram Research

Posted 1 year ago

I understand that there is a project to do this. It will also call the library directly rather than via the server as I did here, for better efficiency. I don't know when to expect that to be available though, so be patient for now!

POSTED BY: Jon McLoone

Ettore Mariotti

Posted 1 year ago

Hi Jon! This is very interesting. I was wondering if it's possible to configure a local language model as the default large language model (LLM) that the system uses when dealing with features like `LLMFunction`, etc. It would be great if it were possible to leverage all the Wolfram Language technology already developed for LLMs, such as the prompt repository, and so on. Does anyone have any idea about the feasibility of this?

POSTED BY: Ettore Mariotti

Dave Middleton

Posted 1 year ago

Thank you Jon for bringing this to our attention. I was looking for alternative options to run an LLM on my machine to substitute endless subscriptions. Justine's repository no longer works. Mozilla integrated the llmafiles into their ecosystem see this post: https://hacks.mozilla.org/2023/11/introducing-llamafile/ You can find the llmafiles at this Mozilla's Github page: https://github.com/Mozilla-Ocho/llamafile. My initial findings of running the llmafile on my machine: It runs acceptably in a chat browser out-of-the-box on my PC (Intel i7 laptop with NVDIA card, Win 10 64-bit) without additional flags, but requires memory (I suspect you need at least 16 GB of RAM). Here is a performance (tokens /sec) comparison of running the llmafile in a browser chat with and without the GPU flag: CPU only: 2.85; With GPU flag: 4.96. I will play with it in the WL and see what I will find.

POSTED BY: Dave Middleton

EDITORIAL BOARD

EDITORIAL BOARD, WOLFRAM

Posted 1 year ago

POSTED BY: EDITORIAL BOARD

Reply to this discussion

Reply Preview

Attachments

Remove Add a file to this post

Follow this discussion

or Discard

Feedback