Local LLMs

I did a quick exploration for local llms and I found the following

If you search YouTube for local llms you’d find many videos about the matter. I found this to be the most useful one 6 Ways For Running A Local LLM

Open Table Of Contents

HuggingFace Transformers

Go to HuggingFace.co the largest repository for open-source models
Go to models and choose a conversational
If you want chat download a conversational model like microsoft/dialogGPT-large
Explore and follow instructions to use your model of choice
Having Python, TensorFlow, PyTorch is usually a prerequisite for using the models
Many models provide Python apis to interact with them
The reliance on Python makes this option not the most performant one especially when you want to use models locally (I have an old laptop)
Best for learning

Github Repo: Llamafile
Built on top of llama.cpp
A script that requires no compilcation
Same benefits as llama.cpp with an easier install and UI
Can embed your model in an executable file and share it
Offers a browser interface that lets you tweak the model and chat with your chosen model
My favorite tool as it’s performant, offers a UI for chatting, open-source, and can run .gguf models in contrast to easier to use tools that have a smaller library and options of llms to use

Wesbite: GPT4ALL
Most user friendly UI similar to ChatGPT
Can add your own documents into the application so it indexes them and you can ‘talk’ to them
Models can be tweaked
Very limited amount of models