https://github.com/ggerganov/ggml/blob/master/docs/gguf.md - discu.eu

Linking pages

A Visual Guide to Quantization - by Maarten Grootendorst https://newsletter.maartengrootendorst.com/p/a-visual-guide-to-quantization 29 comments
Using Llamafiles for Embeddings in Local RAG Applications - Mozilla Innovations https://future.mozilla.org/news/llamafiles-for-embeddings-in-local-rag-applications/ 23 comments
GitHub - mukel/llama3.java: Practical Llama 3 inference in Java https://github.com/mukel/llama3.java 4 comments
GitHub - containers/podman-desktop-extension-ai-lab https://github.com/containers/podman-desktop-extension-ai-lab 2 comments
Running Open Source LLMs In Python - A Practical Guide https://christophergs.com/blog/running-open-source-llms-in-python 0 comments
Giving AI Brain Damage https://btr.pm/blog/giving-ai-brain-damage/ 0 comments
One standard to deploy them all - with Ben Firshman of Replicate https://www.latent.space/p/replicate 0 comments
GitHub - trailofbits/ml-file-formats: List of ML file formats https://github.com/trailofbits/ml-file-formats 0 comments
GitHub - antirez/gguf-tools: GGUF implementation in C as a library and a tools CLI program https://github.com/antirez/gguf-tools 0 comments
GitHub - xingyaoww/code-act: Official Repo for ICML 2024 paper "Executable Code Actions Elicit Better LLM Agents" by Xingyao Wang, Yangyi Chen, Lifan Yuan, Yizhe Zhang, Yunzhu Li, Hao Peng, Heng Ji. https://github.com/xingyaoww/code-act 0 comments

Related searches:

Search whole site: site:github.com

See how to search.

Submit link to: