Hacker News
- SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 12 clouds) https://github.com/skypilot-org/skypilot 0 comments
- [D] Best approach to handle cloud for side projects https://github.com/skypilot-org/skypilot 10 comments machinelearning
Linking pages
- Mistral 7B | Mistral AI | Open source models https://mistral.ai/news/announcing-mistral-7b/ 618 comments
- Hello Qwen2 | Qwen https://qwenlm.github.io/blog/qwen2/ 130 comments
- What I learned from looking at 900 most popular open source AI tools https://huyenchip.com/2024/03/14/ai-oss.html 40 comments
- Qwen2.5: A Party of Foundation Models! | Qwen https://qwenlm.github.io/blog/qwen2.5/ 38 comments
- Finetuning Llama 2 in your own cloud environment, privately | SkyPilot Blog https://blog.skypilot.co/finetuning-llama2-operational-guide/ 13 comments
- GitHub - lm-sys/FastChat: An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena. https://github.com/lm-sys/FastChat 4 comments
- AI on Kubernetes Without the Pain | SkyPilot Blog https://blog.skypilot.co/ai-on-kubernetes/ 4 comments
- SkyPilot: ML and Data Science on any cloud with massive cost savings | by Zongheng Yang | Nov, 2022 | Medium https://medium.com/@zongheng_yang/skypilot-ml-and-data-science-on-any-cloud-with-massive-cost-savings-244189cc7c0f 1 comment
- skypilot/examples/llama-llm-chatbots at master · skypilot-org/skypilot · GitHub https://github.com/skypilot-org/skypilot/tree/master/examples/llama-llm-chatbots 1 comment
- Aman's AI Journal • Primers • Overview of Large Language Models https://aman.ai/primers/ai/LLM/ 1 comment
- Scaling AI Robotics on the Cloud | SkyPilot Blog https://blog.skypilot.co/covariant/ 1 comment
- runhouse/examples/fastapi-embeddings-rag at main · run-house/runhouse · GitHub https://github.com/run-house/runhouse/tree/main/examples/fastapi-embeddings-rag 1 comment
- awesome-stars/topics.md at master · maguowei/awesome-stars · GitHub https://github.com/maguowei/awesome-stars/blob/master/topics.md 0 comments
- Run LLaMA LLM chatbots on any cloud with one click | SkyPilot Blog https://blog.skypilot.co/llama-llm-chatbots-on-any-cloud/ 0 comments
- SkyPilot 0.3: LLM support and unprecedented GPU availability across more clouds | SkyPilot Blog https://blog.skypilot.co/announcing-skypilot-0.3/ 0 comments
- #15 Inside S3, Unikernels and Observability cost https://anjulsahu.substack.com/p/15-inside-s3-unikernels-and-observability 0 comments
- GitHub - nlpfromscratch/nlp-llms-resources: Master list of curated resources on NLP and LLMs https://github.com/nlpfromscratch/nlp-llms-resources 0 comments
- Scaling Mixtral LLM Serving with the High GPU Availability and Cost Efficiency | SkyPilot Blog https://blog.skypilot.co/scaling-mixtral-llm-serving-with-the-best-gpu-availability-and-cost-efficiency/ 0 comments
- Serving LLMs on a budget - NOS Docs https://docs.nos.run/docs/blog/serving-llms-on-a-budget.html 0 comments
- Introducing SkyServe: 50% Cheaper AI Serving on Any Cloud with High Availability | SkyPilot Blog https://blog.skypilot.co/introducing-sky-serve/ 0 comments
Linked pages
- Mistral 7B | Mistral AI | Open source models https://mistral.ai/news/announcing-mistral-7b/ 618 comments
- Gemma: Google introduces new state-of-the-art open models https://blog.google/technology/developers/gemma-open-models/ 535 comments
- Introducing Code Llama, a state-of-the-art large language model for coding https://ai.meta.com/blog/code-llama-large-language-model-coding/ 527 comments
- Introducing DBRX: A New State-of-the-Art Open LLM | Databricks https://www.databricks.com/blog/introducing-dbrx-new-state-art-open-llm 343 comments
- Mixtral of experts | Mistral AI | Open source models https://mistral.ai/news/mixtral-of-experts/ 300 comments
- Reproducing GPT-2 (124M) in llm.c in 90 minutes for $20 · karpathy/llm.c · Discussion #481 · GitHub https://github.com/karpathy/llm.c/discussions/481 117 comments
- Qwen1.5-110B: The First 100B+ Model of the Qwen1.5 Series | Qwen https://qwenlm.github.io/blog/qwen1.5-110b/ 58 comments
- vLLM: Easy, Fast, and Cheap LLM Serving with PagedAttention https://vllm.ai/ 42 comments
- Finetuning Llama 2 in your own cloud environment, privately | SkyPilot Blog https://blog.skypilot.co/finetuning-llama2-operational-guide/ 13 comments
- [2205.07147] The Sky Above The Clouds https://arxiv.org/abs/2205.07147 10 comments
- Vicuna: An Open-Source Chatbot Impressing GPT-4 with 90%* ChatGPT Quality | LMSYS Org https://lmsys.org/blog/2023-03-30-vicuna/ 7 comments
- GitHub - zetavg/LLaMA-LoRA-Tuner: UI tool for fine-tuning and testing your own LoRA models base on LLaMA, GPT-J and more. One-click run on Google Colab. https://github.com/zetavg/LLaMA-LoRA-Tuner 1 comment
- Serving LLM 24x Faster On the Cloud with vLLM and SkyPilot | SkyPilot Blog https://blog.skypilot.co/serving-llm-24x-faster-on-the-cloud-with-vllm-and-skypilot/ 1 comment
- Scaling AI Robotics on the Cloud | SkyPilot Blog https://blog.skypilot.co/covariant/ 1 comment
- GitHub - predibase/lorax: Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs https://github.com/predibase/lorax 1 comment
- GitHub - OpenAccess-AI-Collective/axolotl: Go ahead and axolotl questions https://github.com/OpenAccess-AI-Collective/axolotl 1 comment
- SkyPilot: An Intercloud Broker for Sky Computing | USENIX https://www.usenix.org/conference/nsdi23/presentation/yang-zongheng 0 comments
- Finetune Llama 3.1 on Your Infra | SkyPilot Blog https://blog.skypilot.co/finetune-llama-3_1-on-your-infra/ 0 comments
- GitHub - ollama/ollama: Get up and running with Llama 3.2, Mistral, Gemma 2, and other large language models. https://github.com/ollama/ollama 0 comments