Hacker News
- Layer Skip: Enabling Early Exit Inference and Self-Speculative Decoding https://huggingface.co/papers/2404.16710 1 comment
- Llama-3-8B-256k-PoSE https://huggingface.co/winglian/llama-3-8b-256k-PoSE 0 comments
- Llama-3 8B Instruct 262k https://huggingface.co/gradientai/Llama-3-8B-Instruct-262k 2 comments
- Snowflake Arctic Instruct Open LLM https://huggingface.co/Snowflake/snowflake-arctic-instruct 1 comment
- OpenELM: Efficient Language Model Family with Open-Source Training and Inference https://huggingface.co/papers/2404.14619 0 comments
- Apple OpenELM Instruct Models https://huggingface.co/collections/apple/openelm-instruct-models-6619ad295d7ae9f868b759ca 0 comments
- Mixtral-8x22B on HuggingFace https://huggingface.co/mistral-community/Mixtral-8x22B-v0.1 3 comments
- CodeGemma – an official Google release for code LLMs https://huggingface.co/blog/codegemma 2 comments
- Explaining the SDXL Latent Space https://huggingface.co/blog/TimothyAlexisVass/explaining-the-sdxl-latent-space 33 comments
- 10.7B Solar: Elevating Performance with Upstage Depth Up Scaling https://huggingface.co/upstage/SOLAR-10.7B-v1.0 2 comments
- Phi-2 https://huggingface.co/microsoft/phi-2 5 comments
- Mistral-8x7B-Chat https://huggingface.co/mattshumer/mistral-8x7b-chat 69 comments
- Yi-34B-Chat https://huggingface.co/01-ai/Yi-34B-Chat 63 comments
- Switch Transformers C – 2048 experts (1.6T params for 3.1 TB) (2022) https://huggingface.co/google/switch-c-2048 36 comments
- MistralLite by Amazon Web Services https://huggingface.co/amazon/MistralLite 24 comments
- HuggingChat https://huggingface.co/chat/ 17 comments
- StackLlama: A hands-on guide to train LlaMa with RLHF https://huggingface.co/blog/stackllama 38 comments
- Create a GPT3 powered Q&A Chatbot for *any* GitHub repo by posting its link https://huggingface.co/spaces/ysharma/LangchainBot-space-creator 4 comments
- Wordalle – Guess the prompt used to generate a set of images from DalleMini https://huggingface.co/spaces/huggingface-projects/wordalle 15 comments
- Large Language Models: A New Moore's Law? https://huggingface.co/blog/large-language-models 22 comments
- [D] Gemma vs Mistral (and other open models) https://huggingface.co/spaces/lastmileai/gemma-playground 4 comments machinelearning
- [P] Discover AstraQuasar-4B: a NEW LLaMA-based arch | First training implementation of the self layer calling (Duplicate Trick) https://huggingface.co/AstraMindAI/AstraQuasar-4B 4 comments machinelearning
- [P] Pearl-3x7B, an xtraordinary Mixure of Experts (MoE) for data science https://huggingface.co/louisbrulenaudet/Pearl-3x7B 2 comments machinelearning
- PubMedBERT Embeddings - Semantic search and retrieval augmented generation for medical literature https://huggingface.co/NeuML/pubmedbert-base-embeddings 2 comments programming
- Open source instruction training dataset for a joke telling LLM https://huggingface.co/datasets/Middletownbooks/joke_training 2 comments learnmachinelearning
- Question on how the T5 is trained https://huggingface.co/docs/transformers/model_doc/t5#training 2 comments learnmachinelearning
- [R] Blogpost on comparing Chatbots like ChatGPT, LaMDA, Sparrow, BlenderBot 3, and Claude https://huggingface.co/blog/dialog-agents 5 comments machinelearning
- AI Code Debugger https://huggingface.co/spaces/krrishD/stacktrace-QA 3 comments javascript
- [P] Finetuned Diffusion: multiple fine-tuned Stable Diffusion models, trained on different styles https://huggingface.co/spaces/anzorq/finetuned_diffusion 62 comments machinelearning
- Does anyone know where I can get access to a prebuilt general document understanding model? https://huggingface.co/philschmid/layoutlm-funsd 4 comments datascience
- Hosting guidance for a small version of a website like huggingface? https://huggingface.co/ 3 comments selfhosted
- Whisper Web UI, is a general-purpose speech recognition model by OpenAI https://huggingface.co/spaces/openai/whisper 3 comments programming
- Pokémon text to image, Generate new Pokémon from a text description https://huggingface.co/spaces/lambdalabs/text-to-pokemon 10 comments programming
- I used an AI model to generate funny, professional headshots when you upload a selfie/picture of yourself/your friends :D https://huggingface.co/spaces/krrishD/suitify_v2 10 comments sideproject
- Made an AI model that predicts subreddit based on the title of a post https://huggingface.co/spaces/daspartho/predict-subreddit 13 comments internetisbeautiful
- [N] Pull Requests and Discussions on Hugging Face https://huggingface.co/blog/community-update 11 comments machinelearning
- Convert an image to colorful ASCII art based on ascii character density in Python + Web Demo with Gradio https://huggingface.co/spaces/HighCWu/colorful-ascii-art 6 comments python
- [D] What is a good emoji aware pre-trained language model? https://huggingface.co/docs/transformers/model_doc/bertweet 23 comments machinelearning
- [D] NLP has HuggingFace, what does Computer Vision have? https://huggingface.co/ 52 comments machinelearning
- Build ML Products using Python https://huggingface.co/tasks 37 comments python
- [N] Machine Learning for Audio with Hugging Face https://huggingface.co/facebook/fastspeech2-en-ljspeech?text=test 2 comments machinelearning
- [D] Why is DistilRoberta 15x less used than DistilBert ? https://huggingface.co/distilroberta-base 5 comments machinelearning
Linking pages
- President Biden meets with AI CEOs at the White House amid ethical criticism | Ars Technica https://arstechnica.com/information-technology/2023/05/critics-take-aim-at-bidens-ai-meeting-with-ceos-from-google-openai-microsoft/ 952 comments
- Hugging Face raises $235M from investors including Salesforce and Nvidia | TechCrunch https://techcrunch.com/2023/08/24/hugging-face-raises-235m-from-investors-including-salesforce-and-nvidia/ 234 comments
- GitHub - deep-floyd/IF https://github.com/deep-floyd/IF 231 comments
- Machine Learning: The Great Stagnation - by Mark Saroufim https://marksaroufim.substack.com/p/machine-learning-the-great-stagnation 218 comments
- Weird A.I. Yankovic, a cursed deep dive into the world of voice cloning - Waxy.org https://waxy.org/2023/10/weird-ai-yankovic-voice-cloning/ 198 comments
- GitHub - microsoft/LoRA: Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models" https://github.com/microsoft/LoRA 156 comments
- GitHub - lucidrains/DALLE2-pytorch: Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch https://github.com/lucidrains/DALLE2-pytorch 152 comments
- Postgres Full Text Search vs the rest https://supabase.com/blog/postgres-full-text-search-vs-the-rest 144 comments
- So you want to build your own open source chatbot… - Mozilla Hacks - the Web developer blog https://hacks.mozilla.org/2023/07/so-you-want-to-build-your-own-open-source-chatbot/ 123 comments
- OpenJourney https://open-journey.github.io/ 121 comments
- GitHub - lucidrains/imagen-pytorch: Implementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch https://github.com/lucidrains/imagen-pytorch 117 comments
- Generative AI’s Act Two | Sequoia Capital https://www.sequoiacap.com/article/generative-ai-act-two/ 106 comments
- Emerging Architectures for LLM Applications | Andreessen Horowitz https://a16z.com/2023/06/20/emerging-architectures-for-llm-applications/ 95 comments
- GitHub Actions as a time-sharing supercomputer https://blog.alexellis.io/github-actions-timesharing-supercomputer/ 91 comments
- GitHub - ripienaar/free-for-dev: A list of SaaS, PaaS and IaaS offerings that have free tiers of interest to devops and infradev https://github.com/ripienaar/free-for-dev 80 comments
- Hugging Face CEO tells US House open-source AI is 'extremely aligned' with American interests | VentureBeat https://venturebeat.com/ai/hugging-face-ceo-tells-us-house-open-source-ai-is-extremely-aligned-with-american-interests/ 75 comments
- New localllm lets you develop gen AI apps locally, without GPUs | Google Cloud Blog https://cloud.google.com/blog/products/application-development/new-localllm-lets-you-develop-gen-ai-apps-locally-without-gpus 74 comments
- GitHub - EleutherAI/gpt-neox: An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library. https://github.com/EleutherAI/gpt-neox 67 comments
- Notes on training BERT from scratch on an 8GB consumer GPU | sidsite https://sidsite.com/posts/bert-from-scratch/ 67 comments
- Rage of the machine: An AI* makes metal music | by Bernhard Mueller | Medium https://muellerberndt.medium.com/rage-of-the-machine-an-ai-makes-metal-music-f299dc1f706a 62 comments