Accelerating Generative AI with PyTorch II: GPT, Fast | PyTorch - discu.eu

Hacker News

Accelerating Generative AI with PyTorch II: GPT, Fast https://pytorch.org/blog/accelerating-generative-ai-2/ 69 comments 30/11/2023

Linking pages

We Are Running Out of Low-Background Tokens (Nov 2023 Recap) https://www.latent.space/i/139368545/the-concept-of-low-background-tokens 6 comments
Gemlite: Towards Building Custom Low-Bit Fused CUDA Kernels https://mobiusml.github.io/gemlite_blogpost/ 2 comments
GitHub - pytorch-labs/gpt-fast: Simple and efficient pytorch-native transformer text generation in <1000 LOC of python. https://github.com/pytorch-labs/gpt-fast 1 comment
Accelerating Generative AI Part III: Diffusion, Fast | PyTorch https://pytorch.org/blog/accelerating-generative-ai-3/ 0 comments
Faster and Smaller Whisper: A Deep Dive into Quantization and Torch Compilation https://mobiusml.github.io/whisper-static-cache-blog/ 0 comments
Introducing torchchat: Accelerating Local LLM Inference on Laptop, Desktop and Mobile | PyTorch https://pytorch.org/blog/torchchat-local-llm-inference/ 0 comments
Large language model inference optimizations on AMD GPUs — ROCm Blogs https://rocm.blogs.amd.com/artificial-intelligence/llm-inference-optimize/README.html 0 comments
aie-book/resources.md at main · chiphuyen/aie-book · GitHub https://github.com/chiphuyen/aie-book/blob/main/resources.md 0 comments

Linked pages

Related searches:

Search whole site: site:pytorch.org

Search title: Accelerating Generative AI with PyTorch II: GPT, Fast | PyTorch

See how to search.

Submit link to: