- Why is there a meaningful relationship between probabilities of non-ground truth classes in a softmax distribution? https://arxiv.org/abs/1503.02531 5 comments learnmachinelearning
Linking pages
- Towards a Conversational Agent that Can Chat About…Anything – Google AI Blog https://ai.googleblog.com/2020/01/towards-conversational-agent-that-can.html 152 comments
- Distilling step-by-step: Outperforming larger language models with less training data and smaller model sizes – Google Research Blog https://blog.research.google/2023/09/distilling-step-by-step-outperforming.html 123 comments
- Attacking Machine Learning with Adversarial Examples https://openai.com/blog/adversarial-example-research/ 102 comments
- Google Research: Themes from 2021 and Beyond – Google AI Blog https://ai.googleblog.com/2022/01/google-research-themes-from-2021-and.html 52 comments
- GitHub - terryum/awesome-deep-learning-papers: The most cited deep learning papers https://github.com/terryum/awesome-deep-learning-papers 47 comments
- Deep learning has a size problem. Shifting from state-of-the-art accuracy… | by Jameson Toole | Heartbeat https://heartbeat.fritz.ai/deep-learning-has-a-size-problem-ea601304cd8 46 comments
- Grammar Correction as You Type, on Pixel 6 – Google AI Blog https://ai.googleblog.com/2021/10/grammar-correction-as-you-type-on-pixel.html 43 comments
- A Recipe for Training Neural Networks http://karpathy.github.io/2019/04/25/recipe/#2-set-up-the-end-to-end-trainingevaluation-skeleton--get-dumb-baselines 39 comments
- Deep-Learning-Papers-Reading-Roadmap/README.md at master · floodsung/Deep-Learning-Papers-Reading-Roadmap · GitHub https://github.com/songrotek/deep-learning-papers-reading-roadmap 29 comments
- GitHub - astorfi/Deep-Learning-Roadmap: Organized Resources for Deep Learning Researchers and Developers https://github.com/astorfi/Deep-Learning-World 22 comments
- Large Transformer Model Inference Optimization | Lil'Log https://lilianweng.github.io/posts/2023-01-10-inference-optimization/ 20 comments
- Neural Language Models Explained – Ofir Press http://ofir.io/Neural-Language-Modeling-From-Scratch?a=1 11 comments
- Efficient LLM inference - by Finbarr Timbers https://www.artfintel.com/p/efficient-llm-inference 11 comments
- Towards a Conversational Agent that Can Chat About…Anything – Google AI Blog https://ai.googleblog.com/2020/01/towards-conversational-agent-that-can.html?m=1 10 comments
- Neural Language Models Explained – Ofir Press http://ofir.io/Neural-Language-Modeling-From-Scratch/ 10 comments
- Attacking Machine Learning with Adversarial Examples https://blog.openai.com/adversarial-example-research/ 8 comments
- GitHub - reiinakano/arbitrary-image-stylization-tfjs: Arbitrary style transfer using TensorFlow.js https://github.com/reiinakano/arbitrary-image-stylization-tfjs 8 comments
- GitHub - floodsung/Deep-Learning-Papers-Reading-Roadmap: Deep Learning papers reading roadmap for anyone who are eager to learn this amazing tech! https://github.com/floodsung/Deep-Learning-Papers-Reading-Roadmap 5 comments
- Grandmaster-Level Chess Without Search | Tom Hipwell https://tomhipwell.co/reading/grandmaster_chess_without_search/ 3 comments
- Introducing BodyPix: Real-time Person Segmentation in the Browser with TensorFlow.js | by TensorFlow | TensorFlow | Medium https://medium.com/tensorflow/introducing-bodypix-real-time-person-segmentation-in-the-browser-with-tensorflow-js-f1948126c2a0 2 comments
Related searches:
Search whole site: site:arxiv.org
Search title: [1503.02531] Distilling the Knowledge in a Neural Network
See how to search.