Hacker News
- DeepMind’s New Language Model, Chinchilla https://www.marktechpost.com/2022/04/09/check-out-this-deepminds-new-language-model-chinchilla-70b-parameters-which-significantly-outperforms-gopher-280b-and-gpt-3-175b-on-a-large-range-of-downstream-evaluation-tasks/ 142 comments
- Check Out This DeepMind’s New Language Model, Chinchilla (70B Parameters), Which Significantly Outperforms Gopher (280B) and GPT-3 (175B) on a Large Range of Downstream Evaluation Tasks https://www.marktechpost.com/2022/04/09/check-out-this-deepminds-new-language-model-chinchilla-70b-parameters-which-significantly-outperforms-gopher-280b-and-gpt-3-175b-on-a-large-range-of-downstream-evaluation-tasks/ 4 comments artificial
Linking pages
- Microsoft Researchers Introduce 'Jigsaw': An AI Tool To Augment Large Language Models (GPT-3, Codex, etc.) By Deploying Post-Processing Techniques That Understand The Programs’ Syntax And Semantics - MarkTechPost https://www.marktechpost.com/2022/04/04/microsoft-researchers-introduce-jigsaw-an-ai-tool-to-augment-large-language-models-gpt-3-codex-etc-by-deploying-post-processing-techniques-that-understand-the-programs-syntax-and-se/ 2 comments
- Microsoft AI Team Proposes DeepSpeed MoE Model: An End-to-End MoE Training and Inference Solution as Part of the DeepSpeed Library - MarkTechPost https://www.marktechpost.com/2022/01/21/microsoft-ai-team-proposes-deepspeed-moe-model-an-end-to-end-moe-training-and-inference-solution-as-part-of-the-deepspeed-library/ 2 comments
- Deepmind Researchers Probe Image-Language Transformers and Propose SVO Probes for Verb Understanding - MarkTechPost https://www.marktechpost.com/2022/02/27/deepmind-researchers-probe-image-language-transformers-and-propose-svo-probes-for-verb-understanding/ 0 comments
- In the Latest AI Research Google Explains How It Taps the Full Potential of Datacenter Machine Learning Accelerators with Platform Aware Neural Architecture Search (NAS) - MarkTechPost https://www.marktechpost.com/2022/02/13/in-the-latest-ai-research-google-explains-how-it-taps-the-full-potential-of-datacenter-machine-learning-accelerators-with-platform-aware-neural-architecture-search-nas/ 0 comments
- Stanford Researchers Apply a Combination of Autonomous Drone Technology With Scientific Machine Learning To Find How Fast Will Antarctica’s Ice Sheet Melt and Reduce The Uncertainty of Sea-Level Rise - MarkTechPost https://www.marktechpost.com/2022/03/12/stanford-researchers-apply-a-combination-of-autonomous-drone-technology-with-scientific-machine-learning-to-find-how-fast-will-antarcticas-ice-sheet-melt-and-reduce-the-uncertainty-of-sea-lev/ 0 comments
- Researchers from Tel Aviv Propose Long-Text NLP Benchmark Called SCROLLS - MarkTechPost https://www.marktechpost.com/2022/03/03/researchers-from-tel-aviv-propose-long-text-nlp-benchmark-called-scrolls/ 0 comments
- Google AI’s 'TokenLearner' Can Improve Vision Transformer Efficiency And Accuracy - MarkTechPost https://www.marktechpost.com/2021/12/13/google-ais-tokenlearner-can-improve-vision-transformer-efficiency-and-accuracy/ 0 comments
- Google AI's Latest Research on Language Model Proposes Two Different Sequence-To-Sequence Approaches Toward Zero-Shot Transfer For Dialogue Modeling - MarkTechPost https://www.marktechpost.com/2022/04/17/google-ais-latest-research-on-language-model-proposes-two-different-sequence-to-sequence-approaches-toward-zero-shot-transfer-for-dialogue-modeling/ 0 comments
- UC Sandiego Researchers Propose A Controllable Voice Cloning Method That Allows Fine-Grained Control Over Various Style Aspects Of The Synthesized Speech For An Unseen Speaker - MarkTechPost https://www.marktechpost.com/2022/01/10/uc-sandiego-researchers-propose-a-controllable-voice-cloning-method-that-allows-fine-grained-control-over-various-style-aspects-of-the-synthesized-speech-for-an-unseen-speaker/ 0 comments
- Researchers Propose Mitigation Strategies to Tackle Overinterpretation of Deep Learning Methods - MarkTechPost https://www.marktechpost.com/2022/01/02/researchers-propose-mitigation-strategies-to-tackle-overinterpretation-of-deep-learning-methods/ 0 comments
- Google AI Introduces 'Federated Reconstruction' Framework That Enables Scalable Partially Local Federated Learning - MarkTechPost https://www.marktechpost.com/2021/12/28/google-ai-introduces-federated-reconstruction-framework-that-enables-scalable-partially-local-federated-learning/ 0 comments
Linked pages
- Microsoft Researchers Introduce 'Jigsaw': An AI Tool To Augment Large Language Models (GPT-3, Codex, etc.) By Deploying Post-Processing Techniques That Understand The Programs’ Syntax And Semantics - MarkTechPost https://www.marktechpost.com/2022/04/04/microsoft-researchers-introduce-jigsaw-an-ai-tool-to-augment-large-language-models-gpt-3-codex-etc-by-deploying-post-processing-techniques-that-understand-the-programs-syntax-and-se/ 2 comments
- [2203.15556] Training Compute-Optimal Large Language Models https://arxiv.org/abs/2203.15556 0 comments
- Google AI's Latest Research on Language Model Proposes Two Different Sequence-To-Sequence Approaches Toward Zero-Shot Transfer For Dialogue Modeling - MarkTechPost https://www.marktechpost.com/2022/04/17/google-ais-latest-research-on-language-model-proposes-two-different-sequence-to-sequence-approaches-toward-zero-shot-transfer-for-dialogue-modeling/ 0 comments