Hacker News
- Using linear algebra to convert a large code model https://gist.github.com/moyix/7896575befbe1b99162ccfec8d135566 3 comments
Linking pages
Linked pages
- GitHub Copilot · Your AI pair programmer · GitHub https://github.com/features/copilot 1062 comments
- GitHub - kingoflolz/mesh-transformer-jax: Model parallel transformers in JAX and Haiku https://github.com/kingoflolz/mesh-transformer-jax 146 comments
- [2101.00027] The Pile: An 800GB Dataset of Diverse Text for Language Modeling https://arxiv.org/abs/2101.00027 81 comments
- [2108.09293] Asleep at the Keyboard? Assessing the Security of GitHub Copilot's Code Contributions https://arxiv.org/abs/2108.09293 45 comments
- GitHub - salesforce/CodeGen: CodeGen is an open-source model for program synthesis. Trained on TPU-v4. Competitive with OpenAI Codex. https://github.com/salesforce/CodeGen 2 comments
- [2203.13474] CodeGen: An Open Large Language Model for Code with Multi-Turn Program Synthesis https://arxiv.org/abs/2203.13474 1 comment
- Rotary Embeddings: A Relative Revolution | EleutherAI Blog https://blog.eleuther.ai/rotary-embeddings/ 1 comment
- GitHub - NVIDIA/FasterTransformer: Transformer related optimization, including BERT, GPT https://github.com/NVIDIA/FasterTransformer/ 1 comment
- EleutherAI https://www.eleuther.ai/ 0 comments
Related searches:
Search whole site: site:gist.github.com
Search title: How to convert the SalesForce CodeGen models to GPT-J · GitHub
See how to search.