Hacker News
Linked pages
- [2501.12948] DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning https://arxiv.org/abs/2501.12948 1061 comments
- https://openai.com/index/introducing-o3-and-o4-mini/ 497 comments
- https://openai.com/index/introducing-deep-research/ 425 comments
- Announcing the Agent2Agent Protocol (A2A) - Google Developers Blog https://developers.googleblog.com/en/a2a-a-new-era-of-agent-interoperability/ 285 comments
- vLLM: Easy, Fast, and Cheap LLM Serving with PagedAttention https://vllm.ai/ 42 comments
- A Little Bit of Reinforcement Learning from Human Feedback https://rlhfbook.com/ 37 comments
- Introduction - Model Context Protocol https://modelcontextprotocol.io/introduction 34 comments
- Tool support · Ollama Blog https://ollama.com/blog/tool-support 24 comments
- Ollama https://ollama.com/ 0 comments
Related searches:
Search whole site: site:timkellogg.me
Search title: Inner Loop Agents - Tim Kellogg
See how to search.