- OpenAI Releases HealthBench: An Open-Source Benchmark for Measuring the Performance and Safety of Large Language Models in Healthcare https://www.marktechpost.com/2025/05/12/openai-releases-healthbench-an-open-source-benchmark-for-measuring-the-performance-and-safety-of-large-language-models-in-healthcare/ 0 comments machinelearningnews
Linked pages
- https://openai.com/index/healthbench/ 150 comments
- AGENTIC AI | AI Mini Conference https://minicon.marktechpost.com/ 5 comments
- AG-UI (Agent-User Interaction Protocol): An Open, Lightweight, Event-based Protocol that Standardizes How AI Agents Connect to Front-End Applications - MarkTechPost https://www.marktechpost.com/2025/05/12/ag-ui-agent-user-interaction-protocol-an-open-lightweight-event-based-protocol-that-standardizes-how-ai-agents-connect-to-front-end-applications/ 2 comments
- PrimeIntellect Releases INTELLECT-2: A 32B Reasoning Model Trained via Distributed Asynchronous Reinforcement Learning - MarkTechPost https://www.marktechpost.com/2025/05/12/primeintellect-releases-intellect-2-a-32b-reasoning-model-trained-via-distributed-asynchronous-reinforcement-learning/ 1 comment
- GitHub - openai/simple-evals https://github.com/openai/simple-evals 0 comments
- A Coding Implementation of Accelerating Active Learning Annotation with Adala and Google Gemini - MarkTechPost https://www.marktechpost.com/2025/05/10/a-coding-implementation-of-accelerating-active-learning-annotation-with-adala-and-google-gemini/ 0 comments
- LightOn AI Released GTE-ModernColBERT-v1: A Scalable Token-Level Semantic Search Model for Long-Document Retrieval and Benchmark-Leading Performance - MarkTechPost https://www.marktechpost.com/2025/05/11/lighton-ai-released-gte-moderncolbert-v1-a-scalable-token-level-semantic-search-model-for-long-document-retrieval-and-benchmark-leading-performance/ 0 comments
Would you like to stay up to date with Computer science? Checkout Computer science
Weekly.