Linking pages
- DeepSeek AI Unveils DeepSeek-V3-0324: Blazing Fast Performance on Mac Studio, Heating Up the Competition with OpenAI - MarkTechPost https://www.marktechpost.com/2025/03/25/deepseek-ai-unveils-deepseek-v3-0324-blazing-fast-performance-on-mac-studio-heating-up-the-competition-with-openai/ 6 comments
- UCLA Researchers Released OpenVLThinker-7B: A Reinforcement Learning Driven Model for Enhancing Complex Visual Reasoning and Step-by-Step Problem Solving in Multimodal Systems - MarkTechPost https://www.marktechpost.com/2025/03/28/ucla-researchers-released-openvlthinker-7b-a-reinforcement-learning-driven-model-for-enhancing-complex-visual-reasoning-and-step-by-step-problem-solving-in-multimodal-systems/ 4 comments
- Google AI Released Gemini 2.5 Pro Experimental: An Advanced AI Model that Excels in Reasoning, Coding, and Multimodal Capabilities - MarkTechPost https://www.marktechpost.com/2025/03/25/google-ai-released-gemini-2-5-pro-experimental-an-advanced-ai-model-that-excels-in-reasoning-coding-and-multimodal-capabilities/ 3 comments
- Fin-R1: A Specialized Large Language Model for Financial Reasoning and Decision-Making - MarkTechPost https://www.marktechpost.com/2025/03/22/fin-r1-a-specialized-large-language-model-for-financial-reasoning-and-decision-making/ 2 comments
- Google DeepMind Researchers Propose CaMeL: A Robust Defense that Creates a Protective System Layer around the LLM, Securing It even when Underlying Models may be Susceptible to Attacks - MarkTechPost https://www.marktechpost.com/2025/03/26/google-deepmind-researchers-propose-camel-a-robust-defense-that-creates-a-protective-system-layer-around-the-llm-securing-it-even-when-underlying-models-may-be-susceptible-to-attacks/ 2 comments
- Meet ReSearch: A Novel AI Framework that Trains LLMs to Reason with Search via Reinforcement Learning without Using Any Supervised Data on Reasoning Steps - MarkTechPost https://www.marktechpost.com/2025/03/31/meet-research-a-novel-ai-framework-that-trains-llms-to-reason-with-search-via-reinforcement-learning-without-using-any-supervised-data-on-reasoning-steps/ 2 comments
- Sea AI Lab Researchers Introduce Dr. GRPO: A Bias-Free Reinforcement Learning Method that Enhances Math Reasoning Accuracy in Large Language Models Without Inflating Responses - MarkTechPost https://www.marktechpost.com/2025/03/22/sea-ai-lab-researchers-introduce-dr-grpo-a-bias-free-reinforcement-learning-method-that-enhances-math-reasoning-accuracy-in-large-language-models-without-inflating-responses/ 1 comment
- Qwen Releases the Qwen2.5-VL-32B-Instruct: A 32B Parameter VLM that Surpasses Qwen2.5-VL-72B and Other Models like GPT-4o Mini - MarkTechPost https://www.marktechpost.com/2025/03/24/qwen-releases-the-qwen2-5-vl-32b-instruct-a-32b-parameter-vlm-that-surpasses-qwen2-5-vl-72b-and-other-models-like-gpt-4o-mini/ 1 comment
- NVIDIA AI Researchers Introduce FFN Fusion: A Novel Optimization Technique that Demonstrates How Sequential Computation in Large Language Models LLMs can be Effectively Parallelized - MarkTechPost https://www.marktechpost.com/2025/03/29/nvidia-ai-researchers-introduce-ffn-fusion-a-novel-optimization-technique-that-demonstrates-how-sequential-computation-in-large-language-models-llms-can-be-effectively-parallelized/ 1 comment
- Microsoft AI Releases RD-Agent: An AI-Driven Tool for Performing R&D with LLM-based Agents - MarkTechPost https://www.marktechpost.com/2025/03/22/microsoft-ai-releases-rd-agent-an-ai-driven-tool-for-performing-rd-with-llm-based-agents/ 0 comments
- Meta AI Researchers Introduced SWEET-RL and CollaborativeAgentBench: A Step-Wise Reinforcement Learning Framework to Train Multi-Turn Language Agents for Realistic Human-AI Collaboration Tasks - MarkTechPost https://www.marktechpost.com/2025/03/22/meta-ai-researchers-introduced-sweet-rl-and-collaborativeagentbench-a-step-wise-reinforcement-learning-framework-to-train-multi-turn-language-agents-for-realistic-human-ai-collaboration-tasks/ 0 comments
- Meet LocAgent: Graph-Based AI Agents Transforming Code Localization for Scalable Software Maintenance - MarkTechPost https://www.marktechpost.com/2025/03/23/meet-locagent-graph-based-ai-agents-transforming-code-localization-for-scalable-software-maintenance/ 0 comments
- TxAgent: An AI Agent that Delivers Evidence-Grounded Treatment Recommendations by Combining Multi-Step Reasoning with Real-Time Biomedical Tool Integration - MarkTechPost https://www.marktechpost.com/2025/03/23/txagent-an-ai-agent-that-delivers-evidence-grounded-treatment-recommendations-by-combining-multi-step-reasoning-with-real-time-biomedical-tool-integration/ 0 comments
- Meet Open Deep Search (ODS): A Plug-and-Play Framework Democratizing Search with Open-source Reasoning Agents - MarkTechPost https://www.marktechpost.com/2025/03/27/meet-open-deep-search-ods-a-plug-and-play-framework-democratizing-search-with-open-source-reasoning-agents/ 0 comments
- Google AI Released TxGemma: A Series of 2B, 9B, and 27B LLM for Multiple Therapeutic Tasks for Drug Development Fine-Tunable with Transformers - MarkTechPost https://www.marktechpost.com/2025/03/27/google-ai-released-txgemma-a-series-of-2b-9b-and-27b-llm-for-multiple-therapeutic-tasks-for-drug-development-fine-tunable-with-transformers/ 0 comments
- PilotANN: A Hybrid CPU-GPU System For Graph-based ANNS - MarkTechPost https://www.marktechpost.com/2025/03/30/pilotann-a-hybrid-cpu-gpu-system-for-graph-based-anns/ 0 comments
- Meta AI Proposes Multi-Token Attention (MTA): A New Attention Method which Allows LLMs to Condition their Attention Weights on Multiple Query and Key Vectors - MarkTechPost https://www.marktechpost.com/2025/04/01/meta-ai-proposes-multi-token-attention-mta-a-new-attention-method-which-allows-llms-to-condition-their-attention-weights-on-multiple-query-and-key-vectors/ 0 comments
- Open AI Releases PaperBench: A Challenging Benchmark for Assessing AI Agents’ Abilities to Replicate Cutting-Edge Machine Learning Research - MarkTechPost https://www.marktechpost.com/2025/04/02/open-ai-releases-paperbench-a-challenging-benchmark-for-assessing-ai-agents-abilities-to-replicate-cutting-edge-machine-learning-research/ 0 comments
- Salesforce AI Introduce BingoGuard: An LLM-based Moderation System Designed to Predict both Binary Safety Labels and Severity Levels - MarkTechPost https://www.marktechpost.com/2025/04/02/salesforce-ai-introduce-bingoguard-an-llm-based-moderation-system-designed-to-predict-both-binary-safety-labels-and-severity-levels/ 0 comments