AI Research News | ByMachine

Archive — Page 3

67 articles total

Partial Grounding Offers Middle Ground for Classical Planning Problems

Researchers explore hybrid encoding that avoids exponential blowup from full grounding while maintaining computational tractability.

about 1 month agoBy Machine

Research

DEAF Benchmark Tests Whether Audio Models Actually Hear

New diagnostic tool reveals whether audio language models process acoustic signals or fake understanding through text inference.

about 1 month agoBy Machine

Research

Generative AI Transforms Stakeholder Problem-Solving in Environmental Planning

New research demonstrates how large language models bridge the gap between natural language stakeholder input and formal computational models.

about 1 month agoBy Machine

Research

Transformers Are Bayesian Networks, Researchers Formally Prove

New mathematical framework shows transformer architecture implements belief propagation, offering precise theoretical understanding of why these models work.

about 1 month agoBy Machine

Research

Open-Source Mamba 3 Outperforms Transformers With Lower Latency

New state-space model architecture achieves 4% better language modeling while reducing computational overhead and inference speed.

about 1 month agoBy Machine

Research

Popular Data Analysis Agents Stumble on Real-World Timeseries Tasks

Study finds six commercial and open-source agents struggle with stateful queries and incident-specific scenarios.

about 2 months agoBy Machine

Research

Large Reasoning Models Struggle With Computational Imbalance

New research identifies how frontier LRMs waste computation on simple tasks while failing on complex ones, limiting real-world deployment.

about 2 months agoBy Machine

Research

New Framework Scales Diversity in Agent Training for Better Tool Use

DIVE method addresses brittleness in LLM agents by synthesizing more diverse tasks while maintaining executability and verifiability.

about 2 months agoBy Machine

Research

Autonomous Driving Shifts from Perception to Reasoning Bottleneck

Survey finds LLMs and multimodal models could address a fundamental deficit in how self-driving systems handle long-tail scenarios and social judgment.

about 2 months agoBy Machine