Academic

Latest AI Research

Stay ahead of the curve with our curated collection of the most impactful Artificial Intelligence research papers.

Consumer Attitudes Towards AI in Digital Health: A Mixed-Methods Survey in Australia

AI applications are increasingly being introduced into digital health. While technical performance has advanced rapidly, successful deployment mainly depends on consumer attitudes, especially to patient-facing applications.

Sun 22 Mar 2026

Authors: Wei Zhou, Rashina Hoda, Joycelyn Ling

Iterative Multimodal Retrieval-Augmented Generation for Medical Question Answering

Read

Medical retrieval-augmented generation (RAG) systems typically operate on text chunks extracted from biomedical literature, discarding the rich visual content (tables, figures, structured layouts) of original document pages. We propose MED-VRAG, an iterative multimodal RAG framework that retrieves and reasons over PMC document page images instead of OCR'd text.

Sat 21 Mar 2026

Authors: Xupeng Chen, Binbin Shi, Chenqian Le, Jiaqi Zhang, Kewen Wang, Ran Gong, Jinhan Zhang, Chihang Wang

Auditing Frontier Vision-Language Models for Trustworthy Medical VQA: Grounding Failures, Format Collapse, and Domain Adaptation

Read

Deploying vision-language models (VLMs) in clinical settings demands auditable behavior under realistic failure conditions, yet the failure landscape of frontier VLMs on specialized medical inputs is poorly characterized. We audit five recent frontier and grounding-aware VLMs (Gemini~2.

Fri 20 Mar 2026

Authors: Xupeng Chen, Binbin Shi, Chenqian Le, Qifu Yin, Lang Lin, Haowei Ni, Ran Gong, Panfeng Li

Knowledge Graph Representations for LLM-Based Policy Compliance Reasoning

Read

The risks posed by AI features are increasing as they are rapidly integrated into software applications. In response, regulations and standards for safe and secure AI have been proposed.

Thu 19 Mar 2026

Authors: Wilder Baldwin, Sepideh Ghanavati

Contextual Agentic Memory is a Memo, Not True Memory

Read

Current agentic memory systems (vector stores, retrieval-augmented generation, scratchpads, and context-window management) do not implement memory: they implement lookup. We argue that treating lookup as memory is a category error with provable consequences for agent capability, long-term learning, and security.

Wed 18 Mar 2026

Authors: Binyan Xu, Xilin Dai, Kehuan Zhang

Bridging Values and Behavior: A Hierarchical Framework for Proactive Embodied Agents

Read

Current embodied agents are often limited to passive instruction-following or reactive need-satisfaction, lacking a stable, high-order value framework essential for long-term, self-directed behavior and resolving motivational conflicts. We introduce \textit{ValuePlanner}, a hierarchical cognitive architecture that decouples high-level value scheduling from low-level action execution.

Tue 17 Mar 2026

Authors: Chunhui Zhang, Yuxuan Wang, Aoyang Qin, Yi-Long Lu, Kunlun Wu, Yizhou Wang, Wei Wang

When Agents Evolve, Institutions Follow

Read

Across millennia, complex societies have faced the same coordination problem of how to organize collective action among cognitively bounded and informationally incomplete individuals. Different civilizations developed different political institutions to answer the same basic questions of who proposes, who reviews, who executes, and how errors are corrected.

Mon 16 Mar 2026

Authors: Chao Fei, Hongcheng Guo, Yanghua Xiao

The TEA Nets framework combines AI and cognitive network science to model targets, events and actors in text

Read

We introduce Target-Event-Agent Networks (TEA Nets) as a computational framework to extract subjects (``Agents"), verbs (``Events"), and objects (``Targets") from texts. Grounded in cognitive network science and artificial intelligence, TEA Nets are implemented as an open-source Python library.

Sun 15 Mar 2026

Authors: Sebastiano Franchini, Alexis Carrillo, Edoardo Sebastiano De Duro, Riccardo Improta, Ali Aghazadeh Ardebili, Massimo Stella

Fairness for distribution network operations and planning

Read

The incorporation of fairness into the distribution network (DN) planning and operation has become a key goal of recent studies. The cost of implementing fairness, denominated the price of fairness (PoF), covers the efficiency that is renounced for attaining social cohesion through fair outcomes.

Sat 14 Mar 2026

Authors: Pedro F. C. de Carvalho, Zijie Liu, Md Umar Hashmi, Dirk Van Hertem

From Context to Skills: Can Language Models Learn from Context Skillfully?

Read

Many real-world tasks require language models (LMs) to reason over complex contexts that exceed their parametric knowledge. This calls for context learning, where LMs directly learn relevant knowledge from the given context.

Fri 13 Mar 2026

Authors: Shuzheng Si, Haozhe Zhao, Yu Lei, Qingyi Wang, Dingwei Chen, Zhitong Wang, Zhenhailong Wang, Kangyang Luo, Zheng Wang, Gang Chen, Fanchao Qi, Minjia Zhang, Maosong Sun

Optimization before Evaluation: Evaluation with Unoptimised Prompts Can be Misleading

Read

Current Large Language Model (LLM) evaluation frameworks utilize the same static prompt template across all models under evaluation. This differs from the common industry practice of using prompt optimization (PO) techniques to optimize the prompt for each model to maximize application performance.

Thu 12 Mar 2026

Authors: Nicholas Sadjoli, Tim Siefken, Atin Ghosh, Yifan Mai, Daniel Dahlmeier

Generative structure search for efficient and diverse discovery of molecular and crystal structures

Read

Predicting stable and metastable structures is central to molecular and materials discovery, but remains limited by the cost of searching high-dimensional energy landscapes. Deep generative models offer efficient structure sampling, yet their outputs remain shaped by training data and can underexplore minima that are rare but physically relevant.

Wed 11 Mar 2026

Authors: Yifang Qin, Yu Shi, Junfu Tan, Chang Liu, Ming Zhang, Ziheng Lu

Political Bias Audits of LLMs Capture Sycophancy to the Inferred Auditor

Read

Large language models (LLMs) are commonly evaluated for political bias based on their responses to fixed questionnaires, which typically place frontier models on the political left. A parallel literature shows that LLMs are sycophantic: they adapt their answers to the views, identities, and expectations of the user.

Tue 10 Mar 2026

Authors: Petter Törnberg, Michelle Schimmel

WaferSAGE: Large Language Model-Powered Wafer Defect Analysis via Synthetic Data Generation and Rubric-Guided Reinforcement Learning

Read

We present WaferSAGE, a framework for wafer defect visual question answering using small vision-language models. To address data scarcity in semiconductor manufacturing, we propose a three-stage synthesis pipeline incorporating structured rubric generation for precise evaluation.

Mon 9 Mar 2026

Authors: Ke Xu

Math Education Digital Shadows for facilitating learning with LLMs: Math performance, anxiety and confidence in simulated students and AIs

Read

To enhance LLMs' impact on math education, we need data on their mathematical prowess and biases across prompts. To fill this gap, we introduce MEDS (Math Education Digital Shadows) as a dataset mapping how large language models reason about and report mathematics across human- and AI-like conditions.

Sun 8 Mar 2026

Authors: Naomi Esposito, Anthony Tricarico, Luisa Porzio, Ali Aghazadeh Ardebili, Massimo Stella

Trace-Level Analysis of Information Contamination in Multi-Agent Systems

Read

Reasoning over heterogeneous artifacts (PDFs, spreadsheets, slide decks, etc. ) increasingly occurs within structured agent workflows that iteratively extract, transform, and reference external information.

Sat 7 Mar 2026

Authors: Anna Mazhar, Huzaifa Suri, Sainyam Galhotra

SpatialGrammar: A Domain-Specific Language for LLM-Based 3D Indoor Scene Generation

Read

Automatically generating interactive 3D indoor scenes from natural language is crucial for virtual reality, gaming, and embodied AI. However, existing LLM-based approaches often suffer from spatial errors and collisions, in part because common scene representations-raw coordinates or verbose code-are difficult for models to reason about 3D spatial relationships and physical constraints.

Fri 6 Mar 2026

Authors: Song Tang, Kaiyong Zhao, Yuliang Li, Qingsong Yan, Penglei Sun, Junyi Zou, Qiang Wang, Xiaowen Chu

In-Context Examples Suppress Scientific Knowledge Recall in LLMs

Read

Scientific reasoning rarely stops at what is directly observable; it often requires uncovering hidden structure from data. From estimating reaction constants in chemistry to inferring demand elasticities in economics, this latent structure recovery is what distinguishes scientific reasoning from curve fitting.

Thu 5 Mar 2026

Authors: Chaemin Jang, Woojin Park, Hyeok Yun, Dongman Lee, Jihee Kim

Belief-Guided Inference Control for Large Language Model Services via Verifiable Observations

Read

In black-box large language model (LLM) services, response reliability is often only partially observable at decision time, while stronger inference pathways incur substantial computational cost, inducing a budgeted sequential decision problem: for each request, the system should decide whether the default low-cost response is sufficiently reliable or whether additional computation should be allocated to improve response quality. In this paper, we propose \textbf{Ver}ifiable \textbf{O}bservations for Risk-aware \textbf{I}nference \textbf{C}ontrol (\textsc{Veroic}), a framework for adaptive inference control in black-box LLM settings, which formulates request-time control as a \textit{partially observable Markov decision process} to capture partial observability and sequential budget coupling.

Wed 4 Mar 2026

Authors: Wenhao Yuan, Chenchen Lin, Jian Chen, Jinfeng Xu, Shuo Yang, Edith Cheuk Han Ngai

PRTS: A Primitive Reasoning and Tasking System via Contrastive Representations

Read

Vision-Language-Action (VLA) models advance robotic control via strong visual-linguistic priors. However, existing VLAs predominantly frame pretraining as supervised behavior cloning, overlooking the fundamental nature of robot learning as a goal-reaching process that requires understanding temporal task progress.

Tue 3 Mar 2026

Authors: Yang Zhang, Jiangyuan Zhao, Chenyou Fan, Fangzheng Yan, Tian Li, Haitong Tang, Sen Fu, Xuan'er Wu, Qizhen Weng, Weinan Zhang, Xiu Li, Chi Zhang, Chenjia Bai, Xuelong Li

1 2