**摘要**
Clinical practice is not the selection of an answer from enumerated options: a physician gathers heterogeneous information incrementally and commits to sequential, irreversible decisions under uncertainty. Static benchmarks cannot probe and existing interactive medical benchmarks each compromise on at least one of them. We present ClinEnv, an interactive benchmark that evaluates LLMs as attending
👤 作者: Yuxing Lu, Yushuhong Lin, Wenqi Shi, J. Ben Tamo, Xukai Zhao, Jinzhuo Wang, May Dongmei Wang

---
🔗 **[ClinEnv :代理商的交互式多阶段远景EHR环境](https://arxiv.org/abs/2606.02568v1)**

> ClinEnv: An Interactive Multi-Stage Long Horizon EHR Environment for Agents
🏷️ 来源: ArXiv cs.AI
⏱️ 2026-06-02 14:00