**摘要**
Multi-shot video generation extends single-shot generation to coherent visual narratives, yet maintaining consistent characters, objects, and locations across shots remains a challenge over long sequences. Existing evaluations typically use independently generated prompt sets with limited entity coverage and simple consistency metrics, making standardized comparison difficult. We introduce EntityB
👤 作者: Ruozhen He, Meng Wei, Ziyan Yang, Vicente Ordonez

---
🔗 **[EntityBench :朝向实体一致的长距离多镜头视频生成](https://arxiv.org/abs/2605.15199v1)**

> EntityBench: Towards Entity-Consistent Long-Range Multi-Shot Video Generation
🏷️ 来源: ArXiv cs.AI
⏱️ 2026-05-16 08:00