• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
搜索

Author:

Li, Xuefen (Li, Xuefen.) | Wang, Bo (Wang, Bo.) | Shi, Ge (Shi, Ge.) | Feng, Chong (Feng, Chong.) | Teng, Jiahao (Teng, Jiahao.)

Indexed by:

EI

Abstract:

Existing video-LLMs excel at capturing the overall description of a video but lack the ability to demonstrate an understanding of temporal dynamics and a fine-grained grasp of localized content within the video. In this paper, we propose a Time-Perception Enhanced Video Grounding via Boundary Perception and Temporal Reasoning aimed at mitigating LLMs' difficulties in understanding the discrepancies between video and text temporality. Specifically, to address the inherent biases in current datasets, we design a series of boundary-perception tasks to enable LLMs to capture accurate video temporality. To tackle LLMs' insufficient understanding of temporal information, we develop specialized tasks for boundary perception and temporal relationship reasoning to deepen LLMs' perception of video temporality. Our experimental results show significant improvements across three datasets: ActivityNet, Charades, and DiDeMo (achieving up to 11.2% improvement on R@0.3), demonstrating the effectiveness of our proposed temporal awareness-enhanced data construction method. © 2025 Association for Computational Linguistics.

Keyword:

Computational linguistics Video analysis

Author Community:

  • [ 1 ] [Li, Xuefen]Beijing University of Technology, China
  • [ 2 ] [Wang, Bo]Beijing Institute of Technology, China
  • [ 3 ] [Wang, Bo]Southeast Academy of Information Technology, Beijing Institute of Technology, China
  • [ 4 ] [Shi, Ge]Beijing University of Technology, China
  • [ 5 ] [Feng, Chong]Beijing Institute of Technology, China
  • [ 6 ] [Feng, Chong]Southeast Academy of Information Technology, Beijing Institute of Technology, China
  • [ 7 ] [Teng, Jiahao]Beijing University of Technology, China

Reprint Author's Address:

Email:

Show more details

Related Keywords:

Related Article:

Source :

ISSN: 2951-2093

Year: 2025

Volume: Part F206484-1

Page: 9804-9813

Language: English

Cited Count:

WoS CC Cited Count:

SCOPUS Cited Count:

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 8

Affiliated Colleges:

Online/Total:554/10596450
Address:BJUT Library(100 Pingleyuan,Chaoyang District,Beijing 100124, China Post Code:100124) Contact Us:010-67392185
Copyright:BJUT Library Technical Support:Beijing Aegean Software Co., Ltd.