Show simple item record

dc.contributor.authorYan, M
dc.contributor.authorZhang, P
dc.contributor.authorZhang, H
dc.contributor.authorHao, R
dc.contributor.authorLiu, J
dc.contributor.authorWang, X
dc.contributor.authorLiu, L
dc.date.accessioned2025-01-06T15:00:20Z
dc.date.issued2024-12-19
dc.date.updated2024-12-26T09:50:07Z
dc.description.abstractThermal infrared (TIR) tracking is pivotal in computer vision tasks due to its all-weather imaging capability. Traditional tracking methods predominantly rely on hand-crafted features, and while deep learning has introduced correlation filtering techniques, these are often constrained by rudimentary correlation operations. Furthermore, transformer-based approaches tend to overlook temporal and coordinate information, which is critical for TIR tracking that lacks texture and color information. In this paper, to address these issues, we apply natural language modeling to TIR tracking and propose a coordinate-aware thermal infrared tracking model called NLMTrack, which enhances the utilization of coordinate and temporal information. NLMTrack applies an encoder that unifies feature extraction and feature fusion, which simplifies the TIR tracking pipeline. To address the challenge of low detail and low contrast in TIR images, on the one hand, we design a multi-level progressive fusion module that enhances the semantic representation and incorporates multi-scale features. On the other hand, the decoder combines the TIR features and the coordinate sequence features using a causal transformer to generate the target sequence step by step. Moreover, we explore an adaptive loss aimed at elevating tracking accuracy and a simple template update strategy to accommodate the target’s appearance variations. Experiments show that NLMTrack achieves state-of-the-art performance on multiple benchmarks.en_GB
dc.description.sponsorshipAeronautical Science Fund, Chinaen_GB
dc.description.sponsorshipNational Natural Science Foundation of Chinaen_GB
dc.identifier.citationVol. 267, article 126012en_GB
dc.identifier.doihttps://doi.org/10.1016/j.eswa.2024.126012
dc.identifier.grantnumber2024Z071080002en_GB
dc.identifier.grantnumber62075031en_GB
dc.identifier.urihttp://hdl.handle.net/10871/139496
dc.identifierORCID: 0000-0001-9332-2700 (Wang, Xiaoyang)
dc.language.isoenen_GB
dc.publisherElsevieren_GB
dc.relation.urlhttps://github.com/ELOESZHANG/NLMTracken_GB
dc.rights© 2024 The author(s). For the purpose of open access, the author has applied a Creative Commons Attribution (CC BY) licence to any Author Accepted Manuscript version arising from this submission.en_GB
dc.subjectThermal infrared object trackingen_GB
dc.subjectNatural language modelen_GB
dc.subjectTransformer trackingen_GB
dc.titleCoordinate-aware thermal infrared tracking via natural language modelingen_GB
dc.typeArticleen_GB
dc.date.available2025-01-06T15:00:20Z
dc.identifier.issn0957-4174
exeter.article-number126012
dc.descriptionThis is the author accepted manuscript. The final version is available from Elsevier via the DOI in this recorden_GB
dc.descriptionCode availability: The Code is publicly available at https://github.com/ELOESZHANG/NLMTracken_GB
dc.descriptionData availability: Data will be made available on request.en_GB
dc.identifier.journalExpert Systems with Applicationsen_GB
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/en_GB
dcterms.dateAccepted2024-11-30
dcterms.dateSubmitted2024-11-04
rioxxterms.versionAMen_GB
rioxxterms.licenseref.startdate2024-12-19
rioxxterms.typeJournal Article/Reviewen_GB
refterms.dateFCD2025-01-06T14:47:53Z
refterms.versionFCDAM
refterms.dateFOA2025-01-06T15:05:33Z
refterms.panelBen_GB
exeter.rights-retention-statementNo


Files in this item

This item appears in the following Collection(s)

Show simple item record

© 2024 The author(s). For the purpose of open access, the author has applied a Creative Commons Attribution (CC BY) licence to any Author Accepted Manuscript version arising from this submission.
Except where otherwise noted, this item's licence is described as © 2024 The author(s). For the purpose of open access, the author has applied a Creative Commons Attribution (CC BY) licence to any Author Accepted Manuscript version arising from this submission.