Show simple item record

dc.contributor.authorDey, S
dc.contributor.authorRiba, P
dc.contributor.authorDutta, A
dc.contributor.authorLlados, J
dc.contributor.authorSong, Y-Z
dc.date.accessioned2019-10-31T10:00:35Z
dc.date.issued2020-01-09
dc.description.abstractIn this paper, we investigate the problem of zero-shot sketch-based image retrieval (ZS-SBIR), where human sketches are used as queries to conduct retrieval of photos from unseen categories. We importantly advance prior arts by proposing a novel ZS-SBIR scenario that represents a firm step forward in its practical application. The new setting uniquely recognizes two important yet often neglected challenges of practical ZS-SBIR, (i) the large domain gap between amateur sketch and photo, and (ii) the necessity for moving towards large-scale retrieval. We first contribute to the community a novel ZS-SBIR dataset, QuickDraw-Extended, that consists of 330,000 sketches and 204,000 photos spanning across 110 categories. Highly abstract amateur human sketches are purposefully sourced to maximize the domain gap, instead of ones included in existing datasets that can often be semi-photorealistic. We then formulate a ZS-SBIR framework to jointly model sketches and photos into a common embedding space. A novel strategy to mine the mutual information among domains is specifically engineered to alleviate the domain gap. External semantic knowledge is further embedded to aid semantic transfer. We show that, rather surprisingly, retrieval performance significantly outperforms that of state-of-the-art on existing datasets that can already be achieved using a reduced version of our model. We further demonstrate the superior performance of our full model by comparing with a number of alternatives on the newly proposed dataset. The new dataset, plus all training and testing code of our model, will be publicly released to facilitate future research.en_GB
dc.description.sponsorshipEuropean Unionen_GB
dc.description.sponsorshipCERCA Program/Generalitat de Catalunyaen_GB
dc.identifier.citationCVPR 2019: IEEE Conference on Computer Vision and Pattern Recognition, 16-20 June 2019, Long Beach, USAen_GB
dc.identifier.doi10.1109/CVPR.2019.00228
dc.identifier.grantnumber665919en_GB
dc.identifier.grantnumberFPU15/06264en_GB
dc.identifier.grantnumberTIN2015-70924-C2-2-Ren_GB
dc.identifier.urihttp://hdl.handle.net/10871/39424
dc.language.isoenen_GB
dc.publisherInstitute of Electrical and Electronics Engineers (IEEE)en_GB
dc.rights© 2020 IEEEen_GB
dc.subjectRecognition: Detection, Retrieval
dc.subjectDeep Learning
dc.subjectVision + Graphics
dc.subjectVision Applications and Systems
dc.titleDoodle to Search: Practical Zero-Shot Sketch-based Image Retrievalen_GB
dc.typeConference paperen_GB
dc.date.available2019-10-31T10:00:35Z
dc.identifier.issn1063-6919
exeter.place-of-publicationLong Beach, USAen_GB
dc.descriptionThis is the author accepted manuscript. The final version is available from IEEE via the DOI in this recorden_GB
dc.rights.urihttp://www.rioxx.net/licenses/all-rights-reserveden_GB
dcterms.dateAccepted2019-02-25
rioxxterms.versionAMen_GB
rioxxterms.licenseref.startdate2019-06-16
rioxxterms.typeConference Paper/Proceeding/Abstracten_GB
refterms.dateFCD2019-10-30T19:31:32Z
refterms.versionFCDAM
refterms.dateFOA2020-02-03T16:06:08Z
refterms.panelBen_GB


Files in this item

This item appears in the following Collection(s)

Show simple item record