Artistic Object Recognition by
Unsupervised Style Adaptation
Christopher Thomas and Adriana Kovashka
University of Pittsburgh

Published in ACCV 2018
Concept Figure
Abstract
Computer vision systems currently lack the ability to reliably recognize artistically rendered objects, especially when such data is limited. In this paper, we propose a method for recognizing objects in artistic modalities (such as paintings, cartoons, or sketches), without requiring any labeled data from those modalities. Our method explicitly accounts for stylistic domain shifts between and within domains. To do so, we introduce a complementary training modality constructed to be similar in artistic style to the target domain, and enforce that the network learns features that are invariant between the two training modalities. We show how such artificial labeled source domains can be generated automatically through the use of style transfer techniques, using diverse target images to represent the style in the target domain. Unlike existing methods which require a large amount of unlabeled target data, our method can work with as few as ten unlabeled images. We evaluate it on a number of cross-domain object and scene classification tasks and on a new dataset we release. Our experiments show that our approach, though conceptually simple, significantly improves the accuracy that existing domain adaptation techniques obtain for artistic object recognition.
Download our ACCV 2018 Paper
Additional Resources:
Download poster
Download supplementary material
Download dataset (1.2 GB)
Dataset Release
We publicly release a new artistic objects dataset as part of our work. We call this dataset CASPA: (Cartoons, Sketches, Paintings). To collect this dataset, we assembled a dataset of 5,047 cartoons by querying Google Image Search with each of the ten animal categories from Microsoft COCO: "bear", "bird", "cat", "cow", "dog", "elephant", "giraffe", "horse", "sheep", and "zebra". We also collected a new dataset of 1,391 paintings which cover eight of our ten categories (except "giraffe" and "zebra"), and annotated these images with 2,834 bounding boxes to crop out the objects of interest. We also include 12,008 sketches from the Sketchy Database. In total, the dataset contains 18,446 artistic images. Our dataset also contains approx. 65,000 cropped photos of animals from COCO. Our dataset is available for download by clicking here.
Acknowledgement of NSF Support
This material is based upon work supported by the National Science Foundation under NSF CISE Award No. 1566270. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the authors and do not necessarily reflect the views of the National Science Foundation.
National Science Foundation Logo