A content-based image retrieval method based on the google cloud vision API and wordnet

  • Shih Hsin Chen
  • , Yi Hui Chen*
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

29 Scopus citations

Abstract

Content-Based Image Retrieval (CBIR) method analyzes the content of an image and extracts the features to describe images, also called the image annotations (or called image labels). A machine learning (ML) algorithm is commonly used to get the annotations, but it is a time-consuming process. In addition, the semantic gap is another problem in image labeling. To overcome the first difficulty, Google Cloud Vision API is a solution because it can save much computational time. To resolve the second problem, a transformation method is defined for mapping the undefined terms by using the WordNet. In the experiments, a well-known dataset, Pascal VOC 2007, with 4952 testing figures is used and the Cloud Vision API on image labeling implemented by R language, called Cloud Vision API. At most ten labels of each image if the scores are over 50. Moreover, we compare the Cloud Vision API with well-known ML algorithms. This work found this API yield 42.4% mean average precision (mAP) among the 4,952 images. Our proposed approach is better than three well-known ML algorithms. Hence, this work could be extended to test other image datasets and as a benchmark method while evaluating the performances.

Original languageEnglish
Title of host publicationIntelligent Information and Database Systems - 9th Asian Conference, ACIIDS 2017, Proceedings
EditorsSatoshi Tojo, Le Minh Nguyen, Ngoc Thanh Nguyen, Bogdan Trawinski
PublisherSpringer Verlag
Pages651-662
Number of pages12
ISBN (Print)9783319544717
DOIs
StatePublished - 2017
Externally publishedYes
Event9th Asian Conference on Intelligent Information and Database Systems, ACIIDS 2017 - Kanazawa, Japan
Duration: 03 04 201705 04 2017

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume10191 LNAI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference9th Asian Conference on Intelligent Information and Database Systems, ACIIDS 2017
Country/TerritoryJapan
CityKanazawa
Period03/04/1705/04/17

Bibliographical note

Publisher Copyright:
© Springer International Publishing AG 2017.

Keywords

  • Content based image retrieval
  • Google cloud vision API
  • Image annotation
  • Pascal VOC 2007
  • WordNet

Fingerprint

Dive into the research topics of 'A content-based image retrieval method based on the google cloud vision API and wordnet'. Together they form a unique fingerprint.

Cite this