Skip to main navigation Skip to search Skip to main content

Efficient parallelised search engine based on virtual cluster

  • Che Lun Hung*
  • , Chun Yuan Lin
  • *Corresponding author for this work
    • Providence University Taiwan

    Research output: Contribution to journalJournal Article peer-review

    4 Scopus citations

    Abstract

    Recently, more and more researches have indicated that the personalised and parallelised search engine can provide users with fast and correct information from the internet. Hadoop is a software framework to process the huge dataset with more than petabyte size. Virtualisation technology can fully utilise the resources of physical machines. In this paper, we construct a virtual cluster as a Hadoop cluster by multiple virtual machines to perform multiple Nutch simultaneously. From the experimental results, the proposed virtual cluster architecture for Nutch can retrieval data rapidly and the performance enhancement is proportional to the number of virtual machines.

    Original languageEnglish
    Pages (from-to)53-57
    Number of pages5
    JournalInternational Journal of Computational Science and Engineering
    Volume12
    Issue number1
    DOIs
    StatePublished - 2016

    Bibliographical note

    Publisher Copyright:
    © Copyright 2016 Inderscience Enterprises Ltd.

    Keywords

    • Hadoop
    • MapReduce
    • cloud computing
    • cluster.
    • search engine
    • virtual machine

    Fingerprint

    Dive into the research topics of 'Efficient parallelised search engine based on virtual cluster'. Together they form a unique fingerprint.

    Cite this