Skip to content
#

apache

Here are 4,077 public repositories matching this topic...

The proposed system makes use of a crawler to gather information from every document on the website and store this information in the index. The index is a structured system of storing the unstructured data returned by the crawler. In this project, Nutch’s main component named ‘crawler’ is used for indexing and Solr is used for ‘searching’. The …

  • Updated Oct 13, 2018
  • JavaScript

Improve this page

Add a description, image, and links to the apache topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the apache topic, visit your repo's landing page and select "manage topics."

Learn more