nutch
Here are 29 public repositories matching this topic...
Developed as part of an Information Retrieval coursework, this project showcases a search engine that efficiently indexes and retrieves information from a given dataset.
-
Updated
Aug 25, 2023 - Python
DataHarvest: Dockerized Web Crawling, Indexing, and Storage Solution
-
Updated
Jun 19, 2023 - Python
Link ranking with Apache Giraph for Apache Nutch
-
Updated
Apr 14, 2023 - Java
Spark-Crawler: Apache Nutch-like crawler that runs on Apache Spark.
-
Updated
Mar 30, 2023 - Java
How to use Apache Nutch without command line
-
Updated
Nov 4, 2022 - Java
Rest Service for Spring/Solr backed search engine.
-
Updated
Aug 22, 2021 - Java
✨ 🧬 Apache Nutch Plugin for Viglet Turing Search
-
Updated
Aug 5, 2021 - Java
A simple web crawler inside a docker container using Apache Nutch 1 and Solr.
-
Updated
Jan 15, 2021 - Dockerfile
Simple crawler using apache nutch and elasticsearch
-
Updated
May 27, 2020 - Shell
Search engine knowledge systems(搜索引擎知识体系).
-
Updated
Feb 22, 2020
Apache Nutch system adapter for ORCA
-
Updated
Sep 19, 2019 - Java
Nutch 1.x Indexer Plugin that runs against ES6.7
-
Updated
Aug 12, 2019 - Java
A OCR Search Engine With Tesseract Nutch Solr And PHP
-
Updated
Jan 25, 2019 - JavaScript
Improve this page
Add a description, image, and links to the nutch topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the nutch topic, visit your repo's landing page and select "manage topics."