Login
Arch Search Engine 1.41
-
License:
Freeware
-
Latest Version:
1.41
-
Editors' Review:
Not yet reviewed
-
Updated:
May 08, 2012
- Publisher:
-
Platform:
Windows, Linux
- Category:
- Subcategory:
-
File size:
13.4 Mb
-
Downloads:
141
Arch Search Engine Description
Arch Search Engine - An open source, high precision corporate search engine based on Apache Nutch
Arch is an open source extension of Apache Nutch (a popular, highly scalable general purpose search engine) for intranet search. Not happy with your corporate search engine? Not surprising, very few people are. To the best of our knowledge, there are no intranet engines that work as well as the Google's global Web search does. There is a fundamental reason for this: the algorithms used by Google on the global Web (or similar) do not work nearly as well on intranets for the lack of statistical data. Arch (finally!) solves this problem. It uses a novel method to deliver high precision search results that works great. Don't believe it? Blind test evaluation tools are included. You can deploy Arch and compare its performance to your current search engine and/or Google (on the public part of your site) using a blind test methodology.
In addition to the excellent search quality, Arch has many features critical for corporate environments:
- Document level security. Users can find only documents that they are authorized to see.
- Inexpensive index updates. Arch is able to keep indexes up to date and avoid regular complete site recrawling.
- 24/7 availabilty. There is always a working index available, even if a crawl fails.
- Support for simultaneous indexing and search of multiple web sites, with ability to search and administer any site separately, if needed. Dynamic adding and removal of web sites is easy.
- An automatically generated site directory.
- Low cost support once deployed.
- Double interface (PHP and Java) for easy deployment and customization. Use the one that better matches your skills.
- An extensive and extensible set of parsers for parsing a variety of file formats: HTML, PHP, PDF, MS Office, Open Office, etc.
- A modular, plugin-based architecture that can be easily customized and extended.
- The source code is included.
- High performance and scalability. Arch can run on computer clusters to index very large data sets.
Arch is an open source extension of Apache Nutch (a popular, highly scalable general purpose search engine) for intranet search. Not happy with your corporate search engine? Not surprising, very few people are. To the best of our knowledge, there are no intranet engines that work as well as the Google's global Web search does. There is a fundamental reason for this: the algorithms used by Google on the global Web (or similar) do not work nearly as well on intranets for the lack of statistical data. Arch (finally!) solves this problem. It uses a novel method to deliver high precision search results that works great. Don't believe it? Blind test evaluation tools are included. You can deploy Arch and compare its performance to your current search engine and/or Google (on the public part of your site) using a blind test methodology.
In addition to the excellent search quality, Arch has many features critical for corporate environments:
- Document level security. Users can find only documents that they are authorized to see.
- Inexpensive index updates. Arch is able to keep indexes up to date and avoid regular complete site recrawling.
- 24/7 availabilty. There is always a working index available, even if a crawl fails.
- Support for simultaneous indexing and search of multiple web sites, with ability to search and administer any site separately, if needed. Dynamic adding and removal of web sites is easy.
- An automatically generated site directory.
- Low cost support once deployed.
- Double interface (PHP and Java) for easy deployment and customization. Use the one that better matches your skills.
- An extensive and extensible set of parsers for parsing a variety of file formats: HTML, PHP, PDF, MS Office, Open Office, etc.
- A modular, plugin-based architecture that can be easily customized and extended.
- The source code is included.
- High performance and scalability. Arch can run on computer clusters to index very large data sets.
Arch Search Engine 1.41 is licensed as Freeware for the Windows, Linux operating system / platform. Arch Search Engine is provided as a free download for all software users (Freeware).
Arch Search Engine User Reviews (0)
No reviews yet, be the first to add a review and we'll give you some extra points.
Arch Search Engine Related Searches
Arch Search Engine Download Notice
Arch Search Engine is periodically updated by FileCluster but you may encounter situations when the software informations are slightly out-of-date, the producers of Arch Search Engine can modify the product without notifying us. Arch Search Engine 1.41 is currently the last updated version of the software. All rights for Arch Search Engine are belong to the developer, CSIRO Astronomy and Space Science.
Any form of support or software problems regarding Arch Search Engine will be addressd to its developers. Please be aware that we do NOT provide Arch Search Engine cracks, serial numbers, registration codes or any forms of pirated software downloads.
Any form of support or software problems regarding Arch Search Engine will be addressd to its developers. Please be aware that we do NOT provide Arch Search Engine cracks, serial numbers, registration codes or any forms of pirated software downloads.
Arch Search Engine Related Software
Free Tcp Port Scanner 1.5.0
Free Tcp Port Scanner is the software that helps to find TCP opened ports.
Free Tcp Port Scanner is the software that helps to find TCP opened ports.
0 / 134


