re-isearch,Project re-Isearch,github

This is the respositiory for re-Isearch.

Project re-isearch:

a novel multimodal search and retrieval engine using mathematical models and algorithms different from the all-too-common inverted index (popularized by Salton in the 1960s). The design allows it to have, in practice, effectively no limits on the frequency of words, term length, number of fields or complexity of structured data and support even overlap--- where fields or structures cross other's boundaries (common examples are quotes, line/sentences, biblical verse, annotations). Its model enables a completely flexible unit of retrieval and modes of search.

Despite being a new project it has a long and esteemed history reaching back into the 1990s. Previous versions were widely adopted and used in hundreds of public search sites, including many high profile projects such as the U.S. Patent and Trademark Office (USPTO) patent search, the Federal Geographic Data Clearinghouse (FGDC), the NASA Global Change Master Directory, the NASA EOS Guide System, the NASA Catalog Interoperability Project, the astronomical pre-print service based at the Space Telescope Science Institute, The PCT Electronic Gazette at the World Intellectual Property Organization (WIPO), the Australian National Genomic Information Service (ANGIS), the SAGE Project of the Special Collections Department at Emory University, Eco Companion Australasia (an environmental geospatial resources catalog), European Space Organization, the Open Directory Project, numerous governmental portals and ...

Featues/Uses

Low-code ETL / "Any-to-Any" architecture
No need for a “middle layer” of content manipulation code. Instead of getting URLs from a search engine, fetching documents, parsing them, and navigating the DOMs to find required elements, it lets you simply request the elements you need and they are returned directly.
Handles a wide range of document formats (from Atom to XML) including “live” data.
Powerful Search (Structure, Objects, Spatial) / Relevancy Engine
NoSQL Datastore
Useful for Analytics
Useful for Recommendation / Autosuggestion
Embeddable in products (comparatively low resource demands)
Customization.
Support Peer-to-Peer and Federated architectures.
Runs on a wide range of hardware and operating systems
Freely available under a permissive software license.

Despite its wealth of features it has a comparatively small memory footprint (previous version have run on 32-bit machines with as little as 8 MB physical RAM) making it suitable for appliances. It has also been designed to try to impose a minimal computing impact on the host. Rather than run multiple threads and a high CPU workload it’s strategy is to be fast but not at the cost of other processes, heat or increased energy consumption.

This Repository

This is the main central repository for re-Isearch development.

It contains the engine as a freely available and completely open-source (and multiplatform) C++ library, bindings for other languages (such as Python) and some reference sample code using the library in some of these languages.

Under doctypes/ one can see the native doctypes supported.

Building, installing, developing

For information on building, installing, developing and using the system please consult the handbook in docs/.

A basic cheat-sheet is in INSTALLATION

In the directory bin/ and lib/ are binaries of standalone tools compiled on Ubuntu 18.04.2 LTS and targetting Intel Skylake or newer processors. They are included solely to enable fast software evaluations.

Copyrights, attributions and acknowledgements

Portions Copyright (c) 1995 CNIDR/MCNC, (c) 1995-2011 BSn/Munich; (c) 2011-2020 NONMONOTONIC Networks; Copyright (c) 2020-22 Edward C. Zimmermann and the re-iSearch project. Is is made available and licensed under the Apache 2.0 license: see LICENSE

The software has a lot of history (as one can see from the above copyright). For the historical last public release: Isearch

This project was funded through the NGI0 Discovery Fund, a fund established by NLnet with financial support from the European Commission's Next Generation Internet programme, under the aegis of DG Communications Networks, Content and Technology under grant agreement No 825322.

The extension of the engine to support IPFS and a number of additional document formats and data types was made possible by the Federal Ministry of Education and Research Germany under grant agreement No 01IS22S32.

re-isearch Goto Github PK

This is the respositiory for re-Isearch.

Project re-isearch:

This Repository

Building, installing, developing

Copyrights, attributions and acknowledgements

Project re-Isearch's Projects

cpp-libp2p

date

date-extension

isearch-1.14

re-isearch

test-data

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent