There is a server, a heap of reference material is stored on it (pdf, html, txt, doc, etc.). It is necessary that all this good be searched for and indexed through the web interface. Are there ready-made solutions? if not, what is the easiest way to organize such a search engine?

  • Is this server accessible from outside? - Mikhail Vaysman
  • Of course. Otherwise, the meaning is lost. - Ivan Stotsky
  • one
    then a simple option would be to set up Google or Yandex to index your site. - Mikhail Vaysman
  • Such a solution is needed in a corporate LAN. I would not want the world to show this data. - Ivan Stotsky
  • I therefore asked my first question. - Mikhail Vaysman

1 answer 1

I use Apache - Lucene search engine in my projects. Lucene is a free high-speed full-text search library. It can be used for searching the Internet and solving various problems of computational linguistics. For example, Lucene is used as a component in the decentralized search engine YaCy (free software). Written in different languages: C ++, Java, C #

The search engine is very fast and flexible, you can even configure it to search for inaccurate words and even phrases, a sea of ​​fine-tuning for your needs.

Detailed examples here:

https://www.codeproject.com/Articles/320219/Lucene-Net-ultra-fast-search-for-MVC-or-WebForms

https://www.codeproject.com/Articles/609980/Small-Lucene-NET-Demo-App