Buscar
Social
Ofertas laborales ES
« Struts 1.1 | Main | JRoots application framework »
lunes
jun302003

Alternativa a Lucene: Java Search Engine

Java Search Engine se presenta como una alternativa bastante interesante a Jakarta Lucene. Bajo estas líneas podéis encontrar la lista de características extraida directamente de su página web:





  • Full featured text search engine software for web sites


  • Supports any operating system, completely in Java


  • Includes web crawler


  • It is FREE


  • Complete solution, no additional software required


  • Available as WAR (Web ARchive), servlet, JSP for Tomcat, Resin or other JSP engine


  • Simple installation using web interface


  • Available as EJB (Enterprise JavaBean) on J2EE Application Server


  • HTML, PDF and plain text indexing


  • Available as Java API library


  • Supports incremental update, "hot update" and "hot rescan" (without stopping search), delete pages


  • International encodings support, multiply encodings in one storage, automatic detection


  • Stopwords and word stemming (suffixes stripping) for every configured language


  • Using file system or database (JDBC) for index storage


  • Supports META tags (description and keywords), image ALT tags, BASIC Authentication and forms crawling


  • Can store several separated sites in one index, even multilingual


  • Can group pages and limit search results to some group


  • Customizable page rank with options to boost on word position, number of appearances or by URL


  • Google style output, quotations with highlighted words, or you can use META tags to customize description


  • Google query language, includes AND, OR, "-", phrase, substrings and all possible combinations


  • Can transform results directly into XML, or HTML using XSLT




Quizás los aspectos más interesantes sean que soporta extracción de PDF, HTML y TXT sin tener que crear ningún tipo de plugin, que está disponible como un EJB y que puede pasar los resultados a XML.



¿ Qué os parece ? ¿ superior a lucene ?

Reader Comments

There are no comments for this journal entry. To create a new comment, use the form below.
Comentarios deshabilitados
Comentarios deshabilitados en esta noticia.