Thursday, March 19, 2009

 

Apache Tika 0.3 released

Apache Tika, a subproject of Apache Lucene, is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

Chris Mattmann just announced that the release of version 0.3 is official.

Go grab yourself a copy from a mirror nearby. Tika is also available through the central maven repository.

There is also an article about Tika and Solr Cell at Lucid Imagination web site.

Labels: , , ,



Comments

Post a Comment

Subscribe to Post Comments [Atom]



<< Home

Navigation