There is a new pdfbox lib version

You need to be logged in to post messages in the forums. New users may register here.

Ivo Lukac

Member since:
09 January 2008

Posts: 19

Friday 15 January 2010 6:05:16 am

0.8.0. http://pdfbox.apache.org/download.html

with a lot of bugfixes :)
Up

Paul Borgermans

Member since:
09 January 2008

Posts: 2

Saturday 16 January 2010 3:26:52 pm

Hey Ivo

yes, I know, I have an updated tika.jar including it locally since weeks (I'll commit this and more before the winter conf)

But unfortunately, it still does not solve the asian text extraction properly

So, in general use eztika for anything but pdf, and still use xpdf for pdf

Cheers
Paul

Solr, eZ Find expert consulting and training
http://twitter.com/paulborgermans

Up

You need to be logged in to post messages in the forums. New users may register here.